🫎 CHE1147: Chemical Data Science and Engineering

Our course combines {Data + Chemistry + Engineering}. We’ll explore how machine learning and data science can solve real chemical engineering problems with a mix of:

Lectures with chemical examples and datasets 📊
Hands-on sessions 👩‍💻
Group projects 💡

This repo is where lectures, tutorials, assignments, and project guidelines will live for our course.

🗂 Repo Map

Here’s where to find stuff:

lectures/ → demo notebooks
tutorials/ → in-class hands-on exercises
projects/ → group project information
data/ → small sample datasets used in tutorials

👨‍🏫 Lectures

Week	Topic	Slides
Week 01	Introduction to Machine Learning & Course Overview	Open in Google Slides
Week 02	Data, Representation, and Exploratory Data Analysis	Open in Google Slides
Week 03	Supervised Learning Workflow	Open in Google Slides
Week 04	Modelling well: complexity, regularization and model selection	Open in Google Slides
Week 05	Model Zoo: Different Ways of Learning from Data	Open in Google Slides
Week 06	Logistic Regression & Classification	Open in Google Slides
Week 07	Unsupervised Learning	Open in Google Slides

📚 Tutorials

Week	Tutorial	Colab Link
W01	1. Python Refresher
	2. Linear Algebra
W02	3. RDKit and EDA
W03–06	4. Supervised Learning — Regression
W07	5. Supervised Learning — Classification

⚙️ Setup

To reproduce the Python environment:

conda env create -f environment.yml
conda activate che1147

💬 Feedback, Suggestions, & Support

Tell me what to improve or any other requests using this totally anonymous form:

Or open a GitHub issue if you found a bug, typo, or broken link:

Found this useful? Please consider starring the repo 🌟 — it helps others discover the project and shows your support!

🤝 Contribute

We welcome:

🐛 Bug reports (broken notebook cells, path issues, typos)
📚 Content improvements (clearer explanations, new examples)
🧪 New exercises/tutorials/content (small, focused PRs work best)

🫸💥🫷Developers and Maintainers

This course is being created by the AI4ChemS team and TAs:

A shout-out as well to our friends at the Chemical Cognition Lab 👋.
They run CHE1148, which builds on this course. CHE1147 is the foundation, CHE1148 takes it further to neural nets and representation learning. We’ve been inspired by each other’s ideas along the way.

🙏 Acknowledgements

The content, examples, figures, and ideas are inspired from many textbooks, and other open courses which we will reference properly. The main references include:

Christopher Bishop’s Pattern Recognition and Machine Learning (Springer, 2006)
Simon Prince’s Understanding Deep Learning (Cambridge University Press, 2023)
Kevin M. Jablonka for ML-MolSim

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.github/workflows		.github/workflows
assets		assets
data		data
lectures		lectures
projects		projects
tutorials		tutorials
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements_colab.txt		requirements_colab.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🫎 CHE1147: Chemical Data Science and Engineering

🗂 Repo Map

👨‍🏫 Lectures

📚 Tutorials

⚙️ Setup

💬 Feedback, Suggestions, & Support

🤝 Contribute

🫸💥🫷Developers and Maintainers

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

AI4ChemS/CHE1147

Folders and files

Latest commit

History

Repository files navigation

🫎 CHE1147: Chemical Data Science and Engineering

🗂 Repo Map

👨‍🏫 Lectures

📚 Tutorials

⚙️ Setup

💬 Feedback, Suggestions, & Support

🤝 Contribute

🫸💥🫷Developers and Maintainers

🙏 Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages