User Experience on mobile might not be great yet, but I'm working on it.

Your first time on this page? Allow me to give some explanations.

Awesome Dive into Machine Learning

Dive into Machine Learning with Python Jupyter notebook and scikit-learn!

Here you can see meta information about this topic like the time we last updated this page, the original creator of the awesome list and a link to the original GitHub repository.

Last Update: Sept. 21, 2021, 9:12 p.m.

Thank you hangtwenty & contributors
View Topic on GitHub:
hangtwenty/dive-into-machine-learning

Search for resources by name or description.
Simply type in what you are looking for and the results will be filtered on the fly.

Further filter the resources on this page by type (repository/other resource), number of stars on GitHub and time of last commit in months.

Dive into Machine Learning

My coworkers often ask me for Python learning resources. Here are some picks. Many skill levels, emphasis on beginner and intermediate.

353
80
2y 117d
CC-BY-4.0

Machine learning (ML) is the scientific study of algorithms and statistical models that computer systems use to effectively perform a specific task without using explicit instructions, relying on patterns and inference instead."

Tools you'll need

Python 3 is the best option.

provides a rich architecture for interactive computing.

Let's go!

A Few Useful Things to Know about Machine Learning

Quoting Domingos: "Suppose you’ve constructed the best set of features you can, but the classifiers you’re getting are still not accurate enough. What can you do now? There are two main choices: design a better learning algorithm, or gather more data. [...] As a rule of thumb, a dumb algorithm with lots and lots of data beats a clever one with modest amounts of it. (After all, machine learning is all about letting data do the heavy lifting.)"

Jargon note

Just about time for a break...

episode, and listen to that soon.** It supports what we read from Domingos. Ryan Adams talks about starting simple, as we discussed above. Adams also stresses the importance of feature engineering. Feature engineering is an exercise of the "knowledge" Domingos writes about. In a later episode, they share many concrete tips for feature engineering.

Play to learn

538 Election Forecasting Model

279
120
5y 9m
n/a

Repository of teaching materials, code, and data for my data analysis and machine learning projects.

4.79K
1.81K
1y 4m
n/a

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

20.26K
6.45K
2y 7m
n/a

Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.

14.36K
4.03K
1y 24d
n/a

Using Titanic data, "Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques."

Machine Learning](https://www.coursera.org/learn/machine-learning) is a popular and esteemed free online course. I've seen it recommended often. And emphatically.**

Tips for studying

videos. This is just about how to study in general. In the course, they advocate the learn-by-doing approach, as we're doing here. You'll get various other tips that are easy to apply, but go a long way to make your time investment more effective.

Other courses

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

20.26K
6.45K
2y 7m
n/a

the best machine learning tutorials on the web

421
249
10y 5m
n/a

starts with what we've already covered, then continues on at a comfortable place. After the videos you could do Markham's General Assembly's Data Science course. Interactive. Markham's course is also offered in-person in Washington, DC.

online course based on Data 8 is now offered via edX too.

Includes Coursera's Data Science Specialization with 9 courses in it. The Specialization certificate isn't free, but you can take the courses 1-by-1 for free if you don't care about the certificate. The survey also covers Harvard CS109 which I've seen recommended elsewhere.

Supplement: Learning Pandas well

Supplement: Cheat Sheets

More Data Science materials

Bayesian Statistics and Machine Learning

A python tutorial on bayesian modeling techniques (PyMC3)

2.27K
393
4y 4m
n/a

A machine learning / bayesian inference engine assigning attributes to objects

14
6
1y 53d
MIT

_ Made with a "computation/understanding-first, mathematics-second point of view." It's available in print too!

Risks

Welcome to the Danger Zone

This guide can't tell you how you'll know you've "made it" into Machine Learning competence ... let alone expertise. It's hard to evaluate proficiency without schools or other institutions. This is a common problem for self-taught people.

Towards Expertise

Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1

3.71K
30.47K
5y 6m
n/a

Ask for Peer Review

:bow: A note about Machine Learning and User Experience (UX)

:bow: A note about Machine Learning and Security (InfoSec, AppSec)

Machine Learning for Cyber Security

4.25K
1.3K
8m
n/a

AISecurity

1.03K
154
1y 5m
n/a

A curated list of awesome adversarial machine learning resources

1.48K
268
10m
n/a

Deep Learning

ARCHIVED: Contains historical course materials/Homework materials for the FREE MOOC course on "Creative Applications of Deep Learning w/ Tensorflow" #CADL

1.48K
761
3y 4m
Apache-2.0

An interactive book about deep learning

courses on Deep Learning](https://www.coursera.org/specializations/deep-learning)!** There five courses, as part of the Deep Learning Specialization on Coursera. These courses are part of his new venture, deeplearning.ai

Google's fast-paced, practical introduction to machine learning.

Yet halfway through the book, it contains satisfying math content on how to think about actual deep learning.

answered by Greg Brockman (Co-Founder & CTO at OpenAI, previously CTO at Stripe)

"Big" Data?

Finding Open-Source Libraries

A curated list of awesome Machine Learning frameworks, libraries and software.

48.48K
12.13K
7m
n/a

Curated decibans of scientific programming resources in Python.

753
170
1y 21d
n/a

Lore makes machine learning approachable for Software Engineers and maintainable for Machine Learning Researchers

1.5K
120
1y 4m
MIT

Curated decibans of Julia programming language.

1.07K
179
7m
n/a

TensorFlow is an Open Source Software Library for Machine Intelligence

Alternative ways to "Dive into Machine Learning"

Repository of teaching materials, code, and data for my data analysis and machine learning projects.

4.79K
1.81K
1y 4m
n/a

A complete daily plan for studying to become a machine learning engineer.

24.75K
5.81K
9m
CC-BY-SA-4.0

by Sebastian Raschka. A selection of the core online courses and books for getting started with machine learning and gaining expert knowledge. It contextualizes Raschka's own book, Python Machine Learning (which I would have linked to anyway!) See also pattern_classification GitHub repository maintained by the author, which contains IPython notebooks about various machine learning algorithms and various data science related resources.

Google's fast-paced, practical introduction to machine learning.

Amazon have open up their internal training to the public and also offer certification. 30 courses - 45+ hours of content.

is another good introduction, perhaps better if you're more familiar with Java or Scala. It introduces machine learning for a developer audience using Smile, a machine learning library that can be used both in Java and Scala.

is a journal devoted to clear and interactive explanations of the lastest research in machine learning. They offer an alternative to traditional academic publishing that promotes accessibility and transparency in the field.