User Experience on mobile might not be great yet, but I'm working on it.

Your first time on this page? Allow me to give some explanations.

Awesome Dive into Machine Learning

Dive into Machine Learning with Python Jupyter notebook and scikit-learn!

Here you can see meta information about this topic like the time we last updated this page, the original creator of the awesome list and a link to the original GitHub repository.

Last Update: Dec. 1, 2020, 6:05 a.m.

Thank you hangtwenty & contributors
View Topic on GitHub:
hangtwenty/dive-into-machine-learning

Search for resources by name or description.
Simply type in what you are looking for and the results will be filtered on the fly.

Further filter the resources on this page by type (repository/other resource), number of stars on GitHub and time of last commit in months.

Dive into Machine Learning

My coworkers often ask me for Python learning resources. Here are some picks. Many skill levels, emphasis on beginner and intermediate.

353
80
1y 6m
CC-BY-4.0

Machine learning (ML) is the scientific study of algorithms and statistical models that computer systems use to effectively perform a specific task without using explicit instructions, relying on patterns and inference instead."

Tools you'll need

Python 3 is the best option.

provides a rich architecture for interactive computing.

Let's go!

A Few Useful Things to Know about Machine Learning

Quoting Domingos: "Suppose you’ve constructed the best set of features you can, but the classifiers you’re getting are still not accurate enough. What can you do now? There are two main choices: design a better learning algorithm, or gather more data. [...] As a rule of thumb, a dumb algorithm with lots and lots of data beats a clever one with modest amounts of it. (After all, machine learning is all about letting data do the heavy lifting.)"

Jargon note

Just about time for a break...

episode, and listen to that soon.** It supports what we read from Domingos. Ryan Adams talks about starting simple, as we discussed above. Adams also stresses the importance of feature engineering. Feature engineering is an exercise of the "knowledge" Domingos writes about. In a later episode, they share many concrete tips for feature engineering.

Play to learn

538 Election Forecasting Model

277
122
4y 12m
n/a

Repository of teaching materials, code, and data for my data analysis and machine learning projects.

4.79K
1.81K
7m
n/a

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

19.84K
6.32K
1y 9m
n/a

Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.

14.36K
4.03K
94d
n/a

Using Titanic data, "Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques."

Machine Learning](https://www.coursera.org/learn/machine-learning) is a popular and esteemed free online course. I've seen it recommended often. And emphatically.**

Tips for studying

videos. This is just about how to study in general. In the course, they advocate the learn-by-doing approach, as we're doing here. You'll get various other tips that are easy to apply, but go a long way to make your time investment more effective.

Other courses

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

19.84K
6.32K
1y 9m
n/a

the best machine learning tutorials on the web

421
247
9y 7m
n/a

starts with what we've already covered, then continues on at a comfortable place. After the videos you could do Markham's General Assembly's Data Science course. Interactive. Markham's course is also offered in-person in Washington, DC.

online course based on Data 8 is now offered via edX too.

Includes Coursera's Data Science Specialization with 9 courses in it. The Specialization certificate isn't free, but you can take the courses 1-by-1 for free if you don't care about the certificate. The survey also covers Harvard CS109 which I've seen recommended elsewhere.

Supplement: Learning Pandas well

Supplement: Cheat Sheets

More Data Science materials

Bayesian Statistics and Machine Learning

A python tutorial on bayesian modeling techniques (PyMC3)

2.25K
388
3y 7m
n/a

_ Made with a "computation/understanding-first, mathematics-second point of view." It's available in print too!

Risks

Welcome to the Danger Zone

This guide can't tell you how you'll know you've "made it" into Machine Learning competence ... let alone expertise. It's hard to evaluate proficiency without schools or other institutions. This is a common problem for self-taught people.

Towards Expertise

Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1

3.65K
30.2K
4y 8m
n/a

Ask for Peer Review

:bow: A note about Machine Learning and User Experience (UX)

:bow: A note about Machine Learning and Security (InfoSec, AppSec)

Machine Learning for Cyber Security

4.07K
1.24K
15d
n/a

AISecurity

1.01K
152
7m
n/a

A curated list of awesome adversarial machine learning resources

1.45K
261
5d
n/a

Deep Learning

ARCHIVED: Contains historical course materials/Homework materials for the FREE MOOC course on "Creative Applications of Deep Learning w/ Tensorflow" #CADL

1.48K
766
2y 6m
Apache-2.0

An interactive book about deep learning

courses on Deep Learning](https://www.coursera.org/specializations/deep-learning)!** There five courses, as part of the Deep Learning Specialization on Coursera. These courses are part of his new venture, deeplearning.ai

Google's fast-paced, practical introduction to machine learning.

Yet halfway through the book, it contains satisfying math content on how to think about actual deep learning.

answered by Greg Brockman (Co-Founder & CTO at OpenAI, previously CTO at Stripe)

"Big" Data?

Finding Open-Source Libraries

A curated list of awesome Machine Learning frameworks, libraries and software.

47.43K
11.89K
20d
n/a

Curated decibans of scientific programming resources in Python.

753
170
91d
n/a

Lore makes machine learning approachable for Software Engineers and maintainable for Machine Learning Researchers

1.5K
120
6m
MIT

Curated decibans of Julia programming language.

1.04K
175
4d
n/a

TensorFlow is an Open Source Software Library for Machine Intelligence

Alternative ways to "Dive into Machine Learning"

Repository of teaching materials, code, and data for my data analysis and machine learning projects.

4.79K
1.81K
7m
n/a

A complete daily plan for studying to become a machine learning engineer.

24.47K
5.75K
61d
CC-BY-SA-4.0

by Sebastian Raschka. A selection of the core online courses and books for getting started with machine learning and gaining expert knowledge. It contextualizes Raschka's own book, Python Machine Learning (which I would have linked to anyway!) See also pattern_classification GitHub repository maintained by the author, which contains IPython notebooks about various machine learning algorithms and various data science related resources.

Google's fast-paced, practical introduction to machine learning.

Amazon have open up their internal training to the public and also offer certification. 30 courses - 45+ hours of content.

is another good introduction, perhaps better if you're more familiar with Java or Scala. It introduces machine learning for a developer audience using Smile, a machine learning library that can be used both in Java and Scala.

is a journal devoted to clear and interactive explanations of the lastest research in machine learning. They offer an alternative to traditional academic publishing that promotes accessibility and transparency in the field.