User Experience on mobile might not be great yet, but I'm working on it.

Your first time on this page? Allow me to give some explanations.

Awesome Software Engineering for Machine Learning

A curated list of articles that cover the software engineering best practices for building machine learning applications.

Here you can see meta information about this topic like the time we last updated this page, the original creator of the awesome list and a link to the original GitHub repository.

Last Update: Oct. 23, 2021, 12:06 a.m.

Thank you SE-ML & contributors
View Topic on GitHub:
SE-ML/awesome-seml

Search for resources by name or description.
Simply type in what you are looking for and the results will be filtered on the fly.

Further filter the resources on this page by type (repository/other resource), number of stars on GitHub and time of last commit in months.

Broad Overviews

Data Management

Model Training

Deployment and Operation

Social Aspects

Governance

Tooling

Algorithms for outlier, adversarial and drift detection

715
71
4m
Apache-2.0

Reproducible Rapid Research for Neural Architecture Search (NAS)

174
40
1y 9d
n/a

Always know what to expect from your data.

2.86K
315
1y 9d
Apache-2.0

A thoughtful approach to hyperparameter management.

117
5
11m
MIT

Label Studio is a multi-type data labeling and annotation tool with standardized output format

5.04K
456
8m
Apache-2.0

The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.

98
8
1y 38d
BSD-2-Clause

a tool that leverages rich metadata and lineage information in MLMD to build a model card

86
14
1y 10d
Apache-2.0

A Sklearn-like Framework for Hyperparameter Tuning and AutoML in Deep Learning projects. Finally have the right abstractions and design patterns to properly do AutoML. Let your pipeline steps have hyperparameter spaces. Enable checkpoints to cut duplicate calculations. Go from research to production environment easily.

363
40
9m
Apache-2.0

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

8.34K
956
1y 42d
Apache-2.0

REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets --- https://arxiv.org/abs/2004.07999

71
13
13d
MIT

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

2.35K
517
4m
Apache-2.0

Library for exploring and validating machine learning data

553
104
4m
Apache-2.0

Airflow is a platform to programmatically author, schedule and monitor workflows.

DVC is a data and ML experiments management tool.

Robust visualizations to aid in understanding machine learning datasets.

Replaces large files such as datasets with text pointers inside Git.

A platform for data scientists who want to build and experiment with ML pipelines.

platform to manage the ML lifecycle, including experimentation, reproducibility and deployment. Framework anf language agnostic, take a look at all the built-in integrations.

Experiment tracking tool bringing organization and collaboration to data science projects.

An inclusive movement to build an open, organized, online ecosystem for machine learning.

Machine Learning framework for Spark

TensorFlow's Visualization Toolkit.

An end-to-end platform for deploying production ML pipelines.

Experiment tracking, model optimization, and dataset versioning.