Machine Learning Systems

Principles and Practices of Engineering Artificially Intelligent Systems

📖 Read Online • Tiny🔥Torch • 📄 Download PDF • 📓 Download EPUB • 🌐 Explore Ecosystem

📚 Hardcopy edition coming 2026 with MIT Press.

Mission

The world is rushing to build AI systems. It is not engineering them.

That gap is what we mean by AI engineering.

AI engineering is the discipline of building efficient, reliable, safe, and robust intelligent systems that operate in the real world, not just models in isolation.

Our mission: Establish AI engineering as a foundational discipline, alongside software engineering and computer engineering, by teaching how to design, build, and evaluate end to end intelligent systems. The long term impact of AI will be shaped by engineers who can turn ideas into working, dependable systems.

What’s in this repo

This repository is the open learning stack for AI systems engineering.

It includes the textbook source, TinyTorch, hardware kits, and upcoming co-labs that connect principles to runnable code and real devices.

Start Here

Choose a path based on your goal.

READ Start with the textbook. Try Chapter 1 and the Benchmarking chapter.

BUILD Start TinyTorch with the getting started guide. Begin with Module 01 and work up from CNNs to transformers and the MLPerf benchmarks.

DEPLOY Pick a hardware kit and run the labs on Arduino, Raspberry Pi, and other edge devices.

CONNECT Say hello in Discussions. We will do our best to reply.

The Learning Stack

The learning stack below shows how the textbook connects to hands on work and deployment. Read the textbook, then pick your path:

┌───────────────────────────────────────────────────────────────────────────────┐
│                                                                               │
│                           MACHINE LEARNING SYSTEMS                            │
│                              Read the Textbook                                │
│                                                                               │
│                    Theory • Concepts • Best Practices                         │
│                                                                               │
└───────────────────────────────────────┬───────────────────────────────────────┘
                                        │
                          ┌─────────────┼─────────────┐
                          │             │             │
                          ▼             ▼             ▼
┌───────────────────────────────────────────────────────────────────────────────┐
│                            HANDS-ON ACTIVITIES                                │
│                           (pick one or all)                                   │
│                                                                               │
│     ┌─────────────────┐      ┌─────────────────┐      ┌─────────────────┐     │
│     │                 │      │                 │      │                 │     │
│     │    SOFTWARE     │      │    TINYTORCH    │      │    HARDWARE     │     │
│     │    CO-LABS      │      │    FRAMEWORK    │      │      LABS       │     │
│     │                 │      │                 │      │                 │     │
│     │ EXPLORE         │      │ BUILD           │      │ DEPLOY          │     │
│     │                 │      │                 │      │                 │     │
│     │ Run controlled  │      │ Understand      │      │ Engineer under  │     │
│     │ experiments on  │      │ frameworks by   │      │ real constraints│     │
│     │ latency, memory,│      │ implementing    │      │ memory, power,  │     │
│     │ energy, cost    │      │ them            │      │ timing, safety  │     │
│     │                 │      │                 │      │                 │     │
│     │ (coming 2026)   │      │                 │      │ Arduino, Pi     │     │
│     └─────────────────┘      └─────────────────┘      └─────────────────┘     │
│                                                                               │
│           EXPLORE                  BUILD                   DEPLOY             │
│                                                                               │
└───────────────────────────────────────┬───────────────────────────────────────┘
                                        │
                                        ▼
┌───────────────────────────────────────────────────────────────────────────────┐
│                                                                               │
│                                  AI OLYMPICS                                  │
│                                 Prove Mastery                                 │
│                                                                               │
│       Compete across all tracks • University teams • Public leaderboards      │
│                                                                               │
│                                (coming 2026)                                  │
│                                                                               │
└───────────────────────────────────────────────────────────────────────────────┘

	Component	What You Do	Link
READ	📖 Textbook	Understand ML systems concepts	book/
EXPLORE	🔮 Software Co-Labs	Run controlled experiments on latency, memory, energy, cost	Coming 2026
BUILD	🔥 TinyTorch	Understand frameworks by implementing them	tinytorch/
DEPLOY	🔧 Hardware Kits	Engineer under real constraints: memory, power, timing, safety	kits/
PROVE	🏆 AI Olympics	Compete and benchmark across all tracks	Coming 2026

What each path teaches:

EXPLORE teaches why — Understand tradeoffs. Change batch sizes, precision, model architectures and see how latency, memory, and accuracy shift.
BUILD teaches how — Understand internals. Implement autograd, optimizers, and attention from scratch to see how TensorFlow and PyTorch actually work.
DEPLOY teaches where — Understand constraints. Face real memory limits, power budgets, and latency requirements on actual hardware.

What You Will Learn

This textbook teaches you to think at the intersection of machine learning and systems engineering. Each chapter bridges algorithmic concepts with the infrastructure that makes them work in practice.

The ML ↔ Systems Bridge

ML Concept	Systems Concept	What You Learn
Model parameters	Memory constraints	How to fit large models on resource-limited devices
Inference latency	Hardware acceleration	How GPUs, TPUs, and accelerators execute neural networks
Training convergence	Compute efficiency	How mixed-precision and optimization techniques reduce cost
Model accuracy	Quantization and pruning	How to compress models while preserving performance
Data requirements	Pipeline infrastructure	How to build efficient data loading and preprocessing
Model deployment	MLOps practices	How to monitor, version, and update models in production
Privacy constraints	On-device learning	How to train and adapt models without sending data to the cloud

Book Structure

Part	Focus	Chapters
I. Foundations	Core concepts	Introduction, ML Systems, DL Primer, Architectures
II. Design	Building blocks	Workflow, Data Engineering, Frameworks, Training
III. Performance	Making it fast	Efficient AI, Optimizations, HW Acceleration, Benchmarking
IV. Deployment	Making it work	MLOps, On-device Learning, Privacy, Robustness
V. Trust	Making it right	Responsible AI, Sustainable AI, AI for Good
VI. Frontiers	What's next	Emerging trends and future directions

What Makes This Different

This is a living textbook. We keep it updated as the field grows, with community input along the way.

AI may feel like it is moving at lightning speed, but the engineering building blocks that make it work do not change as quickly as the headlines. This project is built around those stable foundations.

Think of it like LEGO. New sets arrive all the time, but the bricks themselves stay the same. Once you learn how the bricks fit together, you can build anything. Here, those "AI bricks" are the solid systems principles that make AI work.

Whether you are reading a chapter, running a lab, or sharing feedback, you are helping make these ideas more accessible to the next learner.

Research to Teaching Loop

We use the same loop for research and teaching: define the system problem, build a reference implementation, benchmark it, then turn it into curriculum and tooling so others can reproduce and extend it.

Loop Step	Research Artifacts	Teaching Artifacts
Measure	Benchmarks, suites, metrics	Benchmarking chapter, assignments
Build	Reference systems, compilers, runtimes	TinyTorch modules, co-labs
Deploy	Hardware targets, constraints, reliability	Hardware labs, kits

Support This Work

We are working toward 1 million learners by 2030 so that AI engineering becomes a shared, teachable discipline, not a collection of isolated practices. Every star, share, and contribution helps move this effort forward.

Why GitHub Stars Matter

What gets measured gets improved.

Each star is a learner, educator, or supporter who believes AI systems should be engineered with rigor and real world constraints in mind.

1 learner → 10 learners → 100 learners → 1,000 learners → 10,000 learners → 100,000 learners → 1M learners

Stars are not the goal. They are a signal.

A visible, growing community makes it easier for universities, foundations, and industry partners to adopt this material, donate hardware, and fund workshops. That momentum lowers the barrier for the next institution, the next classroom, and the next cohort of learners.

Support raised through this signal flows into Open Collective and funds concrete outcomes such as TinyML4D workshops, hardware kits for underserved classrooms, and the infrastructure required to keep this resource free and open.

One click can unlock the next classroom, the next contributor, and the next generation of AI engineers.

Fund the Mission

All contributions go to Open Collective, a transparent fund that supports educational outreach.

Community and Resources

Resource	Description
📖 Textbook	Interactive online textbook
🔥 TinyTorch	Build ML frameworks from scratch
🔧 Hardware Kits	Deploy to Arduino, Raspberry Pi, edge devices
🌐 Ecosystem	Resources, workshops, and community
💬 Discussions	Questions and ideas

Contributing

We welcome contributions to the book, TinyTorch, and hardware kits!

I want to...	Go here
Fix a typo or improve a chapter	book/docs/CONTRIBUTING.md
Add a TinyTorch module or fix a bug	tinytorch/CONTRIBUTING.md
Improve hardware labs	kits/README.md
Report an issue	GitHub Issues
Ask a question	GitHub Discussions

Citation & License

Citation

@inproceedings{reddi2024mlsysbook,
  title        = {MLSysBook.AI: Principles and Practices of Machine Learning Systems Engineering},
  author       = {Reddi, Vijay Janapa},
  booktitle    = {2024 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ ISSS)},
  pages        = {41--42},
  year         = {2024},
  organization = {IEEE},
  url          = {https://mlsysbook.org}
}

License

This project uses a dual-license structure:

Component	License	What It Means
Book content	CC BY-NC-ND 4.0	Share freely with attribution; no commercial use; no derivatives
TinyTorch code	Apache 2.0	Use, modify, and distribute freely; includes patent protection

The textbook content (chapters, figures, explanations) is educational material that should circulate with attribution and without commercial exploitation. The software framework is a tool designed to be easy for anyone to use, modify, or integrate into their own projects.

Contributors

Thanks goes to these wonderful people who have contributed to making this resource better for everyone!

Legend: 🪲 Bug Hunter · 🧑‍💻 Code Contributor · ✍️ Doc Wizard · 🎨 Design Artist · 🧠 Idea Spark · 🔎 Code Reviewer · 🧪 Test Tinkerer · 🛠️ Tool Builder

📖 Textbook Contributors

_{Vijay Janapa Reddi} 🪲 🧑‍💻 🎨 ✍️ 🧠 🔎 🧪 🛠️	_{Marcelo Rovai} 🧑‍💻 🎨 🧪	_{Gabriel Amazonas} 🪲 ✍️ 🧠	_{Kai Kleinbard} 🧑‍💻 🛠️	_{Didier Durand} ✍️ 🪲	_{Zeljko Hrcek} 🧑‍💻	_{Jason Jabbour} ✍️	_{Ikechukwu Uchendu} ✍️	_{Naeem Khoshnevis} ✍️
_{Sara Khosravi} ✍️	_{Douwe den Blanken} ✍️	_{Jeffrey Ma} ✍️	_{shanzehbatool} ✍️	_Elias ✍️	_{Jared Ping} ✍️	_{Itai Shapira} ✍️	_{Maximilian Lam} ✍️	_{Jayson Lin} ✍️
_{Sophia Cho} ✍️	_Andrea ✍️	_{Alex Rodriguez} ✍️	_{Korneel Van den Berghe} ✍️	_Nimo ✍️	_{Colby Banbury} ✍️	_{Zishen Wan} ✍️	_{Mark Mazumder} ✍️	_{Abdulrahman Mahmoud} ✍️
_{Divya Amirtharaj} ✍️	_{Srivatsan Krishnan} ✍️	_marin-llobet ✍️	_{Aghyad Deeb} ✍️	_{Haoran Qiu} ✍️	_{Emil Njor} ✍️	_{ELSuitorHarvard} ✍️	_kaiM0ves ✍️	_oishib ✍️
_{Jared Ni} ✍️	_{Aditi Raju} ✍️	_{Michael Schnebly} ✍️	_{Thuong Duong} ✍️	_{Yu-Shun Hsiao} ✍️	_{Henry Bae} ✍️	_{Eimhin Laverty} ✍️	_{Jae-Won Chung} ✍️	_{Shvetank Prakash} ✍️
_{Marco Zennaro} ✍️	_{Arya Tschand} ✍️	_{Andrew Bass} ✍️	_{Pong Trairatvorakul} ✍️	_{Eura Nofshin} ✍️	_{Matthew Stewart} ✍️	_{Emeka Ezike} ✍️	_jianqingdu ✍️	_{Jennifer Zhou} ✍️
_{The Random DIY} ✍️	_{Fatima Shah} ✍️	_{Bruno Scaglione} ✍️	_Allen-Kuang ✍️	_Tess314 ✍️	_{Tauno Erik} ✍️	_gnodipac886 ✍️	_{Sercan Aygün} ✍️	_{TheHiddenLayer} ✍️
_{Gauri Jain} ✍️	_{Fin Amin} ✍️	_{Alex Oesterling} ✍️	_{Abenezer Angamo} ✍️	_{Baldassarre Cesarano} ✍️	_{Jahnic Beck} ✍️	_{अरनव शुक्ला \| Arnav Shukla} ✍️	_Rin ✍️	_{Bilge Acun} ✍️
_{Andy Cheng} ✍️	_{Aritra Ghosh} ✍️	_{abigailswallow} ✍️	_{Yang Zhou} ✍️	_{JEON HYUNJUN(Luciano)} ✍️	_{Emmanuel Rassou} ✍️	_{Jason Yik} ✍️	_{Jessica Quaye} ✍️	_{Cursor Agent} ✍️
_{happyappledog} ✍️	_Snuggs ✍️	_{Sam Wilcock} ✍️	_{Shreya Johri} ✍️	_{Sonia Murthy} ✍️	_{Costin-Andrei Oncescu} ✍️	_{formlsysbookissue} ✍️	_{Annie Laurie Cook} ✍️	_{Parampreet Singh} ✍️
_{Vijay Edupuganti} ✍️	_{Jothi Ramaswamy} ✍️	_{Batur Arslan} ✍️	_{Curren Iyer} ✍️	_{Edward Jin} ✍️	_{Tess Watt} ✍️	_bluebaer7 ✍️	_yanjingl ✍️	_a-saraf ✍️
_songhan ✍️	_jvijay ✍️	_Zishen ✍️	_{Kristian Radoš} ✍️	_{Dang Truong} 🧑‍💻

🔥 TinyTorch Contributors

_{Vijay Janapa Reddi} 🪲 🧑‍💻 🎨 ✍️ 🧠 🔎 🧪 🛠️	_kai 🪲 🧑‍💻 🎨 ✍️ 🧪	_{Dang Truong} 🪲 🧑‍💻 ✍️ 🧪	_{Didier Durand} 🪲 🧑‍💻 ✍️	_{Karthik Dani} 🪲 🧑‍💻	_{Avik De} 🪲 🧪	_rnjema 🧑‍💻 🛠️	_joeswagson 🧑‍💻 🛠️	_{Amir Alasady} 🪲
_jettythek 🧑‍💻	_Takosaga 🪲	_wzz 🪲	_{Ng Bo Lin} ✍️	_keo-dara 🪲	_{Wayne Norman} 🪲	_{Ilham Rafiqin} 🪲

🛠️ Hardware Kits Contributors

_{Vijay Janapa Reddi}
🪲 🧑‍💻 🎨 ✍️ 🧪 🛠️

_{Marcelo Rovai}
✍️ 🧑‍💻 🎨

🧪 Labs Contributors

_{Vijay Janapa Reddi}
🧑‍💻 🎨 ✍️

⭐ Star us on GitHub • ✉️ Subscribe • 💬 Join discussions • 🌐 Visit mlsysbook.ai

Made with ❤️ for AI engineers
in the making, around the world 🌎

Name		Name	Last commit message	Last commit date
Latest commit History 9,403 Commits
.github		.github
.vale/styles/textbook		.vale/styles/textbook
README		README
_brand		_brand
binder		binder
book		book
kits		kits
labs		labs
tinytorch		tinytorch
.all-contributorsrc		.all-contributorsrc
.codespell-ignore-words.txt		.codespell-ignore-words.txt
.envrc		.envrc
.gitignore		.gitignore
.nojekyll		.nojekyll
.pre-commit-config.yaml		.pre-commit-config.yaml
.yamllint		.yamllint
CITATION.bib		CITATION.bib
CNAME		CNAME
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Systems

Mission

What’s in this repo

Start Here

The Learning Stack