Code Reference: The Project Catalog

The prose in this book teaches the why and the shape of each idea. The code — complete, runnable, tested — lives in a companion catalog of 52 from-scratch systems, linked throughout the chapters with the “Build it →” pattern. This appendix is the index to that catalog.

Each project implements one or more of the concepts a chapter develops, at production scale rather than illustration scale. Where a chapter says “Build it →”, it points here; where a project needs background, its README points back into these chapters. The full bidirectional map of which chapter ↔︎ which project lives in CONCEPT_TO_PROJECT_MAP.md.

All projects are under 06-real-world-projects/ in the repository. Python projects use FastAPI + pytest; Rust projects use trait-based design with Result-based error handling and Criterion benchmarks; the one Go system uses gRPC + protobuf.

Foundation & Backend (1–10)

Job queues, caching, microservices, data platforms, and ML training orchestration — the systems most chapters in Parts I, II, and IV build on.

01 · Distributed Job Queue
02 · Microservice Platform — the repo’s Go system (gRPC + Kong)
03 · High-Performance Cache
04 · ML Training Orchestrator
05 · SaaS Web Platform
06 · Async Runtime
07 · Data Lakehouse
08 · Streaming Platform
09 · Data Observability
10 · Warehouse Semantic Layer

Distributed Systems (11–20)

Consensus, networking, compilers, query engines, and SIMD analytics — the systems-programming tier, most heavily exercised by Part II’s Rust and C++ field guides.

11 · Distributed KV Store (Raft)
12 · Distributed Log System
13 · Service Mesh — mTLS + SPIFFE
14 · Network Stack
15 · Minimal OS Kernel
16 · CRDT Collaboration — TypeScript client
17 · Columnar Query Engine
18 · Compiler / Interpreter
19 · GPU Kernel Optimization
20 · SIMD Analytics Engine

ML / AI Core (21–37)

Embeddings, attention, RAG, model serving, parameter servers, and ML compilers. Part IV draws on the ML-systems members of this tier; the LLM/RAG members are the province of the companion AI Engineering book.

Advanced ML (38–49)

Distributed training, GPU memory and scheduling, inference engines, GNNs, and quantization — the heavy machinery behind Part IV’s Deep Learning, Distributed Training, and GPU chapters.

Data Infrastructure (50–52)

The feature platform, message queue, and time-series database that anchor Part II’s Data Engineering chapters.

Note

The catalog is broader than this book: several projects (notably the RAG, attention, and on-device-LLM systems, 21–27 and 41–47) implement concepts taught in the companion AI Engineering book rather than here. They are listed for completeness — this book links the ones whose concepts it teaches.

# Code Reference: The Project Catalog {.unnumbered} The prose in this book teaches the *why* and the shape of each idea. The **code** — complete, runnable, tested — lives in a companion catalog of 52 from-scratch systems, linked throughout the chapters with the **"Build it →"** pattern. This appendix is the index to that catalog. Each project implements one or more of the concepts a chapter develops, at production scale rather than illustration scale. Where a chapter says *"Build it →"*, it points here; where a project needs background, its README points back into these chapters. The full bidirectional map of which chapter ↔ which project lives in [`CONCEPT_TO_PROJECT_MAP.md`](https://github.com/jchu0/applied-cs-projects/blob/main/CONCEPT_TO_PROJECT_MAP.md). All projects are under [`06-real-world-projects/`](https://github.com/jchu0/applied-cs-projects) in the repository. Python projects use FastAPI + pytest; Rust projects use trait-based design with `Result`-based error handling and Criterion benchmarks; the one Go system uses gRPC + protobuf. --- ## Foundation & Backend (1–10) Job queues, caching, microservices, data platforms, and ML training orchestration — the systems most chapters in Parts I, II, and IV build on. - [01 · Distributed Job Queue](https://github.com/jchu0/applied-cs-projects/tree/main/01-distributed-job-queue) - [02 · Microservice Platform](https://github.com/jchu0/applied-cs-projects/tree/main/02-microservice-platform) — the repo's Go system (gRPC + Kong) - [03 · High-Performance Cache](https://github.com/jchu0/applied-cs-projects/tree/main/03-high-performance-cache) - [04 · ML Training Orchestrator](https://github.com/jchu0/applied-cs-projects/tree/main/04-ml-training-orchestrator) - [05 · SaaS Web Platform](https://github.com/jchu0/applied-cs-projects/tree/main/05-saas-web-platform) - [06 · Async Runtime](https://github.com/jchu0/applied-cs-projects/tree/main/06-async-runtime) - [07 · Data Lakehouse](https://github.com/jchu0/applied-cs-projects/tree/main/07-data-lakehouse) - [08 · Streaming Platform](https://github.com/jchu0/applied-cs-projects/tree/main/08-streaming-platform) - [09 · Data Observability](https://github.com/jchu0/applied-cs-projects/tree/main/09-data-observability) - [10 · Warehouse Semantic Layer](https://github.com/jchu0/applied-cs-projects/tree/main/10-warehouse-semantic-layer) ## Distributed Systems (11–20) Consensus, networking, compilers, query engines, and SIMD analytics — the systems-programming tier, most heavily exercised by Part II's Rust and C++ field guides. - [11 · Distributed KV Store (Raft)](https://github.com/jchu0/applied-cs-projects/tree/main/11-distributed-kv-raft) - [12 · Distributed Log System](https://github.com/jchu0/applied-cs-projects/tree/main/12-distributed-log-system) - [13 · Service Mesh](https://github.com/jchu0/applied-cs-projects/tree/main/13-service-mesh) — mTLS + SPIFFE - [14 · Network Stack](https://github.com/jchu0/applied-cs-projects/tree/main/14-network-stack) - [15 · Minimal OS Kernel](https://github.com/jchu0/applied-cs-projects/tree/main/15-minimal-os-kernel) - [16 · CRDT Collaboration](https://github.com/jchu0/applied-cs-projects/tree/main/16-crdt-collaboration) — TypeScript client - [17 · Columnar Query Engine](https://github.com/jchu0/applied-cs-projects/tree/main/17-columnar-query-engine) - [18 · Compiler / Interpreter](https://github.com/jchu0/applied-cs-projects/tree/main/18-compiler-interpreter) - [19 · GPU Kernel Optimization](https://github.com/jchu0/applied-cs-projects/tree/main/19-gpu-kernel-optimization) - [20 · SIMD Analytics Engine](https://github.com/jchu0/applied-cs-projects/tree/main/20-simd-analytics-engine) ## ML / AI Core (21–37) Embeddings, attention, RAG, model serving, parameter servers, and ML compilers. Part IV draws on the ML-systems members of this tier; the LLM/RAG members are the province of the companion *AI Engineering* book. - [21 · Custom Embedding Model](https://github.com/jchu0/applied-cs-projects/tree/main/21-custom-embedding-model) - [22 · Long-Context Attention](https://github.com/jchu0/applied-cs-projects/tree/main/22-long-context-attention) - [23 · LLM Agentic Runtime](https://github.com/jchu0/applied-cs-projects/tree/main/23-llm-agentic-runtime) - [24 · Synthetic Data Generator](https://github.com/jchu0/applied-cs-projects/tree/main/24-synthetic-data-generator) - [25 · RAG Baseline](https://github.com/jchu0/applied-cs-projects/tree/main/25-rag-baseline) - [26 · Advanced RAG](https://github.com/jchu0/applied-cs-projects/tree/main/26-advanced-rag) - [27 · Micro-Model Orchestrated RAG](https://github.com/jchu0/applied-cs-projects/tree/main/27-micro-model-orchestrated-rag) - [28 · AI Workflow Engine](https://github.com/jchu0/applied-cs-projects/tree/main/28-ai-workflow-engine) - [29 · Model Routing Layer](https://github.com/jchu0/applied-cs-projects/tree/main/29-model-routing-layer) - [30 · Parameter Server](https://github.com/jchu0/applied-cs-projects/tree/main/30-parameter-server) - [31 · ML Compiler](https://github.com/jchu0/applied-cs-projects/tree/main/31-ml-compiler) - [32 · Distributed Tensor Algebra](https://github.com/jchu0/applied-cs-projects/tree/main/32-distributed-tensor-algebra) - [33 · RL Physics Engine](https://github.com/jchu0/applied-cs-projects/tree/main/33-rl-physics-engine) - [34 · Distributed File System](https://github.com/jchu0/applied-cs-projects/tree/main/34-distributed-file-system) - [35 · Differentiable Programming](https://github.com/jchu0/applied-cs-projects/tree/main/35-differentiable-programming) - [36 · Distributed Streaming Analytics](https://github.com/jchu0/applied-cs-projects/tree/main/36-distributed-streaming-analytics) - [37 · Dynamic Graph Runtime](https://github.com/jchu0/applied-cs-projects/tree/main/37-dynamic-graph-runtime) ## Advanced ML (38–49) Distributed training, GPU memory and scheduling, inference engines, GNNs, and quantization — the heavy machinery behind Part IV's Deep Learning, Distributed Training, and GPU chapters. - [38 · Dynamic Graph Execution](https://github.com/jchu0/applied-cs-projects/tree/main/38-dynamic-graph-execution) - [39 · GPU Memory Manager](https://github.com/jchu0/applied-cs-projects/tree/main/39-gpu-memory-manager) - [40 · Distributed Autograd](https://github.com/jchu0/applied-cs-projects/tree/main/40-distributed-autograd) - [41 · Vector-Quantized LLM](https://github.com/jchu0/applied-cs-projects/tree/main/41-vector-quantized-llm) - [42 · GNN Runtime](https://github.com/jchu0/applied-cs-projects/tree/main/42-gnn-runtime) - [43 · Vector Index](https://github.com/jchu0/applied-cs-projects/tree/main/43-vector-index) - [44 · Autoregressive Inference](https://github.com/jchu0/applied-cs-projects/tree/main/44-autoregressive-inference) - [45 · Neural Compression](https://github.com/jchu0/applied-cs-projects/tree/main/45-neural-compression) - [46 · Multi-Tenant GPU Scheduler](https://github.com/jchu0/applied-cs-projects/tree/main/46-multi-tenant-gpu-scheduler) - [47 · On-Device LLM](https://github.com/jchu0/applied-cs-projects/tree/main/47-on-device-llm) - [48 · Multi-GPU Kernel Scheduler](https://github.com/jchu0/applied-cs-projects/tree/main/48-multi-gpu-kernel-scheduler) - [49 · AI Benchmark Suite](https://github.com/jchu0/applied-cs-projects/tree/main/49-ai-benchmark-suite) ## Data Infrastructure (50–52) The feature platform, message queue, and time-series database that anchor Part II's Data Engineering chapters. - [50 · Feature Engineering Platform](https://github.com/jchu0/applied-cs-projects/tree/main/50-feature-engineering-platform) - [51 · Message Queue](https://github.com/jchu0/applied-cs-projects/tree/main/51-message-queue) - [52 · Time-Series Database](https://github.com/jchu0/applied-cs-projects/tree/main/52-time-series-database) --- ::: {.callout-note} The catalog is broader than this book: several projects (notably the RAG, attention, and on-device-LLM systems, 21–27 and 41–47) implement concepts taught in the companion *AI Engineering* book rather than here. They are listed for completeness — this book links the ones whose concepts it teaches. :::