Resources

Essential repositories, libraries, and learning materials.

Learning from scratch

Resource	What
nn-zero-to-hero	Karpathy’s neural nets from scratch video series code
nanoGPT	Simplest GPT training code (~300 lines)
minbpe	Minimal BPE tokenizer implementation
LLMs-from-scratch	Build a GPT step-by-step with PyTorch
d2l-en	”Dive into Deep Learning” interactive textbook
annotated_deep_learning_paper_implementations	60+ landmark papers reimplemented with line-by-line annotations

Core libraries

Library	What
pytorch	The dominant deep learning framework
scikit-learn	Classical ML (SVMs, random forests, clustering, preprocessing)
transformers	400k+ pretrained models via unified API
datasets	100k+ datasets with smart caching
peft	LoRA, QLoRA, prefix tuning — fine-tune large models on consumer hardware
diffusers	Diffusion model library (Stable Diffusion, Flux, etc.)
jax	NumPy + autograd + XLA compilation for GPU/TPU

LLM inference

Tool	What
ollama	Run LLMs locally with one command
llama.cpp	LLM inference in C/C++ — the engine behind GGUF quantization
vllm	High-throughput serving with PagedAttention, OpenAI-compatible API

Agents and RAG

Framework	What
langgraph	Graph-based agent orchestration
llama_index	RAG pipelines and document agents (300+ integrations)
dspy	”Programming not prompting” — compiles LLM pipelines
autogen	Multi-agent conversation framework
crewai	Role-based multi-agent framework

Claude / Anthropic

Resource	What
anthropic-sdk-python	Official Python SDK for Claude API
anthropic-cookbook	Recipes for tool use, RAG, agents, multimodal
MCP python-sdk	Build MCP servers and clients
MCP servers	Reference MCP server implementations
courses	Official prompt engineering course materials

Reinforcement learning

Tool	What
Gymnasium	Standard RL environment API (successor to OpenAI Gym)
stable-baselines3	Reliable PPO, SAC, A2C, DQN implementations
cleanrl	Single-file RL algorithms — best for learning

MLOps

Tool	What
mlflow	Experiment tracking, model registry, deployment
pytorch-lightning	Removes PyTorch boilerplate (training loops, distributed, logging)

Reference lists

Resource	What
awesome-machine-learning	Curated ML frameworks, tools, resources across all languages
best-of-ml-python	Ranked Python ML libraries with activity metrics
TheAlgorithms/Python	All algorithms implemented in Python

Links