Resources
Essential repositories, libraries, and learning materials.
Learning from scratch
Core libraries
| Library | What |
|---|
| pytorch | The dominant deep learning framework |
| scikit-learn | Classical ML (SVMs, random forests, clustering, preprocessing) |
| transformers | 400k+ pretrained models via unified API |
| datasets | 100k+ datasets with smart caching |
| peft | LoRA, QLoRA, prefix tuning — fine-tune large models on consumer hardware |
| diffusers | Diffusion model library (Stable Diffusion, Flux, etc.) |
| jax | NumPy + autograd + XLA compilation for GPU/TPU |
LLM inference
| Tool | What |
|---|
| ollama | Run LLMs locally with one command |
| llama.cpp | LLM inference in C/C++ — the engine behind GGUF quantization |
| vllm | High-throughput serving with PagedAttention, OpenAI-compatible API |
Agents and RAG
| Framework | What |
|---|
| langgraph | Graph-based agent orchestration |
| llama_index | RAG pipelines and document agents (300+ integrations) |
| dspy | ”Programming not prompting” — compiles LLM pipelines |
| autogen | Multi-agent conversation framework |
| crewai | Role-based multi-agent framework |
Claude / Anthropic
Reinforcement learning
| Tool | What |
|---|
| Gymnasium | Standard RL environment API (successor to OpenAI Gym) |
| stable-baselines3 | Reliable PPO, SAC, A2C, DQN implementations |
| cleanrl | Single-file RL algorithms — best for learning |
MLOps
| Tool | What |
|---|
| mlflow | Experiment tracking, model registry, deployment |
| pytorch-lightning | Removes PyTorch boilerplate (training loops, distributed, logging) |
Reference lists
Links