Draft Outline
- When local models make sense: privacy, cost control, offline work, and experimentation
- Model selection tradeoffs: size, license, context length, reasoning quality, and domain fit
- Quantization, memory, batching, and latency basics
- Evaluation workflow before trusting a local model in a real application