Draft Outline
- Log queries, retrieved chunks, reranking decisions, prompts, outputs, and citations
- Separate retrieval misses from synthesis errors and instruction-following failures
- Track drift in documents, embeddings, and user query patterns
- Build review workflows around traces, examples, and recurring failure categories