All tracks
Intermediate~14 hours14 lessons

Production RAG Engineer

Build, evaluate, debug, and deploy real-world retrieval-augmented generation systems. Go beyond toy examples to handle chunking tradeoffs, hybrid retrieval, reranking, citation grounding, and RAG observability.

What you'll learn

Document ingestion pipelines
Chunking strategies and tradeoffs
Embedding model selection
Vector database setup and indexing
Hybrid retrieval (sparse + dense)
Reranking models and latency tradeoffs
Query rewriting and HyDE
Citations and grounded answers
Permission-aware retrieval
RAG evaluation datasets and metrics
Hallucination reduction techniques
RAG observability and tracing
Cost and latency optimization
Production RAG capstone project

Learning outcomes

1Design a production RAG pipeline end to end
2Choose chunking strategies for different document types
3Implement hybrid retrieval with BM25 and dense vectors
4Evaluate RAG quality with faithfulness, recall, and precision metrics
5Debug hallucinations and retrieval failures systematically

Related lessons

Browse all lessons

Up next after this track

Advanced
12h

LLMOps and AI Observability

Operate AI systems with tracing, evaluation pipelines, regression tests, cost monitoring, and production dashboards. Learn the engineering discipline behind reliable LLM products.

12 lessons

Prompt versioning and experiment trackingDataset versioning for evaluationEvaluation pipelines: automated and humanLLM-as-judge patterns and calibration+8 more
Preview track

Weekly newsletter

Get practical AI engineering insights in your inbox.

Weekly guides, interview prep, architecture breakdowns, and production lessons for engineers building with AI — free forever.

Subscribe free