The living reference for production AI systems
Interview-ready depth on RAG, agents, inference, and evaluation — rewritten for clarity, built to go deep.
foundations
A ground-up tour of tokens, embeddings, attention, and why transformers scale.
retrieval
Why retrieval-augmented generation works, and how to build a pipeline that actually grounds answers.
agents
From single LLM calls to autonomous agents: planning, tool use, memory, and the control loop.