SFT: Supervised Fine-Tuning—training a model on labeled examples to specialize it for a task
RL: Reinforcement Learning—training agents to maximize a reward signal through trial and error
Agentic RAG: Retrieval-Augmented Generation where the retrieval process is actively managed by an agent (e.g., via tool calls) rather than a fixed pipeline
DSPy: A framework for programming language models that optimizes prompts automatically
LangChain: A popular open-source framework for building applications with LLMs
Closed-source models: Proprietary models like GPT-4 or Claude whose weights and training data are not public
Human-in-the-loop: Systems designed so that a human must review, approve, or interact with the agent's output at critical steps
System-level design: Improving reliability through architecture (guardrails, retries, constraints) rather than improving the core AI model itself