RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
Agentic RAG: RAG systems where the model acts as an agent, autonomously deciding when and what to retrieve in multiple steps rather than a single fixed retrieval step
Hypothetico-deductive reasoning: The scientific method used in clinical diagnosis: formulating hypotheses based on symptoms and testing them against evidence
SFT: Supervised Fine-Tuning—training a model on labeled examples
RL: Reinforcement Learning—training an agent to maximize a reward signal through trial and error
Case-Based Reasoning (CBR): Solving new problems based on the solutions of similar past problems (implemented here via the <match> action)
OOD: Out-of-Distribution—evaluating on data from sources or distributions not seen during training
Hit@N: A metric measuring if the correct information (e.g., diagnosis) appears in the top N retrieved items
PPA: Positive Percent Agreement—measure of consensus between evaluators