LLM-as-annotators: Using Large Language Models to label data with human-level quality, replacing manual crowd-sourcing
Golden Set: High-quality, manually annotated dataset created by expert raters used to evaluate LLM performance
Silver Set: Large-scale dataset annotated by the Teacher LLM, used to train smaller Student models
Knowledge Distillation: Training a small, fast model (Student) to mimic the output of a large, slow model (Teacher/LLM) to reduce latency
Personalized Restricted Retrieval: A recommendation strategy where the system restricts the search space to items with specific attributes based on predicted user intent
SCANN: Scalable Nearest Neighbors—an efficient algorithm for vector similarity search used in retrieval
Student DNN: Deep Neural Network—a lightweight model trained on LLM outputs to perform annotation at scale
F1 score: The harmonic mean of precision and recall, used to measure classification accuracy