reasoning factor graph: A structured graph where nodes represent queries, products, and 'reasoning factors' (like specific user needs or product utilities), and edges represent valid connections between them
bi-encoder: A retrieval architecture where query and item are encoded independently into vectors, allowing fast similarity search (high efficiency, lower accuracy)
cross-encoder: A retrieval architecture where query and item are processed together by the model to score relevance (high accuracy, low efficiency)
DSPy: A framework for programmatically optimizing LM prompts and pipelines
SFT: Supervised Fine-Tuning—training a model on labeled examples to adapt it to a specific task
LLM self-evaluation: A technique where the LLM is prompted to critique or verify its own previous outputs (e.g., 'Is this edge reasonable? Yes/No')
InfoNCE loss: A contrastive loss function used to learn representations by pulling positive pairs closer and pushing negative pairs apart
adapter: A small trainable module (here, an MLP) that projects embeddings from one space (LLM query embedding) to another (reasoning factor space)