Implicit Superlative Queries: Search queries seeking the 'best' of a category without explicitly stating the criteria (e.g., 'best shoes for trail running' implies grip, durability, ankle support)
SUPERB: Superlatives with Best relevance annotations—the authors' proposed dataset and 4-level labeling schema for superlative queries
Listwise Ranking: Prompting an LLM with a query and a list of multiple documents, asking it to output a ranked order of those documents
Pointwise Ranking: Prompting an LLM to score or label a single document at a time independently of others
Deliberated Prompting: A two-step prompting strategy where the LLM first generates reasoning (e.g., attributes of a 'best' product) before generating the final score or label
ESCI: Exact, Substitute, Complement, Irrelevant—a standard e-commerce relevance scale used in the source dataset
nDCG: Normalized Discounted Cumulative Gain—a measure of ranking quality that accounts for the position of relevant items
RM3: A pseudo-relevance feedback model that expands the original query using terms from the top initially retrieved documents