OG: Online Grocery—low-priced, routine purchase items like vegetables and meats
GM: General Merchandise—higher-priced, discretionary items like cookware and electronics
Lift: A metric quantifying the strength of association between two items, calculated as P(A,B) divided by P(A)P(B)
NDCG: Normalized Discounted Cumulative Gain—a measure of ranking quality that accounts for the position of relevant items in the list
Cross-Encoder: A model that processes two input texts simultaneously to output a relevance score, capturing deeper semantic interaction than separate embeddings
Market Basket Analysis: A technique (like Apriori) to find associations between items frequently bought together
E5: A text embedding model used for semantic search and retrieval
MPNet: A sentence transformer model used here to generate title and product type embeddings for the ranker