← Back to Paper List

LLM-Enhanced Reranking for Complementary Product Recommendation

Zekun Xu, Yudi Zhang
North Carolina State University, Iowa State University
arXiv (2025)
Recommendation Agent KG

📝 Paper Summary

Complementary Product Recommendation LLM-based Reranking
A model-agnostic framework that uses multi-agent LLM prompting to rerank complementary product candidates retrieved by graph neural networks, first enhancing diversity and then refining for accuracy.
Core Problem
Graph Neural Networks (GNNs) used for recommendation often favor popular, highly connected items, failing to capture the semantic nuance required for diverse, long-tail complementary products.
Why it matters:
  • Recommending complementary items (e.g., lens for a camera) drives significant e-commerce value but requires understanding functional relationships beyond simple co-occurrence.
  • Existing GNN approaches struggle with the accuracy-diversity tradeoff, often recommending repetitive or obvious items while missing novel but relevant complements.
  • Prior LLM integration methods typically require expensive retraining of the base recommender to incorporate augmented data.
Concrete Example: When recommending for an 'iPhone', a standard model might suggest popular 'iPhone' substitutes or generic accessories. The proposed system uses an LLM to explicitly reason that 'iPhone Case' is an accessory and 'Speaker Cables' complement 'Speaker Stands', reranking these semantically relevant but potentially less connected items higher.
Key Novelty
Two-Stage Multi-Agent LLM Reranking
  • Decomposes the reranking task into two sequential LLM agents: a 'Diversity Agent' that prioritizes different product genres from the candidate list, and an 'Accuracy Agent' that filters the result for strict relevance.
  • Utilizes a model-agnostic 'retrieve-then-rerank' pipeline where the LLM operates solely on textual metadata (titles) of candidates retrieved by any base GNN, avoiding model retraining.
Evaluation Highlights
  • Achieves ~22% improvement in Hit@1 on the Cell Phones dataset using SComGNN as the base retriever (1.087 → 1.326).
  • Improves diversity metrics (vocabulary size) by over 10% on Cell Phones with GraphSAGE base (19.5 → 21.2) while simultaneously boosting accuracy.
  • Consistently improves NDCG@1 across all four datasets (Cell Phones, Electronics, Grocery, Home) when applied to GraphSAGE, GAT, and SComGNN baselines.
Breakthrough Assessment
6/10
Effective application of LLMs to the specific problem of complementary recommendation with a clean, model-agnostic design. While the architectural novelty is moderate (prompt-based reranking), the demonstrated balance of accuracy and diversity is valuable.
×