← Back to Paper List

FACTER: Fairness-Aware Conformal Thresholding and Prompt Engineering for Enabling Fair LLM-Based Recommender Systems

Arya Fayyazi, Mehdi Kamal, Massoud Pedram
University of Southern California
arXiv (2025)
Recommendation P13N Benchmark

📝 Paper Summary

LLM-based Recommendation Systems Fairness and Bias Mitigation Conformal Prediction
FACTER uses conformal prediction to set adaptive fairness thresholds and iteratively refines prompts to mitigate bias in black-box LLM recommenders without retraining model parameters.
Core Problem
Generative LLMs in recommendation systems exhibit subtle biases where changing a sensitive attribute (e.g., gender) alters the recommendation, but determining a robust threshold for 'unfair' deviation is difficult without statistical guarantees.
Why it matters:
  • LLMs are often deployed as black boxes (APIs), preventing parameter-level bias mitigation like adversarial training.
  • Generative bias is subtler than classification bias; simple differences in output text style or sentiment can affect user perception.
  • Standard heuristic thresholds for bias detection lack statistical coverage guarantees, leading to either excessive false alarms or missed discrimination.
Concrete Example: If a prompt asks for a movie recommendation for a 'male teacher' versus a 'female teacher', the LLM might suggest action movies for the male and romance for the female. Existing methods struggle to decide if the semantic distance between these outputs is large enough to constitute a violation or just random noise.
Key Novelty
Conformal Fairness Thresholding with Violation-Triggered Prompt Repair
  • Uses conformal prediction to calculate a dynamic semantic variance threshold from a calibration set, ensuring a statistical guarantee on the rate of fairness violations.
  • When a recommendation's semantic distance exceeds this threshold (a violation), the system automatically updates the prompt with an adversarial example to steer the LLM back to fairness.
  • Operates entirely on the input/output level (black-box), avoiding the need to access or modify the LLM's internal weights.
Evaluation Highlights
  • Reduces fairness violations by up to 95.5% on MovieLens and Amazon datasets compared to standard prompting.
  • Maintains strong recommendation accuracy while significantly lowering the Sub-Network Similarity Ratio (SNSR), indicating reduced cross-group semantic gaps.
  • Demonstrates that semantic variance is a potent proxy for bias, allowing detection without requiring expensive human annotation for every output.
Breakthrough Assessment
7/10
Solid application of conformal prediction to the specific problem of generative fairness thresholds. Highly relevant for black-box API users, though the prompt repair mechanism is a standard technique applied in a novel loop.
×