Evaluation Setup
Inversion attack on two distinct recommendation domains (Movies, Books) using two different victim architectures
Benchmarks:
- Movie Scenario (Prompt Reconstruction) [New]
- Book Scenario (Prompt Reconstruction) [New]
Metrics:
- Item Recovery Rate (percentage of user-interacted items recovered)
- Attribute Inference Accuracy (percentage of age/gender correctly inferred)
- ROUGE scores (textual similarity, implied by context)
- Statistical methodology: Not explicitly reported in the paper
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| Combined Average (Movies & Books) |
Item Recovery Rate |
Not reported in the paper |
0.65 |
Not reported in the paper
|
| Combined Average (Movies & Books) |
Attribute Inference Accuracy |
Not reported in the paper |
0.87 |
Not reported in the paper
|
| Combined Average |
Reconstruction Fidelity (unspecified metric, likely ROUGE or Recovery Rate) |
Not reported in the paper |
Not reported in the paper |
+0.05 to +0.13
|
Main Takeaways
- LLM-empowered RecSys are highly vulnerable to inversion attacks, with attackers able to reconstruct a majority of user interaction history (65%) and demographics (87%).
- The similarity-guided refinement strategy significantly boosts attack performance (5-13%) by using the victim model's own logits to verify candidate reconstructions.
- Privacy leakage is largely insensitive to the recommendation performance of the victim model itself, meaning even poor recommenders can leak high-fidelity user data.
- The attack is effective across different domains (Movies, Books) and architectures (TallRec/Llama, CoLLM/Qwen) when trained on appropriate synthetic data.