Evaluation Setup
Personalized image generation using a test set of identity images
Benchmarks:
- Custom Evaluation Set (Identity-preserving generation) [New]
Metrics:
- DINO Score (Identity Fidelity)
- CLIP-I (Identity Fidelity)
- CLIP-T (Text/Semantic Consistency)
- Face Similarity (Face Sim)
- Statistical methodology: Not explicitly reported in the paper
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| Quantitative comparison against state-of-the-art tuning-free personalization methods. |
| Custom Evaluation Set |
DINO (Identity Fidelity) |
0.783 |
0.834 |
+0.051
|
| Custom Evaluation Set |
CLIP-T (Semantic Consistency) |
0.264 |
0.282 |
+0.018
|
| Custom Evaluation Set |
Face Sim |
0.686 |
0.795 |
+0.109
|
Main Takeaways
- Infinite-ID achieves a superior balance between ID fidelity and text consistency compared to PhotoMaker (good text, weak ID) and IP-Adapter (good ID, weak text).
- The method generalizes well to style transfer tasks, maintaining structure while applying style via the AdaIN-mean operation.
- Qualitative results show better preservation of facial details (e.g., gaze, expression) than competitors.