| Benchmark | Metric | Baseline | This Paper | ฮ |
|---|---|---|---|---|
| Critic Selection validation: Comparing the 'Task-Wise Best' selected critics against baselines (Single Best critic and Mean Critic) on predicting YouTube video popularity rankings. | ||||
| Studio C (Top vs. Bottom) | Accuracy | 73.5 | 80.0 | +6.5 |
| VLDL (Top vs. Bottom) | Accuracy | 70.0 | 75.0 | +5.0 |
| SNL (Top vs. Bottom) | Accuracy | 65.5 | 71.0 | +5.5 |
| Key & Peele (Top vs. Bottom) | Accuracy | 66.5 | 66.5 | 0.0 |
| Studio C (Top vs. Bottom) | Accuracy | 73.3 | 80.0 | +6.7 |