| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| The paper highlights a critical failure in prior work under frozen encoder evaluation. | ||||
| Standard ETC Datasets | Accuracy (Frozen Encoder) | 90.0 | 47.0 | -43.0 |
| FlowSem-MAE performance claims (specific numbers for FlowSem-MAE vs baselines are described qualitatively in abstract/intro text provided). | ||||
| Standard ETC Datasets | Accuracy | Not reported in the paper | Not reported in the paper | Not reported in the paper |