← Back to Paper List

Continual SFT Matches Multimodal RLHF with Negative Supervision

Ke Zhu, Yu Wang, Yanpeng Sun, Qiang Chen, Jiangjiang Liu, Gang Zhang, Jingdong Wang
arXiv (2024)
MM RL
📄

No Summary Available

This paper hasn't been summarized yet.

×