Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
Paper • 2506.11712 • Published
Wenqi Liu1,
Xuemeng Song2,
Jiaxi Li3,
Yinwei Wei1,
Zheng Na4,
Jianhua Yin1,
Liqiang Nie5
1Shandong University
2Southern University of Science and Technology
3University of Georgia
4National University of Singapore
5Harbin Institute of Technology, Shenzhen
We present SymMPO, a framework for mitigating hallucination in multimodal large language models (MLLMs). Our method introduces a theory-consistent symmetric multimodal preference optimization approach that addresses the hallucination problem from a principled perspective. This repository provides the official implementation, pretrained checkpoints, and evaluation scripts built on top of LLaVA.
If you find our work helpful, please consider citing:
@inproceedings{
liu2025mitigating,
title={Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization},
author={Wenqi Liu and Xuemeng Song and Jiaxi Li and Yinwei Wei and Na Zheng and Jianhua Yin and Liqiang Nie},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
year={2025},
url={https://openreview.net/forum?id=tIW29IpCwG}
}
Base model
liuhaotian/llava-v1.5-13b