[ICLR 2026] Official repository of "Uni-DPO: A Unified Paradigm for Dynamic Preference Optimization of LLMs". Repo: https://github.com/pspdada/Uni-DPO
Peng Shangpin
psp-dada
AI & ML interests
Multimodal Large Language Models, Preference Optimization, Algorithm
Recent Activity
published
a model
about 1 hour ago
psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-GPT-4
published
a dataset
about 1 hour ago
psp-dada/Uni-DPO
published
a model
about 1 hour ago
psp-dada/Llama-3-8B-Base-SFT-Uni-DPO
Organizations
None yet