arxiv:2502.07599
Xiliang Yang
NoManDeRY
AI & ML interests
None yet
Organizations
None yet
models 7
NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-fixed-0.95
Text Generation • 8B • Updated
NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-decrease_linear-1.0to0.95
Text Generation • 8B • Updated
• 3
NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-increase_linear_0.95to1.0
Text Generation • 8B • Updated
• 6
NoManDeRY/DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-1.0
Text Generation • 8B • Updated
• 2
NoManDeRY/DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-0.95
Text Generation • 8B • Updated
• 2
NoManDeRY/DPO-Shift-Qwen-2-7B-UltraChat200K-SFT
Text Generation • 8B • Updated
• 10
NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-fixed-1.0
Text Generation • 8B • Updated
datasets 0
None public yet