-
oceanpty/TOA-Ultrafeedback-SFT-Rand-lla3.1-8b-inst
Viewer • Updated • 59.9k • 5 -
oceanpty/TOA-Ultrafeedback-SFT-Rand-qwen2-7b-inst
Viewer • Updated • 59.9k • 10 -
oceanpty/TOA-Ultrafeedback-SFT-PRS-lla3.1-8b-inst
Viewer • Updated • 59.9k • 5 -
oceanpty/TOA-Ultrafeedback-SFT-PRS-qwen2-7b-inst
Viewer • Updated • 59.9k • 8
Hai Ye
oceanpty
AI & ML interests
None yet
Recent Activity
upvoted a paper 5 days ago
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome upvoted a paper 20 days ago
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification upvoted a collection 27 days ago
MiroThinker-1.7Organizations
None yet