arxiv:2603.13033
Yanpeng Zhao
surprisal
·
AI & ML interests
None yet
Recent Activity
authored a paper 3 days ago
v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound authored a paper 3 days ago
Connecting the Dots between Audio and Text without Parallel Data through
Visual Knowledge Transfer authored a paper 3 days ago
ESPIRE: A Diagnostic Benchmark for Embodied Spatial Reasoning of Vision-Language ModelsOrganizations
None yet