Do Audio-Visual Large Language Models Really See and Hear? Paper • 2604.02605 • Published 12 days ago • 7