Vlaser
Collection
Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning
•
6 items
•
Updated
•
4
Computer Vision
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs
fn.