MBZUAI/longshot-bench
Viewer • Updated • 2.08k • 16 • 2
Natural Language Processing, Machine Learning, and Computer Vision
CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare
From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering