Multimodal Medical Reasoning Collection Multimodal reasoning tasks for the medical domain • 1 item • Updated 6 days ago
Zero Shot Medical Benchmarks Collection Popular medical benchmarks intended for zero shot evaluation (no training splits available). • 5 items • Updated 20 days ago
Zero Shot Medical Benchmarks Collection Popular medical benchmarks intended for zero shot evaluation (no training splits available). • 5 items • Updated 20 days ago
Free-Form Response Tasks Collection Medical tasks which do not have a fixed label set. Evaluation is typically done with token-f1 or other semantic similarity metrics. • 2 items • Updated 21 days ago