GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts Paper • 2604.12978 • Published 5 days ago • 5
GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts Paper • 2604.12978 • Published 5 days ago • 5
Insights from the ICLR Peer Review and Rebuttal Process Paper • 2511.15462 • Published Nov 19, 2025 • 7
CoBia: Constructed Conversations Can Trigger Otherwise Concealed Societal Biases in LLMs Paper • 2510.09871 • Published Oct 10, 2025 • 3
ADAM: A Diverse Archive of Mankind for Evaluating and Enhancing LLMs in Biographical Reasoning Paper • 2509.22991 • Published Sep 26, 2025 • 2
MEENA (PersianMMMU): Multimodal-Multilingual Educational Exams for N-level Assessment Paper • 2508.17290 • Published Aug 24, 2025 • 8
The Touché23-ValueEval Dataset for Identifying Human Values behind Arguments Paper • 2301.13771 • Published Jan 31, 2023
MEENA (PersianMMMU): Multimodal-Multilingual Educational Exams for N-level Assessment Paper • 2508.17290 • Published Aug 24, 2025 • 8
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26, 2025 • 78
How Programming Concepts and Neurons Are Shared in Code Language Models Paper • 2506.01074 • Published Jun 1, 2025 • 4
Tracing Multilingual Factual Knowledge Acquisition in Pretraining Paper • 2505.14824 • Published May 20, 2025 • 4
ELAB: Extensive LLM Alignment Benchmark in Persian Language Paper • 2504.12553 • Published Apr 17, 2025 • 2
Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions Paper • 2504.19056 • Published Apr 27, 2025 • 18
On Relation-Specific Neurons in Large Language Models Paper • 2502.17355 • Published Feb 24, 2025 • 10
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation Paper • 2502.08826 • Published Feb 12, 2025 • 17