view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 8 days ago β’ 63
Biased Tales: Cultural and Topic Bias in Generating Children's Stories Paper β’ 2509.07908 β’ Published Sep 9, 2025 β’ 1
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards Paper β’ 2601.06021 β’ Published Jan 9 β’ 47
ACE Collection Ai2 Climate Emulator (ACE) is a family of fast ML models that simulate global atmospheric variability over time scales ranging from hours to centuries β’ 9 items β’ Updated 4 days ago β’ 12
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog β’ 9 items β’ Updated 16 days ago β’ 87
AgenticMath: Enhancing LLM Reasoning via Agentic-based Math Data Generation Paper β’ 2510.19361 β’ Published Oct 22, 2025 β’ 2
Rewriting Pre-Training Data Boosts LLM Performance in Math and Code Paper β’ 2505.02881 β’ Published May 5, 2025 β’ 7
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization Paper β’ 2512.00956 β’ Published Nov 30, 2025 β’ 23
π Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized β’ 135 items β’ Updated Dec 18, 2025 β’ 120
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper β’ 2504.07096 β’ Published Apr 9, 2025 β’ 77