OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 11 days ago • 312
Flavors of Moonshine Collection A suite of tiny automatic speech recognition (ASR) models specialized for a range of underrepresented languages. • 6 items • Updated Sep 11, 2025 • 1
NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models Paper • 2602.06694 • Published 10 days ago • 15
Reliable and Responsible Foundation Models: A Comprehensive Survey Paper • 2602.08145 • Published 11 days ago • 8
Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration? Paper • 2602.07055 • Published 11 days ago • 21
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 7 days ago • 39
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published 7 days ago • 26
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery Paper • 2602.08990 • Published 6 days ago • 68
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery Paper • 2601.19325 • Published 20 days ago • 79
The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents Paper • 2601.11496 • Published about 1 month ago • 47
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 126
TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents Paper • 2601.05899 • Published Jan 9 • 4
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published Jan 12 • 114