ndl-core-collection Collection A collection of UK government structured datasets and textual sources for research, analysis, and AI applications. • 6 items • Updated Jan 12 • 3
view article Article Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets 10 days ago • 17
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding 11 days ago • 44
Datasets of AI Ecosystem Data Collection Datasets shared on the Hub to support research and investigation of the AI ecosystem • 3 items • Updated 13 days ago • 1
Visualizations of AI Ecosystem Data Collection Spaces and demos showing the evolution of the AI ecosystem • 6 items • Updated 13 days ago • 1
Research on AI Ecosystem Data Collection Research papers leveraging AI ecosystem data • 6 items • Updated 13 days ago • 1
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 21 days ago • 81
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 20 days ago • 182
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published Feb 27 • 88
view article Article easytranscriber: Speech Recognition with Accurate Timestamps in the HF Ecosystem 28 days ago • 5
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 65