Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection Paper • 2404.16944 • Published Apr 25, 2024 • 1
The Appeal and Reality of Recycling LoRAs with Adaptive Merging Paper • 2602.12323 • Published Feb 12 • 1
Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published Feb 24 • 31
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published Feb 11 • 16
Quantifying the Carbon Emissions of Machine Learning Paper • 1910.09700 • Published Oct 21, 2019 • 43
The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving Paper • 2601.00747 • Published Jan 2 • 20
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 124
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published Dec 23, 2025 • 18
EasyV2V: A High-quality Instruction-based Video Editing Framework Paper • 2512.16920 • Published Dec 18, 2025 • 18
PuzzleCraft: Exploration-Aware Curriculum Learning for Puzzle-Based RLVR in VLMs Paper • 2512.14944 • Published Mar 13 • 36
supertoken Collection The initial checkpoints for the token comparison research. • 20 items • Updated May 22, 2025 • 2
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5, 2025 • 61