AzeroS: Extending LLM to Speech with Self-Generated Instruction-Free Tuning Paper • 2601.06086 • Published Dec 31, 2025 • 1
TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling Paper • 2504.07053 • Published Apr 9, 2025 • 6
UniAudio 2.0: A Unified Audio Language Model with Text-Aligned Factorized Audio Tokenization Paper • 2602.04683 • Published 10 days ago • 2
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 9 days ago • 308
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research Paper • 2602.06540 • Published 8 days ago • 20