view article Article Arcade-3B: SLM Optimization via Orthogonal Decoupling of Latent State Spaces 9 days ago • 1
view article Article Exploring New Frontiers of LLMs: Adaptive Dual-Search Distillation (ADS) and the 30B Model Open Beta 23 days ago • 2
view article Article Shattering the Memory Wall: O(1) Inference and Causal Monoid State Compression in Spartacus-1B 27 days ago • 2