Submitted by yihongzhuang 59 LLaDA2.1: Speeding Up Text Diffusion via Token Editing inclusionAI 333 4
Submitted by Zhangxuan Gu 10 VenusBench-GD: A Comprehensive Multi-Platform GUI Benchmark for Diverse Grounding Tasks inclusionAI 2
Submitted by Zhenglin Cheng (SII) 75 TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows inclusionAI 476 9
Submitted by Bingguang Hao 10 FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling inclusionAI 89 1
Submitted by taesiri 40 Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation inclusionAI 1
10 Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation inclusionAI 428
Submitted by Zhang Zhiqiang 84 Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation inclusionAI 2
Submitted by Xiaolong Wang 11 ARGenSeg: Image Segmentation with Autoregressive Image Generation Model inclusionAI 2
Submitted by taesiri 72 Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model inclusionAI 91 3
Submitted by zheng 76 Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer inclusionAI 136 3
Submitted by taesiri 12 MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks inclusionAI 2