SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks Paper • 2604.08865 • Published 5 days ago • 23
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published 29 days ago • 308
LLMtimesMapReduce: Simplified Long-Sequence Processing using Large Language Models Paper • 2410.09342 • Published Oct 12, 2024 • 39