PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference Paper • 2603.02479 • Published 4 days ago • 18
Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization Paper • 2406.01171 • Published Jun 3, 2024 • 1
SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models Paper • 2506.01062 • Published Jun 1, 2025 • 5
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning Paper • 2505.16421 • Published May 22, 2025 • 19
Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization Paper • 2406.01171 • Published Jun 3, 2024 • 1
Data Contamination Report from the 2024 CONDA Shared Task Paper • 2407.21530 • Published Jul 31, 2024 • 10
THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation Paper • 2406.10996 • Published Jun 16, 2024 • 35