Search-R1-v0.3 Collection RL with outcome reward + format reward. https://arxiv.org/abs/2505.15117 • 12 items • Updated Aug 12, 2025 • 4
Agentic Reinforcement Learning for Search is Unsafe Paper • 2510.17431 • Published Oct 20, 2025 • 5 • 2
Clinical knowledge in LLMs does not translate to human interactions Paper • 2504.18919 • Published Apr 26, 2025 • 26
Can sparse autoencoders be used to decompose and interpret steering vectors? Paper • 2411.08790 • Published Nov 13, 2024 • 8
Can sparse autoencoders be used to decompose and interpret steering vectors? Paper • 2411.08790 • Published Nov 13, 2024 • 8 • 2
Ablation is Not Enough to Emulate DPO: How Neuron Dynamics Drive Toxicity Reduction Paper • 2411.06424 • Published Nov 10, 2024 • 5
Ablation is Not Enough to Emulate DPO: How Neuron Dynamics Drive Toxicity Reduction Paper • 2411.06424 • Published Nov 10, 2024 • 5 • 2
Fine-tuning Large Language Models with Human-inspired Learning Strategies in Medical Question Answering Paper • 2408.07888 • Published Aug 15, 2024 • 13 • 2
Fine-tuning Large Language Models with Human-inspired Learning Strategies in Medical Question Answering Paper • 2408.07888 • Published Aug 15, 2024 • 13