4 4

YY

yy0514

AI & ML interests

None yet

Recent Activity

upvoted a collection 8 days ago

Search-R1-v0.3

upvoted a paper 4 months ago

Agentic Reinforcement Learning for Search is Unsafe

commented on a paper 4 months ago

Agentic Reinforcement Learning for Search is Unsafe

View all activity

Organizations

None yet

upvoted a collection 8 days ago

Search-R1-v0.3

Collection

RL with outcome reward + format reward. https://arxiv.org/abs/2505.15117 • 12 items • Updated Aug 12, 2025 • 4

upvoted a paper 4 months ago

Agentic Reinforcement Learning for Search is Unsafe

Paper • 2510.17431 • Published Oct 20, 2025 • 5

commented a paper 4 months ago

Agentic Reinforcement Learning for Search is Unsafe

Paper • 2510.17431 • Published Oct 20, 2025 • 5 •

upvoted a paper 10 months ago

Clinical knowledge in LLMs does not translate to human interactions

Paper • 2504.18919 • Published Apr 26, 2025 • 26

authored a paper over 1 year ago

Can sparse autoencoders be used to decompose and interpret steering vectors?

Paper • 2411.08790 • Published Nov 13, 2024 • 8

commented a paper over 1 year ago

Can sparse autoencoders be used to decompose and interpret steering vectors?

Paper • 2411.08790 • Published Nov 13, 2024 • 8 •

upvoted a paper over 1 year ago

Ablation is Not Enough to Emulate DPO: How Neuron Dynamics Drive Toxicity Reduction

Paper • 2411.06424 • Published Nov 10, 2024 • 5

commented 2 papers over 1 year ago

Ablation is Not Enough to Emulate DPO: How Neuron Dynamics Drive Toxicity Reduction

Paper • 2411.06424 • Published Nov 10, 2024 • 5 •

Fine-tuning Large Language Models with Human-inspired Learning Strategies in Medical Question Answering

Paper • 2408.07888 • Published Aug 15, 2024 • 13 •

authored a paper over 1 year ago

Fine-tuning Large Language Models with Human-inspired Learning Strategies in Medical Question Answering

Paper • 2408.07888 • Published Aug 15, 2024 • 13

updated 10 models about 2 years ago

YY

AI & ML interests

Recent Activity

Organizations

yy0514's activity