AnIdealRing
SmartDazi
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models
upvoted
a
paper
10 days ago
RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents
upvoted
a
paper
22 days ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning