1 19 6

Jeff Gao

jeff-gao

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

upvoted a paper 28 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

upvoted a paper about 1 month ago

Heterogeneous Agent Collaborative Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper 14 days ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted a paper 28 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published about 1 month ago • 150

upvoted 3 papers about 1 month ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 193

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 216

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 352

upvoted 2 papers 2 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 110

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published Jan 29 • 60

upvoted 3 papers 3 months ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published Jan 14 • 127

User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

Paper • 2601.08225 • Published Jan 13 • 53

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Paper • 2601.03986 • Published Jan 7 • 34

upvoted a paper 5 months ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published Nov 9, 2025 • 53

liked a model 6 months ago

ASLP-lab/Easy-Turn

Updated Oct 11, 2025 • 36 • 15

liked a model 8 months ago

inclusionAI/Rubicon-Preview

Text Generation • 31B • Updated Aug 19, 2025 • 86 • 25

upvoted a paper 8 months ago

Evaluating, Synthesizing, and Enhancing for Customer Support Conversation

Paper • 2508.04423 • Published Aug 6, 2025 • 9

upvoted a paper 10 months ago

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Paper • 2506.09827 • Published Jun 11, 2025 • 22

liked a model about 1 year ago

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • Updated Sep 17, 2025 • 55.9k • 1.61k

published a model about 1 year ago

jeff-gao/Qwen2.5-1.5B-Open-R1-GRPO

Updated Feb 25, 2025

updated a model about 1 year ago

jeff-gao/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Feb 24, 2025 • 2

published a model about 1 year ago

jeff-gao/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Feb 24, 2025 • 2

liked a model over 1 year ago

jinaai/reader-lm-1.5b

Text Generation • 2B • Updated Jan 17, 2025 • 459 • • 608

Jeff Gao

AI & ML interests

Recent Activity

Organizations

jeff-gao's activity