6 423 31

Young-Jun Lee PRO

passing2961

https://sites.google.com/view/passing2961/home

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper about 7 hours ago

Aletheia tackles FirstProof autonomously

upvoted a paper about 7 hours ago

Benchmark Test-Time Scaling of General LLM Agents

upvoted a paper about 7 hours ago

PyVision-RL: Forging Open Agentic Vision Models via RL

View all activity

Organizations

upvoted 4 papers about 7 hours ago

upvoted 6 papers 2 days ago

SkillOrchestra: Learning to Route Agents via Skill Transfer

Paper • 2602.19672 • Published 3 days ago • 47

Agents of Chaos

Paper • 2602.20021 • Published 3 days ago • 25

CADEvolve: Creating Realistic CAD via Program Evolution

Paper • 2602.16317 • Published 8 days ago • 26

AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines

Paper • 2602.14296 • Published 11 days ago • 47

Modeling Distinct Human Interaction in Web Agents

Paper • 2602.17588 • Published 7 days ago • 3

Discovering Multiagent Learning Algorithms with Large Language Models

Paper • 2602.16928 • Published 8 days ago • 14

upvoted 3 papers 7 days ago

Towards a Science of AI Agent Reliability

Paper • 2602.16666 • Published 8 days ago • 13

HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam

Paper • 2602.13964 • Published 11 days ago • 2

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published 13 days ago • 52

upvoted 2 papers 8 days ago

ResearchGym: Evaluating Language Model Agents on Real-World AI Research

Paper • 2602.15112 • Published 10 days ago • 20

Experiential Reinforcement Learning

Paper • 2602.13949 • Published 11 days ago • 67

upvoted 3 papers 9 days ago

BrowseComp-V^3: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents

Paper • 2602.12876 • Published 13 days ago • 8

Qute: Towards Quantum-Native Database

Paper • 2602.14699 • Published 10 days ago • 13

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

Paper • 2602.14234 • Published 11 days ago • 26

upvoted 2 papers 10 days ago

Multimodal Fact-Level Attribution for Verifiable Reasoning

Paper • 2602.11509 • Published 14 days ago • 4

P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling

Paper • 2602.12116 • Published 14 days ago • 4

Young-Jun Lee PRO

AI & ML interests

Recent Activity

Organizations

passing2961's activity