Chi's picture

Chi PRO

ChilleD

·

AI & ML interests

Natural Language Processing.

Recent Activity

upvoted a paper 14 days ago

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

upvoted a paper 15 days ago

PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency

new activity 23 days ago

Snowflake/AgentWorldModel-1K:Fix yaml block?

View all activity

Organizations

upvoted a paper 14 days ago

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Paper • 2602.22190 • Published 15 days ago • 15

upvoted a paper 15 days ago

PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency

Paper • 2602.16745 • Published 22 days ago • 8

upvoted a collection 24 days ago

Agent World Model

4 items • Updated 29 days ago • 9

upvoted 2 papers 29 days ago

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Paper • 2602.10090 • Published 30 days ago • 51

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 69

upvoted a paper 30 days ago

Towards Agentic Intelligence for Materials Science

Paper • 2602.00169 • Published Jan 29 • 46

upvoted 4 papers 4 months ago

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning

Paper • 2511.11653 • Published Nov 10, 2025 • 58

Adapting Web Agents with Synthetic Supervision

Paper • 2511.06101 • Published Nov 8, 2025 • 7

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 82

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 59

upvoted a paper 5 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 273

upvoted a paper 11 months ago

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published Apr 10, 2025 • 48

upvoted 3 papers over 1 year ago

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17, 2024 • 51

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Paper • 2407.05131 • Published Jul 6, 2024 • 26

upvoted 3 papers over 2 years ago

Democratizing Reasoning Ability: Tailored Learning from Large Language Model

Paper • 2310.13332 • Published Oct 20, 2023 • 16

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 84

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Paper • 2309.00267 • Published Sep 1, 2023 • 53