Zhongzhi Li's picture

Zhongzhi Li

lizhongzhi2022

·

AI & ML interests

NLP

Organizations

None yet

upvoted a paper 3 months ago

Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning

Paper • 2602.01745 • Published Feb 2 • 7

upvoted an article 4 months ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

May 20, 2025

•

66

upvoted 2 papers 5 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 159

MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism

Paper • 2511.11373 • Published Nov 14, 2025 • 14

upvoted 2 papers 6 months ago

ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models

Paper • 2510.06014 • Published Oct 7, 2025 • 10

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

Paper • 2510.24514 • Published Oct 28, 2025 • 22

upvoted 4 papers 8 months ago

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration

Paper • 2508.13755 • Published Aug 19, 2025 • 14

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Paper • 2508.18124 • Published Aug 25, 2025 • 49

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 119

upvoted a paper 9 months ago

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published Jul 21, 2025 • 69

upvoted 5 papers 11 months ago

Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation

Paper • 2506.11924 • Published Jun 13, 2025 • 35

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Paper • 2506.08989 • Published Jun 10, 2025 • 14

From System 1 to System 2: A Survey of Reasoning Large Language Models

Paper • 2502.17419 • Published Feb 24, 2025 • 3

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2, 2025 • 48

upvoted an article 11 months ago

Article

Exploring the Daily Papers Page on Hugging Face

Sep 23, 2024

•

67