Tony Congqian Wang's picture

Tony Congqian Wang

TonyCWang

AI & ML interests

None yet

Organizations

None yet

commented 2 papers 4 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229 •

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26, 2025 • 44 •

commented 2 papers 5 months ago

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Paper • 2510.03259 • Published Sep 26, 2025 • 57 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •

New activity in timm/vit_little_patch16_reg4_gap_256.sbb_in1k 7 months ago

Loss exploding to nan

#1 opened 7 months ago by

commented 2 papers 8 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •

New activity in timm/mobilenetv4_conv_aa_large.e230_r448_in12k_ft_in1k 8 months ago

Training recipe

#2 opened 8 months ago by

commented 2 papers 9 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •