Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tony Congqian Wang's picture
6 14 1

Tony Congqian Wang

TonyCWang

AI & ML interests

None yet

Organizations

None yet

commented 2 papers 4 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229 •
8

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26, 2025 • 44 •
4
commented 2 papers 5 months ago

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Paper • 2510.03259 • Published Sep 26, 2025 • 57 •
4

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •
50
New activity in timm/vit_little_patch16_reg4_gap_256.sbb_in1k 7 months ago

Loss exploding to nan

31
#1 opened 7 months ago by
tony0278611
commented 2 papers 8 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •
50

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •
50
New activity in timm/mobilenetv4_conv_aa_large.e230_r448_in12k_ft_in1k 8 months ago

Training recipe

#2 opened 8 months ago by
TonyCWang
commented 2 papers 9 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •
50

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •
50
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs