Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Chenyan Xiong Research Group at CMU

university
https://www.cs.cmu.edu/~cx/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

yuzc19  updated a model 28 days ago
cx-cmu/repro-rephraser-4B
yuzc19  updated a model 28 days ago
cx-cmu/repro-rephraser-1B
zhongshsh  submitted a paper about 1 month ago
Agent Skills: A Data-Driven Analysis of Claude Skills for Extending Large Language Model Functionality
View all activity

Papers

RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

View all Papers

Chenyan Xiong's profile pictureJingyuan He's profile pictureMahima Jagadeesh Patel's profile picturezhihan zhang's profile pictureCassandra Cohen's profile picture Zichun Yu's profile pictureKira Jones's profile pictureyujiang wu's profile pictureShanshan Zhong's profile pictureJoao Coelho's profile pictureEthan Ning's profile picture

cx-cmu 's collections 1

RePro
Space for RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
  • cx-cmu/repro-rephraser-4B

    Text Generation • 196k • Updated 28 days ago • 2.72k • 2
  • cx-cmu/repro-rl-data

    Viewer • Updated Oct 18, 2025 • 41k • 16
  • RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

    Paper • 2510.10681 • Published Oct 12, 2025 • 6
  • cx-cmu/repro-rephrased-data-72B

    Viewer • Updated Oct 18, 2025 • 39M • 618
RePro
Space for RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
  • cx-cmu/repro-rephraser-4B

    Text Generation • 196k • Updated 28 days ago • 2.72k • 2
  • cx-cmu/repro-rl-data

    Viewer • Updated Oct 18, 2025 • 41k • 16
  • RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

    Paper • 2510.10681 • Published Oct 12, 2025 • 6
  • cx-cmu/repro-rephrased-data-72B

    Viewer • Updated Oct 18, 2025 • 39M • 618
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs