Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Chenyan Xiong Research Group at CMU

university
https://www.cs.cmu.edu/~cx/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

ethanning  authored a paper 1 day ago
Revision or Re-Solving? Decomposing Second-Pass Gains in Multi-LLM Pipelines
ethanning  submitted a paper 1 day ago
Revision or Re-Solving? Decomposing Second-Pass Gains in Multi-LLM Pipelines
yuzc19  updated a model about 1 month ago
cx-cmu/repro-rephraser-4B
View all activity

Papers

RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

View all Papers

Chenyan Xiong's profile pictureJingyuan He's profile pictureMahima Jagadeesh Patel's profile picturezhihan zhang's profile pictureCassandra Cohen's profile picture Zichun Yu's profile pictureKira Jones's profile pictureyujiang wu's profile pictureShanshan Zhong's profile pictureJoao Coelho's profile pictureEthan Ning's profile picture

cx-cmu 's collections 1

RePro
Space for RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
  • cx-cmu/repro-rephraser-4B

    Text Generation • 196k • Updated Feb 18 • 66 • 2
  • cx-cmu/repro-rl-data

    Viewer • Updated Oct 18, 2025 • 41k • 22
  • RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

    Paper • 2510.10681 • Published Oct 12, 2025 • 6
  • cx-cmu/repro-rephrased-data-72B

    Viewer • Updated Oct 18, 2025 • 39M • 492
RePro
Space for RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
  • cx-cmu/repro-rephraser-4B

    Text Generation • 196k • Updated Feb 18 • 66 • 2
  • cx-cmu/repro-rl-data

    Viewer • Updated Oct 18, 2025 • 41k • 22
  • RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

    Paper • 2510.10681 • Published Oct 12, 2025 • 6
  • cx-cmu/repro-rephrased-data-72B

    Viewer • Updated Oct 18, 2025 • 39M • 492
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs