4 13 1

Guo

YiDuo1999

AI & ML interests

Continual Learning

Recent Activity

upvoted a paper 2 months ago

Sequential Diffusion Language Models

upvoted a paper 3 months ago

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

upvoted a paper 4 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

View all activity

Organizations

None yet

upvoted a paper 2 months ago

Sequential Diffusion Language Models

Paper • 2509.24007 • Published Sep 28 • 45

upvoted a paper 3 months ago

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Paper • 2509.06949 • Published Sep 8 • 55

upvoted a paper 4 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 60

upvoted a paper 7 months ago

Synthetic Data RL: Task Definition Is All You Need

Paper • 2505.17063 • Published May 18 • 10

upvoted 2 papers 8 months ago

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 53

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3 • 57

upvoted a paper 10 months ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 58

updated a dataset 11 months ago

JustinZekai/SlidesBench

Updated Mar 9 • 640

updated a model about 1 year ago

YiDuo1999/random_models_9

Updated Nov 20, 2024

updated 2 models over 1 year ago

YiDuo1999/Gemma-2-9b-medical

Text Generation • Updated Jul 2, 2024 • 11 • 1

YiDuo1999/Llama-3-Physician-8B-Instruct

Text Generation • Updated Jul 2, 2024 • 14 • 6

authored a paper over 1 year ago

Efficient Continual Pre-training by Mitigating the Stability Gap

Paper • 2406.14833 • Published Jun 21, 2024 • 20

upvoted a paper over 1 year ago

Efficient Continual Pre-training by Mitigating the Stability Gap

Paper • 2406.14833 • Published Jun 21, 2024 • 20

commented a paper over 1 year ago

Efficient Continual Pre-training by Mitigating the Stability Gap

Paper • 2406.14833 • Published Jun 21, 2024 • 20 •

updated a model over 1 year ago

YiDuo1999/Llama-3-Physician-8B-Base

Text Generation • Updated Jun 21, 2024 • 8

updated a dataset over 1 year ago

YiDuo1999/ttro

Preview • Updated May 1, 2024 • 107

upvoted a paper almost 2 years ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 65

updated 2 datasets almost 2 years ago

YiDuo1999/medpub

Updated Jan 16, 2024 • 26

YiDuo1999/PPT

Updated Jan 12, 2024 • 11

upvoted a paper almost 2 years ago

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Paper • 2312.09390 • Published Dec 14, 2023 • 33

Guo

AI & ML interests

Recent Activity

Organizations

YiDuo1999's activity