2 37 1

Jiarui Yao

FlippyDora

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

upvoted a paper 4 days ago

Rethinking the Divergence Regularization in LLM RL

upvoted a paper 4 days ago

Lean4Agent: Formal Modeling and Verification for Agent Workflow and Trajectory

View all activity

Organizations

upvoted 4 papers 4 days ago

upvoted a paper 7 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 13 days ago • 119

upvoted a paper 11 days ago

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

Paper • 2606.02754 • Published 13 days ago • 13

upvoted a paper 12 days ago

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published 13 days ago • 228

updated a model 20 days ago

jrtmp/preditive-mask

Updated 20 days ago

published a model 20 days ago

jrtmp/preditive-mask

Updated 20 days ago

updated a dataset 25 days ago

CorrectKLinRL/math500

Viewer • Updated 25 days ago • 500 • 34

published a dataset 25 days ago

CorrectKLinRL/math500

Viewer • Updated 25 days ago • 500 • 34

updated a model 27 days ago

CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-forward_k3-clipLow_inf-clipHigh_inf

2B • Updated 27 days ago • 60

published a model 27 days ago

CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-forward_k3-clipLow_inf-clipHigh_inf

2B • Updated 27 days ago • 60

updated a model 27 days ago

CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-reverse_k3-clipLow_inf-clipHigh_inf

2B • Updated 27 days ago • 18

published a model 27 days ago

CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-reverse_k3-clipLow_inf-clipHigh_inf

2B • Updated 27 days ago • 18

updated a dataset 27 days ago

CorrectKLinRL/olympiadbench

Viewer • Updated 27 days ago • 674 • 43

published a dataset 27 days ago

CorrectKLinRL/olympiadbench

Viewer • Updated 27 days ago • 674 • 43

updated a dataset 27 days ago

CorrectKLinRL/minerva_math

Viewer • Updated 27 days ago • 272 • 46

published a dataset 27 days ago

CorrectKLinRL/minerva_math

Viewer • Updated 27 days ago • 272 • 46

updated a dataset 27 days ago

CorrectKLinRL/amc23

Viewer • Updated 27 days ago • 40 • 35

Jiarui Yao

AI & ML interests

Recent Activity

Organizations

FlippyDora's activity