7 8

Sen Yang

ringos

https://ringos.github.io/

ringos

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

liked a Space 7 days ago

HuggingFaceFW/finephrase

upvoted a paper 6 months ago

ExGRPO: Learning to Reason from Experience

View all activity

Organizations

upvoted a paper 5 days ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published 7 days ago • 177

liked a Space 7 days ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

206

Explore synthetic data experiments as an interactive bookshelf

upvoted a paper 6 months ago

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 82

upvoted a paper 7 months ago

The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang

Paper • 2509.00425 • Published Aug 30, 2025 • 12

upvoted a paper 8 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19, 2025 • 136

liked a dataset 10 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 13.4k • 715

upvoted a paper 11 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88

liked a dataset about 1 year ago

di-zhang-fdu/AIME_1983_2024

Viewer • Updated Mar 3, 2025 • 933 • 5k • 40

upvoted 2 papers about 1 year ago

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published Feb 19, 2025 • 36

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22, 2025 • 61

updated a dataset about 1 year ago

ringos/output_Llama-3.1-8B-simpleqa-0_1000-m_generation-n_128-t_1.0-k_50-p_0.95-l_128

Updated Dec 25, 2024 • 65

liked a Space over 1 year ago

Scaling test-time compute

📈

593

Run advanced search strategies to boost LLM problem solving

updated 5 datasets over 1 year ago

ringos/output_Llama-3.1-8B-simpleqa-0_-1-m_generation-n_128-t_1.0-k_50-p_0.95-l_128

Updated Dec 17, 2024 • 128

ringos/mistral_nemo_base-mmlu-val

Viewer • Updated Dec 16, 2024 • 18.7k • 18

ringos/llama-3_1-8b-mmlu-val

Viewer • Updated Dec 16, 2024 • 18.7k • 27

ringos/output_Mistral-Nemo-Base-2407-simpleqa-0_1000-m_generation-n_32-t_1.0-k_40-p_0.9-l_128

Viewer • Updated Dec 2, 2024 • 216 • 34

ringos/simple_qa

Viewer • Updated Dec 2, 2024 • 4.33k • 208

liked 2 datasets over 1 year ago

HuggingFaceTB/smoltalk

Viewer • Updated Feb 10, 2025 • 2.2M • 9.48k • 393

MingZhong/crosseval

Viewer • Updated Oct 1, 2024 • 1.4k • 12 • 6

updated a dataset over 1 year ago

ringos/bio-detailed-Llama-3.1-8B-gemma2-rm-gold_True-n_32

Viewer • Updated Nov 13, 2024 • 371 • 10