nanoverl (nanoverl)

koalazf99

authored 2 papers 7 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 48

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17, 2025 • 49

koalazf99

updated a dataset 8 months ago

nanoverl/aime2025_repeated_8x

Viewer • Updated May 8, 2025 • 240 • 8

koalazf99

published a dataset 8 months ago

nanoverl/aime2025_repeated_8x

Viewer • Updated May 8, 2025 • 240 • 8

koalazf99

updated a dataset 8 months ago

nanoverl/aime2025

Viewer • Updated May 8, 2025 • 30 • 4

koalazf99

published a dataset 8 months ago

nanoverl/aime2025

Viewer • Updated May 8, 2025 • 30 • 4

koalazf99

in nanoverl/finqa 8 months ago

[bot] Conversion to Parquet

#1 opened 8 months ago by

parquet-converter

koalazf99

published a dataset 8 months ago

nanoverl/finqa

Viewer • Updated May 6, 2025 • 1.15k • 6

koalazf99

updated a dataset 8 months ago

nanoverl/finqa

Viewer • Updated May 6, 2025 • 1.15k • 6

koalazf99

authored a paper 9 months ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3, 2025 • 35

koalazf99

authored a paper 11 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18, 2025 • 19

koalazf99

updated 6 datasets 11 months ago

koalazf99

published 3 datasets 11 months ago

nanoverl/deepscaler

Viewer • Updated Feb 16, 2025 • 40.3k • 24

nanoverl/olympiad_bench

Viewer • Updated Feb 16, 2025 • 675 • 15

nanoverl/minerva

Viewer • Updated Feb 16, 2025 • 272 • 9

AI & ML interests

Team members 1

nanoverl's activity

[bot] Conversion to Parquet