3 4 1

WangShouli

WangSl2004

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

SWE-bench/SWE-bench

upvoted a paper 3 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

upvoted a paper 4 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

View all activity

Organizations

liked a dataset about 2 months ago

SWE-bench/SWE-bench

Viewer • Updated Apr 29, 2025 • 21.5k • 2.98k • 18

upvoted a paper 3 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16, 2025 • 53

upvoted a paper 4 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 109

upvoted a paper 7 months ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 31

New activity in RLAIF-V/RLPR-Benchmarks 8 months ago

Create README.md

#2 opened 8 months ago by

resilience

New activity in RLAIF-V/RLPR-Train-Dataset 8 months ago

Create README.md

#1 opened 8 months ago by

resilience

New activity in RLAIF-V/RLPR-Qwen2.5-7B-Base 8 months ago

Update README.md

#1 opened 8 months ago by

resilience

updated a dataset 8 months ago

RLAIF-V/RLPR-Benchmarks

Viewer • Updated Jun 22, 2025 • 638 • 66

published a dataset 8 months ago

RLAIF-V/RLPR-Benchmarks

Viewer • Updated Jun 22, 2025 • 638 • 66

updated a dataset 8 months ago

RLAIF-V/RLPR-Train-Dataset

Viewer • Updated Jun 22, 2025 • 77.7k • 19

published a dataset 8 months ago

RLAIF-V/RLPR-Train-Dataset

Viewer • Updated Jun 22, 2025 • 77.7k • 19

updated a model 8 months ago

RLAIF-V/RLPR-Qwen2.5-7B-Base

8B • Updated Jun 22, 2025 • 3 • 1

published a model 8 months ago

RLAIF-V/RLPR-Qwen2.5-7B-Base

8B • Updated Jun 22, 2025 • 3 • 1

upvoted a paper over 1 year ago

LLMtimesMapReduce: Simplified Long-Sequence Processing using Large Language Models

Paper • 2410.09342 • Published Oct 12, 2024 • 39

WangShouli

AI & ML interests

Recent Activity

Organizations

WangSl2004's activity

Create README.md

Create README.md

Update README.md