Domain-Specific-Datasets Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning Paper • 2507.17512 • Published Jul 23 • 36
Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning Paper • 2507.17512 • Published Jul 23 • 36
VLMs openbmb/MiniCPM-V-2_6 Image-Text-to-Text • 8B • Updated Jun 13 • 96.9k • 1.01k microsoft/Florence-2-large-ft Image-Text-to-Text • 0.8B • Updated Aug 4 • 36.8k • 374
Domain-Specific-Datasets Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning Paper • 2507.17512 • Published Jul 23 • 36
Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning Paper • 2507.17512 • Published Jul 23 • 36
VLMs openbmb/MiniCPM-V-2_6 Image-Text-to-Text • 8B • Updated Jun 13 • 96.9k • 1.01k microsoft/Florence-2-large-ft Image-Text-to-Text • 0.8B • Updated Aug 4 • 36.8k • 374