MARSHAL - a nics-efc Collection

nics-efc 's Collections

MARSHAL

TaH

C2C

R2R

Papers from the NICS-EFFALG Team

MARSHAL

updated 26 days ago

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs

MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games

Paper • 2510.15414 • Published Oct 17 • 1

Note Note: This paper has been updated to v2 on arXiv. MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
nics-efc/MARSHAL-Generalist-Qwen3-4B

Text Generation • 4B • Updated 26 days ago • 39
nics-efc/MARSHAL-Generalist-Qwen3-8B

Text Generation • 8B • Updated 26 days ago • 30
nics-efc/MARSHAL-Tic-Tac-Toe-Qwen3-4B

Text Generation • 4B • Updated 26 days ago • 35
nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B

Text Generation • 4B • Updated 26 days ago • 31
nics-efc/MARSHAL-Mini-Hanabi-Qwen3-4B

Text Generation • 4B • Updated 26 days ago • 29