Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
bigcode 's Collections
⚔️ BigCodeArena
💫 StarCoder2
⭐ StarCoder
📑 The Stack
🌸BigCodeBench
🐙OctoPack
🌎 Community
✨Astraios-1B
✨Astraios-3B
✨Astraios-7B
✨Astraios-15B

⚔️ BigCodeArena

updated Oct 13

Unveiling More Reliable Human Preferences in Code Generation via Execution

Upvote
6

  • Running
    37

    BigCodeArena

    🚀
    37

    Compare two AI models by sending them code and seeing their responses


  • BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

    Paper • 2510.08697 • Published Oct 9 • 35

  • bigcode/bigcodearena-raw-14k

    Viewer • Updated Oct 13 • 14.1k • 47 • 1

  • bigcode/bigcodearena-preference-5k

    Viewer • Updated Oct 13 • 4.73k • 94 • 1

  • bigcode/bigcodereward

    Viewer • Updated Oct 15 • 4.73k • 162 • 2

  • bigcode/bigcodereward-experiment-results

    Viewer • Updated Oct 13 • 141k • 439

  • bigcode/autocodearena-v0

    Viewer • Updated Oct 15 • 600 • 108 • 2
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs