Evaluate Comparison

https://github.com/huggingface/evaluate/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

lvwerra authored a paper 18 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

sasha authored a paper 24 days ago

Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model

sasha authored a paper 24 days ago

Hype, Sustainability, and the Price of the Bigger-is-Better Paradigm in AI

View all activity

lvwerra

authored a paper 18 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published 23 days ago • 33

sasha

authored 3 papers 24 days ago

Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model

Paper • 2211.02001 • Published Nov 3, 2022

Hype, Sustainability, and the Price of the Bigger-is-Better Paradigm in AI

Paper • 2409.14160 • Published Sep 21, 2024 • 3

From Efficiency Gains to Rebound Effects: The Problem of Jevons' Paradox in AI's Polarized Environmental Debate

Paper • 2501.16548 • Published Jan 27

evaluate-bot

updated 2 Spaces about 1 month ago

McNemar

Exact Match

lvwerra

authored a paper 4 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 73

lvwerra

authored a paper 7 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 200

sasha

authored a paper 7 months ago

Fully Autonomous AI Agents Should Not be Developed

Paper • 2502.02649 • Published Feb 4 • 35

lvwerra

authored a paper 9 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 243

lvwerra

authored a paper 10 months ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 63

lvwerra

authored a paper 12 months ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 24

lvwerra

updated 2 Spaces about 1 year ago

Exact Match

McNemar

lvwerra

authored 3 papers over 1 year ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 97

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28, 2024 • 12

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 148

sasha

authored 2 papers almost 2 years ago

Power Hungry Processing: Watts Driving the Cost of AI Deployment?

Paper • 2311.16863 • Published Nov 28, 2023 • 6

What's in the Box? A Preliminary Analysis of Undesirable Content in the Common Crawl Corpus

Paper • 2105.02732 • Published May 6, 2021

lvwerra

authored a paper about 2 years ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122