Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

EvalEval Coalition

community
https://evalevalai.com/
evaluatingevals
evaleval
Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

evijit  published a dataset 12 days ago
evaleval/every_eval_ever
felfri  authored a paper 28 days ago
Measuring and Guiding Monosemanticity
iamgroot42  authored a paper 2 months ago
Exploiting Leaderboards for Large-Scale Distribution of Malicious Models
View all activity

Yacine Jernite's profile picture Jenny Chim's profile picture Alina Leidinger's profile picture Margaret Mitchell's profile picture Leshem Choshen's profile picture Irene Solaiman's profile picture Ali El Filali's profile picture Joseph [open/acc] Pollack's profile picture Felix Friedrich's profile picture Mowafak Allaham's profile picture Prajna Soni's profile picture Jennifer Mickel's profile picture Usman Gohar's profile picture Shubham Singh's profile picture Mubashara Akhtar's profile picture Avijit Ghosh's profile picture Anshuman Suri's profile picture Canyu Chen's profile picture Kevin Wei's profile picture Aurélien-Morgan CLAUDON's profile picture Levent Sagun's profile picture Monojit's profile picture wave's profile picture Amita Shukla's profile picture Jan Batzner's profile picture Andrew Tran's profile picture

evaleval 's collections 1

Resources: Bias, Stereotypes, and Representational Harms
Linking collected resources for this category that have a dataset, model, or demo on Hugging Face or a paper on ArXiv (inked through Hugging Face)
  • Runtime error
    14
    14

    BiasDetection

    🐠

    Analyze bias and toxicity in language models

  • Runtime error
    16
    16

    StableBias

    📖

  • McGill-NLP/stereoset

    Viewer • Updated Jan 23, 2024 • 4.23k • 1.77k • 25
  • nyu-mll/crows_pairs

    Updated Jan 18, 2024 • 955 • 10
Resources: Bias, Stereotypes, and Representational Harms
Linking collected resources for this category that have a dataset, model, or demo on Hugging Face or a paper on ArXiv (inked through Hugging Face)
  • Runtime error
    14
    14

    BiasDetection

    🐠

    Analyze bias and toxicity in language models

  • Runtime error
    16
    16

    StableBias

    📖

  • McGill-NLP/stereoset

    Viewer • Updated Jan 23, 2024 • 4.23k • 1.77k • 25
  • nyu-mll/crows_pairs

    Updated Jan 18, 2024 • 955 • 10
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs