We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.
Display AI evaluation results
Create and download an AI system evaluation scorecard
Generate a detailed AI evaluation report