Center for AI Safety

non-profit

https://www.safe.ai

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

justinphan3110 updated a dataset 19 days ago

cais/hle-rolling

justinphan3110 new activity about 2 months ago

cais/hle:adds eval.yaml

justinphan3110 new activity about 2 months ago

cais/hle:Is it possible to add additionnal questions not currently in the dataset ?

View all activity

Papers

Humanity's Last Exam

View all Papers

Collections 2

spaces 1

TextQuests

📟

How Good are LLMs at Text-Based Video Games?

models 8

datasets 12

cais/hle-rolling

Viewer • Updated 19 days ago • 2.62k • 1.03k • 14

cais/hle

Benchmark • Updated Jan 20 • 2.5k • 42.4k • 738

cais/rli-public-set

Updated Nov 3, 2025 • 302 • 5

cais/rli-example-deliverables

Viewer • Updated Nov 1, 2025 • 176 • 179

cais/wmdp-cyber-forget-corpus

Viewer • Updated May 29, 2025 • 1k • 765 • 3

cais/wmdp-bio-forget-corpus

Viewer • Updated May 29, 2025 • 24.5k • 2k • 1

cais/MASK

Viewer • Updated Mar 20, 2025 • 1k • 2.51k • 11

cais/imagenet-o

Viewer • Updated May 27, 2024 • 2k • 84

cais/wmdp

Viewer • Updated Apr 27, 2024 • 3.67k • 30.3k • 23

cais/wmdp-mmlu-auxiliary-corpora

Viewer • Updated Apr 25, 2024 • 8.88k • 559 • 4

View 12 datasets