Run code and analyze data in a Jupyter notebook
A new open-source dataset for training VLMs
The Werewolf Benchmark tests LLMs’ social intelligence.
gradio chat app MCP and gpt-oss powered