Spaces

·

The AI App Directory

New Space Get PRO Learn more

Running on Zero

Multimodal OCR3

nanonets2 / dots.ocr / olmOCR2 / chandraOCR

Running on Zero

Nanonets-OCR2-3B

Extract text from document images

Qwen Atari

Play Atari games using a vision-language model

Running on Zero

EmoAct MiMo

Controllable emotional TTS

DALL·E mini

Generate images from text prompts

CodeFormer

Enhance and restore old photos with faces

Video Face Swap

Video deep fake

Edge TTS Text To Speech

Generate speech from text using Microsoft Edge TTS

Running on Zero

FLUX.1 Kontext

Kontext image editing on FLUX[dev]

Running on Zero

Chatterbox-Multilingual-TTS

Chatterbox TTS supporting 23 languages

Running on Zero

Qwen3 VL HF Demo

Chat using Qwen3-VL for Image, Video, PDF, and GIF

Whisper Web

Convert spoken words into text

Running on CPU Upgrade

Open ASR Leaderboard

Display and request speech recognition model benchmarks

Running on CPU Upgrade

C4AI Command Models

Ask questions and get answers

Visualize Dataset (v2.0+ latest dataset format)

Visualize LeRobot Datasets

Running on Zero

Joy Caption Pre Alpha

Generate captions for images

Running on Zero

Flux.1-dev Upscaler

Upscale low-resolution images to high resolution

Running on Zero

Joy Caption Alpha Two

Generate captions for images in various styles

Running on Zero

vggt

VGGT (CVPR 2025)

WeShopAI Virtual Try On

WeShopAI Virtual Try On. Switch outfits with ease virtually.

Running on Zero

TRELLIS

Scalable and Versatile 3D Generation from images

Running on Zero

SDXL Text To Image

Generate images from text prompts

Wan2.2 14B Text2Video on AMD GPUs

Wan 2.2 14B

Running on Zero

Qwen Image Edit 2509

Generate edited images based on prompts and input images