Video vision - a carlizor Collection

carlizor 's Collections

Agents

Multi lora spaces

TTS

Document retrieval / chat

Flux

Image restoration

LLM

To Read

Video

Image Segmentation

Image Generation (Fast)

Audio

Image Generation

Image that talks

Image upscaling

Face Recognition

Video vision

updated Jun 18

lmms-lab/LLaVA-NeXT-Video-32B-Qwen

Video-Text-to-Text • 33B • Updated Oct 4, 2024 • 20.3k • 15
lmms-lab/LLaVA-Video-72B-Qwen2

Text Generation • 73B • Updated Oct 25, 2024 • 406 • 20
tencent/DepthCrafter

Depth Estimation • Updated Jul 30 • 12k • 100
Vision-CAIR/LongVU_Qwen2_7B

Video-Text-to-Text • 8B • Updated Feb 28 • 140 • 73
facebook/vjepa2-vitl-fpc64-256

Video Classification • 0.3B • Updated Aug 11 • 121k • 160