Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
carlizor
's Collections
Agents
Multi lora spaces
TTS
Utilities
Document retrieval / chat
Flux
Image restoration
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium
Video vision
updated
Jun 18
Upvote
-
lmms-lab/LLaVA-NeXT-Video-32B-Qwen
Video-Text-to-Text
•
33B
•
Updated
Oct 4, 2024
•
20.3k
•
15
lmms-lab/LLaVA-Video-72B-Qwen2
Text Generation
•
73B
•
Updated
Oct 25, 2024
•
406
•
20
tencent/DepthCrafter
Depth Estimation
•
Updated
Jul 30
•
12k
•
100
Vision-CAIR/LongVU_Qwen2_7B
Video-Text-to-Text
•
8B
•
Updated
Feb 28
•
140
•
73
facebook/vjepa2-vitl-fpc64-256
Video Classification
•
0.3B
•
Updated
Aug 11
•
121k
•
160
Upvote
-
Share collection
View history
Collection guide
Browse collections