Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
carlizor 's Collections
Agents
Multi lora spaces
TTS
Utilities
Document retrieval / chat
Flux
Image restoration
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium

Video vision

updated Jun 18
Upvote
-

  • lmms-lab/LLaVA-NeXT-Video-32B-Qwen

    Video-Text-to-Text • 33B • Updated Oct 4, 2024 • 20.3k • 15

  • lmms-lab/LLaVA-Video-72B-Qwen2

    Text Generation • 73B • Updated Oct 25, 2024 • 406 • 20

  • tencent/DepthCrafter

    Depth Estimation • Updated Jul 30 • 12k • 100

  • Vision-CAIR/LongVU_Qwen2_7B

    Video-Text-to-Text • 8B • Updated Feb 28 • 140 • 73

  • facebook/vjepa2-vitl-fpc64-256

    Video Classification • 0.3B • Updated Aug 11 • 121k • 160
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs