Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
fs-tom 's Collections
music
inpainting
DAW
vision
code
chat
TTS
embed
music stem separation
3d
video
image
talking head
asr
benchmarks

vision

updated Jul 29, 2024
Upvote
-

  • Runtime error
    Featured
    37

    Paligemma Tracking

    🐨
    37


  • microsoft/Phi-3-vision-128k-instruct

    Text Generation • 4B • Updated Dec 10, 2025 • 35.2k • 970

  • Paused
    21

    Video Llava

    🐨
    21

    Generate descriptions by uploading images or videos

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs