Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
fs-tom
's Collections
music
inpainting
DAW
vision
code
chat
TTS
embed
music stem separation
3d
video
image
talking head
asr
benchmarks
vision
updated
Jul 29, 2024
Upvote
-
Runtime error
Featured
37
Paligemma Tracking
🐨
37
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
4B
•
Updated
Dec 10, 2025
•
35.2k
•
970
Paused
21
Video Llava
🐨
21
Generate descriptions by uploading images or videos
Upvote
-
Share collection
View history
Collection guide
Browse collections