Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceTB
/
SmolVLM2-256M-Video-Instruct
like
83
Follow
Hugging Face Smol Models Research
2.98k
Image-Text-to-Text
Transformers
ONNX
Safetensors
12 datasets
English
smolvlm
conversational
arxiv:
2504.05299
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
13
Deploy
Use this model
1afb5e4
SmolVLM2-256M-Video-Instruct
/
onnx
/
vision_encoder_quantized.onnx
Commit History
Upload ONNX weights
1afb5e4
verified
Xenova
HF Staff
commited on
Feb 13