andito
·
AI & ML interests
Multimodal models, VLM and TTS
Recent Activity
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
Supercharge your OCR Pipelines with Open Models
view article
TimeScope: How Long Can Your Video Large Multimodal Model Go?
view article
SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data
view article
nanoVLM: The simplest repository to train your VLM in pure PyTorch
view article
Vision Language Models (Better, Faster, Stronger)