Mitsua/mitsua-japanese-clip-vit-b-16 Zero-Shot Image Classification β’ 0.2B β’ Updated Dec 9, 2024 β’ 4 β’ 7
Running 557 557 Talking Face Generation with Multilingual TTS π Generate a talking face video from text in multiple languages