Image-Text-to-Text
Safetensors
Japanese
llava
conversational