Visual Question Answering
Transformers
Safetensors
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
New discussion

How can I fine-tune the model?

#2 opened about 1 year ago by deleted

Add new task tag

#1 opened over 1 year ago by
merve