meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 375k • • 1.54k
view post Post 1590 💾🧠Want to know how much VRAM you will need for training your model? 💾🧠Now you can use this app in which you can input a torch/tensorflow summary or the parameters count and get an estimate of the required memory!Use it in: howmuchvram.com Also, everything is Open Source so you can contribute in repo: https://github.com/AlexBodner/How_Much_VRAMLeave it a star⭐ 👍 5 5 👀 2 2 + Reply
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133