Qwen/Qwen3-VL-235B-A22B-Instruct Image-Text-to-Text β’ 236B β’ Updated 24 days ago β’ 97.7k β’ β’ 304
Running 3.34k 3.34k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade 985 985 Model Memory Utility π Calculate vRAM needed for model training and inference
Running 14 14 Transformers Modular Refactor π» Interactive analyzer for modular models in Transformers lib