Running 3.55k The Ultra-Scale Playbook 🌌 3.55k The ultimate guide to training LLM on large GPU Clusters
DETRs Beat YOLOs on Real-time Object Detection Paper • 2304.08069 • Published Apr 17, 2023 • 15
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models Paper • 2407.01906 • Published Jul 2, 2024 • 45
Sleeping 1 Scientific Argument Recommender 📚 1 Analyse texts to identify arguments and relationships
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community +1 Apr 15, 2024 • 190