distilbert/distilbert-base-uncased-finetuned-sst-2-english Text Classification • 67M • Updated Dec 19, 2023 • 4.03M • • 873
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper • 2501.16975 • Published Jan 28, 2025 • 32
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 205