Diffusion Language Models are Super Data Learners Paper • 2511.03276 • Published 17 days ago • 116
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper • 2509.22638 • Published Sep 26 • 67
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18 • 19
🔱 Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated 3 days ago • 30
LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation Paper • 2410.13846 • Published Oct 17, 2024 • 2
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Jul 21 • 211