Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale Paper • 2509.14008 • Published Sep 17 • 87
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic Paper • 2509.01363 • Published Sep 1 • 58
Train Long, Think Short: Curriculum Learning for Efficient Reasoning Paper • 2508.08940 • Published Aug 12 • 26
An Embarrassingly Simple Defense Against LLM Abliteration Attacks Paper • 2505.19056 • Published May 25 • 6
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Paper • 2504.20708 • Published Apr 29 • 23
Towards Data-Efficient Pretraining for Atomic Property Prediction Paper • 2502.11085 • Published Feb 16 • 3
CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society Paper • 2303.17760 • Published Mar 31, 2023 • 1
Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right? Paper • 2305.09275 • Published May 16, 2023 • 1
SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training? Paper • 2402.01832 • Published Feb 2, 2024 • 6
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch Paper • 2406.14563 • Published Jun 20, 2024 • 30
Mindstorms in Natural Language-Based Societies of Mind Paper • 2305.17066 • Published May 26, 2023 • 3