PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published 3 days ago • 44
Fanar Collection A powerful and versatile family of Arabic Large Language Models (LLMs) designed for a wide range of tasks. • 3 items • Updated Jun 10 • 8
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 546
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality Mar 4 • 78
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces Paper • 2410.13194 • Published Oct 17, 2024 • 1
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment Paper • 2408.06266 • Published Aug 12, 2024 • 10
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 261
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models Paper • 2402.10986 • Published Feb 16, 2024 • 81
SPAR: Personalized Content-Based Recommendation via Long Engagement Attention Paper • 2402.10555 • Published Feb 16, 2024 • 35
DPO vs KTO vs IPO Collection A collection of datasets and models used for the Aligning LLMs with Direct Preference Optimization Methods blogpost • 2 items • Updated Jan 16, 2024 • 12