Uniform Discrete Diffusion with Metric Path for Video Generation Paper • 2510.24717 • Published 1 day ago • 35
Reasoning Efficiency Research Collection Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs • 3 items • Updated 6 days ago • 7
Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published 9 days ago • 61
CommonForms: A Large, Diverse Dataset for Form Field Detection Paper • 2509.16506 • Published Sep 20 • 18
The Ultimate Collection of Code Classifiers Collection 🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated May 5 • 15
EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling Paper • 2509.23909 • Published Sep 28 • 30
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated Apr 30 • 21
MolmoAct Collection All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated Sep 6 • 29
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources Paper • 2509.25531 • Published about 1 month ago • 7
LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training Paper • 2509.23661 • Published Sep 28 • 44
FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions Paper • 2509.17177 • Published Sep 21 • 13
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 70
view article Article 📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models By nvidia and 1 other • Aug 18 • 5
Towards A Generalist Code Embedding Model Based On Massive Data Synthesis Paper • 2505.12697 • Published May 19 • 2