Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2603.14482

Theory and Representation learning

I-Con: A Unifying Framework for Representation Learning

Paper • 2504.16929 • Published Apr 23, 2025 • 31
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7, 2025 • 48
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms

Paper • 2511.04217 • Published Nov 6, 2025 • 17
Large Language Models as Markov Chains

Paper • 2410.02724 • Published Oct 3, 2024 • 33

action-condition generation

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109
V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning

Paper • 2603.14482 • Published Mar 15 • 34

DVPS Scientific Watch

Collection of external scientific material relevant to the project

HuggingFaceFW/finetranslations

Viewer • Updated Jan 9 • 3.33B • 117k • 288
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Paper • 2411.00136 • Published Oct 31, 2024
The Illusion of Readiness in Health AI

Paper • 2509.18234 • Published Sep 22, 2025 • 1
The Roots of Performance Disparity in Multilingual Language Models: Intrinsic Modeling Difficulty or Design Choices?

Paper • 2601.07220 • Published Jan 12

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 104
An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark

Paper • 2405.19707 • Published May 30, 2024 • 9
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Paper • 2410.08049 • Published Oct 10, 2024 • 8

Theory and Representation learning

V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning

Paper • 2603.14482 • Published Mar 15 • 34
Real-Time Object Detection Meets DINOv3

Paper • 2509.20787 • Published Sep 25, 2025 • 11

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 324
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Paper • 2512.23988 • Published Dec 30, 2025 • 19
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time

Paper • 2512.25075 • Published Dec 31, 2025 • 16
Guiding a Diffusion Transformer with the Internal Dynamics of Itself

Paper • 2512.24176 • Published Dec 30, 2025 • 8

Representation Learning

End-to-End Vision Tokenizer Tuning

Paper • 2505.10562 • Published May 15, 2025 • 22
Global and Local Entailment Learning for Natural World Imagery

Paper • 2506.21476 • Published Jun 26, 2025 • 1
DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 307
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1, 2025 • 62

Theory and Representation learning

I-Con: A Unifying Framework for Representation Learning

Paper • 2504.16929 • Published Apr 23, 2025 • 31
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7, 2025 • 48
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms

Paper • 2511.04217 • Published Nov 6, 2025 • 17
Large Language Models as Markov Chains

Paper • 2410.02724 • Published Oct 3, 2024 • 33

Theory and Representation learning

V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning

Paper • 2603.14482 • Published Mar 15 • 34
Real-Time Object Detection Meets DINOv3

Paper • 2509.20787 • Published Sep 25, 2025 • 11

action-condition generation

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109
V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning

Paper • 2603.14482 • Published Mar 15 • 34

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 324
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Paper • 2512.23988 • Published Dec 30, 2025 • 19
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time

Paper • 2512.25075 • Published Dec 31, 2025 • 16
Guiding a Diffusion Transformer with the Internal Dynamics of Itself

Paper • 2512.24176 • Published Dec 30, 2025 • 8

DVPS Scientific Watch

Collection of external scientific material relevant to the project

HuggingFaceFW/finetranslations

Viewer • Updated Jan 9 • 3.33B • 117k • 288
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Paper • 2411.00136 • Published Oct 31, 2024
The Illusion of Readiness in Health AI

Paper • 2509.18234 • Published Sep 22, 2025 • 1
The Roots of Performance Disparity in Multilingual Language Models: Intrinsic Modeling Difficulty or Design Choices?

Paper • 2601.07220 • Published Jan 12

Representation Learning

End-to-End Vision Tokenizer Tuning

Paper • 2505.10562 • Published May 15, 2025 • 22
Global and Local Entailment Learning for Natural World Imagery

Paper • 2506.21476 • Published Jun 26, 2025 • 1
DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 307
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1, 2025 • 62

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 104
An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark

Paper • 2405.19707 • Published May 30, 2024 • 9
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Paper • 2410.08049 • Published Oct 10, 2024 • 8

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs