Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2510.13998

inference optimization

Low-Rank Adapters Meet Neural Architecture Search for LLM Compression

Paper • 2501.16372 • Published Jan 23 • 12
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Paper • 2501.16937 • Published Jan 28 • 7
Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 32
Identifying Sensitive Weights via Post-quantization Integral

Paper • 2503.01901 • Published Feb 28 • 8

Interesting Papers

BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49

about 16 hours ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published 12 days ago • 30
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published 11 days ago • 36
BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49
GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published 5 days ago • 39

Run on CPU Optimizations

BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49

Read Later Stack

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 14 days ago • 31
Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published 18 days ago • 9
Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published 21 days ago • 22
DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published 14 days ago • 26

interesting architecture

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 28
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 88
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31 • 22
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13 • 8

Paper2Web: Let's Make Your Paper Alive!

Paper • 2510.15842 • Published 10 days ago • 24
Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published 21 days ago • 106
BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published about 1 month ago • 76
Robot Learning: A Tutorial

Paper • 2510.12403 • Published 13 days ago • 88
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published 12 days ago • 60
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published 20 days ago • 51

BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49

Large Language Model (LLM) and NLP related papers.

LoRA+: Efficient Low Rank Adaptation of Large Models

Paper • 2402.12354 • Published Feb 19, 2024 • 6
The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20, 2024 • 23
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Paper • 2402.13249 • Published Feb 20, 2024 • 13
TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 69

inference optimization

Low-Rank Adapters Meet Neural Architecture Search for LLM Compression

Paper • 2501.16372 • Published Jan 23 • 12
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Paper • 2501.16937 • Published Jan 28 • 7
Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 32
Identifying Sensitive Weights via Post-quantization Integral

Paper • 2503.01901 • Published Feb 28 • 8

interesting architecture

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 28
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 88
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31 • 22
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13 • 8

Interesting Papers

BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49

Paper2Web: Let's Make Your Paper Alive!

Paper • 2510.15842 • Published 10 days ago • 24
Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published 21 days ago • 106
BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49

about 16 hours ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published 12 days ago • 30
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published 11 days ago • 36
BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49
GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published 5 days ago • 39

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published about 1 month ago • 76
Robot Learning: A Tutorial

Paper • 2510.12403 • Published 13 days ago • 88
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published 12 days ago • 60
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published 20 days ago • 51

Run on CPU Optimizations

BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49

BitNet Distillation

Paper • 2510.13998 • Published 12 days ago • 49

Read Later Stack

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 14 days ago • 31
Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published 18 days ago • 9
Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published 21 days ago • 22
DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published 14 days ago • 26

Large Language Model (LLM) and NLP related papers.

LoRA+: Efficient Low Rank Adaptation of Large Models

Paper • 2402.12354 • Published Feb 19, 2024 • 6
The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20, 2024 • 23
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Paper • 2402.13249 • Published Feb 20, 2024 • 13
TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 69

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs