Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2502.08127

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published Feb 12 • 58
Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 83
HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation

Paper • 2504.13072 • Published Apr 17 • 13
What are you sinking? A geometric approach on attention sink

Paper • 2508.02546 • Published Aug 4 • 1

Transformers model

sentence-transformers/all-MiniLM-L6-v2

Sentence Similarity • 22.7M • Updated Mar 6 • 136M • • 4.06k
sentence-transformers/paraphrase-xlm-r-multilingual-v1

Sentence Similarity • 0.3B • Updated Mar 6 • 228k • • 69
rtzr/ko-gemma-2-9b-it

Text Generation • 9B • Updated Jul 15, 2024 • 4.24k • • 91
Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 221k • 1.81k

Reasoning, Thinking, RL and Test-Time Scaling

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 39
Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46
Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 37
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 47

PDFTriage: Question Answering over Long, Structured Documents

Paper • 2309.08872 • Published Sep 16, 2023 • 53
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 81
Table-GPT: Table-tuned GPT for Diverse Table Tasks

Paper • 2310.09263 • Published Oct 13, 2023 • 41
Context-Aware Meta-Learning

Paper • 2310.10971 • Published Oct 17, 2023 • 17

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 237
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 37
BLINK: Multimodal Large Language Models Can See but Not Perceive

Paper • 2404.12390 • Published Apr 18, 2024 • 26
RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9, 2024 • 39

How_LLMS_Think _and_Reason_Papers

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 115
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published Feb 12 • 58
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 421

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published Dec 31, 2024 • 31
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107
Xmodel-2 Technical Report

Paper • 2412.19638 • Published Dec 27, 2024 • 26
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 104

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3, 2024 • 34
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 27
Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126
Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17, 2024 • 22

Running

226

226

BigCodeBench Leaderboard

🥇

Explore and analyze code completion benchmarks
Running

1.21k

1.21k

UGI Leaderboard

📢

Uncensored General Intelligence Leaderboard
Running

4.65k

4.65k

LMArena Leaderboard

🏆

Display LMArena Leaderboard
Running on CPU Upgrade

6.63k

6.63k

MTEB Leaderboard

🥇

Embedding Leaderboard

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published Feb 12 • 58
Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 83
HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation

Paper • 2504.13072 • Published Apr 17 • 13
What are you sinking? A geometric approach on attention sink

Paper • 2508.02546 • Published Aug 4 • 1

How_LLMS_Think _and_Reason_Papers

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 115
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published Feb 12 • 58
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 421

Transformers model

sentence-transformers/all-MiniLM-L6-v2

Sentence Similarity • 22.7M • Updated Mar 6 • 136M • • 4.06k
sentence-transformers/paraphrase-xlm-r-multilingual-v1

Sentence Similarity • 0.3B • Updated Mar 6 • 228k • • 69
rtzr/ko-gemma-2-9b-it

Text Generation • 9B • Updated Jul 15, 2024 • 4.24k • • 91
Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 221k • 1.81k

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published Dec 31, 2024 • 31
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107
Xmodel-2 Technical Report

Paper • 2412.19638 • Published Dec 27, 2024 • 26
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 104

Reasoning, Thinking, RL and Test-Time Scaling

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 39
Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46
Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 37
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 47

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3, 2024 • 34
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 27
Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126
Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17, 2024 • 22

PDFTriage: Question Answering over Long, Structured Documents

Paper • 2309.08872 • Published Sep 16, 2023 • 53
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 81
Table-GPT: Table-tuned GPT for Diverse Table Tasks

Paper • 2310.09263 • Published Oct 13, 2023 • 41
Context-Aware Meta-Learning

Paper • 2310.10971 • Published Oct 17, 2023 • 17

Running

226

226

BigCodeBench Leaderboard

🥇

Explore and analyze code completion benchmarks
Running

1.21k

1.21k

UGI Leaderboard

📢

Uncensored General Intelligence Leaderboard
Running

4.65k

4.65k

LMArena Leaderboard

🏆

Display LMArena Leaderboard
Running on CPU Upgrade

6.63k

6.63k

MTEB Leaderboard

🥇

Embedding Leaderboard

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 237
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 37
BLINK: Multimodal Large Language Models Can See but Not Perceive

Paper • 2404.12390 • Published Apr 18, 2024 • 26
RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9, 2024 • 39

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs