Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
oeohomos 's Collections
Embedding
Inbox
Multimode
Reasoning
Qwen
Deepseek Papers
RAG

Reasoning

updated Mar 13
Upvote
-

  • START: Self-taught Reasoner with Tools

    Paper • 2503.04625 • Published Mar 6 • 113

  • LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

    Paper • 2503.04724 • Published Mar 6 • 72

  • LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

    Paper • 2503.07536 • Published Mar 10 • 88

  • Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

    Paper • 2503.07572 • Published Mar 10 • 47

  • Gemini Embedding: Generalizable Embeddings from Gemini

    Paper • 2503.07891 • Published Mar 10 • 44
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs