Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dongguanting 's Collections
AEPO
ARPO
Tool-Star
RAG-Critic

AEPO

updated 6 days ago

The official datasets and model checkpoints of AEPO

Upvote
3

  • Agentic Entropy-Balanced Policy Optimization

    Paper • 2510.14545 • Published 11 days ago • 96

  • dongguanting/Qwen3-8B-AEPO-DeepSearch

    Text Generation • 8B • Updated about 6 hours ago • 54

  • dongguanting/Qwen3-14B-AEPO-DeepSearch

    Robotics • 15B • Updated 6 days ago • 44 • 1

  • dongguanting/Qwen2.5-7B-AEPO

    Text Generation • 8B • Updated about 6 hours ago • 44
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs