Guanhua Huang's picture

1 3

Guanhua Huang

Carlanlarkk

AI & ML interests

None yet

Recent Activity

authored a paper 23 days ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

upvoted a paper 23 days ago

Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and Planning

upvoted a paper 23 days ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

View all activity

Organizations

None yet

authored a paper 23 days ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published 30 days ago • 45

upvoted 2 papers 23 days ago

Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and Planning

Paper • 2509.25052 • Published Sep 29 • 4

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published 30 days ago • 45

commented a paper 23 days ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published 30 days ago • 45 •

upvoted a paper 24 days ago

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published Jan 17 • 52

authored 6 papers 24 days ago

Are AI-Generated Text Detectors Robust to Adversarial Perturbations?

Paper • 2406.01179 • Published Jun 3, 2024

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published Jan 17 • 52

AGILE: A Novel Reinforcement Learning Framework of LLM Agents

Paper • 2405.14751 • Published May 23, 2024

ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation

Paper • 2507.04952 • Published Jul 7 • 9

Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework

Paper • 2507.06829 • Published Jul 9

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23 • 67