arxiv:2503.03588
Zujie Liang
jokieleung
ยท
AI & ML interests
LLM/VLM Agents, reasoning
Recent Activity
upvoted
a
paper
23 days ago
Cache-to-Cache: Direct Semantic Communication Between Large Language
Models
upvoted
a
paper
about 1 month ago
EPO: Entropy-regularized Policy Optimization for LLM Agents
Reinforcement Learning
upvoted
a
paper
about 2 months ago
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for
Long-Horizon LLM Agents