Hu Xiaoyan's picture

Hu Xiaoyan

Yann1021

https://yannxiaoyanhu.github.io

AI & ML interests

Reinforcement learning

Recent Activity

authored a paper about 2 months ago

Provably Efficient CVaR RL in Low-rank MDPs

authored a paper about 2 months ago

PAK-UCB Contextual Bandit: An Online Learning Approach to Prompt-Aware Selection of Generative Models and LLMs

authored a paper about 2 months ago

A Multi-Armed Bandit Approach to Online Selection and Evaluation of Generative Models

View all activity

Organizations

None yet

Papers 3

arxiv:2410.13287

arxiv:2406.07451

arxiv:2311.11965

models 0

None public yet

datasets 0

None public yet