Zikang Shan's picture

Zikang Shan PRO

zkshan2002
·

AI & ML interests

Reinforcement Learning

Recent Activity

updated a dataset 1 minute ago
zktmp/vpt_gen1-8b-distill-lam0.95-gen_critic
updated a dataset 3 minutes ago
zktmp/vpt_gen1-8b-distill-gen_critic
updated a model about 3 hours ago
zktmp/sft-bs32-lr5e-5-step734
View all activity

Organizations

Reinforced Token Optimization's profile picture zktmp's profile picture