Zikang Shan PRO
zkshan2002
·
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a dataset
1 minute ago
zktmp/vpt_gen1-8b-distill-lam0.95-gen_critic
updated
a dataset
3 minutes ago
zktmp/vpt_gen1-8b-distill-gen_critic
updated
a model
about 3 hours ago
zktmp/sft-bs32-lr5e-5-step734