Arham
arhamk
AI & ML interests
Machine Learning, LLM
Organizations
models 15
arhamk/ppo-LunarLander-v2-2
Reinforcement Learning • Updated
arhamk/a2c-PandaReachDense-v2
Reinforcement Learning • Updated
• 2
arhamk/llama2-qlora-sft
Updated
• 1
arhamk/llama2-finance-sft
Text Generation • Updated
arhamk/vizdoom_health_gathering_supreme
Reinforcement Learning • Updated
arhamk/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning • Updated
arhamk/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning • Updated
• 5
arhamk/q-Taxi-v3
Reinforcement Learning • Updated
arhamk/a2c-AntBulletEnv-v0
Reinforcement Learning • Updated
• 1
arhamk/ppo-Pyramids
Reinforcement Learning • Updated
• 11