AI & ML interests
None yet
Recent Activity
Organizations
None yet
lhl616/Qwen3-8B-axon-error-aware-128-8-ratio
8B
•
Updated
•
1
lhl616/Qwen3-8B-axon-error-aware-128-8-mixed
8B
•
Updated
•
2
lhl616/Qwen3-8B-Base-axon-ppo
8B
•
Updated
•
1
lhl616/Qwen3-8B-Base-axon-grpo-step-128-8
8B
•
Updated
•
1
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-ratio-new
8B
•
Updated
•
1
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-passk
8B
•
Updated
•
1
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-nstd-0.5-0.8-step-2
8B
•
Updated
•
1
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-nstd-0.5-0.8-start-relu
8B
•
Updated
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-nstd-0.5-0.8-start
8B
•
Updated
•
1
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-nstd-0.5-0.8-relu-normal-real
8B
•
Updated
•
1
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-nstd-0.5-0.8-normal-mode-fixed0.4-relu
8B
•
Updated
•
1
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-0.5-0.8-start-relu
8B
•
Updated
•
1
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-0.5-0.8-start
8B
•
Updated
•
1
lhl616/Qwen3-4B-error-aware-128-8-standard-0.5-0.8-start-new
4B
•
Updated
•
1
lhl616/Qwen3-4B-error-aware-128-8-dense-nstd-0.5-0.8-start-new
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-grpo-step-128-8
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-grpo-nstd-step-128-8-fixed_denominator
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-ratio
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-mixed
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-relu-0.5-0.8
4B
•
Updated
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-passk
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-nstd-0.5-0.8-start
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-nstd-0.5-0.8-relu-normal-real
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-nstd-0.5-0.8-relu-normal-new
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-nstd-0.5-0.8-normal-new
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-nstd-0.5-0.8-normal-fixed-denominator
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-nstd-0.5-0.8-new-relu
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-0.5-0.8-start
4B
•
Updated
•
1
lhl616/Qwen3-4B-axon-error-aware-128-8-dense-0.5-0.8-penalty-2
4B
•
Updated
•
1
4B
•
Updated
•
1