AI & ML interests
None yet
Organizations
None yet
hdong0/deepseek-Qwen-1.5B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4_plus
Text Generation
• 2B • Updated hdong0/deepseek-Qwen-7B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4
Text Generation
• 8B • Updated • 1
hdong0/deepseek-Qwen-1.5B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4
Text Generation
• 2B • Updated • 3
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_8192_to_16384_nokl
Text Generation
• 8B • Updated • 6
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_4096_to_16384_nokl
Text Generation
• 8B • Updated • 2
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_2048_to_16384_nokl
Text Generation
• 8B • Updated • 6
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_16384_nokl
Text Generation
• 8B • Updated • 10
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_4096_nokl
Text Generation
• 8B • Updated • 5
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_2048_nokl
Text Generation
• 8B • Updated • 9
hdong0/Qwen3-1.7B-base-Open-R1-GRPO_dapo_acc_4096_nokl
Text Generation
• 2B • Updated • 2
hdong0/Qwen3-1.7B-base-Open-R1-GRPO_deepscaler_acc_16384_nokl
hdong0/Qwen3-1.7B-base-Open-R1-GRPO_deepscaler_acc_4096_nokl
2B • Updated • 2
hdong0/Qwen3-1.7B-base-Open-R1-GRPO_deepscaler_acc_8192_nokl
Text Generation
• 2B • Updated • 2
hdong0/Qwen3-1.7B-Open-R1-GRPO_deepscaler_acc_16384
Text Generation
• 2B • Updated • 3
hdong0/Qwen3-1.7B-Open-R1-GRPO_deepscaler_acc_2048
Text Generation
• 2B • Updated • 4
hdong0/Qwen3-1.7B-base-Open-R1-GRPO_deepscaler_acc_16384
Text Generation
• 2B • Updated • 3
hdong0/Qwen3-1.7B-base-Open-R1-GRPO_deepscaler_acc_2048
Text Generation
• 2B • Updated • 5
hdong0/Qwen3-1.7B-Open-R1-GRPO_deepscaler_acc_8192
Text Generation
• 2B • Updated • 1
hdong0/Qwen3-1.7B-base-Open-R1-GRPO_deepscaler_acc_4096
Text Generation
• 2B • Updated • 3
hdong0/Qwen3-1.7B-base-Open-R1-GRPO_deepscaler_acc_8192
Text Generation
• 2B • Updated • 5
hdong0/Qwen3-1.7B-Open-R1-GRPO_deepscaler_acc_4096
Text Generation
• 2B • Updated • 3
hdong0/deepseek-Qwen-1.5B-Open-R1-GRPO_deepscaler_acc_4096
Text Generation
• 2B • Updated • 4
hdong0/deepseek-Qwen-1.5B-Open-R1-GRPO_deepscaler_acc_2048
Text Generation
• 2B • Updated • 6
hdong0/deepseek-Qwen-1.5B-Open-R1-GRPO_deepscaler_acc_16384
Text Generation
• 2B • Updated • 7
hdong0/deepseek-Qwen-1.5B-Open-R1-GRPO_deepscaler_acc_8196
Text Generation
• 2B • Updated • 3
hdong0/deepseek-Qwen2.5-7B-Open-R1-GRPO_deepscaler_acc_8196
Updated
hdong0/deepseek-Qwen2.5-7B-Open-R1-GRPO_deepscaler_acc_4096
Updated
Text Generation
• 15B • Updated • 1
Text Generation
• 9B • Updated • 1
Text Generation
• 9B • Updated