ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1 Viewer • Updated Feb 27, 2025 • 29.3k • 6
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1_v1 Viewer • Updated Feb 27, 2025 • 29.3k • 5 • 1
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1 Viewer • Updated Feb 22, 2025 • 29.3k • 10
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2filtered Viewer • Updated Feb 19, 2025 • 28.9k • 17
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2 Viewer • Updated Feb 19, 2025 • 29.3k • 5
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filteredd Viewer • Updated Feb 19, 2025 • 29.3k • 19
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1filtered Viewer • Updated Feb 19, 2025 • 29.1k
ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2 Viewer • Updated Feb 18, 2025 • 29.3k • 28
ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1 Viewer • Updated Feb 18, 2025 • 29.3k • 2
ZHLiu627/ultrafeedback_binarized_with_response_full_part1 Viewer • Updated Mar 8, 2024 • 20k • 16 • 1