beyoru
/

Cery-rc-M

Text Generation

text-generation-inference

Model card Files Files and versions

Research model

Small training in T4 kaggle

Evaluation model from ACEBench

Compare 3 models Cery (SFT), Cery-M(GRPO), Cery-High(SFT+GRPO)

Details:

Fix the chat template for instruct generation.
GRPO training process. (focus on calling a tool)

Config LoRA

rank 16
alpha 32
epoch 1

Downloads last month: 5

Safetensors

Model size

2B params

Tensor type

BF16

·

Model tree for beyoru/Cery-rc-M

Base model

Qwen/Qwen3-1.7B-Base

Finetuned

Qwen/Qwen3-1.7B

Finetuned

(308)

this model

Quantizations

Dataset used to train beyoru/Cery-rc-M