Research model

Small training in T4 kaggle


Evaluation model from ACEBench

Compare 3 models Cery (SFT), Cery-M(GRPO), Cery-High(SFT+GRPO)

image


Details:

  • Fix the chat template for instruct generation.
  • GRPO training process. (focus on calling a tool)

Config LoRA

rank 16
alpha 32
epoch 1
Downloads last month
5
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for beyoru/Cery-rc-M

Finetuned
Qwen/Qwen3-1.7B
Finetuned
(308)
this model
Quantizations
2 models

Dataset used to train beyoru/Cery-rc-M