Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
dmnsh
/
Qwen3-4b-W0-GenRM
like
0
Reinforcement Learning
PyTorch
SAA-Lab/LitBench-Train
qwen3
RewardModel
GenerativeRewardModel
W0
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
README.md exists but content is empty.
Downloads last month
-
Video Preview
Reinforcement Learning
loading
Model tree for
dmnsh/Qwen3-4b-W0-GenRM
Base model
Qwen/Qwen3-4B-Base
Finetuned
PrimeIntellect/Qwen3-4B
Finetuned
dmnsh/Qwen3-4B-W0-LitBench-SFT
Finetuned
(
1
)
this model
Dataset used to train
dmnsh/Qwen3-4b-W0-GenRM
SAA-Lab/LitBench-Train
Viewer
•
Updated
Jul 7, 2025
•
43.8k
•
1.43k
•
3