Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
berkeley-nest
/
Starling-RM-7B-alpha
like
103
Follow
Berkeley-Nest
78
Transformers
PyTorch
berkeley-nest/Nectar
English
llama
reward model
RLHF
RLAIF
text-generation-inference
arxiv:
2203.02155
arxiv:
2301.11270
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
8
Deploy
Use this model
main
Starling-RM-7B-alpha
26.7 GB
6 contributors
History:
16 commits
evan-nexusflow
Create config.json
6c6b4d5
verified
over 1 year ago
.gitattributes
1.52 kB
Duplicate from banghua/n_rm
about 2 years ago
README.md
6.73 kB
Update README.md
over 1 year ago
config.json
621 Bytes
Create config.json
over 1 year ago
latest
15 Bytes
Duplicate from banghua/n_rm
about 2 years ago
pytorch_model.bin
26.7 GB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_0.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_1.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_2.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_3.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_4.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_5.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_6.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_7.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
trainer_state.json
18.9 kB
Duplicate from banghua/n_rm
about 2 years ago
training_args.bin
5.31 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
zero_to_fp32.py
24.2 kB
Duplicate from banghua/n_rm
about 2 years ago