Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5456es
/
implicit_reward_Llama-3.2-3B-Instruct_prune_0.3-sigmoid
like
0
Safetensors
llama
dpo
preference-learning
implicit
pruned
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
f6403b7
implicit_reward_Llama-3.2-3B-Instruct_prune_0.3-sigmoid
609 kB
1 contributor
History:
8 commits
5456es
Upload rng_state_4.pth with huggingface_hub
f6403b7
verified
3 months ago
.gitattributes
1.52 kB
initial commit
3 months ago
config.json
873 Bytes
Upload config.json with huggingface_hub
3 months ago
latest
16 Bytes
Upload latest with huggingface_hub
3 months ago
rng_state_1.pth
16.4 kB
xet
Upload rng_state_1.pth with huggingface_hub
3 months ago
rng_state_4.pth
16.4 kB
xet
Upload rng_state_4.pth with huggingface_hub
3 months ago
rng_state_5.pth
16.4 kB
xet
Upload rng_state_5.pth with huggingface_hub
3 months ago
trainer_state.json
548 kB
Upload trainer_state.json with huggingface_hub
3 months ago
training_args.bin
8.85 kB
xet
Upload training_args.bin with huggingface_hub
3 months ago