Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5456es
/
implicit_reward_Llama-3.2-3B-Instruct_prune_0.3-sigmoid
like
0
Safetensors
llama
dpo
preference-learning
implicit
pruned
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
f6403b7
implicit_reward_Llama-3.2-3B-Instruct_prune_0.3-sigmoid
Commit History
Upload rng_state_4.pth with huggingface_hub
f6403b7
verified
5456es
commited on
Sep 8
Upload rng_state_5.pth with huggingface_hub
2b462c5
verified
5456es
commited on
Sep 8
Upload latest with huggingface_hub
bde420b
verified
5456es
commited on
Sep 8
Upload training_args.bin with huggingface_hub
dea5300
verified
5456es
commited on
Sep 8
Upload rng_state_1.pth with huggingface_hub
0709201
verified
5456es
commited on
Sep 8
Upload config.json with huggingface_hub
f3b1e3b
verified
5456es
commited on
Sep 8
Upload trainer_state.json with huggingface_hub
5fc753a
verified
5456es
commited on
Sep 8
initial commit
7531bfa
verified
5456es
commited on
Sep 8