Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

5456es
/
implicit_reward_Llama-3.2-3B-Instruct_prune_0.3-sigmoid

Safetensors
llama
dpo
preference-learning
implicit
pruned
Model card Files Files and versions
xet
Community
implicit_reward_Llama-3.2-3B-Instruct_prune_0.3-sigmoid
609 kB
  • 1 contributor
History: 8 commits
5456es's picture
5456es
Upload rng_state_4.pth with huggingface_hub
f6403b7 verified 3 months ago
  • .gitattributes
    1.52 kB
    initial commit 3 months ago
  • config.json
    873 Bytes
    Upload config.json with huggingface_hub 3 months ago
  • latest
    16 Bytes
    Upload latest with huggingface_hub 3 months ago
  • rng_state_1.pth
    16.4 kB
    xet
    Upload rng_state_1.pth with huggingface_hub 3 months ago
  • rng_state_4.pth
    16.4 kB
    xet
    Upload rng_state_4.pth with huggingface_hub 3 months ago
  • rng_state_5.pth
    16.4 kB
    xet
    Upload rng_state_5.pth with huggingface_hub 3 months ago
  • trainer_state.json
    548 kB
    Upload trainer_state.json with huggingface_hub 3 months ago
  • training_args.bin
    8.85 kB
    xet
    Upload training_args.bin with huggingface_hub 3 months ago