Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jz666
/
simpo-train-small-correct
like
0
Text Generation
Transformers
Safetensors
jz666/gemma2-ultrafeedback-ppl-split
gemma2
alignment-handbook
trl
simpo
Generated from Trainer
conversational
text-generation-inference
License:
gemma
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
bc4984e
simpo-train-small-correct
18.5 GB
1 contributor
History:
2 commits
jz666
Training in progress, step 137
bc4984e
verified
19 days ago
.gitattributes
Safe
1.57 kB
Training in progress, step 137
19 days ago
config.json
896 Bytes
Training in progress, step 137
19 days ago
model-00001-of-00004.safetensors
4.9 GB
xet
Training in progress, step 137
19 days ago
model-00002-of-00004.safetensors
4.95 GB
xet
Training in progress, step 137
19 days ago
model-00003-of-00004.safetensors
4.96 GB
xet
Training in progress, step 137
19 days ago
model-00004-of-00004.safetensors
3.67 GB
xet
Training in progress, step 137
19 days ago
model.safetensors.index.json
Safe
39.1 kB
Training in progress, step 137
19 days ago
special_tokens_map.json
Safe
636 Bytes
Training in progress, step 137
19 days ago
tokenizer.json
Safe
17.5 MB
xet
Training in progress, step 137
19 days ago
tokenizer_config.json
Safe
47 kB
Training in progress, step 137
19 days ago
training_args.bin
pickle
Detected Pickle imports (13)
"transformers.training_args.OptimizerNames"
,
"accelerate.state.PartialState"
,
"torch.device"
,
"transformers.trainer_utils.SchedulerType"
,
"accelerate.utils.dataclasses.DeepSpeedPlugin"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.integrations.deepspeed.HfDeepSpeedConfig"
,
"simpo_config.SimPOConfig"
,
"transformers.trainer_utils.HubStrategy"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
,
"torch.bfloat16"
How to fix it?
7.51 kB
xet
Training in progress, step 137
19 days ago