JayHyeon
/

pythia-2.8b-rDPO_5e-7_1.0vpo_constant-1ep_0.3label_smoothing

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

pythia-2.8b-rDPO_5e-7_1.0vpo_constant-1ep_0.3label_smoothing / runs

19.8 kB

1 contributor

History: 1 commit

JayHyeon's picture

Training in progress, step 970

721e795 verified 4 months ago

Aug05_20-50-53_01933a260f36
Training in progress, step 970 4 months ago