mistral7b-pissa-coding-11-v1 / all_results.json
chansung's picture
End of training
7f12d6f verified
raw
history blame contribute delete
397 Bytes
{
"epoch": 1.0,
"eval_loss": 1.372731328010559,
"eval_runtime": 0.5647,
"eval_samples": 16,
"eval_samples_per_second": 19.479,
"eval_steps_per_second": 1.771,
"total_flos": 9.06748200373715e+17,
"train_loss": 0.9395653671688504,
"train_runtime": 729.0361,
"train_samples": 116368,
"train_samples_per_second": 56.83,
"train_steps_per_second": 0.296
}