s0han24 commited on
Commit
793e2b0
·
verified ·
1 Parent(s): 960be0d

End of training

Browse files
Files changed (3) hide show
  1. README.md +11 -11
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -18,12 +18,12 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 2.9320
22
- - Rouge1: 19.4342
23
- - Rouge2: 3.1111
24
- - Rougel: 15.4989
25
- - Rougelsum: 16.1892
26
- - Gen Len: 20.0
27
 
28
  ## Model description
29
 
@@ -42,9 +42,9 @@ More information needed
42
  ### Training hyperparameters
43
 
44
  The following hyperparameters were used during training:
45
- - learning_rate: 2e-05
46
- - train_batch_size: 2
47
- - eval_batch_size: 2
48
  - seed: 42
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
@@ -55,8 +55,8 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
58
- | No log | 1.0 | 5 | 2.9732 | 19.536 | 3.1111 | 14.9596 | 16.2927 | 20.0 |
59
- | No log | 2.0 | 10 | 2.9320 | 19.4342 | 3.1111 | 15.4989 | 16.1892 | 20.0 |
60
 
61
 
62
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 2.2337
22
+ - Rouge1: 24.3779
23
+ - Rouge2: 4.0979
24
+ - Rougel: 18.8773
25
+ - Rougelsum: 18.8739
26
+ - Gen Len: 19.3
27
 
28
  ## Model description
29
 
 
42
  ### Training hyperparameters
43
 
44
  The following hyperparameters were used during training:
45
+ - learning_rate: 0.0005
46
+ - train_batch_size: 4
47
+ - eval_batch_size: 4
48
  - seed: 42
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
58
+ | No log | 1.0 | 3 | 2.3551 | 23.7776 | 2.9006 | 17.5796 | 17.5488 | 20.0 |
59
+ | No log | 2.0 | 6 | 2.2337 | 24.3779 | 4.0979 | 18.8773 | 18.8739 | 19.3 |
60
 
61
 
62
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f6bd9ddec4944eba516c9c187e33c7ffbb27bf9fff5f9f6ab0f5ba8be746130d
3
  size 891644712
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c96d378ff6de78a6f717bedfc07d8776f6959a8a3024fbdfc0404f2128cfe01
3
  size 891644712
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5c6ba67256b3a96c05f17a25b09106d6685bc14c78faf88ed334177a2a9fe246
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e523004dfe5a3f2534527ab8b2f5a4e42a0e0345aa00152881a0ccced164bd26
3
  size 5496