annay3294 commited on
Commit
591c437
·
verified ·
1 Parent(s): 5a58d69

End of training

Browse files
Files changed (3) hide show
  1. README.md +2 -2
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -2,7 +2,7 @@
2
  base_model: HuggingFaceTB/SmolLM-135M-Instruct
3
  datasets: HumanLLMs/Human-Like-DPO-Dataset
4
  library_name: transformers
5
- model_name: trainer_output
6
  tags:
7
  - generated_from_trainer
8
  - trl
@@ -10,7 +10,7 @@ tags:
10
  licence: license
11
  ---
12
 
13
- # Model Card for trainer_output
14
 
15
  This model is a fine-tuned version of [HuggingFaceTB/SmolLM-135M-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM-135M-Instruct) on the [HumanLLMs/Human-Like-DPO-Dataset](https://huggingface.co/datasets/HumanLLMs/Human-Like-DPO-Dataset) dataset.
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
 
2
  base_model: HuggingFaceTB/SmolLM-135M-Instruct
3
  datasets: HumanLLMs/Human-Like-DPO-Dataset
4
  library_name: transformers
5
+ model_name: annay3294/llm-course-hw2-reward-model
6
  tags:
7
  - generated_from_trainer
8
  - trl
 
10
  licence: license
11
  ---
12
 
13
+ # Model Card for annay3294/llm-course-hw2-reward-model
14
 
15
  This model is a fine-tuned version of [HuggingFaceTB/SmolLM-135M-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM-135M-Instruct) on the [HumanLLMs/Human-Like-DPO-Dataset](https://huggingface.co/datasets/HumanLLMs/Human-Like-DPO-Dataset) dataset.
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6317661ca165840e367ee13a01154b634e8de2d052b12e31e3044d9ef5b27c01
3
  size 538092792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65533307bfe7df64c014f52a265af4e616bacff0fc092ec1d58b095ff6148c7e
3
  size 538092792
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4168e6b7bb5ada750c3145e49b95caaa73ead5bb36a47c578dbc9d6329ae4b24
3
  size 5432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bea79283a2116ad3a2e77acd224a666ac52ded02dacf45e2c68f140f8a100355
3
  size 5432