Text Generation
Safetensors
English
hudsongouge commited on
Commit
e31bf23
·
verified ·
1 Parent(s): cd5dc1c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -18,7 +18,8 @@ It was trained on a set of Discord chat data, public domain books, and English B
18
 
19
  ## Training Data
20
 
21
- As the smallest DAT Byte model, this version was trained on a reduced dataset, composed exclusively of the following sources:
 
22
 
23
  - [**Gutenberg English**](https://huggingface.co/datasets/sedthh/gutenberg_english) — English books in the public domain
24
  - [**OpenDiscord**](https://huggingface.co/datasets/hudsongouge/Open-Discord) — Discord dumps in ChatML format
 
18
 
19
  ## Training Data
20
 
21
+ As the smallest DAT Byte model, this version was trained on less data than its larger family members.
22
+ The training data was composed exclusively of the following sources:
23
 
24
  - [**Gutenberg English**](https://huggingface.co/datasets/sedthh/gutenberg_english) — English books in the public domain
25
  - [**OpenDiscord**](https://huggingface.co/datasets/hudsongouge/Open-Discord) — Discord dumps in ChatML format