tinyllamas / stories260K /tok512.bin
karpathy's picture
Changed the schema for the tokenizer.bin files, so overwriting with the new format
0bd21da
This file is stored with Xet . It is too big to display, but you can still download it.

Xet Pointer Details

( Raw pointer file )
Xet hash:
9320ac4e92dbdab1cf27343eb2c7ac620504689b2d3a614806a39526b4442d48
Size of remote file:
6.23 kB
·
SHA256:
037cb335abb25d1fa9e8ecae30ed2a3a8ace9302862ebcdc05d51a6bbb10c312

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.