Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

alea-institute
/
kl3m-multi-word-002-32k

Fill-Mask
Transformers
English
tokenizer
legal
bpe
byte-pair-encoding
multi-word
kl3m
legal-domain
hierarchical
Model card Files Files and versions
xet
Community
kl3m-multi-word-002-32k
2.25 MB
  • 1 contributor
History: 4 commits
alea-institute's picture
alea-institute
Upload KL3M multi-word tokenizer v2 (32K) - Update README
d73516e verified 8 days ago
  • .gitattributes
    1.52 kB
    initial commit 8 days ago
  • README.md
    12.5 kB
    Upload KL3M multi-word tokenizer v2 (32K) - Update README 8 days ago
  • special_tokens_map.json
    189 Bytes
    Upload KL3M multi-word tokenizer v2 (32K) 8 days ago
  • tokenizer.json
    2.24 MB
    Upload KL3M multi-word tokenizer v2 (32K) 8 days ago
  • tokenizer_config.json
    1.55 kB
    Upload KL3M multi-word tokenizer v2 (32K) 8 days ago