Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
5
7
14
Catherine Arnett
catherinearnett
Follow
HasanOJ's profile picture
vargis93's profile picture
sfeucht's profile picture
108 followers
·
37 following
https://catherinearnett.github.io/
linguist_cat
catherinearnett
catherinearnett.bsky.social
AI & ML interests
multilingual NLP, tokenization
Recent Activity
updated
a dataset
about 1 month ago
catherinearnett/bilingual-tokenizer-training-data
published
a dataset
about 1 month ago
catherinearnett/bilingual-tokenizer-training-data
liked
a dataset
about 1 month ago
commoncrawl/CommonLID
View all activity
Organizations
catherinearnett
's models
18
Sort: Recently updated
catherinearnett/B-GPT_pl_en_sequential
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
8
catherinearnett/B-GPT_en_pl_sequential
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
336
catherinearnett/B-GPT_pl_en_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
2
catherinearnett/B-GPT_en_pl_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
2
catherinearnett/B-GPT_el_en_sequential
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
5
catherinearnett/B-GPT_en_el_sequential
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
370
catherinearnett/B-GPT_el_en_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
1
catherinearnett/B-GPT_en_el_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
51
catherinearnett/B-GPT_es_en_sequential
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
138
catherinearnett/B-GPT_en_es_sequential
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
356
catherinearnett/B-GPT_es_en_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
86
catherinearnett/B-GPT_en_es_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
9
catherinearnett/B-GPT_nl_en_sequential
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
8
catherinearnett/B-GPT_en_nl_sequential
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
340
catherinearnett/B-GPT_nl_en_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
48
catherinearnett/B-GPT_en_nl_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12, 2025
•
3
catherinearnett/pythia-1b-bigram_masked
Updated
May 1, 2025
catherinearnett/pythia-160m-bigram_masked
Updated
May 1, 2025