Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
15
4
9
Pietro Lesci
pietrolesci
Follow
charlesdedampierre's profile picture
kamaludeen's profile picture
yjernite's profile picture
17 followers
·
33 following
https://pietrolesci.github.io/
pietro_lesci
pietrolesci
pietrolesci
pietrolesci.bsky.social
AI & ML interests
I like developing and applying causal methods to study the effect of training choices on models’ behaviour, including memorisation, shortcut learning, and tokenisation.
Recent Activity
upvoted
an
article
about 1 month ago
There is no such thing as a tokenizer-free lunch
updated
a model
about 1 month ago
pietrolesci/tokenisers
updated
a model
about 1 month ago
pietrolesci/tokenizers
View all activity
Organizations
pietrolesci
's datasets
56
Sort: Recently updated
pietrolesci/unimixlm
Viewer
•
Updated
Jul 25
•
81.9M
•
13
pietrolesci/me-minipile-evals
Viewer
•
Updated
Jun 3
•
1.22M
•
16
pietrolesci/pile-deduped
Viewer
•
Updated
May 5
•
748M
•
32
pietrolesci/pythia-deduped-memorisation-profiles
Viewer
•
Updated
Apr 9
•
2.13M
•
101
pietrolesci/pile-validation
Viewer
•
Updated
Apr 9
•
429k
•
25
pietrolesci/pile-deduped-subset
Viewer
•
Updated
Apr 9
•
16.3k
•
11
pietrolesci/pythia-deduped-stats
Viewer
•
Updated
Apr 9
•
16.3M
•
139
pietrolesci/pythia-deduped-stats-raw
Viewer
•
Updated
Apr 9
•
14.9M
•
47.2k
pietrolesci/agnews
Viewer
•
Updated
Apr 9
•
510k
•
15
pietrolesci/amazoncat-13k
Viewer
•
Updated
Apr 9
•
5.99M
•
183
•
1
pietrolesci/wikitoxic
Viewer
•
Updated
Apr 9
•
894k
•
17
•
1
pietrolesci/multiwoz_all_versions
Viewer
•
Updated
Apr 9
•
82k
•
21
•
1
pietrolesci/anchoral-paper-artefacts
Viewer
•
Updated
Apr 9
•
2.78M
•
34
pietrolesci/pile-deduped-pythia-preshuffled
Viewer
•
Updated
Mar 25
•
244M
•
70
pietrolesci/pile-deduped-pythia-tokfreq
Viewer
•
Updated
Mar 17
•
50.1k
•
4
pietrolesci/finewebedu-20B
Viewer
•
Updated
Mar 16
•
40.4M
•
118
pietrolesci/minipile
Viewer
•
Updated
Feb 27
•
6.06M
•
56
pietrolesci/opus-5langs-1M
Viewer
•
Updated
Dec 10, 2024
•
5M
•
18
pietrolesci/opus-raw
Viewer
•
Updated
Nov 27, 2024
•
4.06B
•
280
pietrolesci/pythia-pile-stats
Viewer
•
Updated
Sep 23, 2024
•
113M
•
4
pietrolesci/slim-pajama-eval
Viewer
•
Updated
Sep 16, 2024
•
1.84M
•
9
•
1
pietrolesci/pile-subset
Updated
Sep 13, 2024
•
33
pietrolesci/cmnist
Viewer
•
Updated
Jul 29, 2024
•
308k
•
41
pietrolesci/celeba-wilds
Viewer
•
Updated
Jul 2, 2024
•
203k
•
11
•
1
pietrolesci/civilcomments-wilds
Viewer
•
Updated
Jul 2, 2024
•
893k
•
13
pietrolesci/mnli-stats
Viewer
•
Updated
May 13, 2024
•
785k
•
3
pietrolesci/mnli-embeddings
Viewer
•
Updated
Mar 22, 2024
•
785k
•
3
pietrolesci/_mnli-stats
Viewer
•
Updated
Mar 20, 2024
•
15.7M
•
11
pietrolesci/wikitext-103-raw-v1_gpt2-20k
Viewer
•
Updated
Nov 16, 2023
•
8.01M
•
57
pietrolesci/yahoo_answers_topics
Viewer
•
Updated
Sep 25, 2023
•
2.92M
•
19
Previous
1
2
Next