Pietro Lesci PRO
pietrolesci
AI & ML interests
I like developing and applying causal methods to study the effect of training choices on models’ behaviour, including memorisation, shortcut learning, and tokenisation.
Recent Activity
updated
a dataset
5 days ago
pietrolesci/pythia-deduped-memorisation-profiles
updated
a dataset
5 days ago
pietrolesci/pile-validation
updated
a dataset
5 days ago
pietrolesci/pile-deduped-subset
Organizations
Collections
10
spaces
1
models
21

pietrolesci/me100M_finewebedu-20B_bpe32000minipile
Updated
•
52

pietrolesci/me100M-tied_finewebedu-20B_bpe32000minipile
Updated
•
51

pietrolesci/me850M_minipile_bpe32000minipile
Updated

pietrolesci/me340M-tied_minipile_bpe32000minipile
Updated

pietrolesci/me57M-tied_minipile_bpe2wp32000minipile
Updated

pietrolesci/me57M-tied_minipile_bpe128000minipile
Updated

pietrolesci/me57M-tied_minipile_wordpiece32000minipile
Updated

pietrolesci/me57M-tied_minipile_bpe8064minipile
Updated

pietrolesci/me57M-tied_minipile_bpe32000minipile
Updated

pietrolesci/tokenisers
Updated
datasets
54
pietrolesci/pythia-deduped-memorisation-profiles
Viewer
•
Updated
•
2.13M
•
27
pietrolesci/pile-validation
Viewer
•
Updated
•
429k
•
108
pietrolesci/pile-deduped-subset
Viewer
•
Updated
•
16.3k
•
26
pietrolesci/pythia-deduped-stats
Viewer
•
Updated
•
16.3M
•
1.2k
pietrolesci/pythia-deduped-stats-raw
Viewer
•
Updated
•
14.9M
•
16.3k
pietrolesci/agnews
Viewer
•
Updated
•
510k
•
96
pietrolesci/amazoncat-13k
Viewer
•
Updated
•
5.99M
•
308
•
1
pietrolesci/wikitoxic
Viewer
•
Updated
•
894k
•
108
•
1
pietrolesci/multiwoz_all_versions
Viewer
•
Updated
•
82k
•
36
•
1
pietrolesci/anchoral-paper-artefacts
Viewer
•
Updated
•
2.78M
•
4.4k