Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Catherine Arnett's picture
3 5 7

Catherine Arnett

catherinearnett
lunarflu's profile picture shirkey's profile picture kw1ntti's profile picture
·
https://catherinearnett.github.io/
  • linguist_cat
  • catherinearnett
  • catherinearnett.bsky.social

AI & ML interests

multilingual NLP, tokenization

Recent Activity

authored a paper 21 days ago
BPE Stays on SCRIPT: Structured Encoding for Robust Multilingual Pretokenization
authored a paper 21 days ago
Evaluating Morphological Alignment of Tokenizers in 70 Languages
liked a dataset 22 days ago
classla/ParlaSpeech-PL
View all activity

Organizations

Blog-explorers's profile picture Language and Cognition Lab (UCSD)'s profile picture

catherinearnett 's datasets 1

catherinearnett/morphscore

Viewer • Updated 26 days ago • 5.09M • 363 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs