Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
6
2
9
Artidoro Pagnoni
artidoro
Follow
mohammedbriman's profile picture
BayanDuygu's profile picture
thomwolf's profile picture
23 followers
·
3 following
https://artidoro.github.io/
ArtidoroPagnoni
artidoro
AI & ML interests
NLP, generation, factuality, disinformation.
Recent Activity
liked
a model
7 days ago
facebook/blt
reacted
to
Jaward
's
post
with ❤️
17 days ago
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4. Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
upvoted
a
paper
about 1 month ago
SuperBPE: Space Travel for Language Models
View all activity
Organizations
Articles
1
Article
140
Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA
Papers
3
arxiv:
2412.09871
arxiv:
2305.14314
arxiv:
2212.10449
models
3
Sort: Recently updated
artidoro/model-tvergho
Updated
Nov 18, 2023
artidoro/model-vinaic
Updated
Nov 18, 2023
artidoro/model-vinaia
Updated
Nov 18, 2023
datasets
None public yet