Artidoro Pagnoni

artidoro

AI & ML interests

NLP, generation, factuality, disinformation.

Recent Activity

Organizations

University of Washington NLP's profile picture

artidoro's activity

liked a model about 2 months ago
reacted to Jaward's post with ❤️ 2 months ago
view post
Post
3157
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4.

Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
New activity in open-llm-leaderboard/open_llm_leaderboard almost 2 years ago
New activity in uwnlp/guanaco-playground-tgi about 2 years ago
New activity in uwnlp/guanaco-playground-tgi about 2 years ago

Guanaco-13B

3
#3 opened about 2 years ago by
dmeight