Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Nathan Godey's picture
11 2 5

Nathan Godey

nthngdy
tahamajs's profile picture stefan-it's profile picture erickmiller's profile picture
·
https://nathangodey.github.io/
  • nthngdy
  • NathanGodey

AI & ML interests

None yet

Organizations

ALMAnaCH (Inria)'s profile picture huggingPartyParis's profile picture

authored 3 papers over 1 year ago

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Paper • 2404.07647 • Published Apr 11, 2024 • 4

On the Scaling Laws of Geographical Representation in Language Models

Paper • 2402.19406 • Published Feb 29, 2024

Anisotropy Is Inherent to Self-Attention in Transformers

Paper • 2401.12143 • Published Jan 22, 2024
authored 2 papers almost 2 years ago

MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling

Paper • 2212.07284 • Published Dec 14, 2022

Headless Language Models: Learning without Predicting with Contrastive Weight Tying

Paper • 2309.08351 • Published Sep 15, 2023 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs