Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mitchell Wortsman's picture
7

Mitchell Wortsman

mitchellw
naturelizer's profile picture thomwolf's profile picture 21world's profile picture
·
https://mitchellnw.github.io/
  • mitchnw
  • mitchellnw

AI & ML interests

None yet

Organizations

LAION eV's profile picture openflamingo's profile picture ML Foundations's profile picture

authored a paper about 1 year ago

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 54
authored 3 papers over 1 year ago

Language models scale reliably with over-training and on downstream tasks

Paper • 2403.08540 • Published Mar 13, 2024 • 15

Editing Models with Task Arithmetic

Paper • 2212.04089 • Published Dec 8, 2022 • 7

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Paper • 2203.05482 • Published Mar 10, 2022 • 7
authored 2 papers almost 2 years ago

Small-scale proxies for large-scale Transformer training instabilities

Paper • 2309.14322 • Published Sep 25, 2023 • 21

Replacing softmax with ReLU in Vision Transformers

Paper • 2309.08586 • Published Sep 15, 2023 • 17
authored a paper about 2 years ago

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models

Paper • 2308.01390 • Published Aug 2, 2023 • 33
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs