Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Daniel Wang's picture
2 8 7

Daniel Wang

DanielWang
Lina87's profile picture jw1015's profile picture ShuruiXu's profile picture
·
  • benywon

AI & ML interests

Natural Language Processing, Machine Learning

Organizations

BitNoteGroup's profile picture

upvoted 2 papers 8 months ago

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published Jan 3 • 34

KV Shifting Attention Enhances Language Modeling

Paper • 2411.19574 • Published Nov 29, 2024 • 9
upvoted 6 papers over 1 year ago

Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

Paper • 2403.12015 • Published Mar 18, 2024 • 69

Language models scale reliably with over-training and on downstream tasks

Paper • 2403.08540 • Published Mar 13, 2024 • 15

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 52

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12, 2024 • 44

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 92

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 66
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs