Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AndrewB's picture
1 7 193

AndrewB

aboundy
dvilasuero's profile picture 21world's profile picture
·

AI & ML interests

None yet

Organizations

GPU MODE's profile picture

aboundy's activity

upvoted a paper 3 months ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 118
upvoted a collection 3 months ago

Deepseek Papers

Collection
Deepseek papers collection • 24 items • Updated 1 day ago • 247
upvoted an article 4 months ago
view article
Article

Open-R1: Update #1

By open-r1 and 7 others •
Feb 2
• 305
upvoted a paper 10 months ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16, 2024 • 57
upvoted 2 papers about 1 year ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 65

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12, 2024 • 42
upvoted a collection over 1 year ago

Model Merging

Collection
Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 238
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs