Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.17764

testmycollection

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12 • 89.8k • 4.28k
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590
princeton-nlp/gemma2-ultrafeedback-armorm

Viewer • Updated Jul 18 • 61.5k • 6.19k • 24
Running on Zero

2.9k

🏎️💨

FLUX.1 [Schnell]

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12 • 89.8k • 4.28k
Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 239
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 250
LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 256

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590
facebook/opt-350m

Text Generation • Updated Sep 15, 2023 • 160k • 123

Grandmaster-Level Chess Without Search

Paper • 2402.04494 • Published Feb 7 • 65
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

dasdgjy asfgjhagsd gasjh da

Mọi người chắc cũng biết tôi là 1 người thuộc hệ huyền học phương tây Kabbalah, Gematria, decode, The Art of Knowing Tomorrow, huyền học hiểu ngắn gọn

openbmb/MiniCPM-Llama3-V-2_5

Visual Question Answering • Updated Aug 10 • 87k • 1.33k
meta-llama/Meta-Llama-3-8B

Text Generation • Updated May 13 • 2.19M • 5.66k
TIGER-Lab/MMLU-Pro

Viewer • Updated 12 days ago • 12.1k • 154k • 259
meta-llama/Meta-Llama-3.1-8B-Instruct

Text Generation • Updated about 1 month ago • 4.16M • • 2.51k

Quantization Papers

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Paper • 2306.00978 • Published Jun 1, 2023 • 8
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Paper • 2210.17323 • Published Oct 31, 2022 • 7
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

Papers - BitNet

Additional paper with faq, code and tips on: https://github.com/microsoft/unilm/blob/master/bitnet/The-Era-of-1-bit-LLMs__Training_Tips_Code_FAQ.pdf

You Only Cache Once: Decoder-Decoder Architectures for Language Models

Paper • 2405.05254 • Published May 8 • 8
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

Paper • 2406.04333 • Published Jun 6 • 36

Ternary LLMs & Knowledge distillation & SOTA

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590
Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25 • 57
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Paper • 2405.08707 • Published May 14 • 27
Token-Scaled Logit Distillation for Ternary Weight Generative Language Models

Paper • 2308.06744 • Published Aug 13, 2023 • 1

Previous
1
2
3
4
...
19
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs