Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

HF1BitLLM
/
Llama3-8B-1.58-100B-tokens

Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
8-bit precision
bitnet
Model card Files Files and versions Community
14
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

ram usage

#14 opened about 1 month ago by
vqvtony

run

#13 opened about 1 month ago by
rakmik

added missing imports

#12 opened 6 months ago by
bitsTobyte

Triton error while running demo code

2
#11 opened 7 months ago by
chiauho

Slower than standard Llama 8b?

1
#10 opened 7 months ago by
Sijuade

I found some errors when building on a rpi 5

1
#9 opened 7 months ago by
eddieoz

You can try to convert DeepSeek-V2.5 or Llama-3.1-Nemotron-70B-Instruct-HF?

πŸ‘ 3
2
#8 opened 7 months ago by
win10

Finetuning this model

πŸ‘ πŸ‘€ 2
9
#7 opened 8 months ago by
Andrefty

GGUF conversion

πŸ‘ 4
11
#3 opened 8 months ago by
compilade
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs