Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-32B-FP8
like
53
Follow
Qwen
37.5k
Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
fp8
arxiv:
2309.00071
arxiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
7
Train
Deploy
Use this model
refs/pr/6
Qwen3-32B-FP8
Commit History
Update README.md
c068d63
verified
medmekk
HF Staff
commited on
May 6
Remove vLLM FP8 Limitation (
#3
)
98a6390
verified
jklj077
simon-mo
commited on
Apr 30
Update README.md
37f3f67
verified
yangapku
commited on
Apr 29
Update README.md
6e71f0f
verified
yangapku
commited on
Apr 28
Update README.md
48dd627
verified
littlebird13
commited on
Apr 28
Update README.md
6913646
verified
jklj077
commited on
Apr 28
Delete special_tokens_map.json
dcadc0d
verified
littlebird13
commited on
Apr 28
Delete added_tokens.json
87b3d4d
verified
littlebird13
commited on
Apr 28
Update README.md
49e5bc4
verified
littlebird13
commited on
Apr 28
Update generation_config.json
3ca9f67
verified
littlebird13
commited on
Apr 28
Update README.md
8404e43
verified
littlebird13
commited on
Apr 28
Upload folder using huggingface_hub
6e2312b
verified
littlebird13
commited on
Apr 28
initial commit
36c62d7
verified
littlebird13
commited on
Apr 28