Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-32B-FP8
like
53
Follow
Qwen
37.5k
Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
fp8
arxiv:
2309.00071
arxiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
7
Train
Deploy
Use this model
main
Qwen3-32B-FP8
Commit History
Update README.md
c2d5a15
verified
littlebird13
commited on
May 21
update tokenizer_config.json
d0a17c0
feihu.hf
commited on
May 19
Remove vLLM FP8 Limitation (
#3
)
98a6390
verified
jklj077
simon-mo
commited on
Apr 30
Update README.md
37f3f67
verified
yangapku
commited on
Apr 29
Update README.md
6e71f0f
verified
yangapku
commited on
Apr 28
Update README.md
48dd627
verified
littlebird13
commited on
Apr 28
Update README.md
6913646
verified
jklj077
commited on
Apr 28
Delete special_tokens_map.json
dcadc0d
verified
littlebird13
commited on
Apr 28
Delete added_tokens.json
87b3d4d
verified
littlebird13
commited on
Apr 28
Update README.md
49e5bc4
verified
littlebird13
commited on
Apr 28
Update generation_config.json
3ca9f67
verified
littlebird13
commited on
Apr 28
Update README.md
8404e43
verified
littlebird13
commited on
Apr 28
Upload folder using huggingface_hub
6e2312b
verified
littlebird13
commited on
Apr 28
initial commit
36c62d7
verified
littlebird13
commited on
Apr 28