Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9
3
Jerry Zhang
jerryzh168
Follow
medmekk's profile picture
clem's profile picture
2 followers
·
4 following
jerryzh168
AI & ML interests
None yet
Recent Activity
reacted
to
cbensimon
's
post
with 🚀
3 days ago
🚀 ZeroGPU now supports PyTorch native quantization via `torchao` While it hasn’t been battle-tested yet, `Int8WeightOnlyConfig` is already working flawlessly in our tests. Let us know if you run into any issues — and we’re excited to see what the community will build! ```python import spaces from diffusers import FluxPipeline from torchao.quantization.quant_api import Int8WeightOnlyConfig, quantize_ pipeline = FluxPipeline.from_pretrained(...).to('cuda') quantize_(pipeline.transformer, Int8WeightOnlyConfig()) # Or any other component(s) @spaces.GPU def generate(prompt: str): return pipeline(prompt).images[0] ```
updated
a model
7 days ago
jerryzh168/Qwen3-4B-untied-weights
published
a model
7 days ago
jerryzh168/Qwen3-4B-untied-weights
View all activity
Organizations
jerryzh168
's models
37
Sort: Recently updated
jerryzh168/Meta-Llama-3-8B-torchao-int4_weight_only-gs_128
Updated
Jan 13
•
6
jerryzh168/Meta-Llama-3-8B-torchao-int8_weight_only
Updated
Jan 13
•
43
jerryzh168/llama3-8B
Updated
Oct 4, 2024
•
4
jerryzh168/llama3-int4wo-128
Updated
Sep 13, 2024
•
6
jerryzh168/flux-schnell-int8wo
Updated
Sep 5, 2024
•
3
jerryzh168/int8_wo_llama2
Updated
Aug 8, 2024
•
11
•
1
jerryzh168/Llama-2-7b-chat-hf-torchao-int4-tinygemm
Text Generation
•
Updated
Aug 2, 2024
•
7
Previous
1
2
Next