Spaces:

fbaldassarri
/

woq-inference

Sleeping

fbaldassarri commited on Apr 30

Commit

f928e33

verified ·

1 Parent(s): 54d6eb5

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,3 +1,13 @@
 # PyTorch Weights-only-Quantization (WoQ)
 Inference scripts for pytorch weights-only-quantization
@@ -28,5 +38,4 @@ For example:
 ```
 python teq_inference.py --base meta-llama/Llama-3.2-1B --model_dir ./meta-llama_Llama-3.2-1B-TEQ-int4-gs128-asym --weights_file quantized_weight.pt --config_file qconfig.json --prompt "Tell me a joke" --device cpu
-```

+---
+license: apache-2.0
+title: PyTorch Weights-only-Quantization (WoQ)
+sdk: gradio
+emoji: 📉
+colorFrom: green
+colorTo: yellow
+pinned: false
+short_description: Inference scripts for pytorch weights-only-quantization
+---
 # PyTorch Weights-only-Quantization (WoQ)
 Inference scripts for pytorch weights-only-quantization
 ```
 python teq_inference.py --base meta-llama/Llama-3.2-1B --model_dir ./meta-llama_Llama-3.2-1B-TEQ-int4-gs128-asym --weights_file quantized_weight.pt --config_file qconfig.json --prompt "Tell me a joke" --device cpu
+```