Spaces:
Running
on
Zero
Running
on
Zero
finished final choice
Browse files
Documentation/Technical_Report.md
CHANGED
@@ -62,7 +62,7 @@ Since I am deploying on HuggingFace Spaces, I will be utilizing their **ZeroGPU*
|
|
62 |
|
63 |
### Final Choice
|
64 |
|
65 |
-
For my final choice, in order to fully demonstrate the capabilities of the Gemma 3 family, I have selected the Gemma 3-27B model with Key-Value caching enabled. This setup leverages the full compute and memory bandwidth of the NVIDIA H200 (70GB VRAM) provided by HuggingFace's ZeroGPU environment. Overall, this configuration strikes a strong balance between maximum model capability and inference efficiency, ensuring that the demo remains smooth, accurate, and production-ready
|
66 |
|
67 |
## User Research
|
68 |
|
|
|
62 |
|
63 |
### Final Choice
|
64 |
|
65 |
+
For my final choice, in order to fully demonstrate the capabilities of the Gemma 3 family, I have selected the Gemma 3-27B model with Key-Value caching enabled. This setup leverages the full compute and memory bandwidth of the NVIDIA H200 (70GB VRAM) provided by HuggingFace's ZeroGPU environment. Overall, this configuration strikes a strong balance between maximum model capability and inference efficiency, ensuring that the demo remains smooth, accurate, and production-ready—even when scaling to complex or multimodal inputs.
|
66 |
|
67 |
## User Research
|
68 |
|