Onnxruntime CPU GenAI
Collection
Model Powered by Onnxruntime CPU GenAI
•
8 items
•
Updated
We measured the performance of CPU-INT4-RTN-BLOCK-32-ACC-LEVEL-4 on AMD Ryzen 9 7940HS /w Radeon 78
| Prompt Length | Generation Length | Average Throughput (tps) |
|---|---|---|
| 128 | 128 | - |
| 128 | 256 | - |
| 128 | 512 | - |
| 128 | 1024 | - |
| 256 | 128 | - |
| 256 | 256 | - |
| 256 | 512 | - |
| 256 | 1024 | - |
| 512 | 128 | - |
| 512 | 256 | - |
| 512 | 512 | - |
| 512 | 1024 | - |
| 1024 | 128 | - |
| 1024 | 256 | - |
| 1024 | 512 | - |
| 1024 | 1024 | - |