sebastavar commited on
Commit
302f188
·
verified ·
1 Parent(s): 4350e8a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -55,7 +55,7 @@ print(generate(
55
 
56
  ## Performance (Apple Silicon, real-world)
57
 
58
- LM Studio and CLI (MLX, Q6 gs32): ~6372 tok/s, TTFB ~0.3–0.4 s (2k-token responses)
59
  - tested on on M1 Max 32 GB (short runs show lower t/s due to startup overhead)
60
 
61
  Throughput varies with Mac model, context, and sampler settings.
 
55
 
56
  ## Performance (Apple Silicon, real-world)
57
 
58
+ LM Studio and CLI (MLX, Q6 gs32): ~4955 tok/s, TTFB ~0.35–0.45 s (2k-token responses)
59
  - tested on on M1 Max 32 GB (short runs show lower t/s due to startup overhead)
60
 
61
  Throughput varies with Mac model, context, and sampler settings.