Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference? Paper • 2310.05079 • Published Oct 8, 2023