view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval By aamirshakir and 2 others โข Mar 22, 2024 โข 90
view article Article Making LLMs lighter with AutoGPTQ and transformers By marcsun13 and 5 others โข Aug 23, 2023 โข 53
view article Article Introduction to Quantization cooked in ๐ค with ๐๐งโ๐ณ By merve โข Aug 25, 2023 โข 31