view article Article Unlocking Longer Generation with Key-Value Cache Quantization By RaushanTurganbay β’ May 16, 2024 β’ 49
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 860
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 265
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques π π By Isayoften β’ Aug 26, 2024 β’ 63
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne β’ Jul 29, 2024 β’ 321
view article Article π€ PEFT welcomes new merging methods By smangrul and 1 other β’ Feb 19, 2024 β’ 19
view article Article Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning By Andyrasika β’ Jan 19, 2024 β’ 17
view article Article π€ PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware By smangrul and 1 other β’ Feb 10, 2023 β’ 82
view article Article RAG vs Fine-Tuning for LLMs: A Comprehensive Guide with Examples By airabbitX β’ Aug 16, 2024 β’ 7
phamvanlinh143/TinyLlama-1.1B-Chat-v1.0-GPTQ-4bit-gs128 Text Generation β’ Updated Aug 16, 2024 β’ 9
view article Article Making LLMs lighter with AutoGPTQ and transformers By marcsun13 and 5 others β’ Aug 23, 2023 β’ 53