view article Article Unlocking Longer Generation with Key-Value Cache Quantization By RaushanTurganbay β’ May 16, 2024 β’ 49
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 860
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 265
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques π π By Isayoften β’ Aug 26, 2024 β’ 63
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne β’ Jul 29, 2024 β’ 321
view article Article π€ PEFT welcomes new merging methods By smangrul and 1 other β’ Feb 19, 2024 β’ 19
view article Article Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning By Andyrasika β’ Jan 19, 2024 β’ 17
view article Article π€ PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware By smangrul and 1 other β’ Feb 10, 2023 β’ 82
view article Article RAG vs Fine-Tuning for LLMs: A Comprehensive Guide with Examples By airabbitX β’ Aug 16, 2024 β’ 7
view article Article Making LLMs lighter with AutoGPTQ and transformers By marcsun13 and 5 others β’ Aug 23, 2023 β’ 53
view article Article Overview of natively supported quantization schemes in π€ Transformers By ybelkada and 4 others β’ Sep 12, 2023 β’ 12
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA By ybelkada and 4 others β’ May 24, 2023 β’ 148
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Paper β’ 2406.06525 β’ Published Jun 10, 2024 β’ 71
TextGrad: Automatic "Differentiation" via Text Paper β’ 2406.07496 β’ Published Jun 11, 2024 β’ 32
The Prompt Report: A Systematic Survey of Prompting Techniques Paper β’ 2406.06608 β’ Published Jun 6, 2024 β’ 64
An Image is Worth 32 Tokens for Reconstruction and Generation Paper β’ 2406.07550 β’ Published Jun 11, 2024 β’ 60