view article Article Assisted Generation: a new direction toward low-latency text generation By joaogante โข May 11, 2023 โข 66
view article Article Introduction to Quantization cooked in ๐ค with ๐๐งโ๐ณ By merve โข Aug 25, 2023 โข 33
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne โข Jul 29, 2024 โข 343
view article Article Mixture of Experts Explained By osanseviero and 5 others โข Dec 11, 2023 โข 693
view article Article ๐ณ๏ธ Attention Sinks in LLMs for endless fluency By tomaarsen โข Oct 9, 2023 โข 12
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods By kashif and 4 others โข Jan 18, 2024 โข 66
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others โข Dec 9, 2022 โข 285
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA By ybelkada and 4 others โข May 24, 2023 โข 155