view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • Jan 30 • 78
view article Article Decoding Strategies in Large Language Models By mlabonne • Oct 29, 2024 • 67
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne • Jul 29, 2024 • 331
view article Article StackLLaMA: A hands-on guide to train LLaMA with RLHF By edbeeching and 6 others • Apr 5, 2023 • 38
view article Article Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU By edbeeching and 5 others • Mar 9, 2023 • 55
view article Article Fine-Tuning Gemma Models in Hugging Face By svaibhav and 3 others • Feb 23, 2024 • 35
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 272