view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 491
view article Article Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization By TuringsSolutions • Feb 8, 2024 • 14
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6 • 181
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • Jan 30 • 106
view article Article Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU By edbeeching and 5 others • Mar 9, 2023 • 60
view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others • Oct 24, 2023 • 63
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne • Jul 29, 2024 • 352