Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning Paper • 2306.00477 • Published Jun 1, 2023 • 1
ApiQ: Finetuning of 2-Bit Quantized Large Language Model Paper • 2402.05147 • Published Feb 7, 2024
Mask More and Mask Later: Efficient Pre-training of Masked Language Models by Disentangling the [MASK] Token Paper • 2211.04898 • Published Nov 9, 2022
3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability Paper • 2409.00119 • Published Aug 28, 2024
Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation Paper • 2505.06027 • Published May 9 • 18
Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation Paper • 2505.06027 • Published May 9 • 18
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 50
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published Jan 31 • 40 • 4
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published Jan 31 • 40
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published Jan 31 • 40