Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11 • 89
Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement about 12 hours ago • 3
📌 Rethinking Multimodality from an Industry Perspective: Captioning Is Far More Important Than You Think 4 days ago • 3
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11 • 89
Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement about 12 hours ago • 3
📌 Rethinking Multimodality from an Industry Perspective: Captioning Is Far More Important Than You Think 4 days ago • 3