Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback Paper β’ 2507.02321 β’ Published 2 days ago β’ 33
DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization Paper β’ 2505.20975 β’ Published May 27 β’ 36
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper β’ 2506.06395 β’ Published 29 days ago β’ 126
Image Reconstruction as a Tool for Feature Analysis Paper β’ 2506.07803 β’ Published 26 days ago β’ 28
ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models Paper β’ 2505.22569 β’ Published May 28 β’ 56
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning Paper β’ 2505.22914 β’ Published May 28 β’ 35
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Paper β’ 2504.06261 β’ Published Apr 8 β’ 110
When Less is Enough: Adaptive Token Reduction for Efficient Image Representation Paper β’ 2503.16660 β’ Published Mar 20 β’ 73
Running on Zero 8 8 Unboxing SDXL with SAEs π Generate and modify images using prompts and features
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper β’ 2503.13358 β’ Published Mar 17 β’ 96
A Primer on the Inner Workings of Transformer-based Language Models Paper β’ 2405.00208 β’ Published Apr 30, 2024 β’ 10
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper β’ 2502.15007 β’ Published Feb 20 β’ 175
Running 2.75k 2.75k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators Paper β’ 2502.06394 β’ Published Feb 10 β’ 90
view article Article Finally, a Replacement for BERT: Introducing ModernBERT By bclavie and 14 others β’ Dec 19, 2024 β’ 660