Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Paper β’ 2504.06261 β’ Published Apr 8 β’ 110
When Less is Enough: Adaptive Token Reduction for Efficient Image Representation Paper β’ 2503.16660 β’ Published Mar 20 β’ 73
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper β’ 2503.13358 β’ Published Mar 17 β’ 96
A Primer on the Inner Workings of Transformer-based Language Models Paper β’ 2405.00208 β’ Published Apr 30, 2024 β’ 10
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper β’ 2502.15007 β’ Published Feb 20 β’ 175
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators Paper β’ 2502.06394 β’ Published Feb 10 β’ 90
view article Article Finally, a Replacement for BERT: Introducing ModernBERT By bclavie and 14 others β’ Dec 19, 2024 β’ 634
CLEAR: Character Unlearning in Textual and Visual Modalities Paper β’ 2410.18057 β’ Published Oct 23, 2024 β’ 210
Mechanistic Permutability: Match Features Across Layers Paper β’ 2410.07656 β’ Published Oct 10, 2024 β’ 19
π Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized β’ 110 items β’ Updated 1 day ago β’ 102
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model By merve and 2 others β’ May 14, 2024 β’ 252
Layerwise Recurrent Router for Mixture-of-Experts Paper β’ 2408.06793 β’ Published Aug 13, 2024 β’ 33
Linear Transformers with Learnable Kernel Functions are Better In-Context Models Paper β’ 2402.10644 β’ Published Feb 16, 2024 β’ 82
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing Paper β’ 2406.10601 β’ Published Jun 15, 2024 β’ 70
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models Paper β’ 2404.07004 β’ Published Apr 10, 2024 β’ 6
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning Paper β’ 2406.08973 β’ Published Jun 13, 2024 β’ 90
Learn Your Reference Model for Real Good Alignment Paper β’ 2404.09656 β’ Published Apr 15, 2024 β’ 87