29 31

Nikita

PQlet

AI & ML interests

None yet

Organizations

None yet

upvoted an article 6 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.1k

upvoted 2 papers 7 months ago

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

Paper • 2508.11383 • Published Aug 15, 2025 • 40

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7, 2025 • 47

upvoted a paper 8 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8, 2025 • 94

upvoted 5 papers 9 months ago

Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback

Paper • 2507.02321 • Published Jul 3, 2025 • 39

DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization

Paper • 2505.20975 • Published May 27, 2025 • 36

upvoted a paper 10 months ago

cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

Paper • 2505.22914 • Published May 28, 2025 • 37

upvoted 2 papers 12 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8, 2025 • 110

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20, 2025 • 72

upvoted 4 papers about 1 year ago

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17, 2025 • 95

A Primer on the Inner Workings of Transformer-based Language Models

Paper • 2405.00208 • Published Apr 30, 2024 • 12

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20, 2025 • 175

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10, 2025 • 89

upvoted an article about 1 year ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

737

upvoted a paper over 1 year ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published Oct 23, 2024 • 209

upvoted an article over 1 year ago

Article

Understanding InstaFlow/Rectified Flow

Oct 6, 2023

•

upvoted a paper over 1 year ago

Mechanistic Permutability: Match Features Across Layers

Paper • 2410.07656 • Published Oct 10, 2024 • 20

Nikita

AI & ML interests

Organizations

PQlet's activity

Mixture of Experts Explained

Finally, a Replacement for BERT: Introducing ModernBERT

Understanding InstaFlow/Rectified Flow