68 191 660

Gabriele Sarti

gsarti

https://gsarti.com

AI & ML interests

Interpretability for generative language models

Recent Activity

updated a collection about 14 hours ago

🔍 Interpretability & Analysis of LMs

upvoted a paper about 14 hours ago

Can Interpretation Predict Behavior on Unseen Data?

liked a dataset about 18 hours ago

sardinelab/MF2

View all activity

Organizations

upvoted a paper about 14 hours ago

Can Interpretation Predict Behavior on Unseen Data?

Paper • 2507.06445 • Published 2 days ago • 1

upvoted a paper about 21 hours ago

Thought Anchors: Which LLM Reasoning Steps Matter?

Paper • 2506.19143 • Published 17 days ago • 11

upvoted an article 9 days ago

Article

Bringing Fusion Down to Earth: ML for Stellarator Optimization

•

9 days ago

• 64

upvoted a paper 10 days ago

TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs

Paper • 2506.23423 • Published 11 days ago • 1

upvoted a paper 11 days ago

Stochastic Parameter Decomposition

Paper • 2506.20790 • Published 15 days ago • 1

upvoted a collection 23 days ago

ELI-Why

Collection

🧠 ELI-Why: Evaluating the Pedagogical Utility of Language Model Explanations ACL Findings 2025 • 4 items • Updated about 1 month ago • 3

upvoted an article 24 days ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

and 6 others •

29 days ago

• 112

upvoted a paper 27 days ago

Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization

Paper • 2506.10920 • Published 29 days ago • 6

upvoted a paper about 1 month ago

From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit

Paper • 2506.03093 • Published Jun 3 • 2

upvoted a collection about 1 month ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 159

upvoted a paper about 1 month ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 168

upvoted 2 articles about 1 month ago

Article

The Transformers Library: standardizing model definitions

and 3 others •

May 15

• 115

Article

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

and 1 other •

Jun 2

• 24

upvoted a collection about 1 month ago

FAMA

Collection

The First Large-Scale Open-Science Speech Foundation Model for English and Italian • 5 items • Updated May 30 • 9

upvoted 4 papers about 1 month ago

Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement

Paper • 2505.23183 • Published May 29 • 2

upvoted a paper about 2 months ago

Steering Large Language Models for Machine Translation Personalization

Paper • 2505.16612 • Published May 22 • 6

upvoted a paper 2 months ago

Contrastive Explanations That Anticipate Human Misconceptions Can Improve Human Decision-Making Skills

Paper • 2410.04253 • Published Oct 5, 2024 • 1

Gabriele Sarti

AI & ML interests

Recent Activity

Organizations

gsarti's activity

Bringing Fusion Down to Earth: ML for Stellarator Optimization

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

The Transformers Library: standardizing model definitions

*Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings