Gabriele Sarti's picture

Gabriele Sarti

gsarti

·

https://gsarti.com

AI & ML interests

Interpretability for generative language models

Recent Activity

updated a collection about 14 hours ago

🔍 Interpretability & Analysis of LMs

upvoted a paper about 14 hours ago

Can Interpretation Predict Behavior on Unseen Data?

liked a dataset about 18 hours ago

sardinelab/MF2

View all activity

Organizations

updated a collection about 14 hours ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 121 items • Updated about 14 hours ago • 107

upvoted a paper about 14 hours ago

Can Interpretation Predict Behavior on Unseen Data?

Paper • 2507.06445 • Published 2 days ago • 1

liked a dataset about 18 hours ago

sardinelab/MF2

Viewer • Updated May 15 • 868 • 233 • 3

updated a collection about 21 hours ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 121 items • Updated about 14 hours ago • 107

upvoted a paper about 21 hours ago

Thought Anchors: Which LLM Reasoning Steps Matter?

Paper • 2506.19143 • Published 17 days ago • 11

liked a dataset 1 day ago

uzaymacar/math-rollouts

Updated 16 days ago • 804 • 2

liked a model 2 days ago

HuggingFaceTB/SmolLM3-3B

Text Generation • 3B • Updated about 17 hours ago • 12k • • 299

updated a Space 5 days ago

MIRAGE

Model Internals to generate RAG citations

updated a Space 9 days ago

DivEMT Explorer

Explore translations, edits and errors in the DivEMT dataset

upvoted an article 9 days ago

Article

Bringing Fusion Down to Earth: ML for Stellarator Optimization

By

•

9 days ago

• 64

updated a collection 10 days ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 121 items • Updated about 14 hours ago • 107

upvoted a paper 10 days ago

TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs

Paper • 2506.23423 • Published 11 days ago • 1

updated a Space 10 days ago

gradio_highlightedtextbox

Gradio component - Editable textarea supporting highlighting

New activity in gsarti/gradio_highlightedtextbox 10 days ago

Update Dockerfile

#8 opened 11 days ago by

New activity in gsarti/gradio_highlightedtextbox 11 days ago

run gradio cc build

#7 opened 11 days ago by

updated a collection 11 days ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 121 items • Updated about 14 hours ago • 107

upvoted a paper 11 days ago

Stochastic Parameter Decomposition

Paper • 2506.20790 • Published 15 days ago • 1

New activity in gsarti/gradio_highlightedtextbox 11 days ago

🚩 Report: Not working

#3 opened 11 months ago by

tip + patch to solve typing

#2 opened about 1 year ago by

fix(parser): Correctly handle literal less-than signs in text

#6 opened 11 days ago by