Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics Paper • 2410.05183 • Published Oct 7, 2024 • 1
Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering Paper • 2503.14996 • Published Mar 19 • 3
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • 25 days ago • 623
Stress-Testing MGT Detecors via Stylistic Alignment Collection Dataset and Models for the ACL 2025 paper "Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors" • 10 items • Updated 30 days ago • 1
Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization Paper • 2506.10920 • Published Jun 12 • 6
Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors Paper • 2505.24523 • Published May 30 • 9
Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors Paper • 2505.24523 • Published May 30 • 9
Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors Paper • 2505.24523 • Published May 30 • 9 • 4
Evaluating Lexical Proficiency in Neural Language Models Collection Public collection for our paper: "Evaluating Lexical Proficiency in Neural Language Models", C. Ciaccio, A. Miaschi, F. Dell'Orletta (ACL 2025) • 5 items • Updated May 26 • 2
Steering Large Language Models for Machine Translation Personalization Paper • 2505.16612 • Published May 22 • 6
ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models Paper • 2505.13180 • Published May 19 • 13
Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation Paper • 2504.17025 • Published Apr 23 • 17
EuroBERT Collection Scaling Multilingual Encoders for European Languages • 4 items • Updated Mar 10 • 13