12 5 12

Sara Papi

spapi

https://sarapapi.github.io/

AI & ML interests

speech processing, speech translation

Recent Activity

upvoted a paper 15 days ago

MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks

commented on a paper 15 days ago

MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks

new activity 2 months ago

FBK-MT/mosel:Add missing Croatian data

View all activity

Organizations

authored 2 papers 3 months ago

NUTSHELL: A Dataset for Abstract Generation from Scientific Talks

Paper • 2502.16942 • Published Feb 24

Granary: Speech Recognition and Translation Dataset in 25 European Languages

Paper • 2505.13404 • Published May 19 • 1

authored 3 papers 8 months ago

SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation

Paper • 2406.14177 • Published Jun 20, 2024

How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not

Paper • 2409.17044 • Published Sep 25, 2024 • 1

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Paper • 2410.01036 • Published Oct 1, 2024 • 16

authored a paper 11 months ago

What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study

Paper • 2410.00545 • Published Oct 1, 2024 • 5

authored 4 papers about 1 year ago

StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection

Paper • 2406.06097 • Published Jun 10, 2024

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling

Paper • 2405.10741 • Published May 17, 2024

How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena

Paper • 2402.13208 • Published Feb 20, 2024

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?

Paper • 2402.12025 • Published Feb 19, 2024 • 1

authored 10 papers over 1 year ago

Attention as a Guide for Simultaneous Speech Translation

Paper • 2212.07850 • Published Dec 15, 2022 • 1

AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation

Paper • 2305.11408 • Published May 19, 2023 • 1

Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora

Paper • 2209.10608 • Published Sep 21, 2022 • 1

Joint Speech Translation and Named Entity Recognition

Paper • 2210.11987 • Published Oct 21, 2022 • 1

Dealing with training and test segmentation mismatch: FBK@IWSLT2021

Paper • 2106.12607 • Published Jun 23, 2021 • 1

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

Paper • 2310.15752 • Published Oct 24, 2023 • 1

Sara Papi

AI & ML interests

Recent Activity

Organizations

spapi's activity