arxiv:1710.03743

Confidence through Attention

Published on Oct 10, 2017

Authors:

Matīss Rikters ,

Mark Fishel

Abstract

Attention distributions of the generated translations are a useful bi-product of attention-based recurrent neural network translation models and can be treated as soft alignments between the input and output tokens. In this work, we use attention distributions as a confidence metric for output translations. We present two strategies of using the attention distributions: filtering out bad translations from a large back-translated corpus, and selecting the best translation in a hybrid setup of two different translation systems. While manual evaluation indicated only a weak correlation between our confidence score and human judgments, the use-cases showed improvements of up to 2.22 BLEU points for filtering and 0.99 points for hybrid translation, tested on English<->German and English<->Latvian translation.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/1710.03743 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/1710.03743 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/1710.03743 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.