LCFO: Long Context and Long Form Output Dataset and Benchmarking Paper • 2412.08268 • Published Dec 11, 2024
Large Concept Models: Language Modeling in a Sentence Representation Space Paper • 2412.08821 • Published Dec 11, 2024 • 15
Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification Paper • 2311.13937 • Published Nov 23, 2023 • 1
BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation Paper • 2502.04314 • Published Feb 6
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better Paper • 2212.08597 • Published Dec 16, 2022 • 1
Seamless: Multilingual Expressive and Streaming Speech Translation Paper • 2312.05187 • Published Dec 8, 2023 • 14
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation Paper • 2311.06532 • Published Nov 11, 2023
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation Paper • 2308.11596 • Published Aug 22, 2023 • 1
Text Detoxification using Large Pre-trained Neural Models Paper • 2109.08914 • Published Sep 18, 2021
Methods for Detoxification of Texts for the Russian Language Paper • 2105.09052 • Published May 19, 2021 • 1
Studying the role of named entities for content preservation in text style transfer Paper • 2206.09676 • Published Jun 20, 2022
The first neural machine translation system for the Erzya language Paper • 2209.09368 • Published Sep 19, 2022 • 1
SpeechAlign: a Framework for Speech Translation Alignment Evaluation Paper • 2309.11585 • Published Sep 20, 2023