ymoslem commited on
Commit
6757f43
·
verified ·
1 Parent(s): cccf359

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md CHANGED
@@ -236,3 +236,22 @@ The following hyperparameters were used during training:
236
  - Pytorch 2.2.0+cu121
237
  - Datasets 2.20.0
238
  - Tokenizers 0.19.1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
236
  - Pytorch 2.2.0+cu121
237
  - Datasets 2.20.0
238
  - Tokenizers 0.19.1
239
+
240
+
241
+ ## Citation
242
+
243
+ ```
244
+ @inproceedings{moslem-2024-leveraging,
245
+ title = "Leveraging Synthetic Audio Data for End-to-End Low-Resource Speech Translation",
246
+ author = "Moslem, Yasmin",
247
+ booktitle = "Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024)",
248
+ month = aug,
249
+ year = "2024",
250
+ address = "Bangkok, Thailand (in-person and online)",
251
+ publisher = "Association for Computational Linguistics",
252
+ url = "https://aclanthology.org/2024.iwslt-1.31/",
253
+ doi = "10.18653/v1/2024.iwslt-1.31",
254
+ pages = "265--273",
255
+ abstract = "This paper describes our system submission to the International Conference on Spoken Language Translation (IWSLT 2024) for Irish-to-English speech translation. We built end-to-end systems based on Whisper, and employed a number of data augmentation techniques, such as speech back-translation and noise augmentation. We investigate the effect of using synthetic audio data and discuss several methods for enriching signal diversity."
256
+ }
257
+ ```