hi-paris
/

ssml-breaks2ssml-fr-lora

@@ -1,4 +1,6 @@
 ---
 license: apache-2.0
 base_model: Qwen/Qwen2.5-7B
 library_name: peft
@@ -8,8 +10,12 @@ tags:
 - french
 - qwen2.5
 - lora
 ---
-# ssml-break2ssml-fr-lora
 This is the second-stage LoRA adapter for **French SSML generation**, converting *pause-annotated text* into full SSML markup with `<break>` tags.
@@ -52,6 +58,9 @@ Output: Bonjour<break time="250ms"/> comment vas-tu ?
 ---
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
 from peft import PeftModel
@@ -69,8 +78,10 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))```
 ---
 ## 🧪 Evaluation Summary
 | Metric                    | Value         |
 |--------------------------|---------------|
 | Pause Insertion Accuracy | 87.3%         |
@@ -79,9 +90,12 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))```
 Evaluation was performed on a held-out French validation set with annotated SSML pauses. Mean Opinion Score (MOS) improvements were assessed using TTS outputs rendered with Azure Henri voice and rated by 30 native French speakers.
 ---
-## �� Training Data
 This LoRA adapter was trained on a corpus of ~4,500 French utterances. Input texts were annotated with symbolic pause indicators (e.g., `#250` for 250ms), automatically aligned using a combination of Whisper-Kyutai timestamping and F0/syntactic heuristics.
@@ -111,7 +125,7 @@ Training was performed using the [Unsloth](https://github.com/unslothai/unsloth)
 ---
-## ⚠️ Limitations
 - Only `<break>` tags are supported; no pitch, rate, or emphasis control yet.
 - Pause accuracy is sensitive to punctuation and malformed inputs.
@@ -120,6 +134,8 @@ Training was performed using the [Unsloth](https://github.com/unslothai/unsloth)
   🔗 [`nassimaODL/ssml-text2breaks-fr-lora`](https://huggingface.co/nassimaODL/ssml-text2breaks-fr-lora)
 ---
 @inproceedings{ould-ouali2025improving,
   author    = {Nassima Ould-Ouali and Awais Sani and Tim Luka Horstmann and Jonah Dauvet and Ruben Bueno and Éric Moulines},
   title     = {Improving French Synthetic Speech Quality via SSML Prosody Control},

 ---
 license: apache-2.0
 base_model: Qwen/Qwen2.5-7B
 library_name: peft
 - french
 - qwen2.5
 - lora
 ---
+# 🗣️ ssml-break2ssml-fr-lora
 This is the second-stage LoRA adapter for **French SSML generation**, converting *pause-annotated text* into full SSML markup with `<break>` tags.
 ---
+### How to run the code
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
 from peft import PeftModel
 ---
 ## 🧪 Evaluation Summary
 | Metric                    | Value         |
 |--------------------------|---------------|
 | Pause Insertion Accuracy | 87.3%         |
 Evaluation was performed on a held-out French validation set with annotated SSML pauses. Mean Opinion Score (MOS) improvements were assessed using TTS outputs rendered with Azure Henri voice and rated by 30 native French speakers.
 ---
+## 📚  Training Data
 This LoRA adapter was trained on a corpus of ~4,500 French utterances. Input texts were annotated with symbolic pause indicators (e.g., `#250` for 250ms), automatically aligned using a combination of Whisper-Kyutai timestamping and F0/syntactic heuristics.
 ---
+## ⚠️  Limitations
 - Only `<break>` tags are supported; no pitch, rate, or emphasis control yet.
 - Pause accuracy is sensitive to punctuation and malformed inputs.
   🔗 [`nassimaODL/ssml-text2breaks-fr-lora`](https://huggingface.co/nassimaODL/ssml-text2breaks-fr-lora)
 ---
+## 📖 Citation
 @inproceedings{ould-ouali2025improving,
   author    = {Nassima Ould-Ouali and Awais Sani and Tim Luka Horstmann and Jonah Dauvet and Ruben Bueno and Éric Moulines},
   title     = {Improving French Synthetic Speech Quality via SSML Prosody Control},