nassimaODL commited on
Commit
1b839a9
·
1 Parent(s): aaba749

update model

Browse files
Files changed (1) hide show
  1. README.md +36 -6
README.md CHANGED
@@ -16,16 +16,46 @@ pipeline_tag: text-generation
16
 
17
  # 🗣️ ssml-text2breaks-fr-lora
18
 
19
- This is the first-stage LoRA adapter for French SSML prosody generation, responsible for inserting symbolic pause markers (e.g., #250, #500) at linguistically appropriate positions in raw French text.
20
 
21
- This model is part of the SSML generation cascade described in the accepted paper:
22
 
23
- **"Improving French Synthetic Speech Quality via SSML Prosody Control"**
24
- Nassima Ould-Ouali, Éric Moulines – ICNLSP 2025 (Springer LNCS) [accepted].
25
 
26
- It performs linguistically-aware break prediction, identifying phrasing and syntactic pause points, and outputs text marked with symbolic durations that can then be converted to valid SSML tags (e.g., <break time="250ms"/>) by the second-stage model.
 
 
 
 
 
 
 
 
 
 
 
 
27
 
28
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
  ## Model Details
30
 
31
  - **Base Model**: Qwen/Qwen2.5-7B
@@ -33,7 +63,7 @@ It performs linguistically-aware break prediction, identifying phrasing and synt
33
  - **Task**: Text-to-SSML conversion with pause prediction
34
  - **Languages**: English, French (and others supported by base model)
35
 
36
- ## Usage
37
 
38
  ```python
39
  import torch
 
16
 
17
  # 🗣️ ssml-text2breaks-fr-lora
18
 
19
+ # 🇫🇷 ssml-text2breaks-fr-lora Symbolic Pause Prediction for French TTS
20
 
21
+ **ssml-text2breaks-fr-lora** is a LoRA adapter built on top of `Qwen/Qwen2.5-7B`, trained to predict **symbolic pause markers** (e.g., `#250`, `#500`) in raw French text. These symbolic tags indicate appropriate prosodic boundaries for speech synthesis systems.
22
 
23
+ This model is the **first stage** in the cascaded pipeline presented in:
 
24
 
25
+ > **"Improving French Synthetic Speech Quality via SSML Prosody Control"**
26
+ > *Nassima Ould-Ouali, Éric Moulines* – ICNLSP 2025 (*Springer LNCS*, accepted)
27
+
28
+ It is designed to be followed by [`ssml-break2ssml-fr-lora`](https://huggingface.co/nassimaODL/ssml-break2ssml-fr-lora), which converts symbolic markers into valid SSML tags.
29
+
30
+ ---
31
+
32
+ ## 🧩 Pipeline Overview
33
+
34
+ | Stage | Model Name | Description |
35
+ |-------|------------|-------------|
36
+ | 1️⃣ | `ssml-text2breaks-fr-lora` | Predicts symbolic pause markers such as `#250`, `#500` |
37
+ | 2️⃣ | `ssml-break2ssml-fr-lora` | Converts symbolic markers into `<break time="..."/>` SSML tags |
38
 
39
  ---
40
+
41
+ ## ✨ Example
42
+
43
+ **Input:**
44
+
45
+ ```text
46
+ Bonjour je m'appelle Bertrand Perier. Je suis avocat à la cour.
47
+
48
+ ```
49
+
50
+ **Output**
51
+ ```text
52
+ Bonjour#250 je m'appelle Bertrand Perier.#500 Je suis avocat à la cour.
53
+
54
+ ```
55
+
56
+
57
+
58
+
59
  ## Model Details
60
 
61
  - **Base Model**: Qwen/Qwen2.5-7B
 
63
  - **Task**: Text-to-SSML conversion with pause prediction
64
  - **Languages**: English, French (and others supported by base model)
65
 
66
+ ## 🚀 How to Use
67
 
68
  ```python
69
  import torch