asif00
/

orpheus-bangla-tts

text-generation

text-generation-inference

Model card Files Files and versions Community

asif00 commited on Mar 28

Commit

2254b9c

·

verified ·

1 Parent(s): 740e0ff

--update readme

Files changed (1) hide show

README.md +41 -11

README.md CHANGED Viewed

@@ -1,22 +1,52 @@
 ---
-base_model: unsloth/orpheus-3b-0.1-pretrained-unsloth-bnb-4bit
 tags:
-- text-generation-inference
 - transformers
-- unsloth
 - llama
-- trl
 license: apache-2.0
 language:
-- en
 ---
-# Uploaded  model
-- **Developed by:** asif00
-- **License:** apache-2.0
-- **Finetuned from model :** unsloth/orpheus-3b-0.1-pretrained-unsloth-bnb-4bit
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
+base_model:
+- canopylabs/orpheus-3b-0.1-pretrained
 tags:
 - transformers
 - llama
+- gguf
+- text-to-speech
 license: apache-2.0
 language:
+- bn
+datasets:
+- SUST-CSE-Speech/banspeech
+pipeline_tag: text-to-speech
 ---
+# Orpheus Bangla (16 bit)
+## Model Description
+This model is a proof-of-concept fine-tuned version of the Orpheus 3B TTS (Text-to-Speech) model for Bengali language support. The model has been trained using the `SUST-CSE-Speech/banspeech` dataset, which contains 955 audio samples split from audiobooks. This fine-tuning was performed for 10 epochs on a single Google Colab instance equipped with a T4 GPU.
+Please note that this model is currently in the proof-of-concept phase and is **not recommended for production use**.
+## Intended Use
+This model can be used for generating Bengali speech from text. It is ideal for experimenting with TTS systems for Bengali, particularly for audiobooks, conversational AI, or speech synthesis tasks.
+## Model Training
+- **Dataset**: `SUST-CSE-Speech/banspeech` (955 audiobook audio samples)
+- **Training Epochs**: 10 epochs
+- **Hardware**: Google Colab (single T4 GPU)
+- **Training Script**: A modified Unsloth fine-tuning script was used for the training. The script is available on GitHub: [Orpheus TTS Training Script](https://github.com/asiff00/Training-TTS/blob/main/orpheus/orpheus.ipynb).
+## Limitations
+- This model was trained on a small dataset and for a limited number of epochs, which may lead to less natural or less accurate speech synthesis.
+- Since this is a proof-of-concept model, the synthesis quality may vary based on input text and different conditions. It is not optimized for production environments.
+## Model Usage
+```
+```
+## Training Resources:
+- [TTS Training: Style-TTS2](https://github.com/asiff00/Training-TTS/tree/main/style-tts2)
+- [TTS Training: VIT-TTS](https://github.com/asiff00/Training-TTS/tree/main/vit-tts)
+- [On-Device Speech-to-Speech Conversational AI](https://github.com/asiff00/On-Device-Speech-to-Speech-Conversational-AI)
+- [Bangla Llama](https://github.com/asiff00/Bangla-Llama)
+- [Bangla RAG Pipeline, PoRAG](https://github.com/Bangla-RAG/PoRAG)