YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Gujarati AI Poetry Generator

๐ŸŒŸ Introduction

This project explores the intersection of Artificial Intelligence and Satsang literature by generating Gujarati poetry inspired by the works of great poets like Nishkulanand Swami. Using AI models, this tool transforms English prompts into meaningful, poetic Gujarati verses.

โœจ Features

  • Gujarati BPE Tokenizer: Custom-built tokenizer for efficient text generation.
  • Fine-tuned GPT Model: Leveraging GPT-J/GPT-2 for Gujarati poetry generation.
  • Dataset: Based on structured English-Gujarati poetry pairs.
  • Use Cases in Satsang: AI-driven tools for kirtan composition, scripture analysis, and poetic expressions.

๐Ÿ› ๏ธ Installation

# Clone the repository
git clone https://github.com/yourusername/gujarati-ai-poetry.git
cd gujarati-ai-poetry

# Install dependencies
pip install -r requirements.txt

๐Ÿ“œ Dataset Structure

The training dataset follows this format:

{
    "gujarati": [
        {
            "title_gu": "เชชเซเชฐเช•เชฐเชฃ เซง: เชฎเช‚เช—เชณเชพเชšเชฐเชฃ",
            "title_trans": "Prakaran 1: Mangaแธทฤcharaแน‡",
            "target": "เชฎเช‚เช—เชฒเชฎเซ‚เชฐเซเชคเชฟ เชฎเชนเชพเชชเซเชฐเชญเซ, เชถเซเชฐเซ€เชธเชนเชœเชพเชจเช‚เชฆ เชธเซเช–เชฐเซ‚เชช...",
            "source": "The divine Lord is the embodiment of joy and beauty..."
        }
    ]
}

๐Ÿš€ Training & Fine-Tuning

To train the model:

python train.py --model gpt-j --dataset cleaned_final_data.json

To fine-tune GPT-2 from scratch:

python train_gpt2.py --epochs 5 --batch_size 8

๐Ÿ”ฅ Inference Example

from model import generate_poetry
prompt = "The divine Lord is the embodiment of joy and beauty."
response = generate_poetry(prompt)
print(response)

๐ŸŽฏ Future Enhancements

  • Improving rhyme and meter matching.
  • Expanding dataset with more scriptures & kirtans.
  • Deploying a web-based interface for real-time poetry generation.

๐ŸŒŸ Contribute

Feel free to open issues and pull requests! Your contributions can help refine AI-driven Gujarati poetry generation.

๐Ÿ“œ License

This project is licensed under the MIT License.


๐Ÿ”— For more AI & Satsang insights, connect with me on LinkedIn!


license: mit

Downloads last month
320
Safetensors
Model size
1.21B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support