CryptoMiner16 commited on
Commit
d16f5a2
·
verified ·
1 Parent(s): 72b6faf

Update README.md

Browse files

Hi, this is Gemma3-1B-IT model trained on Bhagwad_Gita dataset https://www.kaggle.com/datasets/a2m2a2n2/bhagwad-gita-dataset . Trained on Unsloth on local Ubuntu machine with RTX 3060 12GB. Time taken for training was ~2.36min.
training_loss=5.790790309906006, metrics={'train_runtime': 141.6324, 'train_samples_per_second': 11.297, 'train_steps_per_second': 1.412, 'total_flos': 335526094897152.0, 'train_loss': 5.790790309906006})

Files changed (1) hide show
  1. README.md +11 -3
README.md CHANGED
@@ -1,3 +1,11 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ - hi
6
+ base_model:
7
+ - google/gemma-3-1b-it
8
+ pipeline_tag: text-generation
9
+ tags:
10
+ - text-generation-inference
11
+ ---