lmajnaric commited on
Commit
ea0c9f9
·
verified ·
1 Parent(s): 09249f2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -13
README.md CHANGED
@@ -14,22 +14,13 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # paligemma-architecture
16
 
17
- This model is a fine-tuned version of [google/paligemma2-3b-pt-448](https://huggingface.co/google/paligemma2-3b-pt-448) on the None dataset.
18
 
19
- ## Model description
20
-
21
- More information needed
22
-
23
- ## Intended uses & limitations
24
-
25
- More information needed
26
-
27
- ## Training and evaluation data
28
-
29
- More information needed
30
 
31
  ## Training procedure
32
 
 
 
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
@@ -46,7 +37,10 @@ The following hyperparameters were used during training:
46
 
47
  ### Training results
48
 
49
-
 
 
 
50
 
51
  ### Framework versions
52
 
 
14
 
15
  # paligemma-architecture
16
 
17
+ This model is a fine-tuned version of [google/paligemma2-3b-pt-448](https://huggingface.co/google/paligemma2-3b-pt-448) on a custom architecture dataset.
18
 
 
 
 
 
 
 
 
 
 
 
 
19
 
20
  ## Training procedure
21
 
22
+ Followed the [notebook from smol-vision](https://github.com/merveenoyan/smol-vision/blob/main/Fine_tune_PaliGemma.ipynb), adjusted dataset loading and some parameters.
23
+
24
  ### Training hyperparameters
25
 
26
  The following hyperparameters were used during training:
 
37
 
38
  ### Training results
39
 
40
+ TrainOutput(global_step=352, training_loss=7.797419488430023,
41
+ metrics={'train_runtime': 1653.6164, 'train_samples_per_second': 1.705,
42
+ 'train_steps_per_second': 0.213, 'total_flos': 5.772661476596784e+16,
43
+ 'train_loss': 7.797419488430023, 'epoch': 3.9645390070921986})
44
 
45
  ### Framework versions
46