lmajnaric commited on
Commit
1d1bc6b
·
verified ·
1 Parent(s): 96088fd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -2
README.md CHANGED
@@ -7,6 +7,8 @@ tags:
7
  model-index:
8
  - name: paligemma-architecture-styles
9
  results: []
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -46,11 +48,46 @@ The following hyperparameters were used during training:
46
 
47
  ### Training results
48
 
49
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
50
 
51
  ### Framework versions
52
 
53
  - Transformers 4.50.0.dev0
54
  - Pytorch 2.6.0+cu124
55
  - Datasets 3.4.0
56
- - Tokenizers 0.21.0
 
7
  model-index:
8
  - name: paligemma-architecture-styles
9
  results: []
10
+ language:
11
+ - en
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
48
 
49
  ### Training results
50
 
51
+ TrainOutput(global_step=261, training_loss=1.761135561912681,
52
+ metrics={'train_runtime': 1063.4627, 'train_samples_per_second': 1.975,
53
+ 'train_steps_per_second': 0.245, 'total_flos': 3.156513684279552e+16,
54
+ 'train_loss': 1.761135561912681, 'epoch': 2.9714285714285715})
55
+
56
+ ### Evals on base vs fine-tune
57
+
58
+ Base model:
59
+
60
+ Evaluation complete - Accuracy: 0.2400 (240/1000)
61
+
62
+ Performance by style:
63
+ Ancient Egyptian architecture: 0.09 (5/57)
64
+ Art Deco architecture: 0.23 (17/75)
65
+ Art Nouveau architecture: 0.01 (1/73)
66
+ Baroque architecture: 0.26 (15/58)
67
+ Bauhaus architecture: 0.00 (0/58)
68
+ Brutalism: 0.00 (0/38)
69
+ Byzantine architecture: 0.34 (17/50)
70
+ Chicago school architecture: 0.06 (3/51)
71
+ Colonial architecture: 0.30 (27/89)
72
+ Deconstructivism: 0.00 (0/38)
73
+ Gothic architecture: 0.98 (59/60)
74
+ Greek Revival architecture: 0.45 (26/58)
75
+ International style: 0.00 (0/66)
76
+ Neoclassicism: 0.14 (18/125)
77
+ Postmodern architecture: 0.94 (47/50)
78
+ Romanesque architecture: 0.09 (5/54)
79
+ Base model results saved to paligemma448_arch_finetune_styles/base_model_folder_eval_20250316_183525.csv
80
+
81
+ === EVALUATION RESULTS COMPARISON ===
82
+ Fine-tuned model accuracy: 0.8440
83
+ Base model accuracy: 0.2400
84
+ Improvement: 0.6040
85
+
86
+ The checkpoint-176 performs better than the latest checkpoint by .02, even though the training loss is lower on the latest checkpoint.
87
 
88
  ### Framework versions
89
 
90
  - Transformers 4.50.0.dev0
91
  - Pytorch 2.6.0+cu124
92
  - Datasets 3.4.0
93
+ - Tokenizers 0.21.0