Update README.md
Browse files
README.md
CHANGED
@@ -7,6 +7,8 @@ tags:
|
|
7 |
model-index:
|
8 |
- name: paligemma-architecture-styles
|
9 |
results: []
|
|
|
|
|
10 |
---
|
11 |
|
12 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -46,11 +48,46 @@ The following hyperparameters were used during training:
|
|
46 |
|
47 |
### Training results
|
48 |
|
49 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
|
51 |
### Framework versions
|
52 |
|
53 |
- Transformers 4.50.0.dev0
|
54 |
- Pytorch 2.6.0+cu124
|
55 |
- Datasets 3.4.0
|
56 |
-
- Tokenizers 0.21.0
|
|
|
7 |
model-index:
|
8 |
- name: paligemma-architecture-styles
|
9 |
results: []
|
10 |
+
language:
|
11 |
+
- en
|
12 |
---
|
13 |
|
14 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
48 |
|
49 |
### Training results
|
50 |
|
51 |
+
TrainOutput(global_step=261, training_loss=1.761135561912681,
|
52 |
+
metrics={'train_runtime': 1063.4627, 'train_samples_per_second': 1.975,
|
53 |
+
'train_steps_per_second': 0.245, 'total_flos': 3.156513684279552e+16,
|
54 |
+
'train_loss': 1.761135561912681, 'epoch': 2.9714285714285715})
|
55 |
+
|
56 |
+
### Evals on base vs fine-tune
|
57 |
+
|
58 |
+
Base model:
|
59 |
+
|
60 |
+
Evaluation complete - Accuracy: 0.2400 (240/1000)
|
61 |
+
|
62 |
+
Performance by style:
|
63 |
+
Ancient Egyptian architecture: 0.09 (5/57)
|
64 |
+
Art Deco architecture: 0.23 (17/75)
|
65 |
+
Art Nouveau architecture: 0.01 (1/73)
|
66 |
+
Baroque architecture: 0.26 (15/58)
|
67 |
+
Bauhaus architecture: 0.00 (0/58)
|
68 |
+
Brutalism: 0.00 (0/38)
|
69 |
+
Byzantine architecture: 0.34 (17/50)
|
70 |
+
Chicago school architecture: 0.06 (3/51)
|
71 |
+
Colonial architecture: 0.30 (27/89)
|
72 |
+
Deconstructivism: 0.00 (0/38)
|
73 |
+
Gothic architecture: 0.98 (59/60)
|
74 |
+
Greek Revival architecture: 0.45 (26/58)
|
75 |
+
International style: 0.00 (0/66)
|
76 |
+
Neoclassicism: 0.14 (18/125)
|
77 |
+
Postmodern architecture: 0.94 (47/50)
|
78 |
+
Romanesque architecture: 0.09 (5/54)
|
79 |
+
Base model results saved to paligemma448_arch_finetune_styles/base_model_folder_eval_20250316_183525.csv
|
80 |
+
|
81 |
+
=== EVALUATION RESULTS COMPARISON ===
|
82 |
+
Fine-tuned model accuracy: 0.8440
|
83 |
+
Base model accuracy: 0.2400
|
84 |
+
Improvement: 0.6040
|
85 |
+
|
86 |
+
The checkpoint-176 performs better than the latest checkpoint by .02, even though the training loss is lower on the latest checkpoint.
|
87 |
|
88 |
### Framework versions
|
89 |
|
90 |
- Transformers 4.50.0.dev0
|
91 |
- Pytorch 2.6.0+cu124
|
92 |
- Datasets 3.4.0
|
93 |
+
- Tokenizers 0.21.0
|