Camais03 commited on
Commit
e563308
·
verified ·
1 Parent(s): eea8cbe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -14,7 +14,7 @@ An advanced deep learning model for automatically tagging anime/manga illustrati
14
  ## 🚀 Updates (April-May 2025)
15
  Apologies for the lack of updates. I've been busy with Uni and unfortunately the extra training (1 epoch) I made the mistake of only loading the model weights and not the optimizer state as well. This is important because the optimizer state maintains a tuned learning rate for each parameter. The reason I had loaded weights only was because training time was much slower so I overlooked the importance of loading the optimizer state.
16
 
17
- The good news is that I should soon be getting an RTX 3090. This means that I should be able to train the entire model in the GPU VRAM (no offloading) and take advantage of the 3x memory bandwidth the 3090 provides. This should mean massive speedups for training time!
18
 
19
  I've also been doing some LLM experiments but they are far from release.
20
 
 
14
  ## 🚀 Updates (April-May 2025)
15
  Apologies for the lack of updates. I've been busy with Uni and unfortunately the extra training (1 epoch) I made the mistake of only loading the model weights and not the optimizer state as well. This is important because the optimizer state maintains a tuned learning rate for each parameter. The reason I had loaded weights only was because training time was much slower so I overlooked the importance of loading the optimizer state.
16
 
17
+ The good news is that I should soon be getting an RTX 3090 soon. This means that I should be able to train the entire model in the GPU VRAM (no offloading) and take advantage of the 3x memory bandwidth the 3090 provides. This should mean massive speedups for training time!
18
 
19
  I've also been doing some LLM experiments but they are far from release.
20