Update README.md
Browse files
README.md
CHANGED
@@ -16,6 +16,8 @@ Apologies for the lack of updates. I've been busy with Uni and unfortunately the
|
|
16 |
|
17 |
The good news is that I should soon be getting an RTX 3090. This means that I should be able to train the entire model in the GPU VRAM (no offloading) and take advantage of the 3x memory bandwidth the 3090 provides. This should mean massive speedups for training time!
|
18 |
|
|
|
|
|
19 |
## 🔑 Key Highlights
|
20 |
|
21 |
- **Efficient Training**: Completed on just a single RTX 3060 GPU (12GB VRAM)
|
|
|
16 |
|
17 |
The good news is that I should soon be getting an RTX 3090. This means that I should be able to train the entire model in the GPU VRAM (no offloading) and take advantage of the 3x memory bandwidth the 3090 provides. This should mean massive speedups for training time!
|
18 |
|
19 |
+
I've also been doing some LLM experiments but they are far from release.
|
20 |
+
|
21 |
## 🔑 Key Highlights
|
22 |
|
23 |
- **Efficient Training**: Completed on just a single RTX 3060 GPU (12GB VRAM)
|