Update README.md
Browse files
README.md
CHANGED
@@ -49,6 +49,8 @@ MEL is trained on a **curated corpus** of **5.52 million legal texts (~92.7GB)**
|
|
49 |
|
50 |
To ensure high-quality text processing, documents were preprocessed by **removing unwanted characters, normalizing spacing, chunking texts, and filtering non-Spanish content**.
|
51 |
|
|
|
|
|
52 |
### Training Configuration
|
53 |
- **GPU:** NVIDIA A100 80GB PCIe
|
54 |
- **Training Time:** 13.9 days (~7 days per epoch, 2 epochs total)
|
|
|
49 |
|
50 |
To ensure high-quality text processing, documents were preprocessed by **removing unwanted characters, normalizing spacing, chunking texts, and filtering non-Spanish content**.
|
51 |
|
52 |
+
**Cutoff date:** February 2024
|
53 |
+
|
54 |
### Training Configuration
|
55 |
- **GPU:** NVIDIA A100 80GB PCIe
|
56 |
- **Training Time:** 13.9 days (~7 days per epoch, 2 epochs total)
|