Update README.md
Browse files
README.md
CHANGED
@@ -53,7 +53,7 @@ This variant is trained synthetic data and on the RAGTruth dataset for hallucina
|
|
53 |
## Training Procedure
|
54 |
|
55 |
- Tokenizer: AutoTokenizer; DataCollatorForTokenClassification; label pad −100
|
56 |
-
- Max length: 8k; batch size:
|
57 |
- Optimizer: AdamW (lr 1e‑5, weight_decay 0.01)
|
58 |
- Hardware: Single A100 80GB
|
59 |
|
|
|
53 |
## Training Procedure
|
54 |
|
55 |
- Tokenizer: AutoTokenizer; DataCollatorForTokenClassification; label pad −100
|
56 |
+
- Max length: 8k; batch size: 16; epochs: 5
|
57 |
- Optimizer: AdamW (lr 1e‑5, weight_decay 0.01)
|
58 |
- Hardware: Single A100 80GB
|
59 |
|