MadeAgents
/

Hammer-7b

Model card Files Files and versions Community

linqq9 commited on Sep 6, 2024

Commit

d3783fe

·

verified ·

1 Parent(s): 639d17a

Update README.md

Files changed (1) hide show

README.md +8 -11

README.md CHANGED Viewed

@@ -53,17 +53,14 @@ Addressing these multifaceted issues necessitates a refined and sophisticated ap
 2. **Fine Tuning**:
     Our fine-tuning process primarily leveraged the Low-Rank Adaptation (LoRA) technique, incorporating specific hyperparameters and strategies to ensure optimal model performance.
-   Masking And Function Shuffling Technique was used during the training process;
-#### Training Setup
-- **LoRA Rank**: 32
-- **Learning Rate**: 5e-5
-- **Warmup Steps**: 100
-- **LR Scheduler Type**: Cosine
-- **Batch Size**: 4
-- **Gradient Accumulation Steps**: 2
-- **Hardware**: 4x A100 (80G) GPUs
 3. **Inference**:

 2. **Fine Tuning**:
     Our fine-tuning process primarily leveraged the Low-Rank Adaptation (LoRA) technique, incorporating specific hyperparameters and strategies to ensure optimal model performance.
+   Masking And Function Shuffling Technique was used during the training process;The Training Setup is as follows:
+  - **LoRA Rank**: 32
+  - **Learning Rate**: 5e-5
+  - **Warmup Steps**: 100
+  - **LR Scheduler Type**: Cosine
+  - **Batch Size**: 4
+  - **Gradient Accumulation Steps**: 2
+  - **Hardware**: 4x A100 (80G) GPUs
 3. **Inference**: