lumolabs-ai
/

Lumo-DeepSeek-R1-8B

4-bit precision

Model card Files Files and versions

lumolabs commited on Jan 27

Commit

feabfff

·

verified ·

1 Parent(s): 198ee6e

Update README.md

Files changed (1) hide show

README.md +1 -52

README.md CHANGED Viewed

@@ -53,58 +53,7 @@ The **Lumo-DeepSeek-R1-8B** model is a fine-tuned version of DeepSeek-R1-Distill
 ### **Training Workflow**
 The model was fine-tuned using parameter-efficient methods with **LoRA** to adapt to the Solana-specific domain. Below is a visualization of the training process:
-```mermaid
-graph TD
-    %% Base Model Section
-    A[Base Model: DeepSeek-R1-Distill-Llama-8B]
-    style A fill:#f9f,stroke:#333,stroke-width:4px
-    %% Architecture Details
-    A -->|Architecture Details| B[Model Architecture]
-    B --> B1[8B Parameters]
-    B --> B2[4-bit Quantization]
-    B --> B3[NF4 Quant Type]
-    B --> B4[FP16 Compute]
-    %% LoRA Configuration
-    A -->|LoRA Config| C[LoRA Parameters]
-    C --> C1[Rank: 8]
-    C --> C2[Alpha: 32]
-    C --> C3[Dropout: 0.01]
-    C --> C4[Adapter Size: ~10MB]
-    %% Training Configuration
-    A -->|Training Setup| D[Training Config]
-    D --> D1[Learning Rate: 3e-4]
-    D --> D2[Batch Size: 1]
-    D --> D3[Gradient Accum: 4]
-    D --> D4[Epochs: 2]
-    %% Optimization Flow
-    D -->|Optimization| E[Training Process]
-    E --> E1[AdamW Optimizer]
-    E --> E2[StepLR Scheduler]
-    E --> E3[FP16 Training]
-    E --> E4[Fast Kernels: SDPA]
-    %% Final Model
-    E -->|Results In| F[Lumo-DeepSeek-R1-8B]
-    style F fill:#9ef,stroke:#333,stroke-width:4px
-    %% Technical Implementation
-    F -->|Implementation| G[Technical Features]
-    G --> G1[BitsAndBytes 4-bit]
-    G --> G2[Auto Device Mapping]
-    G --> G3[Gradient Checkpointing]
-    G --> G4[Packing Strategy]
-    classDef default fill:#f9f9f9,stroke:#333,stroke-width:2px;
-    classDef highlight fill:#e1f5fe,stroke:#01579b,stroke-width:2px;
-    classDef config fill:#fff3e0,stroke:#e65100,stroke-width:2px;
-    class B,C,D,E config;
-    class F highlight;
-```
 ### **Dataset Sources**
 The dataset comprises curated documentation, cookbooks, and API references from the following sources:

 ### **Training Workflow**
 The model was fine-tuned using parameter-efficient methods with **LoRA** to adapt to the Solana-specific domain. Below is a visualization of the training process:
+![Architecture](https://i.imgur.com/IOIFRBA.png)
 ### **Dataset Sources**
 The dataset comprises curated documentation, cookbooks, and API references from the following sources: