Update README.md
Browse files
README.md
CHANGED
@@ -1,6 +1,6 @@
|
|
1 |
---
|
2 |
language: en
|
3 |
-
license:
|
4 |
tags:
|
5 |
- dora
|
6 |
- peft
|
@@ -12,6 +12,10 @@ tags:
|
|
12 |
- healthcare
|
13 |
base_model:
|
14 |
- Qwen/Qwen3-32B
|
|
|
|
|
|
|
|
|
15 |
---
|
16 |
|
17 |
# Gazal-R1-32B-sft-merged-preview
|
@@ -82,6 +86,17 @@ response = tokenizer.decode(outputs[0][inputs.input_ids.shape[1]:], skip_special
|
|
82 |
print(response)
|
83 |
```
|
84 |
|
85 |
-
##
|
86 |
|
87 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
language: en
|
3 |
+
license: apache-2.0
|
4 |
tags:
|
5 |
- dora
|
6 |
- peft
|
|
|
12 |
- healthcare
|
13 |
base_model:
|
14 |
- Qwen/Qwen3-32B
|
15 |
+
datasets:
|
16 |
+
- TachyHealth/structured_medical
|
17 |
+
pipeline_tag: text-generation
|
18 |
+
library_name: transformers
|
19 |
---
|
20 |
|
21 |
# Gazal-R1-32B-sft-merged-preview
|
|
|
86 |
print(response)
|
87 |
```
|
88 |
|
89 |
+
## Performance Results
|
90 |
|
91 |
+
Gazal-R1 achieves exceptional performance across standard medical benchmarks:
|
92 |
+
|
93 |
+
| Model | Size | MMLU Pro (Medical) | MedMCQA | MedQA | PubMedQA |
|
94 |
+
|-------|------|-------------------|---------|-------|----------|
|
95 |
+
| [**Gazal-R1 (Final)**](https://huggingface.co/TachyHealth/Gazal-R1-32B-GRPO-preview) | **32B** | **81.6** | **71.9** | **87.1** | **79.6** |
|
96 |
+
| Gazal-R1 (SFT-only) | 32B | 79.3 | 72.3 | 86.9 | 77.6 |
|
97 |
+
| Llama 3.1 405B Instruct | 405B | 70.2 | 75.8 | 81.9 | 74.6 |
|
98 |
+
| Qwen 2.5 72B Instruct | 72B | 72.1 | 66.2 | 72.7 | 71.7 |
|
99 |
+
| Med42-Llama3.1-70B | 70B | 66.1 | 72.4 | 80.4 | 77.6 |
|
100 |
+
| Llama 3.1 70B Instruct | 70B | 74.5 | 72.5 | 78.4 | 78.5 |
|
101 |
+
| QwQ 32B | 32B | 70.1 | 65.6 | 72.3 | 73.7 |
|
102 |
+
| Qwen 3 32B | 32B | 78.4 | 71.6 | 84.4 | 76.7 |
|