AhmedMostafa commited on
Commit
6b6722c
·
verified ·
1 Parent(s): 0922348

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -3
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  language: en
3
- license: agpl-3.0
4
  tags:
5
  - dora
6
  - peft
@@ -12,6 +12,10 @@ tags:
12
  - healthcare
13
  base_model:
14
  - Qwen/Qwen3-32B
 
 
 
 
15
  ---
16
 
17
  # Gazal-R1-32B-sft-merged-preview
@@ -82,6 +86,17 @@ response = tokenizer.decode(outputs[0][inputs.input_ids.shape[1]:], skip_special
82
  print(response)
83
  ```
84
 
85
- ## Benchmarks
86
 
87
- TBA
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  language: en
3
+ license: apache-2.0
4
  tags:
5
  - dora
6
  - peft
 
12
  - healthcare
13
  base_model:
14
  - Qwen/Qwen3-32B
15
+ datasets:
16
+ - TachyHealth/structured_medical
17
+ pipeline_tag: text-generation
18
+ library_name: transformers
19
  ---
20
 
21
  # Gazal-R1-32B-sft-merged-preview
 
86
  print(response)
87
  ```
88
 
89
+ ## Performance Results
90
 
91
+ Gazal-R1 achieves exceptional performance across standard medical benchmarks:
92
+
93
+ | Model | Size | MMLU Pro (Medical) | MedMCQA | MedQA | PubMedQA |
94
+ |-------|------|-------------------|---------|-------|----------|
95
+ | [**Gazal-R1 (Final)**](https://huggingface.co/TachyHealth/Gazal-R1-32B-GRPO-preview) | **32B** | **81.6** | **71.9** | **87.1** | **79.6** |
96
+ | Gazal-R1 (SFT-only) | 32B | 79.3 | 72.3 | 86.9 | 77.6 |
97
+ | Llama 3.1 405B Instruct | 405B | 70.2 | 75.8 | 81.9 | 74.6 |
98
+ | Qwen 2.5 72B Instruct | 72B | 72.1 | 66.2 | 72.7 | 71.7 |
99
+ | Med42-Llama3.1-70B | 70B | 66.1 | 72.4 | 80.4 | 77.6 |
100
+ | Llama 3.1 70B Instruct | 70B | 74.5 | 72.5 | 78.4 | 78.5 |
101
+ | QwQ 32B | 32B | 70.1 | 65.6 | 72.3 | 73.7 |
102
+ | Qwen 3 32B | 32B | 78.4 | 71.6 | 84.4 | 76.7 |