janhq
/

Jan-v1-4B

@@ -26,22 +26,7 @@ Jan-v1 leverages the newly released [Qwen3-4B-thinking](https://huggingface.co/Q
 ### Question Answering (SimpleQA)
 For question-answering, Jan-v1 shows a significant performance gain from model scaling, achieving 91.2% accuracy.
-| Model | SimpleQA Accuracy |
-| :--- | :--- |
-| **Jan-v1 (Ours)** | **91.1%** |
-| Qwen3-4B-thinking-2507 | 86.5% |
-| Jan-nano-128k-MCP (YaRN 130k) | 83.2% |
-| Jan-nano-MCP | 80.7% |
-| Jan-nano-MCP (YaRN 130k) | 79.7% |
-| Lucy (YaRN 130k) | 78.3% |
-| DeepSeek-V3-MCP | 78.2% |
-| ChatGPT-4.5 | 62.5% |
-| Baseline-MCP | 59.2% |
-| Gemini-2.5-Pro | 52.9% |
-| Claude-3.7-Sonnet | 50% |
-| o3 | 49.4% |
-| Grok-3 | 44.6% |
-| o1 | 42.6% |
 *The 91.2% SimpleQA accuracy represents a significant milestone in factual question answering for models of this scale, demonstrating the effectiveness of our scaling and fine-tuning approach.*

 ### Question Answering (SimpleQA)
 For question-answering, Jan-v1 shows a significant performance gain from model scaling, achieving 91.2% accuracy.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/iWBkeFvE9jAvB9VN_5JeT.png)
 *The 91.2% SimpleQA accuracy represents a significant milestone in factual question answering for models of this scale, demonstrating the effectiveness of our scaling and fine-tuning approach.*