deepseek-ai
/

DeepSeek-R1

Text Generation

text-generation-inference

Model card Files Files and versions Community

Xternn commited on 24 days ago

Commit

89e5da9

·

verified ·

1 Parent(s): 56d4cbb

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -47,15 +47,14 @@ library_name: transformers
 </p>
-## 1. Introduction
 We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
 DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning.
 With RL, DeepSeek-R1-Zero naturally emerged with numerous powerful and interesting reasoning behaviors.
 However, DeepSeek-R1-Zero encounters challenges such as endless repetition, poor readability, and language mixing. To address these issues and further enhance reasoning performance,
 we introduce DeepSeek-R1, which incorporates cold-start data before RL.
-DeepSeek-R1 achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks.
-To support the research community, we have open-sourced DeepSeek-R1-Zero, DeepSeek-R1, and six dense models distilled from DeepSeek-R1 based on Llama and Qwen. DeepSeek-R1-Distill-Qwen-32B outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.
 **NOTE: Before running DeepSeek-R1 series models locally, we kindly recommend reviewing the [Usage Recommendation](#usage-recommendations) section.**

 </p>
+## 1. By DyllBoy
 We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
 DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning.
 With RL, DeepSeek-R1-Zero naturally emerged with numerous powerful and interesting reasoning behaviors.
 However, DeepSeek-R1-Zero encounters challenges such as endless repetition, poor readability, and language mixing. To address these issues and further enhance reasoning performance,
 we introduce DeepSeek-R1, which incorporates cold-start data before RL.
+salam hangat Dyll
 **NOTE: Before running DeepSeek-R1 series models locally, we kindly recommend reviewing the [Usage Recommendation](#usage-recommendations) section.**