gr0010 commited on
Commit
a560ee4
·
verified ·
1 Parent(s): 8f6a41c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -2
README.md CHANGED
@@ -1,3 +1,39 @@
1
  ---
2
- license: unknown
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: other
3
+ ---
4
+
5
+ # Art v0 3B
6
+
7
+ Art v0 3B is our inaugural model in the Art series, fine-tuned from **Qwen/Qwen2.5-3B-Instruct** using a specialized dataset generated with **Gemini 2.0 Flash Thinking**.
8
+
9
+ ## Model Details
10
+ - **Base Model:** Qwen2.5-3B-Instruct
11
+ - **Architecture:** Transformer
12
+ - **Size:** 3B parameters
13
+
14
+ ## Usage
15
+
16
+ The model incorporates a reasoning mechanism using specific tags:
17
+ ```python
18
+ <|start_reasoning|> model's reasoning process <|end_reasoning|> model's response
19
+ ```
20
+
21
+ ### Recommendations
22
+ - Use the model without quantization
23
+ - Use the tokenizer chat template
24
+ - Use a low temperature 0.1-0.3 and repetition_penalty of 1.1
25
+
26
+ ## Training Details
27
+ This experimental model was trained on a curated dataset generated using Gemini 2.0 Flash Thinking. Detailed training methodology, dataset, and code are available exclusively to our community members.
28
+
29
+ ## About Us
30
+ We are a community-funded AI research lab focused on advancing open-source AGI development. Art v0 3B represents our first step in the Art series of models. Our community members support us through Patreon donations.
31
+
32
+ ## Community Access
33
+ Our supporters get exclusive access to:
34
+ - Training dataset
35
+ - Training code and methodology
36
+ - Behind-the-scenes development insights
37
+ - Future model previews
38
+
39
+ [Join Our Community](https://agi-0.com)