File size: 1,371 Bytes
8f6a41c
a560ee4
 
 
3ed7ea4
a560ee4
 
7a55f84
a560ee4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2ae8a42
a560ee4
 
 
 
 
 
 
 
7a55f84
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
---
license: other
---

# Auto Regressive Thinker (Art) v0 3B

Art v0 3B is our inaugural model in the Art series, fine-tuned from **Qwen/Qwen2.5-3B-Instruct** using a specialized dataset generated with **Gemini 2.0 Flash Thinking**.
[Read more about the Art series](https://blog.agi-0.com/posts/art-series)

## Model Details
- **Base Model:** Qwen2.5-3B-Instruct
- **Architecture:** Transformer
- **Size:** 3B parameters

## Usage

The model incorporates a reasoning mechanism using specific tags:
```python
<|start_reasoning|> model's reasoning process <|end_reasoning|> model's response
```

### Recommendations
- Use the model without quantization
- Use the tokenizer chat template
- Use a low temperature 0.1-0.3 and repetition_penalty of 1.1

## Training Details
This experimental model was trained on a curated dataset generated using Gemini 2.0 Flash Thinking. Detailed training methodology, dataset, and code are available exclusively to our community members.

## About Us
We are a community-funded AI research lab focused on advancing open-source AGI development. Our community members support us through Patreon donations.

## Community Access
Our supporters get exclusive access to:
- Training dataset
- Training code and methodology
- Behind-the-scenes development insights
- Future model previews

[Join Our Community](https://blog.agi-0.com/posts/join-us)