Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AXERA-TECH
/
DeepSeek-R1-Distill-Qwen-7B
like
1
Follow
AXERA
15
Transformers
arxiv:
2501.12948
License:
bsd-3-clause
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-7B
Ctrl+K
Ctrl+K
1 contributor
History:
15 commits
qqc1989
Upload 2 files
13273c7
verified
about 2 months ago
deepseek-r1-7b-ax650
Upload 2 files
2 months ago
deepseek-r1_tokenizer
Upload 5 files
2 months ago
.gitattributes
4.29 kB
Upload 4 files
about 2 months ago
README.md
Safe
19.5 kB
Update README.md
2 months ago
config.json
Safe
23 Bytes
Create config.json
2 months ago
deepseek-r1_tokenizer.py
Safe
4.27 kB
Upload 5 files
2 months ago
main_axcl_aarch64
999 kB
LFS
Upload 4 files
about 2 months ago
main_axcl_x86
1.02 MB
LFS
Upload 4 files
about 2 months ago
main_prefill
954 kB
LFS
Upload 4 files
about 2 months ago
post_config.json
277 Bytes
Upload 2 files
2 months ago
run_deepseek-r1_7b_ax650.sh
Safe
497 Bytes
Upload 5 files
2 months ago
run_deepseek-r1_7b_axcl_aarch64.sh
502 Bytes
Upload 2 files
about 2 months ago
run_deepseek-r1_7b_axcl_x86.sh
498 Bytes
Upload 2 files
about 2 months ago