Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AXERA-TECH
/
DeepSeek-R1-Distill-Qwen-1.5B
like
4
Follow
AXERA
22
Transformers
arxiv:
2501.12948
License:
bsd-3-clause
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
32ac4bb
DeepSeek-R1-Distill-Qwen-1.5B
Ctrl+K
Ctrl+K
1 contributor
History:
15 commits
qqc1989
Upload 2 files
32ac4bb
verified
3 months ago
deepseek-r1-1.5b-ax630c
Upload 30 files
4 months ago
deepseek-r1-1.5b-ax650
Upload 30 files
4 months ago
deepseek-r1_tokenizer
Upload 6 files
4 months ago
figures
Rename figures/figures_benchmark.jpg to figures/benchmark.jpg
3 months ago
.gitattributes
Safe
7.12 kB
Upload 2 files
3 months ago
README.md
Safe
19.4 kB
Update README.md
3 months ago
config.json
Safe
20 Bytes
Create config.json
3 months ago
deepseek-r1_tokenizer.py
Safe
4.27 kB
Upload 6 files
4 months ago
main_axcl_aarch64
926 kB
LFS
Upload 2 files
3 months ago
main_axcl_x86
946 kB
LFS
Upload 2 files
3 months ago
main_prefill
3.01 MB
LFS
Rename main_prefill_postprocess to main_prefill
3 months ago
run_deepseek-r1_1.5B_ax630c.sh
Safe
512 Bytes
Upload 6 files
4 months ago
run_deepseek-r1_1.5B_ax650.sh
Safe
509 Bytes
Upload 6 files
4 months ago