Text Generation
Transformers
Safetensors
English
Chinese

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

DeepTranslate-R1-1.5B-Preview 概述

DeepTranslate-R1-1.5B-Preview 是一个从 DeepSeek-R1-Distilled-Qwen-1.5B 微调而来的语言模型,专门用于英文和中文之间的高质量翻译。我们的模型使用监督式微调(SFT)技术,在仅有1.5B参数的计算效率下实现高质量翻译。

输出模板

TEMPLATE """{{- if .System }}{{ .System }}{{ end }}
{{- range $i, $_ := .Messages }}
{{- $last := eq (len (slice $.Messages $i)) 1}}
{{- if eq .Role "user" }}<|User|>{{ .Content }}
{{- else if eq .Role "assistant" }}<|Assistant|>{{ .Content }}{{- if not $last }}<|end▁of▁sentence|>{{- end }}
{{- end }}
{{- if and $last (ne .Role "assistant") }}<|Assistant|>{{- end }}
{{- end }}"""

作者

Downloads last month
0
Safetensors
Model size
1.78B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for FradSer/DeepTranslate-R1-1.5B-Preview

Finetuned
(254)
this model

Dataset used to train FradSer/DeepTranslate-R1-1.5B-Preview