Translating to Traditional Chinese?
Hi,
Great work. I'm wondering whether this model would work with input in Simplified Chinese to give output in Traditional Chinese? If not, are there any models you'd recommend for this step?
Cheers!
Just found out about OpenCC, would that be the mechanism to use, if the model prefers Traditional Chinese as input?
Thx
this model would work with input in Simplified Chinese to give output in Traditional Chinese?
No, if the input is in Simplified Chinese, then the output will typically also be in Simplified Chinese, unless the task explicitly involves translation from Simplified to Traditional Chinese. (幫我把 xxx 翻譯為繁體中文)
Just found out about OpenCC, would that be the mechanism to use, if the model prefers Traditional Chinese as input?
It depends. OpenCC is a solid choice for character-, word-, and phrase-level conversion between Simplified and Traditional Chinese. However, it doesn’t handle rule-based or culturally contextual differences. For example, you can convert 请问民法第一条是什么? to 請問民法第一條是什麼? using OpenCC. But when this converted prompt is send to a LLM, it might interpret it based on a Traditional Chinese legal context (e.g., 「民事,法律所未規定者,依習慣;無習慣者,依法理。」), rather than a Simplified Chinese one (e.g., 「为了保护民事主体的合法权益……制定本法。」).
Therefore, if your use case is not sensitive to cultural context, converting the input to Traditional Chinese may work. But in cases where context matters, the conversion may lead to unexpected responses. For these cases, a Simplified Chinese optimized LLM, such as Qwen or GLM, might work better.