valla2345 commited on
Commit
755053c
ยท
verified ยท
1 Parent(s): 04765d5

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +56 -0
README.md ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - la
4
+ - en
5
+ tags:
6
+ - t5
7
+ - translation
8
+ - latin
9
+ - english
10
+ - hf-trained
11
+ - custom-model
12
+ license: cc-by-4.0
13
+ library_name: transformers
14
+ model_name: William_Tyndale
15
+ datasets:
16
+ - opus
17
+ - bible-uedin
18
+ - tatoeba
19
+ - xlent
20
+ ---
21
+
22
+ # William Tyndale ๐Ÿ•Š๏ธ
23
+
24
+ **William_Tyndale**๋Š” ๋ผํ‹ด์–ด(la)์—์„œ ์˜์–ด(en)๋กœ ๋ฒˆ์—ญํ•˜๊ธฐ ์œ„ํ•ด ํ•™์Šต๋œ `T5-small` ๊ธฐ๋ฐ˜ ์ปค์Šคํ…€ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. Hugging Face์˜ Transformers ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์™€ Seq2SeqTrainer๋ฅผ ์ด์šฉํ•˜์—ฌ ํ•™์Šต๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
25
+
26
+ ## ๐Ÿ“š ํ•™์Šต ๋ฐ์ดํ„ฐ ์ถœ์ฒ˜
27
+
28
+ ์ด ๋ชจ๋ธ์€ ๋‹ค์Œ ๊ณต๊ฐœ ๋ณ‘๋ ฌ ์ฝ”ํผ์Šค๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•™์Šต๋˜์—ˆ์Šต๋‹ˆ๋‹ค:
29
+
30
+ - [**bible-uedin** (CC0 1.0)](http://opus.nlpl.eu/bible-uedin.php): ๋‹ค์–‘ํ•œ ์–ธ์–ด๋กœ ๋œ ์„ฑ๊ฒฝ ๊ตฌ์ ˆ์„ ํฌํ•จํ•œ ๋ง๋ญ‰์น˜
31
+ - [**Tatoeba** (CC BY 2.0 FR)](https://tatoeba.org): ์‚ฌ์šฉ์ž๋“ค์ด ์ œ๊ณตํ•œ ์˜ˆ๋ฌธ ๊ธฐ๋ฐ˜ ๋‹ค๊ตญ์–ด ๋ณ‘๋ ฌ ๋ฌธ์žฅ
32
+ - [**XLENT** (์ธ์šฉ ํ•„์š”)](http://data.statmt.org/xlent/): WikiMatrix, CCAligned ๋“ฑ์—์„œ ์ถ”์ถœ๋œ ๋Œ€๊ทœ๋ชจ ์—”ํ„ฐํ‹ฐ ์ •๋ ฌ ๋ณ‘๋ ฌ ๋ฌธ์žฅ ๋ฐ์ดํ„ฐ์…‹
33
+ - [**OPUS** (CC BY 4.0)](http://opus.nlpl.eu): ๋‹ค์–‘ํ•œ ๊ณต๊ฐœ ๋ฒˆ์—ญ ๋ณ‘๋ ฌ ์ฝ”ํผ์Šค์˜ ๋ชจ์Œ
34
+
35
+ > โš ๏ธ ๊ฐ ๋ฐ์ดํ„ฐ๋Š” ์› ์ถœ์ฒ˜์˜ ๋ผ์ด์„ ์Šค๋ฅผ ๋”ฐ๋ฅด๋ฉฐ, ๋ณธ ๋ชจ๋ธ์€ ์—ฐ๊ตฌ ๋ฐ ํ•™์Šต ๋ชฉ์  ๋ฐฐํฌ๋ฅผ ์ „์ œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.
36
+
37
+ ## ๐Ÿง  ๋ชจ๋ธ ์ •๋ณด
38
+
39
+ - **๋ชจ๋ธ ๊ตฌ์กฐ**: T5-small (220M ํŒŒ๋ผ๋ฏธํ„ฐ)
40
+ - **์ง€์› ์–ธ์–ด์Œ**: Latin โ†’ English
41
+ - **ํ•™์Šต ํ™˜๊ฒฝ**: Kaggle GPU (T4 x2), Transformers 4.51.3
42
+ - **ํ† ํฐํ™”**: `T5Tokenizer` (max_length=128, padding="max_length")
43
+ - **์†์‹ค ํ•จ์ˆ˜**: CrossEntropyLoss
44
+ - **์ตœ์ ํ™” ์•Œ๊ณ ๋ฆฌ์ฆ˜**: AdamW (lr=2e-4, weight_decay=0.01)
45
+ - **ํ‰๊ฐ€์ง€ํ‘œ**: BLEU, ROUGE, METEOR
46
+
47
+ ## โœ๏ธ ๋ผ์ด์„ ์Šค
48
+
49
+ - ๋ชจ๋ธ ์ฝ”๋“œ ๋ฐ ํŒŒ์ƒ ๋ชจ๋ธ: **Creative Commons Attribution 4.0 International (CC BY 4.0)**
50
+ - ํ•™์Šต ๋ฐ์ดํ„ฐ: ๊ฐ ์ถœ์ฒ˜์˜ ๋ผ์ด์„ ์Šค๋ฅผ ๋”ฐ๋ฆ…๋‹ˆ๋‹ค (CC0, CC-BY ๋“ฑ)
51
+
52
+ ## ๐Ÿ™ ์ธ์šฉ
53
+
54
+ ์ด ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์‹ ๋‹ค๋ฉด ์•„๋ž˜ ํ˜•์‹์œผ๋กœ ์ธ์šฉํ•ด์ฃผ์„ธ์š”.
55
+
56
+ > William_Tyndale, valla2345 (2025). Hugging Face Hub. https://huggingface.co/valla2345/William_Tyndale