Azion
/

bert-based-chinese

Model card Files Files and versions

EZlee commited on Aug 25, 2023

Commit

dd06ceb

·

1 Parent(s): 0e7f293

Update README.md

Files changed (1) hide show

README.md +31 -15

README.md CHANGED Viewed

@@ -6,12 +6,6 @@ language:
 pipeline_tag: fill-mask
 ---
-| Dataset\BERT Pretrain  | bert-based-chinese | ckiplab | GufoLab |
-| ------------- |:-------------:|:-------------:|:-------------:|
-| 5000 Tradition Chinese Dataset	|0.7183|	0.6989|	**0.8081**|
-| 10000 Sol-Idea Dataset	| 0.7874|	0.7913|	**0.8025**|
-| ALL DataSet	| 0.7694| 	0.7678| 	**0.8038**|
 ### Model Sources
 - **Paper:** [BERT](https://arxiv.org/abs/1810.04805)
@@ -22,13 +16,6 @@ pipeline_tag: fill-mask
 This model can be used for masked language modeling
-## Risks, Limitations and Biases
-**CONTENT WARNING: Readers should be aware this section contains content that is disturbing, offensive, and can propagate historical and current stereotypes.**
-Significant research has explored bias and fairness issues with language models (see, e.g., [Sheng et al. (2021)](https://aclanthology.org/2021.acl-long.330.pdf) and [Bender et al. (2021)](https://dl.acm.org/doi/pdf/10.1145/3442188.3445922)).
 ## Training
 #### Training Procedure
@@ -41,12 +28,41 @@ botp/yentinglin-zh_TW_c4
 ## Evaluation
-#### Results
-[More Information Needed]
 ## How to Get Started With the Model
 ```python
 from transformers import AutoTokenizer, AutoModelForMaskedLM

 pipeline_tag: fill-mask
 ---
 ### Model Sources
 - **Paper:** [BERT](https://arxiv.org/abs/1810.04805)
 This model can be used for masked language modeling
 ## Training
 #### Training Procedure
 ## Evaluation
+| Dataset\BERT Pretrain  | bert-based-chinese | ckiplab | GufoLab |
+| ------------- |:-------------:|:-------------:|:-------------:|
+| 5000 Tradition Chinese Dataset	|0.7183|	0.6989|	**0.8081**|
+| 10000 Sol-Idea Dataset	| 0.7874|	0.7913|	**0.8025**|
+| ALL DataSet	| 0.7694| 	0.7678| 	**0.8038**|
+#### Results
+| Test ID\Results  | [MASK] Input | Result Output |
+| -------------|-------------|-------------|
+| 1|今天禮拜[MASK]？我[MASK]是很想[MASK]班。|今天禮拜六？我不是很想上班。 |
+| 2|[MASK]灣並[MASK]是[MASK]國不可分割的一部分。|臺灣並不是中國不可分割的一部分。 |
+| 3|如果可以是韋[MASK]安的最新歌[MASK]。|如果可以是韋禮安的最新歌曲。 |
+| 4|[MASK]水老[MASK]有賣很多鐵蛋的攤販。|淡水老街有賣很多鐵蛋的攤販。 |
 ## How to Get Started With the Model
+#### Private Model Download
+**Installation**
+```
+$ curl -s https://packagecloud.io/install/repositories/github/git-lfs/script.deb.sh | sudo bash
+$ sudo apt-get install git-lfs
+$ git lfs install
+$ pip install huggingface_hub
+```
+**Login HuggingFace**
+```
+$ huggingface-cli login
+Token:Your own 'write' token.
+```
+**Pyhon Code**
 ```python
 from transformers import AutoTokenizer, AutoModelForMaskedLM