nlp-waseda
/

bigbird-base-japanese

Inference Endpoints

Model card Files Files and versions Community

mkmkmkmk commited on Jun 15, 2023

Commit

cc4ef3c

·

1 Parent(s): c8fb5cc

Update README.md

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -61,8 +61,9 @@ We tuned learning rate and training epochs for each model and task following [th
 For tasks other than MARC-ja, the maximum length is short, so the attention_type was set to "original_full" and fine-tuning was performed for tasks other than MARC-ja. For MARC-ja, both "block_sparse" and "original_full" were used.
-| Model                         |MARC-ja(block_sparse)/acc | MARC-ja(original_full)/acc | JSTS/pearson | JSTS/spearman | JNLI/acc | JSQuAD/EM | JSQuAD/F1 | JComQA/acc |
-|-------------------------------|-------------|--------------|---------------|----------|-----------|-----------|------------|------------|
-| Waseda RoBERTa base           | - | 0.965       | 0.913        | 0.876         | 0.905    | 0.853     | 0.916     | 0.853      |
-| Waseda RoBERTa large (seq512) | - |0.969       | 0.925        | 0.890         | 0.928    | 0.910     | 0.955     | 0.900      |
-| BigBird base | 0.959 |0.959       | 0.888        | 0.846         | 0.896    | 0.884     | 0.933     | 0.787      |

 For tasks other than MARC-ja, the maximum length is short, so the attention_type was set to "original_full" and fine-tuning was performed for tasks other than MARC-ja. For MARC-ja, both "block_sparse" and "original_full" were used.
+| Model                         | MARC-ja(original_full)/acc | JSTS/pearson | JSTS/spearman | JNLI/acc | JSQuAD/EM | JSQuAD/F1 | JComQA/acc |
+|-------------------------------|--------------|---------------|----------|-----------|-----------|------------|------------|
+| Waseda RoBERTa base           |  0.965       | 0.913        | 0.876         | 0.905    | 0.853     | 0.916     | 0.853      |
+| Waseda RoBERTa large (seq512) | 0.969       | 0.925        | 0.890         | 0.928    | 0.910     | 0.955     | 0.900      |
+| BigBird base (original_full)  |0.959       | 0.888        | 0.846         | 0.896    | 0.884     | 0.933     | 0.787      |
+| BigBird base (block_sparse) |0.959       | -        | -         | -    | -    | -     | -      |