Add evaluation results on the default config and test split of billsum
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [billsum](https://huggingface.co/datasets/billsum) dataset by
@pszemraj
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-billsum-default-6d3727-15406134).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=billsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=billsum).
README.md
CHANGED
@@ -243,6 +243,39 @@ model-index:
|
|
243 |
type: gen_len
|
244 |
value: 65.4005
|
245 |
verified: true
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
246 |
---
|
247 |
|
248 |
# Longformer Encoder-Decoder (LED) fine-tuned on Booksum
|
|
|
243 |
type: gen_len
|
244 |
value: 65.4005
|
245 |
verified: true
|
246 |
+
- task:
|
247 |
+
type: summarization
|
248 |
+
name: Summarization
|
249 |
+
dataset:
|
250 |
+
name: billsum
|
251 |
+
type: billsum
|
252 |
+
config: default
|
253 |
+
split: test
|
254 |
+
metrics:
|
255 |
+
- name: ROUGE-1
|
256 |
+
type: rouge
|
257 |
+
value: 40.5843
|
258 |
+
verified: true
|
259 |
+
- name: ROUGE-2
|
260 |
+
type: rouge
|
261 |
+
value: 17.3401
|
262 |
+
verified: true
|
263 |
+
- name: ROUGE-L
|
264 |
+
type: rouge
|
265 |
+
value: 25.1256
|
266 |
+
verified: true
|
267 |
+
- name: ROUGE-LSUM
|
268 |
+
type: rouge
|
269 |
+
value: 34.6619
|
270 |
+
verified: true
|
271 |
+
- name: loss
|
272 |
+
type: loss
|
273 |
+
value: 4.792657375335693
|
274 |
+
verified: true
|
275 |
+
- name: gen_len
|
276 |
+
type: gen_len
|
277 |
+
value: 163.9394
|
278 |
+
verified: true
|
279 |
---
|
280 |
|
281 |
# Longformer Encoder-Decoder (LED) fine-tuned on Booksum
|