Duguce commited on
Commit
a888af5
·
verified ·
1 Parent(s): f178d10

docs: update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -6
README.md CHANGED
@@ -6,6 +6,9 @@ tags:
6
  - instruction-finetuning
7
  task_categories:
8
  - text-generation
 
 
 
9
  ---
10
 
11
 
@@ -20,7 +23,7 @@ xFinder-llama38it is a model specifically designed for key answer extraction in
20
 
21
  ## Model Sources
22
  - **Repository:** https://github.com/IAAR-Shanghai/xFinder
23
- - **Paper:** https://arxiv.org/abs/2405.11874
24
 
25
  ## Uses
26
  xFinder is primarily used to enhance the evaluation of LLMs by accurately extracting key answers from their outputs. It addresses the limitations of traditional regular expression (RegEx)-based extraction methods, which often fail to handle the diverse and complex outputs generated by LLMs. xFinder improves the reliability of model assessments across various tasks.
@@ -31,10 +34,12 @@ xFinder-llama38it is fine-tuned from Llama3-8B-Instruct. The training data consi
31
  xFinder is evaluated on the fully human-annotated test and generalization sets of the KAF dataset. The results demonstrate significant improvements in extraction accuracy and robustness compared to traditional methods. For more details, please refer to the paper and try it out using the provided code.
32
  ## Citation
33
  ```
34
- @article{xFinder,
35
- title={xFinder: Robust and Pinpoint Answer Extraction for Large Language Models},
36
- author={Qingchen Yu and Zifan Zheng and Shichao Song and Zhiyu Li and Feiyu Xiong and Bo Tang and Ding Chen},
37
- journal={arXiv preprint arXiv:2405.11874},
38
- year={2024},
 
 
39
  }
40
  ```
 
6
  - instruction-finetuning
7
  task_categories:
8
  - text-generation
9
+ license: cc-by-nc-nd-4.0
10
+ datasets:
11
+ - IAAR-Shanghai/KAF-Dataset
12
  ---
13
 
14
 
 
23
 
24
  ## Model Sources
25
  - **Repository:** https://github.com/IAAR-Shanghai/xFinder
26
+ - **Paper:** https://openreview.net/forum?id=7UqQJUKaLM
27
 
28
  ## Uses
29
  xFinder is primarily used to enhance the evaluation of LLMs by accurately extracting key answers from their outputs. It addresses the limitations of traditional regular expression (RegEx)-based extraction methods, which often fail to handle the diverse and complex outputs generated by LLMs. xFinder improves the reliability of model assessments across various tasks.
 
34
  xFinder is evaluated on the fully human-annotated test and generalization sets of the KAF dataset. The results demonstrate significant improvements in extraction accuracy and robustness compared to traditional methods. For more details, please refer to the paper and try it out using the provided code.
35
  ## Citation
36
  ```
37
+ @inproceedings{
38
+ xFinder,
39
+ title={xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation},
40
+ author={Qingchen Yu and Zifan Zheng and Shichao Song and Zhiyu li and Feiyu Xiong and Bo Tang and Ding Chen},
41
+ booktitle={The Thirteenth International Conference on Learning Representations},
42
+ year={2025},
43
+ url={https://openreview.net/forum?id=7UqQJUKaLM}
44
  }
45
  ```