Spaces:

ThaiLLM-Leaderboard
/

leaderboard

Runtime error

App Files Files Community

potsawee commited on Sep 11, 2024

Commit

b5e73da

verified ·

1 Parent(s): 2f21d72

Update src/about.py

Browse files

Files changed (1) hide show

src/about.py +10 -10

src/about.py CHANGED Viewed

@@ -12,9 +12,9 @@ TITLE = """<h1>🇹🇭 Thai LLM Leaderboard</h1>"""
 # <a href="url"></a>
 INTRODUCTION_TEXT = """
-The Thai-LLM Leaderboard 🇹🇭 focused on standardizing evaluation methods for large language models (LLMs) in the Thai language based on <a href="https://github.com/SEACrowd">SEACrowd</a>,
 As part of an open community project, we welcome you to submit new evaluation tasks or models.
-This leaderboard is developed in collaboration with <a href="https://www.scb10x.com">SCB 10X</a>, <a href="https://www.vistec.ac.th/">Vistec</a>, and <a href="https://github.com/SEACrowd">SEACrowd</a>.
 """
 LLM_BENCHMARKS_TEXT = f"""
@@ -35,25 +35,25 @@ The leaderboard currently consists of the following benchmarks:
   - <a href="https://huggingface.co/datasets/iapp/iapp_wiki_qa_squad">iapp Wiki QA Squad</a>: iapp Wiki QA Squad is an extractive question-answering dataset derived from Thai Wikipedia articles.
-Metric Implementation Details:
 - BLEU is calculated using flores200's tokenizer using HuggingFace `evaluate` <a href="https://huggingface.co/spaces/evaluate-metric/sacrebleu">implementation</a>.
 - ROUGEL is calculated using PyThaiNLP newmm tokenizer and HuggingFace `evaluate` <a href="https://huggingface.co/spaces/evaluate-metric/rouge">implementation</a>.
 - LLM-as-a-judge rating is based on OpenAI's gpt-4o-2024-05-13 using the prompt defined in <a href="https://github.com/lm-sys/FastChat/blob/main/fastchat/llm_judge/data/judge_prompts.jsonl">lmsys MT-Bench</a>.
-Reproducibility:
-- For reproducibility of results, we have open-sourced the evaluation pipeline. Please check out the repository <a href="https://github.com/scb-10x/seacrowd-eval">seacrowd-experiments</a>.
-Acknowledgements:
 - We are grateful to previous open-source projects that released datasets, tools, and knowledge. We thank community members for tasks and model submissions. To contribute, please see the submit tab.
 """
 CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results"
 CITATION_BUTTON_TEXT = r"""@misc{thaillm-leaderboard,
-  author = {SCB 10X, VISTEC, SEACrowd},
-  title = {Thai LLM Leaderboard},
-  year = {2024},
-  publisher = {Hugging Face},
   url={https://huggingface.co/spaces/ThaiLLM-Leaderboard/leaderboard}
 }"""

 # <a href="url"></a>
 INTRODUCTION_TEXT = """
+The Thai LLM Leaderboard 🇹🇭 aims to standardize evaluation methods for large language models (LLMs) in the Thai language, building on <a href="https://github.com/SEACrowd">SEACrowd</a>.
 As part of an open community project, we welcome you to submit new evaluation tasks or models.
+This leaderboard is developed in collaboration with <a href="https://www.scb10x.com">SCB 10X</a>, <a href="https://www.vistec.ac.th/">VISTEC</a>, and <a href="https://github.com/SEACrowd">SEACrowd</a>.
 """
 LLM_BENCHMARKS_TEXT = f"""
   - <a href="https://huggingface.co/datasets/iapp/iapp_wiki_qa_squad">iapp Wiki QA Squad</a>: iapp Wiki QA Squad is an extractive question-answering dataset derived from Thai Wikipedia articles.
+<b>Metric Implementation Details</b>:
 - BLEU is calculated using flores200's tokenizer using HuggingFace `evaluate` <a href="https://huggingface.co/spaces/evaluate-metric/sacrebleu">implementation</a>.
 - ROUGEL is calculated using PyThaiNLP newmm tokenizer and HuggingFace `evaluate` <a href="https://huggingface.co/spaces/evaluate-metric/rouge">implementation</a>.
 - LLM-as-a-judge rating is based on OpenAI's gpt-4o-2024-05-13 using the prompt defined in <a href="https://github.com/lm-sys/FastChat/blob/main/fastchat/llm_judge/data/judge_prompts.jsonl">lmsys MT-Bench</a>.
+<b>Reproducibility</b>:
+- For the reproducibility of results, we have open-sourced the evaluation pipeline. Please check out the repository <a href="https://github.com/scb-10x/seacrowd-eval">seacrowd-experiments</a>.
+<b>Acknowledgements</b>:
 - We are grateful to previous open-source projects that released datasets, tools, and knowledge. We thank community members for tasks and model submissions. To contribute, please see the submit tab.
 """
 CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results"
 CITATION_BUTTON_TEXT = r"""@misc{thaillm-leaderboard,
+  author={SCB 10X and VISTEC and SEACrowd},
+  title={Thai LLM Leaderboard},
+  year={2024},
+  publisher={Hugging Face},
   url={https://huggingface.co/spaces/ThaiLLM-Leaderboard/leaderboard}
 }"""