--- inference: false language: - en - zh tags: - instruction-finetuning task_categories: - text-generation base_model: - THUDM/glm-4-9b-chat license: cc-by-nc-nd-4.0 ---

🔍 xVerify-9B-C

xVerify is an evaluation tool fine-tuned from a pre-trained large language model, designed specifically for objective questions with a single correct answer. It accurately extracts the final answer from lengthy reasoning processes and efficiently identifies equivalence across different forms of expressions. --- ## ✨ Key Features ### 📊 Broad Applicability Suitable for various objective question evaluation scenarios including math problems, multiple-choice questions, classification tasks, and short-answer questions. ### ⛓️ Handles Long Reasoning Chains Effectively processes answers with extensive reasoning steps to extract the final answer, regardless of complexity. ### 🌐 Multilingual Support Primarily handles Chinese and English responses while remaining compatible with other languages. ### 🔄 Powerful Equivalence Judgment - ✓ Recognizes basic transformations like letter case changes and Greek letter conversions - ✓ Identifies equivalent mathematical expressions across formats (LaTeX, fractions, scientific notation) - ✓ Determines semantic equivalence in natural language answers - ✓ Matches multiple-choice responses by content rather than just option identifiers --- ## 📚 Citation ```bibtex @misc{xverify_25_github, author = {Ding Chen and Qingchen Yu and Bo Tang and Feiyu Xiong and Zhiyu Li}, title = {xVerify: Efficient Answer Verifier for Large Language Model Evaluations}, url = {https://github.com/IAAR-Shanghai/xVerify}, year={2025} } ```