qingy2024's picture
Improve language tag (#3)
82104fe verified
|
raw
history blame
4.84 kB

library_name: transformers

tags:

  • mergekit

  • merge

base_model:

  • Qwen/Qwen2.5-14B-Instruct

  • qingy2019/Qwen2.5-Math-14B-Instruct

  • Qwen/Qwen2.5-14B

language:

  • zho

  • eng

  • fra

  • spa

  • por

  • deu

  • ita

  • rus

  • jpn

  • kor

  • vie

  • tha

  • ara

model-index:


Qwen2.5 Ultimate 14B Instruct

Merged using rombodawg's method and using the first iteration of my Qwen2.5 Math 14B Instruct.

Merge Details

Merge Method

This model was merged using the TIES merge method using Qwen/Qwen2.5-14B as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:


models:

  - model: qingy2019/Qwen2.5-Math-14B-Instruct

    parameters:

      weight: 1

      density: 1

  - model: Qwen/Qwen2.5-14B-Instruct

    parameters:

      weight: 1

      density: 1

merge_method: ties

base_model: Qwen/Qwen2.5-14B

parameters:

  weight: 1

  density: 1

  normalize: true

  int8_mask: true

tokenizer_source: qingy2019/Qwen2.5-Math-14B-Instruct

dtype: bfloat16


Open LLM Leaderboard Evaluation Results

Detailed results can be found here

| Metric |Value|

|-------------------|----:|

|Avg. |29.29|

|IFEval (0-Shot) |39.38|

|BBH (3-Shot) |40.58|

|MATH Lvl 5 (4-Shot)|28.02|

|GPQA (0-shot) |14.21|

|MuSR (0-shot) | 9.89|

|MMLU-PRO (5-shot) |43.66|