AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

  • Merging Method: task_arithmetic
  • Models Used In Merging
    • Base Model: unsloth/llama-2-13b
    • Code: layoric/llama-2-13b-code-alpaca
    • Math: vanillaOVO/WizardMath-13B-V1.0
    • Instruction Tuned: WizardLMTeam/WizardLM-13B-V1.2
  • AIM: True

Benchmark results and paper details can be found at the official GitHub.

Downloads last month
-
Safetensors
Model size
13B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned___AIM

Collection including ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned___AIM