Papers
arxiv:2505.11427

Mergenetic: a Simple Evolutionary Model Merging Library

Published on May 16
· Submitted by teelinsan on May 19

Abstract

Mergenetic is an open-source library that combines model merging with evolutionary algorithms for flexible experimentation in language models, achieving competitive results with limited computational resources.

AI-generated summary

Model merging allows combining the capabilities of existing models into a new one - post hoc, without additional training. This has made it increasingly popular thanks to its low cost and the availability of libraries that support merging on consumer GPUs. Recent work shows that pairing merging with evolutionary algorithms can boost performance, but no framework currently supports flexible experimentation with such strategies in language models. We introduce Mergenetic, an open-source library for evolutionary model merging. Mergenetic enables easy composition of merging methods and evolutionary algorithms while incorporating lightweight fitness estimators to reduce evaluation costs. We describe its design and demonstrate that Mergenetic produces competitive results across tasks and languages using modest hardware.

Community

Paper author Paper submitter

We propose a simple evolutionary model merging library.

You can merge existing checkpoints on Hugging Face hub to forge new models with superior performance compared to standard merging.

Github Repo: https://github.com/tommasomncttn/mergenetic

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2505.11427 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2505.11427 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2505.11427 in a Space README.md to link it from this page.

Collections including this paper 2