Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
hannayukhymenko 
posted an update 1 day ago
Post
1943
🚀 We are delighted to announce MamayLM, a new state-of-the-art efficient Ukrainian LLM!

📈 MamayLM surpasses similar-sized models in both English and Ukrainian, while matching or overtaking up to 10x larger models.

📊 MamayLM is a 9B model that can run on a single GPU, enabling cost-efficient AI autonomy and adoption across sectors in Ukraine such as education, legal, healthcare, public services and others (e.g., by specializing it to particular use cases). MalayLM is also attractive for organizations wishing to preserve data privacy as it s efficiency allows it to run on a local machine.

🧠 MamayLM is trained on high-quality Ukrainian data and understands Ukrainian language, culture, and history. It is built on top of Google’s Gemma 2 9B model, but uses a number of new advances stemming from INSAIT’s experience in creating BgGPT, a Bulgarian LLM we released last year, now adopted nationwide and profiled several times by Google as a worldwide success case.

🤝 MamayLM is developed in a collaboration between researchers at INSAIT and ETH Zürich and is trained entirely via donations to INSAIT for AI compute resources.

📥 MamayLM is now freely available to download on INSAIT’s HuggingFace in both full and quantized versions. We also publicly release all Ukrainian benchmarks we evaluated on.

📝 Further, we release blog posts in both English and Ukrainian, sharing our approach to creating MamayLM, hoping to drive further improvements by the community.

🌎 The release of LLMs for various languages is part of INSAIT’s mission in ensuring countries can achieve AI autonomy in a cost-efficient, controlled, safe and predictable manner.

MamayLM model and benchmarks: INSAIT-Institute
Blog (EN): https://huggingface.co/blog/INSAIT-Institute/mamaylm
Blog (UKR): https://huggingface.co/blog/INSAIT-Institute/mamaylm-ukr

There is nothing to be proud of, you have based it on the proprietary model, disabling people to use it how they wish and want and totally disregarding free software principles. Why don't you take a good example from Microsoft IBM, Mistral or Allen AI, Qwen or DeepSeek companies which are distributing free software models?

Gemma License (danger) is not Free Software and is not Open Source
https://gnu.support/gnu-emacs/emacs-lisp/Gemma-License-danger-is-not-Free-Software-and-is-not-Open-Source.html

The Gemma Terms of Use and Prohibited Use Policy govern the use, modification, and distribution of Google's Gemma machine learning model and its derivatives. While Gemma is available for public use, it does not conform to Free Software or Open Source principles as defined by the Free Software Foundation (FSF) or Open Source Initiative (OSI). The terms impose significant restrictions, including prohibited use cases (e.g., illegal, harmful, or malicious activities), requirements to enforce Google's use restrictions on downstream users, and limitations on redistribution and derived works. Additionally, the terms do not guarantee access to source code or the freedom to use the software for any purpose, and they include broad disclaimers of warranty and liability. As a result, Gemma is a proprietary model with limited permissions, rather than a truly free or open-source software offering.

What is Free Software? - GNU Project - Free Software Foundation
https://www.gnu.org/philosophy/free-sw.html