CEIA-UFG
/

Gemma-3-Gaia-PT-BR-4b-it

@@ -8,7 +8,7 @@ base_model:
 ---
-# Model Card for GAIA (gemma-3-4b-it-pt)
 **GAIA** is an open, state-of-the-art language model for Brazilian Portuguese. It was developed by continuously pre-training the `google/gemma-3-4b-pt` model on an extensive, high-quality corpus of Portuguese data.
@@ -25,15 +25,26 @@ The development process started with the base model `google/gemma-3-4b-pt` and i
 2.  **Instruction-Following Capability Restoration:** To enable the model to follow instructions without traditional supervised fine-tuning (SFT), a weight merging operation was applied. This technique, described in the paper *“Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs”*, allows the model to integrate the knowledge acquired during continuous pre-training with the ability to interact in a chat format and follow instructions.
 - **Developed by:** The Brazilian Association of AI (ABRIA), the Center of Excellence in Artificial Intelligence (CEIA-UFG), Nama, Amadeus AI, and Google DeepMind.
-- **Model:** GAIA (gemma-3-4b-it-pt)
 - **Model type:** Causal decoder-only Transformer-based language model.
 - **Language(s):** Brazilian Portuguese (pt-BR)
 - **License:** Gemma
 - **Based on:** `google/gemma-3-4b-pt`
 ### Model Sources
-- **Repository:** [CEIA-UFG/gemma-3-4b-it-pt](https://huggingface.co/CEIA-UFG/gemma-3-4b-it-pt)
 - **Paper (Merge Methodology):** [Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs](https://arxiv.org/pdf/2410.10739)
 ## Uses
@@ -94,9 +105,9 @@ The model was evaluated on a set of multiple-choice benchmarks in Portuguese, co
 | Benchmark        | `google/gemma-3-4b-it` (Baseline) | GAIA (Our Model) |
 |------------------|-----------------------------------|------------------|
 | BlueX            |  **0.6630**                       | 0.6575           |
-| ENEM 2024        | 0.6556                            | **0.7000** |
-| ENEM (General)   | 0.7416 |  **0.7486I**   |
-| OAB (Bar Exam)   | **0.4502** | 0.4416           |
 #### Summary
@@ -110,9 +121,9 @@ If you use this model in your research or application, please cite our work.
 ```bibtex
 @misc{gaia-gemma-3-4b-2025,
     title={GAIA: An Open Language Model for Brazilian Portuguese},
-    author={Center of Excellence in Artificial Intelligence (CEIA-UFG) and The Brazilian Association of AI (ABRIA) and Nama and Amadeus AI and Google DeepMind},
     year={2025},
     publisher={Hugging Face},
     journal={Hugging Face repository},
-    howpublished={\url{[https://huggingface.co/CEIA-UFG/gemma-3-4b-it-pt](https://huggingface.co/CEIA-UFG/gemma-3-4b-it-pt)}}
 }

 ---
+# Model Card for GAIA (Gemma-3-Gaia-PT-BR-4b-it)
 **GAIA** is an open, state-of-the-art language model for Brazilian Portuguese. It was developed by continuously pre-training the `google/gemma-3-4b-pt` model on an extensive, high-quality corpus of Portuguese data.
 2.  **Instruction-Following Capability Restoration:** To enable the model to follow instructions without traditional supervised fine-tuning (SFT), a weight merging operation was applied. This technique, described in the paper *“Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs”*, allows the model to integrate the knowledge acquired during continuous pre-training with the ability to interact in a chat format and follow instructions.
 - **Developed by:** The Brazilian Association of AI (ABRIA), the Center of Excellence in Artificial Intelligence (CEIA-UFG), Nama, Amadeus AI, and Google DeepMind.
+- **Model:** GAIA
 - **Model type:** Causal decoder-only Transformer-based language model.
 - **Language(s):** Brazilian Portuguese (pt-BR)
 - **License:** Gemma
 - **Based on:** `google/gemma-3-4b-pt`
+### Team
+This project was made possible by the contributions of the following individuals:
+- Dr. Celso Gonçalves Camilo-Junior
+- Dr. Sávio Salvarino Teles de Oliveira
+- Me. Lucas Araujo Pereira
+- Marcellus Amadeus
+- Daniel Fazzioni
+- Artur Matos Andrade Novais
+- Salatiel Abraão Avelar Jordão
 ### Model Sources
+- **Repository:** [CEIA-UFG/Gemma-3-Gaia-PT-BR-4b-it](https://huggingface.co/CEIA-UFG/Gemma-3-Gaia-PT-BR-4b-it)
 - **Paper (Merge Methodology):** [Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs](https://arxiv.org/pdf/2410.10739)
 ## Uses
 | Benchmark        | `google/gemma-3-4b-it` (Baseline) | GAIA (Our Model) |
 |------------------|-----------------------------------|------------------|
 | BlueX            |  **0.6630**                       | 0.6575           |
+| ENEM 2024        | 0.6556                            | **0.7000**       |
+| ENEM (General)   | 0.7416                            | **0.7486I**      |
+| OAB (Bar Exam)   | **0.4502**                        | 0.4416           |
 #### Summary
 ```bibtex
 @misc{gaia-gemma-3-4b-2025,
     title={GAIA: An Open Language Model for Brazilian Portuguese},
+    author={CAMILO-JUNIOR, C. G.; OLIVEIRA, S. S. T.; PEREIRA, L. A.; AMADEUS, M.; FAZZIONI, D.; NOVAIS, A. M. A.; JORDÃO, S. A. A.},
     year={2025},
     publisher={Hugging Face},
     journal={Hugging Face repository},
+    howpublished={\url{[https://huggingface.co/CEIA-UFG/Gemma-3-Gaia-PT-BR-4b-it](https://huggingface.co/CEIA-UFG/Gemma-3-Gaia-PT-BR-4b-it)}}
 }