Uploaded Model
Overview
This model is a Gemma 3 12B variant specifically fine-tuned using reasoning data from Claude 3.7 Sonnet. The goal was to integrate Claude's acclaimed reasoning capabilities within a powerful, open-source architecture like Gemma.
Technical Details
- Developed by: reedmayhew
- Base Model: google/gemma-3-12b
- Finetuning Method: Supervised Fine-Tuning (SFT) using LoRA
- Training Speed Enhancement: Trained 2x faster with Unsloth and Huggingface's TRL library
Training Data
The model was fine-tuned on a dataset derived from:
- reedmayhew/claude-3.7-sonnet-reasoning
This enables the model to potentially demonstrate superior logical reasoning, complex problem-solving, and analytical thinking compared to the standard Gemma 3 model, while remaining accessible and open-source.
Usage Notes
While this model incorporates some of Claude's reasoning strengths, it remains a derivative built on Gemma architecture. Users should thoroughly evaluate its performance for specific tasks and applications.
This Gemma model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 2,044
Model tree for reedmayhew/claude-3.7-sonnet-reasoning-gemma3-12B
Base model
google/gemma-3-12b-pt