Uploaded Model

Overview

This model is a Gemma 3 12B variant specifically fine-tuned using reasoning data from Claude 3.7 Sonnet. The goal was to integrate Claude's acclaimed reasoning capabilities within a powerful, open-source architecture like Gemma.

Technical Details

  • Developed by: reedmayhew
  • Base Model: google/gemma-3-12b
  • Finetuning Method: Supervised Fine-Tuning (SFT) using LoRA
  • Training Speed Enhancement: Trained 2x faster with Unsloth and Huggingface's TRL library

Training Data

The model was fine-tuned on a dataset derived from:

  • reedmayhew/claude-3.7-sonnet-reasoning

This enables the model to potentially demonstrate superior logical reasoning, complex problem-solving, and analytical thinking compared to the standard Gemma 3 model, while remaining accessible and open-source.

Usage Notes

While this model incorporates some of Claude's reasoning strengths, it remains a derivative built on Gemma architecture. Users should thoroughly evaluate its performance for specific tasks and applications.

This Gemma model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
2,044
GGUF
Model size
11.8B params
Architecture
gemma3

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for reedmayhew/claude-3.7-sonnet-reasoning-gemma3-12B

Quantized
(7)
this model

Dataset used to train reedmayhew/claude-3.7-sonnet-reasoning-gemma3-12B