Uploaded Model

Overview

This model is a Gemma 3 4B variant specifically fine-tuned using reasoning data from Claude 3.7 Sonnet. The goal was to integrate Claude's acclaimed reasoning capabilities within a powerful, open-source architecture like Gemma.

Technical Details

  • Developed by: reedmayhew
  • Base Model: google/gemma-3-4b-it
  • Finetuning Method: Supervised Fine-Tuning (SFT) using LoRA
  • Training Speed Enhancement: Trained 2x faster with Unsloth and Huggingface's TRL library

Training Data

The model was fine-tuned on a dataset derived from:

  • reedmayhew/claude-3.7-sonnet-reasoning

This enables the model to potentially demonstrate superior logical reasoning, complex problem-solving, and analytical thinking compared to the standard Gemma 3 model, while remaining accessible and open-source.

Usage Notes

While this model incorporates some of Claude's reasoning strengths, it remains a derivative built on Gemma architecture. Users should thoroughly evaluate its performance for specific tasks and applications.

This Gemma model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
760
GGUF
Model size
3.88B params
Architecture
gemma3
Hardware compatibility
Log In to view the estimation

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for reedmayhew/gemma3-4B-claude-3.7-sonnet-reasoning-distilled

Quantized
(10)
this model

Dataset used to train reedmayhew/gemma3-4B-claude-3.7-sonnet-reasoning-distilled