Uploaded Model

Overview

This model is a Gemma 3 4B variant specifically fine-tuned using reasoning data from Claude 3.7 Sonnet. The goal was to integrate Claude's acclaimed reasoning capabilities within a powerful, open-source architecture like Gemma.

Technical Details

Developed by: reedmayhew
Base Model: google/gemma-3-4b-it
Finetuning Method: Supervised Fine-Tuning (SFT) using LoRA
Training Speed Enhancement: Trained 2x faster with Unsloth and Huggingface's TRL library

Training Data

The model was fine-tuned on a dataset derived from:

reedmayhew/claude-3.7-sonnet-reasoning

This enables the model to potentially demonstrate superior logical reasoning, complex problem-solving, and analytical thinking compared to the standard Gemma 3 model, while remaining accessible and open-source.

Usage Notes

While this model incorporates some of Claude's reasoning strengths, it remains a derivative built on Gemma architecture. Users should thoroughly evaluate its performance for specific tasks and applications.

This Gemma model was trained 2x faster with Unsloth and Huggingface's TRL library.

reedmayhew
/

gemma3-4B-claude-3.7-sonnet-reasoning-distilled

Uploaded Model

Overview

Technical Details

Training Data

Usage Notes

Model tree for reedmayhew/gemma3-4B-claude-3.7-sonnet-reasoning-distilled

Dataset used to train reedmayhew/gemma3-4B-claude-3.7-sonnet-reasoning-distilled