aylinakkus
/

qwen_2_5_math_epoch_4400

Model card Files Files and versions Community

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

checkpoint-4400

Checkpoint Information

Checkpoint Name: checkpoint-4400

Repository Name: aylinakkus/qwen_2_5_math_epoch_4400

Checkpoint Path: /home/mert/aylin/capability-erosion-sft/LLaMA-Factory/saves/qwen2.5-1.5b/full/sft/checkpoint-4400

Model Configuration

This checkpoint was extracted from a Qwen 2.5 1.5B model training run.

Base Model: Qwen 2.5 1.5B
Training Framework: LLaMA-Factory
Task: Math fine-tuning

Description

This repository contains the model state dict extracted from the training checkpoint.

Files

model_state_dict.pt: PyTorch state dictionary containing the model weights
README.md: This file

Usage

import torch

# Load the model state dict
state_dict = torch.load("model_state_dict.pt", map_location='cpu')

# Use with your model architecture
# model.load_state_dict(state_dict)

Notes

This checkpoint was automatically uploaded using the upload_checkpoints.py script
Checkpoint extracted from: checkpoint-4400
Original path: /home/mert/aylin/capability-erosion-sft/LLaMA-Factory/saves/qwen2.5-1.5b/full/sft/checkpoint-4400

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support