YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

checkpoint-4400

Checkpoint Information

Checkpoint Name: checkpoint-4400

Repository Name: aylinakkus/qwen_2_5_math_epoch_4400

Checkpoint Path: /home/mert/aylin/capability-erosion-sft/LLaMA-Factory/saves/qwen2.5-1.5b/full/sft/checkpoint-4400

Model Configuration

This checkpoint was extracted from a Qwen 2.5 1.5B model training run.

  • Base Model: Qwen 2.5 1.5B
  • Training Framework: LLaMA-Factory
  • Task: Math fine-tuning

Description

This repository contains the model state dict extracted from the training checkpoint.

Files

  • model_state_dict.pt: PyTorch state dictionary containing the model weights
  • README.md: This file

Usage

import torch

# Load the model state dict
state_dict = torch.load("model_state_dict.pt", map_location='cpu')

# Use with your model architecture
# model.load_state_dict(state_dict)

Notes

  • This checkpoint was automatically uploaded using the upload_checkpoints.py script
  • Checkpoint extracted from: checkpoint-4400
  • Original path: /home/mert/aylin/capability-erosion-sft/LLaMA-Factory/saves/qwen2.5-1.5b/full/sft/checkpoint-4400
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support