BenchHub-Cat-7b

BenchHub-Cat-7b is a category classification model based on Qwen2.5-7B, fine-tuned to assign natural language queries to structured category triplets: (subject, skill, target).

πŸ”§ Model Details

  • Base Model: Qwen2.5-7B-Instruct
  • Task: Structured multi-label classification (triple: subject, skill, target)
  • Prompting Style: Instruction-style with expected format output
  • Training Framework: Axolotl + DeepSpeed ZeRO-3

πŸ§ͺ Training Configuration

Hyperparameter Value
Sequence Length 8192
Learning Rate 2 Γ— 10⁻⁡
Batch Size (Effective) 256
Epochs 3
Scheduler Cosine Decay
Warmup Ratio 0.05
Optimizer Method from [19]
Trainer DeepSpeed ZeRO-3
Hardware 4Γ— A6000 48GB GPUs
Training Time ~5 hours per run

🧠 Intended Use

Input: Natural language question or instruction
Output: Triplet (subject, skill, target), such as:

{ "subject_type": "history",  
"task_type": "reasoning",  
"target_type": "korea"}

✨ Prompt Example

### Instruction:
Classify the following query into subject, skill, and target.

### Query:
How did Confucianism shape education in East Asia?

### Output:
{ "subject_type": "history",  
"task_type": "reasoning",  
"target_type": "korea"}

πŸ“œ License

Apache 2.0

Downloads last month
5
Safetensors
Model size
7.61B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for BenchHub/BenchHub-Cat-7b

Quantizations
2 models