Data_dimension
Collection
Models trained using data with different filtering strategies (difficulty, quality filtering)
•
12 items
•
Updated
Base Model: Qwen/Qwen2.5-7B
Training Epoches: 3
Training Objective: SFT + RL
Training Data: