--- license: mit datasets: - HuggingFaceFW/fineweb - nvidia/ChatQA2-Long-SFT-data language: - en base_model: - microsoft/phi-4 --- Pretraining checkpoints for HMT training for Phi-4 model