A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework.
Data Mining and Information Systems Lab
dmis-lab
AI & ML interests
None yet
Recent Activity
updated
a collection
about 12 hours ago
Outlier-Safe Pre-Training (OSP)
upvoted
a
paper
about 12 hours ago
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large
Language Models
updated
a model
1 day ago
dmis-lab/OSP-1.4B-100B-Shampoo-SSNorm-EmbProj
Organizations
None yet