GPT OS3 Alpha 8B A3B
- Developed by: qingy2024
- Base model: AmanPriyanshu/gpt-oss-8.4b-specialized-all-pruned-moe-only-11-experts
GPT OSS Small (OS3) is a project to create usable and intelligent language models based on pruned gpt-oss
variants by @AmanPriyanshu. These are post trained with LoRA on the qingy2024/GPT-OS3-Dataset-v1 dataset to revert some of the "brain damage" due to the expert pruning.
(This is the Alpha release so please don't use it unless you know what you're doing)
- Downloads last month
- 49