
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation
•
Updated
•
528k
•
•
4.32k
Pruned experts from Mixtral-8x7B-Instruct-v0.1 with respect to the paper "A Provably Effective Method for Pruning Experts in Fine-tuned Sparse MoEs"