Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
aldigobbler 
posted an update 7 days ago
Post
238
no ai slop posted here today i just feel like posting what i did for today

wrote a little framework for turning multiple dense models (llama based) into Sparse MoEs.. i found it fun, spent the whole day and a half on it.

code @ https://gist.github.com/cappuch/6a454ec8d2d349a27f9fd84f6ac90554
In this post