Post
214
no ai slop posted here today i just feel like posting what i did for today
wrote a little framework for turning multiple dense models (llama based) into Sparse MoEs.. i found it fun, spent the whole day and a half on it.
code @ https://gist.github.com/cappuch/6a454ec8d2d349a27f9fd84f6ac90554
wrote a little framework for turning multiple dense models (llama based) into Sparse MoEs.. i found it fun, spent the whole day and a half on it.
code @ https://gist.github.com/cappuch/6a454ec8d2d349a27f9fd84f6ac90554