holy moly guacamole
Mikus PRO
aldigobbler
AI & ML interests
tiny fast models
Recent Activity
updated
a model
5 days ago
aldigobbler/smollm2-35m-pruned
published
a model
5 days ago
aldigobbler/smollm2-35m-pruned
Organizations
aldigobbler's activity

posted
an
update
7 days ago
Post
238
no ai slop posted here today i just feel like posting what i did for today
wrote a little framework for turning multiple dense models (llama based) into Sparse MoEs.. i found it fun, spent the whole day and a half on it.
code @ https://gist.github.com/cappuch/6a454ec8d2d349a27f9fd84f6ac90554
wrote a little framework for turning multiple dense models (llama based) into Sparse MoEs.. i found it fun, spent the whole day and a half on it.
code @ https://gist.github.com/cappuch/6a454ec8d2d349a27f9fd84f6ac90554
Update app.py
2
#2 opened 12 months ago
by
aldigobbler
