So, what are you going to do with these findings? Will you need to get Apple involved? Will you submit pull requests to the torch or mlx folks? Any chance of further improvements with more processing time? Can the regressions be mitigated with some kind of switch between the standard kernels and your optimized ones?
Aaron Reitz
Adreitz
·
AI & ML interests
None yet
Recent Activity
commented on
an
article
26 days ago
Automated Discovery of High-Performance GPU Kernels with OpenEvolve
new activity
4 months ago
HiDream-ai/HiDream-I1-Full:No way on earth to get "an albino woman with white skin and dark hair"
new activity
4 months ago
HiDream-ai/HiDream-I1-Full:Is it possible to use locally on Mac?
Organizations
None yet