how can i run this on my Galaxy S23 Ultra?
#31
by
Losanti123
- opened
It would be interesting to see models like this "Any-to-Any" running on Edge devices.
more likely never or waita 5-10 years
You can try with the Chatterui app on github. It uses CPU inference so your speed may not be great. Use q4_0 quants with ARM devices.
Theres also another app on the google playstore called layla that has experimental MLC inference, but its a little hit or miss.
You can try with the Chatterui app on github. It uses CPU inference so your speed may not be great. Use q4_0 quants with ARM devices.
Theres also another app on the google playstore called layla that has experimental MLC inference, but its a little hit or miss.
Where do you see quants?