CoreML LLMs optimized for Apple Neural Engine.
Stephen
smpanaro
AI & ML interests
Apple Neural Engine, Quantization
Recent Activity
updated
a model
30 days ago
smpanaro/Qwen2.5-0.5B-4bit-PerTensor
published
a model
about 1 month ago
smpanaro/Qwen2.5-0.5B-4bit-PerTensor
new activity
3 months ago
smpanaro/Llama-3.2-1B-Instruct-CoreML:Context length