
Generative AI
LLM and SLM models verified as ready for Arm platforms accompanied with Arm Learning Paths to guide you through development and deployment
Text Generation • Updated • 6.07M • • 3.88kNote Run a Large Language Model (LLM) chatbot with PyTorch using KleidiAI on Arm servers - https://learn.arm.com/learning-paths/servers-and-cloud-computing/pytorch-llama/
meta-llama/Llama-3.2-1B-Instruct
Text Generation • Updated • 2.36M • • 892Note Build an Android chat app with Llama, KleidiAI, ExecuTorch, and XNNPACK - https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/build-llama3-chat-android-app-using-executorch-and-xnnpack/
cognitivecomputations/dolphin-2.9.4-llama3.1-8b
Updated • 5.33k • 95Note Learn how you can deploy a Large Language Model (LLM) chatbot with llama.cpp using KleidiAI on Arm servers with this model - https://learn.arm.com/learning-paths/servers-and-cloud-computing/llama-cpu/ You can also learn how you can build a RAG application using Zilliz Cloud on Arm servers with Dolphin 2.9.4-llama3.1-8b - https://learn.arm.com/learning-paths/servers-and-cloud-computing/milvus-rag/
chatpdflocal/llama3.1-8b-gguf
Updated • 328 • 26Note Deploy a RAG-based Chatbot with llama-cpp-python using KleidiAI on Google Axion processors - https://learn.arm.com/learning-paths/servers-and-cloud-computing/rag/
Aryanne/Orca-Mini-3B-gguf
Updated • 698 • 5Note Run a local LLM chatbot on a Raspberry Pi 5 - https://learn.arm.com/learning-paths/embedded-and-microcontrollers/llama-python-cpu/
microsoft/Phi-3-vision-128k-instruct
Text Generation • Updated • 25.9k • 958Note Build an Android chat application with ONNX Runtime API - https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/build-android-chat-app-using-onnxruntime/
Qwen/Qwen2-0.5B-Instruct
Text Generation • Updated • 216k • 186Note Run an LLM chatbot with rtp-llm on Arm-based servers - https://learn.arm.com/learning-paths/servers-and-cloud-computing/rtp-llm/
Qwen/Qwen2.5-0.5B-Instruct
Text Generation • Updated • 1.04M • 314Note Build and Run a Virtual Large Language Model (vLLM) on Arm Servers - https://learn.arm.com/learning-paths/servers-and-cloud-computing/vllm/
google/gemma-2-2b
Text Generation • Updated • 519k • 539Note LLM inference on Android with KleidiAI, MediaPipe, and XNNPACK - https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/kleidiai-on-android-with-mediapipe-and-xnnpack/
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity • Updated • 84.5M • 3.29kNote Learn how you can Build a RAG application using Zilliz Cloud on Arm servers with all-MiniLM - https://learn.arm.com/learning-paths/servers-and-cloud-computing/milvus-rag/