Zack Li's picture

Zack Li PRO

zackli4ai

·

AI & ML interests

LLM, on-device AI

Recent Activity

updated a model 2 days ago

zackli4ai/llama3-1B-Instruct-GPTQ-Int4-GGUF

updated a model 2 days ago

nexa-collaboration/gptqmodel-1024-c4-Llama-3.2-1B-Instruct-4bit

View all activity

Organizations

zackli4ai's activity

upvoted a paper 6 days ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published 10 days ago • 41

upvoted a collection 3 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 224

upvoted a paper 4 months ago

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28 • 42

upvoted a paper 6 months ago

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26 • 47

upvoted a paper 8 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

upvoted an article 8 months ago

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

• 36

upvoted a paper 8 months ago

Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30 • 116

upvoted a paper 9 months ago

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2 • 56

upvoted a paper 10 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126