Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Stephen's picture

5 3 3

Stephen

smpanaro

maxrubin629's profile picture

21world's profile picture

VIATEUR-AI's profile picture

·

https://stephenpanaro.com

flat
smpanaro

AI & ML interests

Apple Neural Engine, Quantization

Organizations

smpanaro 's collections 7

Apple Neural Engine LLMs

CoreML LLMs optimized for Apple Neural Engine.

smpanaro/Llama-2-7b-coreml

Updated Oct 12, 2024 • 3
smpanaro/Llama-3.2-1B-Instruct-CoreML

Updated Oct 13, 2024 • 9
smpanaro/Llama-3.2-3B-Instruct-CoreML

Updated Nov 9, 2024 • 4

SqueezeLLM: Dense-and-Sparse Quantization

Paper • 2306.07629 • Published Jun 13, 2023 • 4
Norm Tweaking: High-performance Low-bit Quantization of Large Language Models

Paper • 2309.02784 • Published Sep 6, 2023 • 2
Extreme Compression of Large Language Models via Additive Quantization

Paper • 2401.06118 • Published Jan 11, 2024 • 13
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Paper • 2402.04291 • Published Feb 6, 2024 • 51

gpt-2 model family quantized using AutoGPTQ.

smpanaro/gpt2-AutoGPTQ-4bit-128g

Text Generation • Updated Feb 28, 2024 • 3
smpanaro/gpt2-medium-AutoGPTQ-4bit-128g

Text Generation • Updated Feb 28, 2024 • 3 • 1
smpanaro/gpt2-large-AutoGPTQ-4bit-128g

Text Generation • Updated Feb 28, 2024 • 3
smpanaro/gpt2-xl-AutoGPTQ-4bit-128g

Text Generation • Updated Feb 28, 2024 • 3

Pruning Large Language Models with Semi-Structural Adaptive Sparse Training

Paper • 2407.20584 • Published Jul 30, 2024

Incremental FastPitch: Chunk-based High Quality Text to Speech

Paper • 2401.01755 • Published Jan 3, 2024 • 10

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Paper • 2401.01325 • Published Jan 2, 2024 • 28
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 51
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19, 2024 • 62
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 23

Pythia model family quantized using AutoGPTQ.

smpanaro/pythia-70m-AutoGPTQ-4bit-128g

Text Generation • Updated Mar 13, 2024 • 3
smpanaro/pythia-160m-AutoGPTQ-4bit-128g

Text Generation • Updated Mar 12, 2024 • 3
smpanaro/pythia-410m-AutoGPTQ-4bit-128g

Text Generation • Updated Mar 12, 2024 • 3
smpanaro/pythia-1b-AutoGPTQ-4bit-128g

Text Generation • Updated Mar 12, 2024 • 3

Apple Neural Engine LLMs

CoreML LLMs optimized for Apple Neural Engine.

smpanaro/Llama-2-7b-coreml

Updated Oct 12, 2024 • 3
smpanaro/Llama-3.2-1B-Instruct-CoreML

Updated Oct 13, 2024 • 9
smpanaro/Llama-3.2-3B-Instruct-CoreML

Updated Nov 9, 2024 • 4

Incremental FastPitch: Chunk-based High Quality Text to Speech

Paper • 2401.01755 • Published Jan 3, 2024 • 10

SqueezeLLM: Dense-and-Sparse Quantization

Paper • 2306.07629 • Published Jun 13, 2023 • 4
Norm Tweaking: High-performance Low-bit Quantization of Large Language Models

Paper • 2309.02784 • Published Sep 6, 2023 • 2
Extreme Compression of Large Language Models via Additive Quantization

Paper • 2401.06118 • Published Jan 11, 2024 • 13
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Paper • 2402.04291 • Published Feb 6, 2024 • 51

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Paper • 2401.01325 • Published Jan 2, 2024 • 28
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 51
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19, 2024 • 62
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 23

gpt-2 model family quantized using AutoGPTQ.

smpanaro/gpt2-AutoGPTQ-4bit-128g

Text Generation • Updated Feb 28, 2024 • 3
smpanaro/gpt2-medium-AutoGPTQ-4bit-128g

Text Generation • Updated Feb 28, 2024 • 3 • 1
smpanaro/gpt2-large-AutoGPTQ-4bit-128g

Text Generation • Updated Feb 28, 2024 • 3
smpanaro/gpt2-xl-AutoGPTQ-4bit-128g

Text Generation • Updated Feb 28, 2024 • 3

Pythia model family quantized using AutoGPTQ.

smpanaro/pythia-70m-AutoGPTQ-4bit-128g

Text Generation • Updated Mar 13, 2024 • 3
smpanaro/pythia-160m-AutoGPTQ-4bit-128g

Text Generation • Updated Mar 12, 2024 • 3
smpanaro/pythia-410m-AutoGPTQ-4bit-128g

Text Generation • Updated Mar 12, 2024 • 3
smpanaro/pythia-1b-AutoGPTQ-4bit-128g

Text Generation • Updated Mar 12, 2024 • 3

Pruning Large Language Models with Semi-Structural Adaptive Sparse Training

Paper • 2407.20584 • Published Jul 30, 2024

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs