Edit Models filters

Inference Providers

HF Inference API

Misc

torchao-my-repo

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

38

Full-text search

Active filters: torchao-my-repo

CJHauser/vibrance

Updated May 14 • 2

appy1234/Llama3.1-8B-Int8DynamicActivationInt8WeightQuantized

Text Generation • Updated Jun 4 • 17

appy1234/Llama-3.2-3B-Instruct-Int8DynamicActivationInt8WeightQuantized

Text Generation • Updated Jun 4 • 18

mikaylagawarecki/foo_bar

Feature Extraction • Updated Jun 11 • 2

RoadToNowhere/Qwen3-0.6B-ao-float8wo

Text Generation • Updated Jun 13 • 13

Vykyan/FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview-ao-int4wo-gs128

Updated Jun 20 • 2

mibrdeniz/turkish-gpt2-medium-350m-instruct-q8

Text Generation • Updated about 1 month ago • 1

mibrdeniz/turkish-gpt2-medium-350m-instruct-q4

Text Generation • Updated about 1 month ago • 1