Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

189

Full-text search

Active filters: torchao

jerryzh168/llama3-int4wo-128

Updated Sep 13, 2024 • 6

medmekk/Meta-Llama-3-8B-quantized-int8_weight_only

Updated Oct 8, 2024 • 18

medmekk/Meta-Llama-3-8B-quantized-int8_dynamic_activation_int8_weight

Updated Oct 8, 2024 • 8

medmekk/Meta-Llama-3-8B-quantized-int4_weight_only

Updated Oct 8, 2024 • 48

medmekk/Meta-Llama-3-8B-quantized-int8_weight_only-2

Updated Oct 8, 2024 • 9

medmekk/Meta-Llama-3-8B-quantized-int4_weight_only-2

Updated Oct 8, 2024 • 10

medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs-64

Updated Oct 8, 2024 • 25

medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs-32

Updated Oct 8, 2024 • 19

medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs_256

Updated Oct 8, 2024 • 8

medmekk/Meta-Llama-3-8B-torchao-int8_weight_only

Updated Oct 9, 2024 • 5

medmekk/Meta-Llama-3-8B-torchao-int8_dynamic_activation_int8_weight

Updated Oct 8, 2024 • 6

medmekk/gpt2-torchao-int8_weight_only

Updated Oct 8, 2024 • 13

medmekk/Llama-3.1-70B-torchao-int8_weight_only

Updated Oct 8, 2024 • 7

medmekk/new_model

Updated Oct 17, 2024 • 3

medmekk/qsdf

Updated Oct 18, 2024 • 3

medmekk/new_gpt2

Updated Oct 18, 2024 • 4

medmekk/an_other_torchao

Updated Oct 18, 2024 • 13

medmekk/an_other_torchao_dynamic

Updated Oct 18, 2024 • 2

marcsun13/Meta-Llama-3-8B-torchao-int8_weight_only

Updated Oct 18, 2024 • 8

medmekk/new_tesing_model

Updated Oct 22, 2024 • 5

medmekk/testing_int4

Updated Oct 22, 2024 • 2

medmekk/quantized_int8_2

Updated Oct 22, 2024 • 7

medmekk/quantized_int4

Updated Oct 22, 2024 • 6

medmekk/quantized_70B

Updated Oct 22, 2024 • 12

medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs_128

Updated Oct 22, 2024 • 9

medmekk/custom_name

Updated Oct 22, 2024 • 5

medmekk/custom_name_1

Updated Oct 22, 2024 • 3

medmekk/deepseek-coder-1.3b-base-torchao-int8_weight_only

Updated Oct 22, 2024 • 10

medmekk/testing_repo_name

Updated Oct 22, 2024 • 18

gurro/llama-3.1-8B-torchao-int4wo-128

Text Generation • Updated Dec 2, 2024 • 11