mobiuslabsgmbh/Llama-3.1-8B-Instruct_mxfp4_weights_calib_demo Text Generation • Updated 29 days ago • 15 • 1
mobiuslabsgmbh/Llama-3.1-8B-Instruct_nvfp4_weights_calib_demo Text Generation • Updated 29 days ago • 16 • 1
mobiuslabsgmbh/Qwen2.5-VL-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit Image-to-Text • Updated Jun 4 • 7 • 1
mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_16bit Text Generation • Updated Jun 4 • 6 • 1
mobiuslabsgmbh/Llama-3.1-8B-Instruct_gemlite-ao_a16w4_gs_128_pack_16bit Text Generation • Updated Jun 4 • 1 • 1
mobiuslabsgmbh/Qwen2.5-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit Text Generation • Updated Jun 4 • 16 • 2
mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_32bit Text Generation • Updated Jun 4 • 1 • 1
mobiuslabsgmbh/Llama-3.1-8B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit Text Generation • Updated Jun 3 • 11 • 2
mobiuslabsgmbh/Meta-Llama-3-8B-Instruct_4bitgs64_hqq_hf Text Generation • 5B • Updated May 23 • 526 • 2
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1_4bitgs64_hqq_hf Text Generation • 25B • Updated Feb 10 • 10 • 1
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1-hf-attn-4bit-moe-2bitgs8-metaoffload-HQQ Text Generation • Updated Feb 5 • 12 • 20
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1-hf-attn-4bit-moe-2bit-metaoffload-HQQ Text Generation • Updated Feb 5 • 24 • 16
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1-hf-attn-4bit-moe-3bit-metaoffload-HQQ Text Generation • Updated Feb 5 • 4 • 13