MXFP4/NVFP4 models
AI & ML interests
Computer Vision, LLMs, Multimodal Models, Model Compression
Recent Activity
View all activity
Organization Card
Multimodal AI on a global scale. Advocates for Open Source and Open Intelligence. Currently investigating how to make Large Machine Learning Models smaller and democratize them for GPU-poor environments. Visit https://mobiusml.github.io/blog/ to see some of our recent work.
Quantized models in AO/GemLite format
-
mobiuslabsgmbh/Llama-3.1-8B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation • Updated • 11 • 1 -
mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation • Updated • 1 • 1 -
mobiuslabsgmbh/Qwen2.5-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation • Updated • 21 • 2 -
mobiuslabsgmbh/Qwen3-32B_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation • Updated • 3 • 1
MXFP4/NVFP4 models
Quantized models in AO/GemLite format
-
mobiuslabsgmbh/Llama-3.1-8B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation • Updated • 11 • 1 -
mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation • Updated • 1 • 1 -
mobiuslabsgmbh/Qwen2.5-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation • Updated • 21 • 2 -
mobiuslabsgmbh/Qwen3-32B_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation • Updated • 3 • 1
models
52

mobiuslabsgmbh/Qwen2.5-VL-7B-Instruct-leftpad
Updated

mobiuslabsgmbh/Llama-3.1-8B-Instruct_mxfp4_weights_calib_demo
Text Generation
•
Updated
•
15
•
1

mobiuslabsgmbh/Llama-3.1-8B-Instruct_nvfp4_weights_calib_demo
Text Generation
•
Updated
•
16
•
1

mobiuslabsgmbh/Qwen2.5-VL-7B-Instruct_gemlite-ao_a8w8
Image-to-Text
•
Updated
•
13
•
2

mobiuslabsgmbh/Qwen2.5-VL-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit
Image-to-Text
•
Updated
•
7
•
1

mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_16bit
Text Generation
•
Updated
•
6
•
1

mobiuslabsgmbh/Llama-3.1-8B-Instruct_gemlite-ao_a16w4_gs_128_pack_16bit
Text Generation
•
Updated
•
1
•
1

mobiuslabsgmbh/Qwen3-32B_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation
•
Updated
•
3
•
1

mobiuslabsgmbh/Qwen2.5-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation
•
Updated
•
21
•
2

mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_32bit
Text Generation
•
Updated
•
1
•
1