mobiuslabsgmbh/Meta-Llama-3-8B-Instruct_4bitgs64_hqq_hf Text Generation • 5B • Updated May 23 • 500 • 2
mobiuslabsgmbh/DeepSeek-R1-ReDistill-Llama3-8B-v1.1 Text Generation • 8B • Updated Jan 30 • 11 • • 11
DeepSeek-R1-ReDistill Collection Re-distilled DeepSeek R1 models • 4 items • Updated Jan 30 • 14
mobiuslabsgmbh/DeepSeek-R1-ReDistill-Qwen-7B-v1.1 Text Generation • 8B • Updated Jan 30 • 10 • 15
mobiuslabsgmbh/DeepSeek-R1-ReDistill-Qwen-1.5B-v1.0 Text Generation • 2B • Updated Jan 29 • 26 • 44
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 3.13M • • 2.51k
view article Article Unlocking Longer Generation with Key-Value Cache Quantization By RaushanTurganbay • May 16, 2024 • 49
mobiuslabsgmbh/Hermes-3-Llama-3.1-70B_4bitgs64_hqq Text Generation • Updated Aug 16, 2024 • 3 • 4
mobiuslabsgmbh/Hermes-3-Llama-3.1-70B_4bitgs64_hqq Text Generation • Updated Aug 16, 2024 • 3 • 4
view post Post 2102 Releasing HQQ Llama-3.1-70b 4-bit quantized version! Check it out at mobiuslabsgmbh/Llama-3.1-70b-instruct_4bitgs64_hqq. Achieves 99% of the base model performance across various benchmarks! Details in the model card. 🔥 8 8 + Reply