fp8 llm - a roshniramesh Collection

roshniramesh 's Collections

fp8 llm

updated Jan 16, 2025

nvidia/Llama-3.1-8B-Instruct-FP8

Text Generation • 8B • Updated Aug 22, 2025 • 395k • • 36
amd/Llama-3.1-8B-Instruct-FP8-KV

8B • Updated Dec 19, 2024 • 39.4k • 6
amd/Mixtral-8x7B-Instruct-v0.1-FP8-KV

3B • Updated Dec 19, 2024 • 11.9k • 3
amd/Meta-Llama-3-8B_fp8_quark

Text Generation • 8B • Updated Jul 12, 2024 • 10 •
ibm-ai-platform/Bamba-9B-2T-fp8

Text Generation • 10B • Updated Dec 19, 2024 • 1 • 2
ibm-ai-platform/Bamba-9B-fp8

Text Generation • 10B • Updated Dec 19, 2024 • 249 • 2
ibm-ai-platform/Bamba-9B-1.8T-fp8

Text Generation • 10B • Updated Dec 19, 2024 • 1 • 2
RedHatAI/Meta-Llama-3-8B-Instruct-FP8

Text Generation • 8B • Updated Jul 18, 2024 • 3.22k • • 24
RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV

Text Generation • 8B • Updated Sep 15, 2025 • 29.7k • • 10
RedHatAI/Qwen2-7B-Instruct-FP8

Text Generation • 8B • Updated Jul 18, 2024 • 3.81k • • 2
RedHatAI/Qwen2-1.5B-Instruct-FP8

Text Generation • 2B • Updated Jul 18, 2024 • 55.2k •
RedHatAI/Mistral-7B-Instruct-v0.3-FP8

Text Generation • 7B • Updated Jul 18, 2024 • 1.52k • 3
RedHatAI/Llama-2-7b-chat-hf-FP8

Text Generation • 7B • Updated Jul 18, 2024 • 72
RedHatAI/gemma-2-9b-it-FP8

Text Generation • 9B • Updated Sep 22, 2025 • 1.14k • • 5
RedHatAI/DeepSeek-Coder-V2-Lite-Instruct-FP8

Text Generation • 16B • Updated Jul 18, 2024 • 153k • 13
FriendliAI/Llama-2-13b-chat-hf-fp8

Text Generation • 13B • Updated Apr 19, 2024 • 9 • 8
FriendliAI/Meta-Llama-3-8B-Instruct-fp8

Text Generation • 8B • Updated Nov 3, 2024 • 13 • 2
FriendliAI/Meta-Llama-3-8B-fp8

Text Generation • 8B • Updated Aug 1, 2024 • 11 • 3
FriendliAI/Meta-Llama-3.1-8B-Instruct-fp8

Text Generation • 8B • Updated Nov 3, 2024 • 4.65k
amd/Llama-3.2-3B-Instruct-FP8-KV

3B • Updated Dec 19, 2024 • 189
amd/Llama-3.2-1B-Instruct-FP8-KV

1B • Updated Dec 19, 2024 • 9.61k • 1
amd/Llama-3.2-3B-FP8-KV

3B • Updated Dec 19, 2024 • 15
amd/Llama-3.2-1B-FP8-KV

1B • Updated Dec 19, 2024 • 114
amd/Meta-Llama-3.1-8B-Instruct-fp8-quark-vllm

Updated Aug 14, 2024 • 1