roshniramesh 's Collections fp8 llm
updated
nvidia/Llama-3.1-8B-Instruct-FP8
Text Generation
• 8B • Updated • 395k
• • 36
amd/Llama-3.1-8B-Instruct-FP8-KV
8B • Updated • 39.4k
• 6
amd/Mixtral-8x7B-Instruct-v0.1-FP8-KV
3B • Updated • 11.9k
• 3
amd/Meta-Llama-3-8B_fp8_quark
Text Generation
• 8B • Updated • 10
•
ibm-ai-platform/Bamba-9B-2T-fp8
Text Generation
• 10B • Updated • 1
• 2
ibm-ai-platform/Bamba-9B-fp8
Text Generation
• 10B • Updated • 249
• 2
ibm-ai-platform/Bamba-9B-1.8T-fp8
Text Generation
• 10B • Updated • 1
• 2
RedHatAI/Meta-Llama-3-8B-Instruct-FP8
Text Generation
• 8B • Updated • 3.22k
• • 24
RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV
Text Generation
• 8B • Updated • 29.7k
• • 10
RedHatAI/Qwen2-7B-Instruct-FP8
Text Generation
• 8B • Updated • 3.81k
• • 2
RedHatAI/Qwen2-1.5B-Instruct-FP8
Text Generation
• 2B • Updated • 55.2k
•
RedHatAI/Mistral-7B-Instruct-v0.3-FP8
Text Generation
• 7B • Updated • 1.52k
• 3
RedHatAI/Llama-2-7b-chat-hf-FP8
Text Generation
• 7B • Updated • 72
RedHatAI/gemma-2-9b-it-FP8
Text Generation
• 9B • Updated • 1.14k
• • 5
RedHatAI/DeepSeek-Coder-V2-Lite-Instruct-FP8
Text Generation
• 16B • Updated • 153k
• 13
FriendliAI/Llama-2-13b-chat-hf-fp8
Text Generation
• 13B • Updated • 9
• 8
FriendliAI/Meta-Llama-3-8B-Instruct-fp8
Text Generation
• 8B • Updated • 13
• 2
FriendliAI/Meta-Llama-3-8B-fp8
Text Generation
• 8B • Updated • 11
• 3
FriendliAI/Meta-Llama-3.1-8B-Instruct-fp8
Text Generation
• 8B • Updated • 4.65k
amd/Llama-3.2-3B-Instruct-FP8-KV
3B • Updated • 189
amd/Llama-3.2-1B-Instruct-FP8-KV
1B • Updated • 9.61k
• 1
3B • Updated • 15
1B • Updated • 114
amd/Meta-Llama-3.1-8B-Instruct-fp8-quark-vllm