17 13 12

kas

shing3232

AI & ML interests

None yet

Recent Activity

new activity 25 days ago

deepseek-ai/DeepSeek-V3.1:tool call for reasoning mode

updated a collection 4 months ago

sakura

new activity 4 months ago

Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4:Int4为什么比没量化的float32和float16还慢

View all activity

Organizations

None yet

Collections 1

spaces 1

No application file

Qwen2 Sakura

😻

models 9

datasets 2

shing3232/dataset_imatrix

Viewer • Updated Jan 15, 2024 • 1 • 6

shing3232/imatrix

Updated Jan 15, 2024 • 9

kas

AI & ML interests

Recent Activity

Organizations

Collections 1

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Beyond Language Models: Byte Models are Digital World Simulators

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Kijai/PrecompiledWheels

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Beyond Language Models: Byte Models are Digital World Simulators

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Kijai/PrecompiledWheels

spaces 1

Qwen2 Sakura

models 9

shing3232/Sakura-1.5B-Qwen2.5-v1.0-GGUF-IMX

shing3232/sakura-14b-qwen2beta-v0.9.2-IMX

shing3232/Sakura13B-LNovel-v0.9-qwen1.5-GGUF-IMX

shing3232/Sakura1.8B-LNovel-v0.9pre2-qwen1_GGUF-IMX

shing3232/Sakura13B-LNovel-v0.9b-GGUF-IMX-2.33_re

shing3232/Sakura1.8B-LNovel-v0.9-qwen1.5_GGUF-IMX_re

shing3232/Sakura13B-LNovel-v0.9b-GGUF-IMX-2.33

shing3232/Sakura-LNovel-v0.9b-GGUF-IMX-JPZH

shing3232/Sakura-13B-LNovel-v0.9b-GGUF-IMX-wikitest

datasets 2

shing3232/dataset_imatrix

shing3232/imatrix

kas

AI & ML interests

Recent Activity

Organizations

Collections 1

spaces 1

Qwen2 Sakura

models 9 Sort: Recently updated

datasets 2 Sort: Recently updated

models 9

datasets 2