kas

shing3232

AI & ML interests

None yet

Recent Activity

Organizations

None yet

shing3232's activity

updated a collection about 1 month ago
upvoted an article about 2 months ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

By medmekk and 5 others
246
New activity in SakuraLLM/Sakura-14B-Qwen2beta-v0.9.2-GGUF about 1 year ago

CUDA运行不了BF16模型?

2
#1 opened about 1 year ago by
NeuronAstate
New activity in Qwen/Qwen1.5-7B-Chat-GGUF about 1 year ago
New activity in Qwen/CodeQwen1.5-7B-Chat about 1 year ago