2 2 7

Kang

kibitzing

kibitzing

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

naver-hyperclovax/HyperCLOVAX-SEED-Think-14B:vllm을 이용해서 openai api로 서빙한 후 응답이 무한 루프되는 문제

liked a model 5 days ago

Qwen/Qwen3-235B-A22B-Instruct-2507

upvoted a collection 5 days ago

Qwen3

View all activity

Organizations

New activity in naver-hyperclovax/HyperCLOVAX-SEED-Think-14B 3 days ago

vllm을 이용해서 openai api로 서빙한 후 응답이 무한 루프되는 문제

#3 opened 4 days ago by

lovedownload

liked a model 5 days ago

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated 6 days ago • 14.3k • • 517

upvoted a collection 5 days ago

Qwen3

Collection

76 items • Updated 2 days ago • 925

liked a model 5 days ago

naver-hyperclovax/HyperCLOVAX-SEED-Think-14B

Text Generation • 15B • Updated 5 days ago • 30.3k • 65

upvoted an article 18 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 191

New activity in naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-1.5B about 2 months ago

Model keeps repeating the prompt – how can I avoid this?

#9 opened about 2 months ago by

sunnyanna

liked a model 3 months ago

Qwen/Qwen3-4B

Text Generation • 4B • Updated 1 day ago • 1.05M • • 322

liked a model 4 months ago

Qwen/Qwen2.5-1.5B

Text Generation • 2B • Updated Oct 8, 2024 • 422k • • 110

liked 2 models 5 months ago

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • 8B • Updated Jun 18 • 1.37M • • 4.09k

meta-llama/Llama-3.1-8B

Text Generation • 8B • Updated Oct 16, 2024 • 892k • • 1.71k

liked a Space 5 months ago

2.84k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

Kang

AI & ML interests

Recent Activity

Organizations

kibitzing's activity

vllm을 이용해서 openai api로 서빙한 후 응답이 무한 루프되는 문제

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Model keeps repeating the prompt – how can I avoid this?

The Ultra-Scale Playbook