Steve Chen's picture

4

Steve Chen

stev236

AI & ML interests

Running local models on different projects.

Recent Activity

new activity 16 days ago

Qwen/Qwen3-8B:New 8B model much slower than old 7B model when running on vLLM.

new activity 17 days ago

Qwen/Qwen3-4B:Why are the new 4B and 8B models slower than the previous 7B-1M model??

new activity 18 days ago

Qwen/Qwen3-14B:Long context: YaRN max_position_embeddings 32K or 40k?

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet