Steve Chen
stev236
AI & ML interests
Running local models on different projects.
Recent Activity
new activity
16 days ago
Qwen/Qwen3-8B:New 8B model much slower than old 7B model when running on vLLM.
new activity
17 days ago
Qwen/Qwen3-4B:Why are the new 4B and 8B models slower than the previous 7B-1M model??
new activity
18 days ago
Qwen/Qwen3-14B:Long context: YaRN max_position_embeddings 32K or 40k?
Organizations
None yet
models
0
None public yet
datasets
0
None public yet