Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
1
Tu Jianhong
tuhahaha
Follow
0 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
new
activity
about 1 month ago
Qwen/Qwen3-235B-A22B:
Qwen3 not Using Tools in Complex Prompts Unlike QwQ-32B
new
activity
4 months ago
Qwen/Qwen2.5-14B-Instruct-1M:
About function calling failures using the huggingface transformer
upvoted
a
paper
5 months ago
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
View all activity
Organizations
Papers
1
arxiv:
2309.16609
models
0
None public yet
datasets
0
None public yet