Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
Hafez Mousavi
hafezmg48
Follow
0 followers
·
1 following
AI & ML interests
LLMs, Transformers
Organizations
None yet
hafezmg48
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Qwen/Qwen1.5-72B-Chat
over 1 year ago
Why 72B model has different vocab size comparing with other models?
7
#1 opened over 1 year ago by
Mikasaka
New activity in
Qwen/Qwen-1_8B
over 1 year ago
Intermediate_size is doubled in config.json
1
#3 opened over 1 year ago by
hafezmg48
Load more