Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
7
1
6
Vadim Karpenko
jrell
Follow
0 followers
Ā·
6 following
AI & ML interests
None yet
Recent Activity
reacted
to
schuler
's
post
with š„
about 12 hours ago
š¢ New Research Alert: Making Language Models Smaller & Smarter! Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance. The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena. š Key Findings: ā¢ 77% parameter reduction. ā¢ Maintained model capabilities. ā¢ Improved generalization. Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT Code: https://github.com/joaopauloschuler/less-parameters-llm
upvoted
a
collection
about 1 month ago
Dolphin 3.0
new
activity
3 months ago
ArliAI/Qwen2.5-32B-ArliAI-RPMax-v1.3-GGUF:
32k context bug
View all activity
Organizations
None yet
jrell
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
3 months ago
PramaLLC/BEN
Image Segmentation
ā¢
Updated
17 days ago
ā¢
145
ā¢
84
liked
a model
5 months ago
mistralai/Mistral-Small-Instruct-2409
Updated
Oct 16, 2024
ā¢
54.2k
ā¢
380
liked
2 models
about 1 year ago
FPHam/Karen_TheEditor_V2_CREATIVE_Mistral_7B
Text Generation
ā¢
Updated
Nov 21, 2023
ā¢
142
ā¢
28
FPHam/Karen_TheEditor_V2_STRICT_Mistral_7B
Text Generation
ā¢
Updated
Apr 21, 2024
ā¢
1.51k
ā¢
16
liked
2 models
over 1 year ago
elinas/chronos-13b
Text Generation
ā¢
Updated
Jun 23, 2023
ā¢
12
ā¢
33
TheBloke/chronos-13B-GGML
Updated
Jun 9, 2023
ā¢
20