Vadim Karpenko's picture

7 2 7

Vadim Karpenko

jrell

·

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago

MrDragonFox/mOrpheus_3B-1Base_early_preview

upvoted a paper 3 months ago

HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

reacted to schuler's post with 🔥 3 months ago

📢 New Research Alert: Making Language Models Smaller & Smarter! Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance. The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena. 🔑 Key Findings: • 77% parameter reduction. • Maintained model capabilities. • Improved generalization. Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT Code: https://github.com/joaopauloschuler/less-parameters-llm

View all activity

Organizations

None yet

jrell's activity

upvoted a paper 3 months ago

HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

Paper • 2502.12574 • Published Feb 18 • 11

upvoted a collection 4 months ago

Dolphin 3.0

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated Feb 7 • 145