Ankit Sharma

nezubn

https://nezubn.com

AI & ML interests

engineering • systems • ml

Recent Activity

liked a model 13 days ago

openai/gpt-oss-20b

upvoted an article about 1 month ago

SmolLM3: smol, multilingual, long-context reasoner

liked a model 2 months ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

upvoted an article about 1 month ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 629

upvoted an article 4 months ago

Article

Cohere on Hugging Face Inference Providers 🔥

and 6 others •

Apr 16

• 131

upvoted an article 5 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 208

upvoted a paper 9 months ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 50

upvoted a paper 10 months ago

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

Paper • 2410.19008 • Published Oct 21, 2024 • 24

upvoted 3 articles about 1 year ago

Article

Optimizing your LLM in production

•

Sep 15, 2023

• 19

Article

Getting Started With Embeddings

•

Jun 23, 2022

• 86

Article

quanto: a pytorch quantization toolkit

and 2 others •

Mar 18, 2024

• 42

upvoted 2 papers about 1 year ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 122

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted a paper over 1 year ago

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25, 2024 • 55

upvoted an article over 1 year ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

and 4 others •

May 24, 2023

• 162

upvoted 8 papers over 1 year ago

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4, 2024 • 66

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 107

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 47

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29, 2024 • 35

Ankit Sharma

AI & ML interests

Recent Activity

Organizations

nezubn's activity

SmolLM3: smol, multilingual, long-context reasoner

Cohere on Hugging Face Inference Providers 🔥

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Optimizing your LLM in production

Getting Started With Embeddings

quanto: a pytorch quantization toolkit

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA