4 29 22

Mukul Ranjan PRO

mukul54

https://www.mukul54.github.io

AI & ML interests

Efficiency, Optimization

Recent Activity

liked a dataset 12 days ago

Salesforce/wikitext

new activity 12 days ago

Salesforce/wikitext:Where is the "download" button

liked a model 20 days ago

GSAI-ML/LLaDA-8B-Base

View all activity

Organizations

upvoted a paper 28 days ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 70

upvoted a paper 30 days ago

GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching

Paper • 2506.20480 • Published Jun 25 • 7

upvoted a paper about 1 month ago

Parallelizing Linear Transformers with the Delta Rule over Sequence Length

Paper • 2406.06484 • Published Jun 10, 2024 • 4

upvoted 2 articles about 1 month ago

Article

Why Maybe We're Measuring LLM Compression Wrong

•

Jun 21

• 9

Article

Stable Diffusion with 🧨 Diffusers

and 3 others •

Aug 22, 2022

• 66

upvoted 2 papers about 1 month ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17 • 41

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19, 2024 • 59

upvoted an article about 2 months ago

Article

KV Cache from scratch in nanoVLM

and 4 others •

Jun 4

• 88

upvoted a paper about 2 months ago

CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark

Paper • 2505.16968 • Published May 22 • 41

upvoted a collection about 2 months ago

VisionLM

Collection

1363 items • Updated about 15 hours ago • 88

upvoted 6 papers about 2 months ago

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

Paper • 2505.24878 • Published May 30 • 23

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 176

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

Paper • 2505.24862 • Published May 30 • 31

CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects

Paper • 2505.21437 • Published May 27 • 22

Vision Language Models are Biased

Paper • 2505.23941 • Published May 29 • 20

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Paper • 2505.24867 • Published May 30 • 80

upvoted a paper 2 months ago

SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem

Paper • 2505.21887 • Published May 28 • 14

upvoted a collection 3 months ago

CASS

Collection

Large-scale dataset and model suite for cross-architecture GPU code transpilation between CUDA and HIP at both source and assembly levels • 2 items • Updated May 15 • 5

upvoted an article 3 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 491

upvoted a collection 3 months ago

Qwen3

Collection

21 items • Updated Apr 29 • 30

Mukul Ranjan PRO

AI & ML interests

Recent Activity

Organizations

mukul54's activity

Why Maybe We're Measuring LLM Compression Wrong

Stable Diffusion with 🧨 Diffusers

KV Cache from scratch in nanoVLM

Vision Language Models (Better, Faster, Stronger)