Anirudh Thatipelli's picture

Anirudh Thatipelli

Anirudh25

·

https://anirudh257.github.io/

Anirudh257

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

microsoft/resnet-152

liked a dataset 5 days ago

zh-plus/tiny-imagenet

upvoted an article 15 days ago

Introduction to State Space Models (SSM)

View all activity

Organizations

None yet

upvoted an article 15 days ago

Article

Introduction to State Space Models (SSM)

By

•

Jul 19, 2024

• 150

upvoted a collection about 1 month ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Apr 28 • 626

upvoted 3 papers about 1 month ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 169

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30 • 95

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Paper • 2505.24867 • Published May 30 • 79

upvoted a collection about 2 months ago

LaViDa-1.0

LArge VIsion-language Diffusion moDel with mAsking • 11 items • Updated May 26 • 7

upvoted a paper about 2 months ago

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Paper • 2505.08617 • Published May 13 • 42

upvoted an article 3 months ago

Article

What is test-time compute and how to scale it?

By

and 1 other •

Feb 6

• 96

upvoted a paper 3 months ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published Apr 8 • 83

upvoted an article 3 months ago

Article

You could have designed state of the art positional encoding

By

•

Nov 25, 2024

• 312

upvoted a collection 4 months ago

Meta's Llama 3.3 models & evals

2 items • Updated Dec 13, 2024 • 73

upvoted an article 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 393

upvoted 3 papers 4 months ago

GAEA: A Geolocation Aware Conversational Model

Paper • 2503.16423 • Published Mar 20 • 6

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published Mar 9 • 30

Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers?

Paper • 2503.10632 • Published Mar 13 • 14

upvoted an article 4 months ago

Article

SmolVLM - small yet mighty Vision Language Model

By

and 4 others •

Nov 26, 2024

• 336

upvoted a paper 5 months ago

WebArena: A Realistic Web Environment for Building Autonomous Agents

Paper • 2307.13854 • Published Jul 25, 2023 • 25

upvoted a collection 5 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Apr 28 • 220

upvoted 2 papers 7 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 64

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Paper • 2402.10210 • Published Feb 15, 2024 • 36