Manish Kumar Pandey

Manish-GenAI

AI & ML interests

#GraphML, #GeometricDL, #3DComputerVision, #DiffusionModels, #GANs, #Generative AI #ComputerVision,#ML ,#RL, #LLM, #MultiModal Fusion #GenerativeFlow Networks

Recent Activity

upvoted an article 16 days ago

nanoVLM: The simplest repository to train your VLM in pure PyTorch

liked a model 18 days ago

eagle0504/finetuned-warren-buffett-letter-model-llama-3.2-1B-Instruct-2024

upvoted an article 22 days ago

The Transformers Library: standardizing model definitions

View all activity

Organizations

Manish-GenAI's activity

upvoted an article 16 days ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

and 6 others •

17 days ago

• 140

upvoted 2 articles 22 days ago

Article

The Transformers Library: standardizing model definitions

and 3 others •

23 days ago

• 112

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

26 days ago

• 417

upvoted a paper 27 days ago

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published 30 days ago • 24

upvoted a paper about 1 month ago

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Paper • 2504.17432 • Published Apr 24 • 39

upvoted 2 collections about 1 month ago

Perception LM

Collection

7 items • Updated Apr 17 • 52

Perception Encoder

Collection

9 items • Updated Apr 17 • 60

upvoted a collection about 2 months ago

InternVL3

Collection

34 items • Updated Apr 20 • 70

upvoted a collection 2 months ago

🌙 March 2025 - Open releases from the Chinese community

Collection

32 items • Updated 22 days ago • 13

upvoted 2 papers 3 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 27

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6 • 15

upvoted an article 3 months ago

Article

🌁#89: AI in Action: How AI Engineers, Self-Optimizing Models, and Humanoid Robots Are Reshaping 2025

•

Feb 25

• 4

upvoted an article 4 months ago

Article

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 89

upvoted a paper 4 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 232

upvoted 3 articles 4 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.25k

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Article

How to deploy and fine-tune DeepSeek models on AWS

and 2 others •

Jan 30

• 52

upvoted 2 articles 5 months ago

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

and 1 other •

Jan 16

• 74

Article

Topic 23: What is LLM Inference, it's challenges and solutions for it

•

Jan 17

• 6

upvoted a collection 5 months ago

InternLM3

Collection

6 items • Updated Feb 11 • 26