Vikramjeet Singh

VikramSingh178

5 69 91

https://vikramxd.github.io

AI & ML interests

Computer Vision | Transformers| Diffusion Models | ML Systems

Organizations

upvoted 2 papers 11 months ago

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Paper • 2508.09983 • Published Aug 13, 2025 • 70

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 276

upvoted a paper about 1 year ago

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Paper • 2410.17891 • Published Oct 23, 2024 • 18

upvoted an article about 1 year ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 262

upvoted 2 papers about 1 year ago

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging

Paper • 2504.12364 • Published Apr 16, 2025 • 22

Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion

Paper • 2504.11447 • Published Apr 15, 2025 • 4

upvoted 7 papers over 1 year ago

Elucidating the Design Space of Diffusion-Based Generative Models

Paper • 2206.00364 • Published Jun 1, 2022 • 18

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 289

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

Paper • 2402.00769 • Published Feb 1, 2024 • 22

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Paper • 2412.00174 • Published Nov 29, 2024 • 23

upvoted a collection over 1 year ago

Daily Papers

Collection

1 item • Updated Oct 26, 2023 • 84

upvoted 6 papers over 1 year ago

VEnhancer: Generative Space-Time Enhancement for Video Generation

Paper • 2407.07667 • Published Jul 10, 2024 • 17

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 34

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Paper • 2411.07126 • Published Nov 11, 2024 • 30

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published Nov 7, 2024 • 39

How Far is Video Generation from World Model: A Physical Law Perspective

Paper • 2411.02385 • Published Nov 4, 2024 • 34

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published Oct 25, 2024 • 24

Vikramjeet Singh

AI & ML interests

Organizations

VikramSingh178's activity

nanoVLM: The simplest repository to train your VLM in pure PyTorch