Oğuzhan Ercan's picture

Oğuzhan Ercan

oguzhanercan

·

AI & ML interests

Computer Vision, Generative Vision, first trajectory bender

Recent Activity

updated a collection 1 day ago

updated a collection 2 days ago

Video Generation

updated a collection 3 days ago

Image-Video MultiModal Understanding

View all activity

Organizations

None yet

oguzhanercan's activity

upvoted a paper 11 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 12 days ago • 241

upvoted a paper 15 days ago

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper • 2504.07960 • Published 16 days ago • 46

upvoted a paper about 1 month ago

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Paper • 2503.06698 • Published Mar 9 • 4

upvoted a paper about 2 months ago

How far can we go with ImageNet for Text-to-Image generation?

Paper • 2502.21318 • Published Feb 28 • 26

upvoted 10 papers 3 months ago

Improved Training Technique for Latent Consistency Models

Paper • 2502.01441 • Published Feb 3 • 8

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30 • 19

Relightable Full-Body Gaussian Codec Avatars

Paper • 2501.14726 • Published Jan 24 • 10

TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

Paper • 2501.12224 • Published Jan 21 • 48

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published Jan 22 • 70

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 386

Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

Paper • 2501.08331 • Published Jan 14 • 20

Diffusion Adversarial Post-Training for One-Step Video Generation

Paper • 2501.08316 • Published Jan 14 • 34

Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence

Paper • 2410.17161 • Published Oct 22, 2024 • 1

Infecting Generative AI With Viruses

Paper • 2501.05542 • Published Jan 9 • 13

upvoted 6 papers 4 months ago

Nested Attention: Semantic-aware Attention Values for Concept Personalization

Paper • 2501.01407 • Published Jan 2 • 11

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

Paper • 2501.01427 • Published Jan 2 • 55

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Paper • 2501.01423 • Published Jan 2 • 43

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 84

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

Paper • 2412.18597 • Published Dec 24, 2024 • 19

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Paper • 2412.16112 • Published Dec 20, 2024 • 23