alfredplpl (Yasunori Ozaki)

upvoted a paper 4 days ago

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Paper • 2411.02397 • Published 5 days ago • 17

upvoted an article 18 days ago

Article

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

19 days ago

• 43

upvoted 2 articles 19 days ago

Article

Allegro: Advanced Video Generation Model

By

•

19 days ago

• 55

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

By

•

20 days ago

• 27

upvoted an article 24 days ago

Article

Fixing Gradient Accumulation

25 days ago

• 39

upvoted a collection about 1 month ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 16 days ago • 451

upvoted an article about 2 months ago

Article

FineVideo: behind the scenes

Sep 23

• 23

upvoted a collection about 2 months ago

CommonCanvas

Collection

Collection of models trained on the CommonCatalogue datasets • 8 items • Updated May 16 • 9

upvoted 2 papers about 2 months ago

LVCD: Reference-based Lineart Video Colorization with Diffusion Models

Paper • 2409.12960 • Published Sep 19 • 22

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18 • 73

upvoted 2 papers 2 months ago

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model

Paper • 2409.01199 • Published Sep 2 • 12

VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers

Paper • 2408.17131 • Published Aug 30 • 11

upvoted a collection 2 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 10 days ago • 489

upvoted 2 papers 2 months ago

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Paper • 2408.15239 • Published Aug 27 • 27

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27 • 121

upvoted a paper 3 months ago

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Paper • 2408.12590 • Published Aug 22 • 33

upvoted an article 3 months ago

Article

Understanding InstaFlow/Rectified Flow

By

•

Oct 6, 2023

• 16

upvoted 3 papers 3 months ago

Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization

Paper • 2408.08019 • Published Aug 15 • 9

Imagen 3

Paper • 2408.07009 • Published Aug 13 • 61

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Paper • 2408.06072 • Published Aug 12 • 35

Yasunori Ozaki PRO

AI & ML interests

Organizations

alfredplpl's activity

Adaptive Caching for Faster Video Generation with Diffusion Transformers

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Allegro: Advanced Video Generation Model

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

Fixing Gradient Accumulation

Llama 3.2

FineVideo: behind the scenes

CommonCanvas

LVCD: Reference-based Lineart Video Colorization with Diffusion Models

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model

VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers

Phi-3

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Diffusion Models Are Real-Time Game Engines

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Understanding InstaFlow/Rectified Flow

Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization

Imagen 3

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer