Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models Paper • 2507.08128 • Published 6 days ago • 7
Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation Paper • 2506.03621 • Published Jun 4 • 22
Edit-A-Video: Single Video Editing with Object-Aware Consistency Paper • 2303.07945 • Published Mar 14, 2023
Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation Paper • 2506.03621 • Published Jun 4 • 22
Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation Paper • 2506.03621 • Published Jun 4 • 22 • 2
When Preferences Diverge: Aligning Diffusion Models with Minority-Aware Adaptive DPO Paper • 2503.16921 • Published Mar 21 • 6
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published Nov 23, 2024 • 39 • 8
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published Nov 23, 2024 • 39 • 8
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage Paper • 2412.15484 • Published Dec 20, 2024 • 15
Unified Speech-Text Pretraining for Spoken Dialog Modeling Paper • 2402.05706 • Published Feb 8, 2024 • 6
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published Nov 23, 2024 • 39
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published Nov 23, 2024 • 39
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published Nov 23, 2024 • 39 • 8
ControlDreamer: Stylized 3D Generation with Multi-View ControlNet Paper • 2312.01129 • Published Dec 2, 2023 • 5
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior Paper • 2106.06406 • Published Jun 11, 2021
Disentangled Motion Modeling for Video Frame Interpolation Paper • 2406.17256 • Published Jun 25, 2024
Style-Friendly SNR Sampler for Style-Driven Generation Paper • 2411.14793 • Published Nov 22, 2024 • 40
Style-Friendly SNR Sampler for Style-Driven Generation Paper • 2411.14793 • Published Nov 22, 2024 • 40