RL for Consistency Models: Faster Reward Guided Text-to-Image Generation
Paper
•
2404.03673
•
Published
•
14
a multi-step Markov Decision Process, allowing one to fine-tune consistency models toward a downstream task using just a reward function.