Text-to-Image

The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation

This model is trained on ImageNet-32x32 text-to-image with conditional minibatch optimal transport data coupling.

Paper: The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation Project Page: https://hkchengrex.github.io/C2OT Code: https://github.com/hkchengrex/C2OT

8GtoMoons

High-Level Summary

C2OT is an algorithm for computing prior-to-data couplings for flow-matching-based generative models during training. Our goal is to achieve straighter flows, enabled by optimal transport (OT) couplings, while mitigating the test-time degradation that OT encounters in the conditional setting. The key idea is that OT samples from a condition-skewed prior distribution at test time, whereas C2OT unskews the prior by incorporating a condition-dependent term into the OT cost.

Usage

See the GitHub repo: https://github.com/hkchengrex/C2OT

Citation

If you use C$^2$OT in your research, please cite the original paper:

@inproceedings{cheng2025curse,
  title={The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation},
  author={Cheng, Ho Kei and Schwing, Alexander},
  booktitle={ICCV},
  year={2025}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Datasets used to train hkchengrex/c2ot-imagenet32-c2ot

Collection including hkchengrex/c2ot-imagenet32-c2ot