Unified Multimodal Discrete Diffusion

Alexander Swerdlow1*  Mihir Prabhudesai1*  Siddharth Gandhi1  Deepak Pathak1  Katerina Fragkiadaki1 

1 Carnegie Mellon University 

ArXiv Webpage

Hugging Face models

The UniDisc checkpoints are available on Hugging Face:

Getting Started

Code can be found here: https://github.com/alexanderswerdlow/unidisc/tree/main

To install the dependencies, run:

git submodule update --init --recursive
uv sync --no-group dev
uv sync

For a more detailed installation guide, please refer to INSTALL.md.

Data

See DATA.md for details on how to download and preprocess the datasets. We provide processing scripts and instructions for all of the used datasets. Additionally, we release a synthetic dataset available here and the corresponding generation scripts as well as the raw data.

Training

See TRAIN.md for training commands.

Inference

Interactive demo:

python demo/server.py
python demo/client_simple_fasthtml.py

Training

See TRAINING.md for details.

Evaluation

See EVAL.md for details.

Citation

To cite our work, please use the following:

@article{TODO,
  title={TODO},
  author={TODO},
  journal={arXiv preprint arXiv:TODO},
  year={TODO}
}

Credits

This repository is built on top of the following repositories:

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support