cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

cadrille is trained to transform point clouds, images, and text into 3D CAD models represented as valid Python code. cadrille basically is CAD-Recode with 2 additional modalities (images and text). Here, you can access the three-modal model without RL fine-tuning. Training and testing code is available on our github. And if you like it, give us a github ๐ŸŒŸ.

Citation

If you find this work useful for your research, please cite our paper:

@article{kolodiazhnyi2025cadrille,
  title={cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning},
  author={Maksim Kolodiazhnyi, Denis Tarasov, Dmitrii Zhemchuzhnikov, Alexander Nikulin, Ilya Zisman, Anna Vorontsova, Anton Konushin, Vladislav Kurenkov, Danila Rukhovich},
  journal={arXiv preprint arXiv:2505.22914},
  year={2025}
}
Downloads last month
29
Safetensors
Model size
2.21B params
Tensor type
F32
ยท
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support