--- license: mit pipeline_tag: image-to-3d tags: - triposg - 3d-generation - rectified-flow --- # TripoSG - High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images. ## Model Description ### Model Architecture TripoSG utilizes a novel architecture combining: - Rectified Flow (RF) based Transformer for stable, linear trajectory modeling - Advanced VAE with SDF-based representation and hybrid geometric supervision - Cross-attention mechanism for image feature condition - 1.5B parameters operating on 2048 latent tokens ## Intended Uses This model is designed for: - Converting single images to high-quality 3D meshes - Creative and design applications - Gaming and VFX asset creation - Prototyping and visualization ## Requirements - CUDA-capable GPU (>8GB VRAM) ## Usage For detailed usage instructions, please visit our [GitHub repository](https://github.com/VAST-AI-Research/TripoSG). ## About TripoSG is developed by [Tripo](https://www.tripo3d.ai), [VAST AI Research](https://github.com/orgs/VAST-AI-Research), pushing the boundaries of 3D Generative AI. For more information: - [GitHub Repository](https://github.com/VAST-AI-Research/TripoSG) - [Paper](https://arxiv.org/abs/2502.06608) - [Gradio Demo](https://huggingface.co/spaces/VAST-AI/TripoSG)