3D - a GEONTT Collection

GEONTT 's Collections

base

3D

LLM

audio

video

image

RAG

3D

updated Jul 23, 2024

Seamless Human Motion Composition with Blended Positional Encodings

Paper • 2402.15509 • Published Feb 23, 2024 • 15
TripoSR: Fast 3D Object Reconstruction from a Single Image

Paper • 2403.02151 • Published Mar 4, 2024 • 14
3D-VLA: A 3D Vision-Language-Action Generative World Model

Paper • 2403.09631 • Published Mar 14, 2024 • 10
Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting

Paper • 2403.09981 • Published Mar 15, 2024 • 8
Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding

Paper • 2403.10395 • Published Mar 15, 2024 • 9
Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Paper • 2403.12032 • Published Mar 18, 2024 • 15
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation

Paper • 2403.12365 • Published Mar 19, 2024 • 11
Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video

Paper • 2404.09833 • Published Apr 15, 2024 • 31
PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

Paper • 2404.13026 • Published Apr 19, 2024 • 25
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Paper • 2404.13013 • Published Apr 19, 2024 • 32
SAGS: Structure-Aware 3D Gaussian Splatting

Paper • 2404.19149 • Published Apr 29, 2024 • 14
STT: Stateful Tracking with Transformers for Autonomous Driving

Paper • 2405.00236 • Published Apr 30, 2024 • 9
Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

Paper • 2405.08054 • Published May 13, 2024 • 26
Toon3D: Seeing Cartoons from a New Perspective

Paper • 2405.10320 • Published May 16, 2024 • 23
CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner

Paper • 2405.14979 • Published May 23, 2024 • 20
Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels

Paper • 2405.16822 • Published May 27, 2024 • 12
Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer

Paper • 2405.17405 • Published May 27, 2024 • 17
ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians

Paper • 2406.16815 • Published Jun 24, 2024 • 7
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Paper • 2406.19389 • Published Jun 27, 2024 • 55
Shape of Motion: 4D Reconstruction from a Single Video

Paper • 2407.13764 • Published Jul 18, 2024 • 20