LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks Paper • 2506.00411 • Published May 31 • 30
Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions Paper • 2505.19949 • Published May 26 • 16
BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference Paper • 2310.11142 • Published Oct 17, 2023
Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads Paper • 2412.00127 • Published Nov 28, 2024 • 1