ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning Paper • 2507.16815 • Published 6 days ago • 31
DesignLab: Designing Slides Through Iterative Detection and Correction Paper • 2507.17202 • Published 6 days ago • 38
Elevating 3D Models: High-Quality Texture and Geometry Refinement from a Low-Quality Model Paper • 2507.11465 • Published 13 days ago • 11
Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling Paper • 2507.11061 • Published 14 days ago • 37
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers Paper • 2507.12956 • Published 12 days ago • 21
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models Paper • 2507.13344 • Published 11 days ago • 50
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs Paper • 2507.11097 • Published 14 days ago • 56
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published 11 days ago • 211
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs Paper • 2507.09477 • Published 16 days ago • 74
EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes Paper • 2507.11407 • Published 13 days ago • 49
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published 14 days ago • 60
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • 20 days ago • 613
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second Paper • 2507.10065 • Published 15 days ago • 23
CLiFT: Compressive Light-Field Tokens for Compute-Efficient and Adaptive Neural Rendering Paper • 2507.08776 • Published 17 days ago • 51