Phi-Ground Tech Report: Advancing Perception in GUI Grounding Paper • 2507.23779 • Published 25 days ago • 42
REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents Paper • 2411.13552 • Published Nov 20, 2024
DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models Paper • 2312.14216 • Published Dec 21, 2023 • 12
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation Paper • 2404.19394 • Published Apr 30, 2024