Phi-Ground Tech Report: Advancing Perception in GUI Grounding Paper • 2507.23779 • Published 25 days ago • 42
Phi-Ground Tech Report: Advancing Perception in GUI Grounding Paper • 2507.23779 • Published 25 days ago • 42
ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL Paper • 2505.24875 • Published May 30 • 10
ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL Paper • 2505.24875 • Published May 30 • 10 • 2
ReasonGen-R1 Collection Model and Datasets for the paper "ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL • 7 items • Updated Jun 2 • 6
microsoft/LLM2CLIP-Llama3.1-8B-siglip2-so400m-patch14-224 Zero-Shot Classification • Updated Mar 24 • 8
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 11 items • Updated May 1 • 60
microsoft/LLM2CLIP-Llama3.1-8B-siglip2-so400m-patch14-224 Zero-Shot Classification • Updated Mar 24 • 8
REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents Paper • 2411.13552 • Published Nov 20, 2024
microsoft/LLM2CLIP-Llama3.2-1B-EVA02-L-14-336 Zero-Shot Image Classification • Updated Dec 13, 2024 • 10
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 11 items • Updated May 1 • 60