S.F.'s picture

S.F.

search-facility

·

ipv6

AI & ML interests

None yet

Recent Activity

upvoted a collection 3 days ago

upvoted a paper 4 days ago

Mode Seeking meets Mean Seeking for Fast Long Video Generation

upvoted a paper 8 days ago

Image Generation with a Sphere Encoder

View all activity

Organizations

None yet

upvoted a collection 3 days ago

Helios

Helios: 14B Real-Time Long Video Generation Model can be Cheaper, Faster but Keep Stronger than 1.3B ones • 7 items • Updated 1 day ago • 14

upvoted a paper 4 days ago

Mode Seeking meets Mean Seeking for Fast Long Video Generation

Paper • 2602.24289 • Published 8 days ago • 37

upvoted 5 papers 8 days ago

Image Generation with a Sphere Encoder

Paper • 2602.15030 • Published 19 days ago • 15

JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation

Paper • 2602.19163 • Published 13 days ago • 14

Solaris: Building a Multiplayer Video World Model in Minecraft

Paper • 2602.22208 • Published 10 days ago • 27

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

Paper • 2602.12160 • Published 23 days ago • 38

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Paper • 2602.22190 • Published 10 days ago • 15

upvoted 4 papers 16 days ago

COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression

Paper • 2602.15200 • Published 19 days ago • 7

Revisiting the Platonic Representation Hypothesis: An Aristotelian View

Paper • 2602.14486 • Published 19 days ago • 11

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 18 days ago • 105

Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines?

Paper • 2602.14111 • Published 20 days ago • 55

liked a model 18 days ago

shallowdream204/BitDance-14B-16x

Text-to-Image • 15B • Updated 17 days ago • 272 • 88

upvoted 3 papers 18 days ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

Paper • 2602.11858 • Published 23 days ago • 59

SemanticMoments: Training-Free Motion Similarity via Third Moment Features

Paper • 2602.09146 • Published 26 days ago • 21

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published 24 days ago • 240

upvoted 2 papers 22 days ago

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Paper • 2602.08711 • Published 26 days ago • 28

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 24 days ago • 29

upvoted a paper 23 days ago

Prism: Spectral-Aware Block-Sparse Attention

Paper • 2602.08426 • Published 26 days ago • 36

upvoted a paper 25 days ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published 29 days ago • 71

upvoted a paper 29 days ago

Video-As-Prompt: Unified Semantic Control for Video Generation

Paper • 2510.20888 • Published Oct 23, 2025 • 50