Gao Sensen's picture

23 18

Gao Sensen PRO

Sensen02

·

AI & ML interests

None yet

Recent Activity

liked a Space 3 days ago

OAOA/InvSR

liked a Space 3 days ago

finegrain/finegrain-image-enhancer

liked a Space 3 days ago

yangheng/Super-Resolution-Anime-Diffusion

View all activity

Organizations

upvoted 2 papers 3 days ago

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published Oct 15 • 72

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published 10 days ago • 190

upvoted a paper 18 days ago

OmniSafeBench-MM: A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack-Defense Evaluation

Paper • 2512.06589 • Published 21 days ago • 17

upvoted 2 papers about 1 month ago

VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator

Paper • 2510.13454 • Published Oct 15 • 8

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12 • 117

upvoted 3 papers about 2 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5 • 127

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

Paper • 2511.04655 • Published Nov 6 • 7

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6 • 37

upvoted 2 papers 2 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13 • 165

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17 • 49

upvoted 6 papers 3 months ago

dParallel: Learnable Parallel Decoding for dLLMs

Paper • 2509.26488 • Published Sep 30 • 19

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 54

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 54

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6 • 118

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

Oyster-I: Beyond Refusal -- Constructive Safety Alignment for Responsible Language Models

Paper • 2509.01909 • Published Sep 2 • 6

upvoted 4 papers 4 months ago

Kwai Keye-VL 1.5 Technical Report

Paper • 2509.01563 • Published Sep 1 • 37

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3 • 85

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Paper • 2508.17472 • Published Aug 24 • 26

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 266