Zikui Cai

Zikui

https://zikuicai.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 months ago

Qwen3

upvoted a paper 5 months ago

DynaGuard: A Dynamic Guardrail Model With User-Defined Policies

upvoted a paper 5 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

View all activity

Organizations

upvoted a collection 2 months ago

Qwen3

Collection

84 items • Updated 29 days ago • 1.61k

upvoted 2 papers 5 months ago

DynaGuard: A Dynamic Guardrail Model With User-Defined Policies

Paper • 2509.02563 • Published Sep 2, 2025 • 21

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85

updated a dataset 6 months ago

video-reasoning/physical-commonsense

Updated Jul 28, 2025 • 122

published a dataset 6 months ago

video-reasoning/physical-commonsense

Updated Jul 28, 2025 • 122

upvoted a paper 6 months ago

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning

Paper • 2507.16746 • Published Jul 22, 2025 • 35

upvoted 5 collections 7 months ago

upvoted a collection 8 months ago

Recurrent Models

Collection

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 15 items • Updated May 21, 2025 • 11

upvoted a paper 8 months ago

ARGUS: Hallucination and Omission Evaluation in Video-LLMs

Paper • 2506.07371 • Published Jun 9, 2025 • 8

authored 5 papers 8 months ago

Cross-Modal Safety Alignment: Is textual unlearning all you need?

Paper • 2406.02575 • Published May 27, 2024 • 1

Single Layer Single Gradient Unlearning

Paper • 2407.11867 • Published Jul 16, 2024 • 1

Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities

Paper • 2502.05209 • Published Feb 3, 2025 • 1

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28, 2025 • 7

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Paper • 2506.05523 • Published Jun 5, 2025 • 34

upvoted 2 papers 8 months ago