Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
zkj
JokerJan
Follow
GaryStack
AI & ML interests
None yet
Recent Activity
authored
a paper
19 days ago
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
authored
a paper
19 days ago
MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos
upvoted
a
paper
25 days ago
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
View all activity
Organizations
None yet
Papers
2
arxiv:
2506.04142
arxiv:
2506.04141
models
0
None public yet
datasets
2
Sort: Recently updated
JokerJan/MMR-VBench
Viewer
•
Updated
25 days ago
•
1.26k
•
2.63k
•
17
JokerJan/rl_think
Updated
26 days ago
•
11