32 275 42

Orr Zohar PRO

orrzohar

https://orrzohar.github.io

AI & ML interests

Large Multi-Modal Models, Foundation Models, Video Understanding

Recent Activity

updated a collection 5 days ago

TimeScope

updated a collection 5 days ago

TimeScope

updated a collection 5 days ago

TimeScope

View all activity

Organizations

upvoted a paper 14 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published 17 days ago • 158

upvoted an article about 1 month ago

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

and 3 others •

Jul 23

• 39

upvoted 3 papers 3 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 128

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Paper • 2505.14231 • Published May 20 • 53

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20 • 76

upvoted 9 papers 4 months ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 48

ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering

Paper • 2504.05506 • Published Apr 7 • 24

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 52

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 93

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Paper • 2504.13122 • Published Apr 17 • 21

upvoted 3 papers 5 months ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published Apr 8 • 86

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 197

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Paper • 2504.02605 • Published Apr 3 • 48

upvoted an article 5 months ago

Article

Everything About Long Context Fine-tuning

•

May 10, 2024

• 50

upvoted 2 papers 5 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 80

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 55

Orr Zohar PRO

AI & ML interests

Recent Activity

Organizations

orrzohar's activity

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Everything About Long Context Fine-tuning