65 84 64

Ge Zhang

zhangysk

AI & ML interests

None yet

Recent Activity

upvoted a collection about 17 hours ago

Hybrid Linear Attention Research

upvoted a paper 2 days ago

Energy-Based Transformers are Scalable Learners and Thinkers

upvoted a paper 6 days ago

Kwai Keye-VL Technical Report

View all activity

Organizations

upvoted a collection about 17 hours ago

Hybrid Linear Attention Research

Collection

All 1.3B & 340M hybrid linear-attention experiments. • 60 items • Updated 1 day ago • 2

upvoted a paper 2 days ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published 6 days ago • 40

upvoted a paper 6 days ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published 6 days ago • 113

upvoted a paper 15 days ago

OAgents: An Empirical Study of Building Effective Agents

Paper • 2506.15741 • Published 21 days ago • 36

upvoted a paper 21 days ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published 23 days ago • 61

upvoted a paper 22 days ago

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published 27 days ago • 32

upvoted 4 papers about 2 months ago

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published May 22 • 41

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21 • 51

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20 • 130

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published May 12 • 26

upvoted an article 2 months ago

Article

Reasoning Datasets Competition

and 6 others •

Apr 9

• 37

upvoted 2 papers 2 months ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Paper • 2505.02735 • Published May 5 • 32

QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining

Paper • 2504.16511 • Published Apr 23 • 20

upvoted 7 papers 3 months ago

Efficient Pretraining Length Scaling

Paper • 2504.14992 • Published Apr 21 • 20

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published Apr 21 • 22

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

Ge Zhang

AI & ML interests

Recent Activity

Organizations

zhangysk's activity

Reasoning Datasets Competition