Maxwell Yao

MaxwellJryao

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

upvoted a paper 9 days ago

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

upvoted a paper 27 days ago

ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind

View all activity

Organizations

upvoted a paper about 15 hours ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

Paper • 2506.18945 • Published 3 days ago • 25

upvoted a paper 9 days ago

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

Paper • 2506.13654 • Published 10 days ago • 42

upvoted a paper 27 days ago

ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind

Paper • 2505.22961 • Published 29 days ago • 8

upvoted a paper about 1 month ago

Time-R1: Towards Comprehensive Temporal Reasoning in LLMs

Paper • 2505.13508 • Published May 16 • 14

upvoted a paper about 2 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published Apr 15 • 18

authored 2 papers about 2 months ago

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Paper • 2502.13131 • Published Feb 18 • 38

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published Apr 15 • 18

upvoted 2 papers about 2 months ago

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Paper • 2505.02391 • Published May 5 • 24

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 77

upvoted a paper 4 months ago

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Paper • 2502.13131 • Published Feb 18 • 38

updated 2 models 10 months ago

MaxwellJryao/sft_loraMoE_-lora-sft_Qwen2-1.5B_lr-1e-3

Updated Sep 6, 2024

MaxwellJryao/sft_loraMoE_wiki_hop_original_choose_best_object_affirmative_1-lora-sft_Qwen2-1.5B_lr-1e-3

Updated Sep 5, 2024

updated 8 datasets 10 months ago

Maxwell Yao

AI & ML interests

Recent Activity

Organizations

MaxwellJryao's activity