10 18 27

Tianyu Fu

fuvty

https://fuvty.thesimple.ink/

fuvty

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

nics-efc/R2R_router

liked a dataset 17 days ago

abisee/cnn_dailymail

liked a dataset 17 days ago

google/IFEval

View all activity

Organizations

upvoted a paper about 1 month ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 154

upvoted a collection about 1 month ago

Qwen3

Collection

84 items • Updated 7 days ago • 1.08k

upvoted an article about 2 months ago

Article

Transformers backend integration in SGLang

and 4 others •

Jun 23

• 52

upvoted 2 papers about 2 months ago

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19 • 60

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 274

upvoted a collection 2 months ago

R2R

Collection

Collections for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing" • 4 items • Updated 13 days ago • 2

upvoted 4 papers 2 months ago

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published Dec 22, 2024 • 40

FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models

Paper • 2501.01986 • Published Dec 30, 2024 • 1

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization

Paper • 2502.04686 • Published Feb 7 • 2

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Paper • 2506.02387 • Published Jun 3 • 57

upvoted an article 3 months ago

Article

Proximal Policy Optimization (PPO)

•

Aug 5, 2022

• 53

upvoted 4 papers 3 months ago

WebDancer: Towards Autonomous Information Seeking Agency

Paper • 2505.22648 • Published May 28 • 26

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25 • 29

SageAttention2++: A More Efficient Implementation of SageAttention2

Paper • 2505.21136 • Published May 27 • 47

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published May 27 • 71

upvoted a collection 3 months ago

Papers from the NICS-EFFALG Team

Collection

11 items • Updated Jun 11 • 4

upvoted 2 papers about 1 year ago

Can LLMs Learn by Teaching? A Preliminary Study

Paper • 2406.14629 • Published Jun 20, 2024 • 20

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Paper • 2406.14909 • Published Jun 21, 2024 • 16

Tianyu Fu

AI & ML interests

Recent Activity

Organizations

fuvty's activity

Transformers backend integration in SGLang

Proximal Policy Optimization (PPO)