5 7 14

Yingfa Chen

chen-yingfa

https://chen-yingfa.github.io

AI & ML interests

Long-context modeling, continual learning, architectures

Recent Activity

authored a paper 24 days ago

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

upvoted a paper 26 days ago

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

upvoted an article 2 months ago

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

View all activity

Organizations

None yet

upvoted a paper 26 days ago

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

Paper • 2507.08771 • Published Jul 11 • 9

upvoted an article 2 months ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

and 6 others •

Jun 12

• 124

upvoted a paper 5 months ago

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

Paper • 2503.09579 • Published Mar 12 • 5

upvoted a paper 9 months ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

upvoted a paper 10 months ago

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

Paper • 2410.07145 • Published Oct 9, 2024 • 2

upvoted an article 12 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

and 2 others •

Aug 14, 2024

• 69

upvoted a paper about 1 year ago

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

Paper • 2406.15718 • Published Jun 22, 2024 • 14

Yingfa Chen

AI & ML interests

Recent Activity

Organizations

chen-yingfa's activity

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

A failed experiment: Infini-Attention, and why we should keep trying?