Simeng Sun's picture

3 1

Simeng Sun

simsun131

https://people.cs.umass.edu/~simengsun/

AI & ML interests

Language Modeling, Machine Translation

Recent Activity

upvoted a paper 29 days ago

Hymba: A Hybrid-head Architecture for Small Language Models

upvoted a paper 29 days ago

Star Attention: Efficient LLM Inference over Long Sequences

View all activity

Organizations

simsun131's activity

upvoted 2 papers 29 days ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20 • 39

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published about 1 month ago • 47

updated a dataset 7 months ago

simsun131/chapterbreak

Viewer • Updated Jun 5 • 8 • 62

liked a Space 7 months ago

FineWeb: decanting the web for the finest text data at scale

authored 4 papers 9 months ago

RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9 • 34

Do Long-Range Language Models Actually Use Long-Range Context?

Paper • 2109.09115 • Published Sep 19, 2021

TopicGPT: A Prompt-based Topic Modeling Framework

Paper • 2311.01449 • Published Nov 2, 2023 • 1

IGA : An Intent-Guided Authoring Assistant

Paper • 2104.07000 • Published Apr 14, 2021 • 1

upvoted a paper 9 months ago

RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9 • 34

authored a paper about 1 year ago

Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages

Paper • 2302.03528 • Published Feb 7, 2023

updated a model over 1 year ago

simsun131/alpacafarm_ppo_lora

Updated Sep 15, 2023

authored a paper over 1 year ago

PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents

Paper • 2305.14564 • Published May 23, 2023 • 1