-
Addition is All You Need for Energy-efficient Language Models
Paper • 2410.00907 • Published • 144 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 93 -
An accurate detection is not all you need to combat label noise in web-noisy datasets
Paper • 2407.05528 • Published • 3 -
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP
Paper • 2407.00402 • Published • 22
meng shao
meng-shao
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 18 hours ago
AI PERSONA: Towards Life-long Personalization of LLMs
upvoted
a
paper
1 day ago
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
commented
a paper
5 days ago
Qwen2.5 Technical Report
Organizations
Collections
2
-
DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation
Paper • 2410.00201 • Published -
Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems
Paper • 2409.19804 • Published -
Rethinking Conventional Wisdom in Machine Learning: From Generalization to Scaling
Paper • 2409.15156 • Published -
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue
Paper • 2409.04927 • Published
models
None public yet
datasets
None public yet