arxiv:2412.18619
Yizhe Xiong
Bostoncake
AI & ML interests
None yet
Recent Activity
authored
a paper
9 days ago
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient
Task Adaptation
authored
a paper
9 days ago
Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective
Scaffold Token Removal
authored
a paper
9 days ago
MaskMoE: Boosting Token-Level Learning via Routing Mask in
Mixture-of-Experts
Organizations
None yet
spaces
1
models
None public yet
datasets
None public yet