arxiv:2501.16975
xunzhou
xunzhou
ยท
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
authored
a paper
4 months ago
Hyper-Connections
authored
a paper
8 months ago
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning
Enhancement in RLHF and Effective-Merged LLMs
Organizations
models
None public yet
datasets
None public yet