-
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Paper • 2403.13257 • Published • 20 -
Model Stock: All we need is just a few fine-tuned models
Paper • 2403.19522 • Published • 12 -
Mergenetic: a Simple Evolutionary Model Merging Library
Paper • 2505.11427 • Published • 12 -
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Paper • 2410.01335 • Published • 5
Yamata Zen
yamatazen
AI & ML interests
None yet
Recent Activity
new activity
about 12 hours ago
mradermacher/model_requests:Model request
updated
a model
about 12 hours ago
yamatazen/FusionEngine-12B-Lorablated
published
a model
about 12 hours ago
yamatazen/FusionEngine-12B-Lorablated
Organizations
None yet
Collections
6
-
Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability
Paper • 2306.06688 • Published -
Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs
Paper • 2412.14471 • Published -
Language Models' Factuality Depends on the Language of Inquiry
Paper • 2502.17955 • Published • 34 -
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Paper • 2410.01335 • Published • 5
models
114

yamatazen/FusionEngine-12B-Lorablated
Text Generation
•
Updated
•
1

yamatazen/HMS-Fusion-12B-Lorablated
Text Generation
•
Updated
•
5

yamatazen/ForgottenMaid-12B-Lorablated
Text Generation
•
Updated
•
8

yamatazen/Shisa-v2-Mistral-Nemo-12B-Lorablated
Text Generation
•
Updated
•
7
•
1

yamatazen/ForgottenMaid-12B-LoRA-Rank128
Updated
•
3
•
1

yamatazen/Gemma2-Ataraxy-Psycho-9B
Text Generation
•
Updated
•
15
•
1

yamatazen/FusionEngine-12B
Text Generation
•
Updated
•
25
•
2

yamatazen/HMS-Fusion-12B
Text Generation
•
Updated
•
24
•
3

yamatazen/Shisa-DellaTest-12B
Text Generation
•
Updated
•
6
•
1

yamatazen/Shirayukihime-12B
Text Generation
•
Updated
•
18
•
2
datasets
0
None public yet