zerozeyi
's Collections
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper
•
2402.01739
•
Published
•
27
Rethinking Interpretability in the Era of Large Language Models
Paper
•
2402.01761
•
Published
•
23
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper
•
2402.03620
•
Published
•
115
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
Model
Paper
•
2402.07827
•
Published
•
47
Chain-of-Thought Reasoning Without Prompting
Paper
•
2402.10200
•
Published
•
105
Generative Representational Instruction Tuning
Paper
•
2402.09906
•
Published
•
54
BitDelta: Your Fine-Tune May Only Be Worth One Bit
Paper
•
2402.10193
•
Published
•
20
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
•
2312.15166
•
Published
•
57
SPAR: Personalized Content-Based Recommendation via Long Engagement
Attention
Paper
•
2402.10555
•
Published
•
35
Learning to Learn Faster from Human Feedback with Language Model
Predictive Control
Paper
•
2402.11450
•
Published
•
22
Paper
•
2402.12219
•
Published
•
17
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
Paper
•
2402.16840
•
Published
•
24
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
•
2402.17764
•
Published
•
608
Can large language models explore in-context?
Paper
•
2403.15371
•
Published
•
32
OpenELM: An Efficient Language Model Family with Open-source Training
and Inference Framework
Paper
•
2404.14619
•
Published
•
127
FLAME: Factuality-Aware Alignment for Large Language Models
Paper
•
2405.01525
•
Published
•
26
Octopus v4: Graph of language models
Paper
•
2404.19296
•
Published
•
117
KAN: Kolmogorov-Arnold Networks
Paper
•
2404.19756
•
Published
•
109
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper
•
2405.00732
•
Published
•
120
From Loops to Oops: Fallback Behaviors of Language Models Under
Uncertainty
Paper
•
2407.06071
•
Published
•
7
Human-like Episodic Memory for Infinite Context LLMs
Paper
•
2407.09450
•
Published
•
60
LLMs + Persona-Plug = Personalized LLMs
Paper
•
2409.11901
•
Published
•
32