Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM Paper • 2403.07816 • Published Mar 12, 2024 • 41
Efficient Tool Use with Chain-of-Abstraction Reasoning Paper • 2401.17464 • Published Jan 30, 2024 • 21
PathFinder: Guided Search over Multi-Step Reasoning Paths Paper • 2312.05180 • Published Dec 8, 2023 • 10
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning Paper • 2309.02591 • Published Sep 5, 2023 • 15