SuperBPE Collection SuperBPE tokenizers and models trained with them • 7 items • Updated 14 days ago • 13
DataComp: In search of the next generation of multimodal datasets Paper • 2304.14108 • Published Apr 27, 2023 • 2
Scalable Extraction of Training Data from (Production) Language Models Paper • 2311.17035 • Published Nov 28, 2023 • 3
Git Re-Basin: Merging Models modulo Permutation Symmetries Paper • 2209.04836 • Published Sep 11, 2022 • 1
PLeaS -- Merging Models with Permutations and Least Squares Paper • 2407.02447 • Published Jul 2, 2024