RLXF the best collection of RLXF model including RLHF, RLAIF etc. Amu/dpo-phi2 Text Generation • 3B • Updated Mar 4, 2024 • 69 • 2 Amu/spin-phi2 Text Generation • 3B • Updated Mar 16, 2024 • 6 • 10 Amu/t1-3B-grpo Text Generation • 3B • Updated Apr 7 • 3 • 1
BABYLLM The baby-LLM is the future of LLM. Amu/supertiny-llama3-0.25B-v0.1 Text Generation • 0.3B • Updated Jul 8, 2024 • 3 • 6 Amu/t1-3B Text Generation • 3B • Updated Mar 11 • 7 • 1
RAG the best collection of RAG model, like embedding, ranker etc. Amu/tao-8k Sentence Similarity • Updated Dec 3, 2023 • 686 • • 46
RLXF the best collection of RLXF model including RLHF, RLAIF etc. Amu/dpo-phi2 Text Generation • 3B • Updated Mar 4, 2024 • 69 • 2 Amu/spin-phi2 Text Generation • 3B • Updated Mar 16, 2024 • 6 • 10 Amu/t1-3B-grpo Text Generation • 3B • Updated Apr 7 • 3 • 1
RAG the best collection of RAG model, like embedding, ranker etc. Amu/tao-8k Sentence Similarity • Updated Dec 3, 2023 • 686 • • 46
BABYLLM The baby-LLM is the future of LLM. Amu/supertiny-llama3-0.25B-v0.1 Text Generation • 0.3B • Updated Jul 8, 2024 • 3 • 6 Amu/t1-3B Text Generation • 3B • Updated Mar 11 • 7 • 1