Kan-LLaMA [ಕನ್-LLama] Tensoic's suite of Kannada Llama Tensoic/Kan-LLaMA-7B-base Text Generation • Updated Jan 24, 2024 • 4 • 11 Tensoic/Kan-Llama-7B-SFT-v0.5 Text Generation • 7B • Updated Jan 24, 2024 • 88 • 1 Tensoic/Kan-LLaMA-7B-SFT-v0.1 Text Generation • Updated Jan 12, 2024 • 3 Tensoic/Kan-LLaMa-SFT-v0.1-GGUF 7B • Updated Apr 6, 2024 • 7
Synthetic Image Data Gen Models, datasets and assets that aid in Synthetic data generation Tensoic/Synth-DataGen-Assets Updated May 31, 2024 • 4 • 1 MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens Paper • 2406.11271 • Published Jun 17, 2024 • 21
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens Paper • 2406.11271 • Published Jun 17, 2024 • 21
Indic Instructions Just Indic Datasets Tensoic/GPTeacher-Assamese Viewer • Updated Mar 8, 2024 • 43.7k • 10 Tensoic/GPTeacher-Bangla Viewer • Updated Mar 8, 2024 • 43.6k • 8 Tensoic/GPTeacher-Gujarati Viewer • Updated Mar 8, 2024 • 43.8k • 4 • 1 Tensoic/GPTeacher-Hindi Viewer • Updated Mar 8, 2024 • 43.6k • 9
Kan-LLaMA [ಕನ್-LLama] Tensoic's suite of Kannada Llama Tensoic/Kan-LLaMA-7B-base Text Generation • Updated Jan 24, 2024 • 4 • 11 Tensoic/Kan-Llama-7B-SFT-v0.5 Text Generation • 7B • Updated Jan 24, 2024 • 88 • 1 Tensoic/Kan-LLaMA-7B-SFT-v0.1 Text Generation • Updated Jan 12, 2024 • 3 Tensoic/Kan-LLaMa-SFT-v0.1-GGUF 7B • Updated Apr 6, 2024 • 7
Indic Instructions Just Indic Datasets Tensoic/GPTeacher-Assamese Viewer • Updated Mar 8, 2024 • 43.7k • 10 Tensoic/GPTeacher-Bangla Viewer • Updated Mar 8, 2024 • 43.6k • 8 Tensoic/GPTeacher-Gujarati Viewer • Updated Mar 8, 2024 • 43.8k • 4 • 1 Tensoic/GPTeacher-Hindi Viewer • Updated Mar 8, 2024 • 43.6k • 9
Synthetic Image Data Gen Models, datasets and assets that aid in Synthetic data generation Tensoic/Synth-DataGen-Assets Updated May 31, 2024 • 4 • 1 MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens Paper • 2406.11271 • Published Jun 17, 2024 • 21
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens Paper • 2406.11271 • Published Jun 17, 2024 • 21