A family of bilingual JA/EN LLMs
AI & ML interests
None defined yet.
Recent Activity
View all activity
Comparing Efficiency and Quality of various formats
-
cyberagent/Mistral-Nemo-Japanese-Instruct-2408
Text Generation • 12B • Updated • 999 • 41 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-FP8-Dynamic
12B • Updated • 3 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-SQ-GPTQ-W8A8-INT8
12B • Updated • 7 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-GPTQ-W4A16-gs32
3B • Updated • 3
-
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper • 2406.08464 • Published • 70 -
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper • 2406.20094 • Published • 102 -
argilla/magpie-ultra-v1.0
Viewer • Updated • 3.22M • 2.24k • 47 -
simplescaling/s1K-1.1
Viewer • Updated • 1k • 9.13k • 125
JA/EN Bilingual LLMs
A family of bilingual JA/EN LLMs
-
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper • 2406.08464 • Published • 70 -
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper • 2406.20094 • Published • 102 -
argilla/magpie-ultra-v1.0
Viewer • Updated • 3.22M • 2.24k • 47 -
simplescaling/s1K-1.1
Viewer • Updated • 1k • 9.13k • 125
Comparing Efficiency and Quality of various formats
-
cyberagent/Mistral-Nemo-Japanese-Instruct-2408
Text Generation • 12B • Updated • 999 • 41 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-FP8-Dynamic
12B • Updated • 3 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-SQ-GPTQ-W8A8-INT8
12B • Updated • 7 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-GPTQ-W4A16-gs32
3B • Updated • 3
JA/EN Bilingual LLMs