📋 Eval Logs Collection Benchmark log generated with Twinkle Eval, recording the model's outputs for each prompt. • 1 item • Updated May 14 • 2
🏎️ Formosa-1 Series Collection A collection of Formosa-1 (F1) reasoning models and datasets focused on Traditional Chinese instruction-following and logic. • 4 items • Updated May 14 • 3
🧠 Traditional Chinese Reasoning Datasets Collection A curated collection of datasets designed to evaluate and train reasoning capabilities in Traditional Chinese across various domains. • 3 items • Updated May 14 • 8
chienweichang/Llama-3-Taiwan-70B-Instruct-GGUF Text Generation • 71B • Updated Jul 12, 2024 • 174 • 4