Tiny Datasets - a vincentkoc Collection

vincentkoc 's Collections

LLM Agent and Prompt Optimizers

Evaluation for Generative AI

Tiny Datasets

updated 5 days ago

vincentkoc/tiny_qa_benchmark

Viewer • Updated 5 days ago • 52 • 89 • 1
vincentkoc/tiny_qa_benchmark_pp

Viewer • Updated 5 days ago • 662 • 254 • 1
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation

Paper • 2505.12058 • Published 7 days ago • 6
roneneldan/TinyStories

Viewer • Updated Aug 12, 2024 • 2.14M • 30k • 663
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 36
tinyBenchmarks: evaluating LLMs with fewer examples

Paper • 2402.14992 • Published Feb 22, 2024 • 14
Microscaling Data Formats for Deep Learning

Paper • 2310.10537 • Published Oct 16, 2023 • 8