TINY MODELS WITH BIG INTELLIGENCE Collection Tiny (<30B) models that tend to outperform their same-parameter counterparts. • 16 items • Updated about 6 hours ago • 3
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. • 162 items • Updated about 13 hours ago • 2
CohereLabs/cohere-transcribe-03-2026 Automatic Speech Recognition • Updated about 15 hours ago • 58.7k • 662
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. • 162 items • Updated about 13 hours ago • 2
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 81 items • Updated 7 days ago • 11
A Neuroscience-Inspired Dual-Process Model of Compositional Generalization Paper • 2507.18868 • Published Jul 25, 2025 • 1
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 81 items • Updated 7 days ago • 11
LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers Paper • 2310.15164 • Published Oct 23, 2023 • 4
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 81 items • Updated 7 days ago • 11
JEPA-Reasoner: Decoupling Latent Reasoning from Token Generation Paper • 2512.19171 • Published Dec 22, 2025 • 1
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 81 items • Updated 7 days ago • 11
Disentangling Reasoning Capabilities from Language Models with Compositional Reasoning Transformers Paper • 2210.11265 • Published Oct 20, 2022 • 1
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 81 items • Updated 7 days ago • 11
The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling Paper • 2603.07461 • Published 24 days ago • 1
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 81 items • Updated 7 days ago • 11
Interpretable-by-Design Transformers via Architectural Stream Independence Paper • 2603.07482 • Published 24 days ago • 1