Theory - a yicui Collection

yicui 's Collections

Coding

ICL

RL

TDD

Theory

Theory

updated Nov 14, 2024

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 109
The Platonic Representation Hypothesis

Paper • 2405.07987 • Published May 13, 2024 • 2
The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning

Paper • 2304.05366 • Published Apr 11, 2023 • 1
Explaining NonLinear Classification Decisions with Deep Taylor Decomposition

Paper • 1512.02479 • Published Dec 8, 2015 • 1
Large Language Models as Markov Chains

Paper • 2410.02724 • Published Oct 3, 2024 • 31
Neural Machine Translation by Jointly Learning to Align and Translate

Paper • 1409.0473 • Published Sep 1, 2014 • 5
Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models

Paper • 2310.17086 • Published Oct 26, 2023 • 1
Cross-Entropy Loss Functions: Theoretical Analysis and Applications

Paper • 2304.07288 • Published Apr 14, 2023 • 1
The Geometry of Concepts: Sparse Autoencoder Feature Structure

Paper • 2410.19750 • Published Oct 10, 2024 • 2
Scaling Laws for Precision

Paper • 2411.04330 • Published Nov 7, 2024 • 7