Asankhaya Sharma PRO

codelion

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and PTS. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

liked a model 5 minutes ago

google/gemma-3-1b-it

reacted to their post with ❤️ 3 days ago

New Research: Theoretical Foundations for In-Context Learning in Transformers I'm excited to share our latest theoretical work that formally proves an interesting property of large language models: base transformer models can approximate fine-tuned capabilities using only inference-time techniques like in-context learning. The core question we investigated: Can specialized behaviors typically acquired through expensive supervised fine-tuning be elicited from base models without any parameter updates? Our theoretical contribution: We provide a formal proof, grounded in the Turing completeness of transformers, showing that this is indeed possible under certain assumptions. The work establishes mathematical bounds on the minimal dataset sizes needed for approximation. Key theoretical results: - For text generation tasks: O(mV/ε²) examples suffice (where m = number of contexts, V = vocabulary size, ε = error tolerance) - For linear classification: O(d/ε) examples (where d = input dimension) - Extensions to finite context scenarios with practical bounds This work helps explain why techniques like few-shot prompting, retrieval-augmented generation, and in-context learning work so effectively in practice. It bridges formal computer science theory with empirical observations about modern language models. While the assumptions are idealized (unbounded computational resources, full dataset access), the results provide mathematical foundations for understanding inference-time adaptation strategies that are increasingly important in AI deployment. Paper: https://huggingface.co/papers/2506.08060

reacted to their post with ➕ 3 days ago

View all activity

Organizations

codelion's activity

upvoted a paper 3 days ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 76

upvoted 3 papers 5 days ago

upvoted 2 papers 10 days ago

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Paper • 2406.10149 • Published Jun 14, 2024 • 53

ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper • 2505.23735 • Published 17 days ago • 23

upvoted a paper 11 days ago

Thinker: Learning to Think Fast and Slow

Paper • 2505.21097 • Published 19 days ago • 10

upvoted a paper 12 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 13 days ago • 151

upvoted a paper 13 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published 16 days ago • 91

upvoted an article 13 days ago

Article

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

•

13 days ago

• 12

upvoted a paper 19 days ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published 23 days ago • 77

upvoted an article 19 days ago

Article

AutoThink: Adaptive Reasoning for Large Language Models

•

19 days ago

• 4

upvoted a paper 26 days ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published 29 days ago • 119

upvoted an article 26 days ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

•

26 days ago

• 22

upvoted an article 29 days ago

Article

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

•

29 days ago

• 5

upvoted a paper about 1 month ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 119

upvoted a collection about 1 month ago

Pivotal Token Search

Collection

Pivotal Token Search (PTS) identifies tokens in a language model's generation that significantly impact the probability of success • 9 items • Updated May 14 • 3

upvoted 3 papers about 1 month ago

Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published May 12 • 45

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published May 8 • 25

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7 • 64