Asankhaya Sharma's picture

Asankhaya Sharma PRO

codelion

·

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and PTS. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

updated a dataset about 8 hours ago

codelion/Qwen3-0.6B-icm

liked a dataset 1 day ago

sumuks/essential-web-v1.0-sample-10M

new activity 1 day ago

EssentialAI/essential-web-v1.0:sample

View all activity

Organizations

Posts 19

Post

2235

🚀 Just published: "OpenEvolve: Open-Source Evolutionary Code Optimization with Real-World GPU Kernel Discovery"

We built the first open-source implementation of Google's AlphaEvolve system and used it to automatically discover GPU kernel optimizations that outperform human engineers!

Key results:

- 21.8% average decode speed improvement on Apple Silicon
- 36.7% improvement on long-context transformer attention
- Discovered novel vectorization patterns and 2-pass softmax algorithm

The system evolved a Metal kernel for Qwen3's Grouped Query Attention from a basic 3-pass implementation into something with sophisticated Apple Silicon optimizations that would take experts months to discover manually. The evolved kernel automatically found the optimal vec<T,8> operations for 128-dim attention heads and fused softmax computation with value accumulation.

Really excited about the potential here - imagine evolutionary algorithms automatically discovering optimizations across all our AI infrastructure. What would you want to optimize with this approach?

Full write-up: https://huggingface.co/blog/codelion/openevolve-gpu-kernel-discovery

GitHub: https://github.com/codelion/openevolve

#AI #MachineLearning #GPU #OpenSource #Evolution #CodeOptimization #TransformerOptimization

Articles 6

Article

16

Automated Discovery of High-Performance GPU Kernels with OpenEvolve

View all Articles

Collections 2

Papers 5

arxiv:2506.08060

arxiv:2501.14249

arxiv:2407.18521

arxiv:2407.16557

spaces 13

InvestmentAnalysis

Simulate investment outcomes using stock data and market probabilities

LLMFeed

Generate TikTok ideas and content from text input

Safety Copilot

Ask about any health & safety related queries

Tablut

Play Tablut against AI

Videoanalysis

Upload and analyze MP4 video to extract key frames and summary

Svg2png

Convert SVG to PNG with specified dimensions

models 18

codelion/Qwen3-0.6B-icm-rm

Text Classification • 0.6B • Updated 3 days ago • 6

codelion/gemma-3-1b-it-icm-sft-lora

Updated 6 days ago

codelion/gemma-3-1b-it-icm-sft

Text Generation • 1.0B • Updated 6 days ago • 19

codelion/gemma-3-1b-it-icm-sft-mlx-fp16

Text Generation • 1B • Updated 6 days ago • 229

codelion/DeepSeek-R1-Distill-Qwen-1.5B-PTS-DPO

Text Generation • 2B • Updated May 13 • 4 • 2

codelion/Qwen3-0.6B-PTS-DPO

Text Generation • 0.6B • Updated May 12 • 18 • 1

codelion/Qwen3-0.6B-PTS-DPO-LoRA

Updated May 7 • 1

codelion/optillm-bert-uncased

0.3B • Updated Feb 16 • 31 • 5

codelion/optillm-modernbert-large

0.4B • Updated Feb 16 • 45 • 8

codelion/Llama-3.3-70B-o1

Text Generation • 71B • Updated Jan 21 • 83 • • 2

datasets 13

codelion/Qwen3-0.6B-icm

Viewer • Updated about 8 hours ago • 100 • 1

codelion/Qwen3-0.6B-pts-dpo-pairs

Viewer • Updated May 19 • 681 • 35 • 2

codelion/Qwen3-0.6B-pts-steering-vectors

Viewer • Updated May 19 • 1.38k • 140 • 4

codelion/Qwen3-0.6B-pts

Viewer • Updated May 19 • 1.38k • 33 • 2

codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts-steering-vectors

Preview • Updated May 13 • 301 • 1

codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts

Preview • Updated May 13 • 13 • 1

codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts-dpo-pairs

Preview • Updated May 13 • 23 • 1

codelion/math500-cot-experiment

Viewer • Updated Apr 30 • 1.5k • 52 • 5

codelion/optillmbench

Viewer • Updated Apr 15 • 500 • 103 • 5

codelion/distilled-QwQ-32B-fineweb-edu

Preview • Updated Apr 13 • 19 • 1

View 13 datasets