facebook
/

KernelLLM

Text Generation

text-generation-inference

Model card Files Files and versions

Zacharias030 commited on May 14

Commit

428cf76

·

verified ·

1 Parent(s): 19545c6

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ datasets:
 # KernelLLM
 ![scatter performance comparison plot](media/llm_performance_comparison.png)
-Caption: On KernelBench-Triton Level 1, our 8B parameter model matches GPT-4o in single-shot performance. With multiple inferences, KernelLLM's performance matches DeepSeek R1. This is all from a model with two orders of magnitude fewer parameters than its competitors.
 ## Making Kernel Development more accessible with KernelLLM
 We introduce KernelLLM, a large language model based on Llama 3.1, which has been trained specifically for the task of authoring GPU kernels using Triton. KernelLLM translates PyTorch modules into Triton kernels and was evaluated on KernelBench-Triton (see [here](https://github.com/ScalingIntelligence/KernelBench/pull/35)).

 # KernelLLM
 ![scatter performance comparison plot](media/llm_performance_comparison.png)
+On KernelBench-Triton Level 1, our 8B parameter model matches GPT-4o in single-shot performance. With multiple inferences, KernelLLM's performance matches DeepSeek R1. This is all from a model with two orders of magnitude fewer parameters than its competitors.
 ## Making Kernel Development more accessible with KernelLLM
 We introduce KernelLLM, a large language model based on Llama 3.1, which has been trained specifically for the task of authoring GPU kernels using Triton. KernelLLM translates PyTorch modules into Triton kernels and was evaluated on KernelBench-Triton (see [here](https://github.com/ScalingIntelligence/KernelBench/pull/35)).