facebook
/

KernelLLM

Text Generation

text-generation-inference

Model card Files Files and versions

Zacharias030 commited on 10 days ago

Commit

19545c6

·

verified ·

1 Parent(s): f5663dc

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -33,9 +33,9 @@ We finetuned Llama3.1-8B-Instruct on the created dataset using supervised instru
 | Model | Parameters (B) | Score | Pass@k |
 |-------|---------------|-------|--------|
-| KernelLLM | 8 | 15.5 | 1 |
-| KernelLLM | 8 | 34.7 | 10 |
-| KernelLLM | 8 | 39.8 | 20 |
 | DeepSeek V3 | 671 | 16 | 1 |
 | GPT-4o | ~200 | 15 | 1 |
 | Qwen2.5 | 32 | 15 | 1 |
@@ -127,7 +127,7 @@ model = KernelLLM()
 model.stream_raw("Your prompt here", max_new_tokens=2048)
 # Generate raw text without the Triton-specific prompt template
-raw_output = model.generate_raw("Your prompt here", temperature=0.6, max_new_tokens=2048)
 ```
 ## Current Limitations and Future Work

 | Model | Parameters (B) | Score | Pass@k |
 |-------|---------------|-------|--------|
+| KernelLLM | 8 | 20.2 | 1 |
+| KernelLLM | 8 | 51.8 | 10 |
+| KernelLLM | 8 | 57.1 | 20 |
 | DeepSeek V3 | 671 | 16 | 1 |
 | GPT-4o | ~200 | 15 | 1 |
 | Qwen2.5 | 32 | 15 | 1 |
 model.stream_raw("Your prompt here", max_new_tokens=2048)
 # Generate raw text without the Triton-specific prompt template
+raw_output = model.generate_raw("Your prompt here", temperature=1.0, max_new_tokens=2048)
 ```
 ## Current Limitations and Future Work