dranger003 commited on
Commit
ad1fa81
·
verified ·
1 Parent(s): 1c24412

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -3,6 +3,8 @@ license: bigcode-openrail-m
3
  pipeline_tag: text-generation
4
  library_name: gguf
5
  ---
 
 
6
  GGUF quants for https://huggingface.co/bigcode/starcoder2-15b
7
 
8
  > StarCoder2-15B model is a 15B parameter model trained on 600+ programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 4+ trillion tokens.
 
3
  pipeline_tag: text-generation
4
  library_name: gguf
5
  ---
6
+ **NOTE**: This is currently an unsupported model, for testing [PR#5795](https://github.com/ggerganov/llama.cpp/pull/5795)
7
+
8
  GGUF quants for https://huggingface.co/bigcode/starcoder2-15b
9
 
10
  > StarCoder2-15B model is a 15B parameter model trained on 600+ programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 4+ trillion tokens.