DavidAU
/

Llama-3.1-1million-ctx-Dark-Planet-v1.01-8B

Text Generation

text-generation-inference

Model card Files Files and versions Community

DavidAU commited on 4 days ago

Commit

7fdbb4a

·

verified ·

1 Parent(s): b494318

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -15,6 +15,12 @@ base_model:
 This repo contains the full precision source code, in "safe tensors" format to generate GGUFs, GPTQ, EXL2, AWQ, HQQ and other formats.
 The source code can also be used directly.
 NOTE: If you intend to make GGUF quants, it is suggested to make the master file in float32 ("f32") then quant from this file due
 to float 32 components / models in this merge.

 This repo contains the full precision source code, in "safe tensors" format to generate GGUFs, GPTQ, EXL2, AWQ, HQQ and other formats.
 The source code can also be used directly.
+"V1.01" has modifications to address some issues related to non-stop/overly long gen and/or repeat "end paragraph" issues. I am keeping the org quants too, because of the difference in
+creative generation between the two versions is very strong. I am not saying "reg" is better than "v1.01", they are
+just different, and you should have the choice between both in my opinion.
+The "GGUF" link at the bottom of the page links to repo with both V1.01 and "reg" quants in the repo.
 NOTE: If you intend to make GGUF quants, it is suggested to make the master file in float32 ("f32") then quant from this file due
 to float 32 components / models in this merge.