todo

make a model card and put a cute girl on it

some info

Making this public so it can be tried and possibly merged if desired while I work on getting the energy to write a proper card.

Short list of things to know:

  • This is a bunch of RP, story writing, etc. creative data applied to ToastyPigeon/ms3-roselily-instruct.
  • Instruct format: ChatML or Alpaca preferred, Tekken v7 possible
  • ChatML tokens were assigned to unused tokens 20 and 21, this leaves all the tekken tokens intact so merges w/ tekken models are feasible
  • Instruct-tuning phase did include Tekken v7 so the tokens are initialized and recognized, but I did not continue with it on the creative step because I do not like it for creative stuff (too restrictive with turn order)
  • Feels a little less sensitive to samplers than Instruct-based MS3 models, but should probably still be used with conservative samplers

chat templates

You may need to set <|im_end|> and/or </s> as stopping strings depending on which format you're using, the model generates both properly but tokenizers can be finicky about what they stop on by default

Alpaca w/ System

### System:
{system prompt}

### Instruction:
{user message}

### Response:
{model answer}</s>

ChatML

<|im_start|>system
{system prompt}<|im_end|>
<|im_start|>user
{user message}<|im_end|>
<|im_start|>assistant
{model answer}<|im_end|>

Also saw some completion training in chat mode and adventure mode.

Downloads last month
87
Safetensors
Model size
23.6B params
Tensor type
FP16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for allura-org/MS3-24B-Roselily-Creative

Finetuned
(2)
this model
Merges
2 models
Quantizations
5 models