Delta-Vector
/

MS3.2-Austral-24B-SFT

Model card Files Files and versions

Delta-Vector commited on Jul 2

Commit

1990840

·

verified ·

1 Parent(s): 44ca624

Update README.md

Files changed (1) hide show

README.md +48 -26

README.md CHANGED Viewed

@@ -1,34 +1,56 @@
----
-base_model: []
-library_name: transformers
-tags:
-- mergekit
-- merge
----
-# 24b
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the Passthrough merge method using /home/quixi/storage/models/Gryphe-Codex-24B-Small-3.2 + /home/quixi/storage/models/NewEden-Austral-24B-Codex-Lora as a base.
-### Models Merged
-The following models were included in the merge:
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-base_model: /home/quixi/storage/models/Gryphe-Codex-24B-Small-3.2+/home/quixi/storage/models/NewEden-Austral-24B-Codex-Lora
-dtype: bfloat16
-merge_method: passthrough
-models:
-  - model: /home/quixi/storage/models/Gryphe-Codex-24B-Small-3.2+/home/quixi/storage/models/NewEden-Austral-24B-Codex-Lora
-```

+# Wot is this
+Just another checkpoint, better to use the -Winton model but this is released to keep in line with being an actual OSS Finetuner. (Unlike some others who don't release datasets or checkpoints!)
+This is the SFT part of the MS3.2 train ontop of Codex
+Wandb: https://wandb.ai/gum1h0x/austral/artifacts/axolotl-config/config-4hspge7d/v0/files/axolotl_config_ept225f_.yml
+Datasets:
+```
+datasets:
+  - path: Delta-Vector/Hydrus-Claude-Instruct-2.7K
+    type: dan-chat-advanced
+  - path: Delta-Vector/Hydrus-Claude-Instruct-5K
+    type: dan-chat-advanced
+  - path: Delta-Vector/Orion-Shoujo-AI-Filtered-ShareGPT
+    type: dan-chat-advanced
+  - path: PocketDoc/Dans-Personamaxx-VN
+    type: dan-chat-advanced
+  - path: NewEden/LIMARP-Complexity
+    type: dan-chat-advanced
+  - path: NewEden/PIPPA-Mega-Filtered
+    type: dan-chat-advanced
+  - path: NewEden/OpenCAI-ShareGPT
+    type: dan-chat-advanced
+  - path: NewEden/Creative_Writing-Complexity
+    type: dan-chat-advanced
+  - path: NewEden/Light-Novels-Roleplay-Logs-Books-Oh-My-duplicate-turns-removed
+    type: dan-chat-advanced
+  - path: PocketDoc/Dans-Failuremaxx-Adventure-3
+    type: dan-chat-advanced
+  - path: NewEden/Books-V2-ShareGPT
+    type: dan-chat-advanced
+  - path: NewEden/Deepseek-V3-RP-Filtered
+    type: dan-chat-advanced
+  - path: NewEden/Final-Alpindale-LNs-ShareGPT
+    type: dan-chat-advanced
+  - path: NewEden/DeepseekRP-Filtered
+    type: dan-chat-advanced
+  - path: NewEden/RP-logs-V2-Experimental
+    type: dan-chat-advanced
+  - path: anthracite-org/kalo_opus_misc_240827
+    type: dan-chat-advanced
+  - path: anthracite-org/kalo_misc_part2
+    type: dan-chat-advanced
+  - path: NewEden/Storium-Prefixed-Clean
+    type: dan-chat-advanced
+  - path: Delta-Vector/Hydrus-AM-Thinking-IF
+    type: dan-chat-advanced
+```
+TYSM to Gum1hox for sponsering the run, Trained on 1xB200 for 30 hours
+https://x.com/gum1h0x