Delta-Vector's picture
Update README.md
96348f8 verified
---
license: apache-2.0
base_model:
- Delta-Vector/MS3.2-Austral-24B-KTO
language:
- en
library_name: transformers
tags:
- roleplay
- finetune
- axolotl
- adventure
- creative-writing
- Mistral
- 24B
datasets:
- PocketDoc/Dans-Prosemaxx-RepRemover-1
---
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Austral 24B Winton</title>
<link href="" rel="stylesheet">
<style>
body {
font-family: 'Roboto Slab', serif;
background: linear-gradient(135deg, #8B4513 0%, #A0522D 25%, #CD853F 50%, #D2691E 75%, #8B4513 100%);
background-size: 400% 400%;
animation: prehistoricShift 20s ease-in-out infinite;
color: #2F1B14;
margin: 0;
padding: 0;
font-size: 16px;
min-height: 100vh;
}
@keyframes prehistoricShift {
0%, 100% { background-position: 0% 50%; }
50% { background-position: 100% 50%; }
}
.container {
margin: 20px;
background: linear-gradient(145deg, #F4E4BC 0%, #DEB887 100%);
padding: 20px;
border-radius: 15px;
box-shadow: 0 8px 25px rgba(0, 0, 0, 0.4), inset 0 2px 5px rgba(255, 255, 255, 0.3);
border: 4px solid #8B4513;
position: relative;
overflow: hidden;
}
.container::before {
content: '';
position: absolute;
top: 0;
left: 0;
right: 0;
bottom: 0;
background-image:
radial-gradient(circle at 20% 80%, rgba(139, 69, 19, 0.1) 0%, transparent 50%),
radial-gradient(circle at 80% 20%, rgba(160, 82, 45, 0.1) 0%, transparent 50%);
pointer-events: none;
}
.header h1 {
font-family: 'Cinzel', serif;
font-size: 32px;
color: #5D2E0C;
margin: 0 0 20px 0;
text-align: center;
text-shadow: 2px 2px 4px rgba(0, 0, 0, 0.3);
letter-spacing: 2px;
position: relative;
}
.section {
margin-top: 30px;
position: relative;
}
.section h2 {
font-family: 'Cinzel', serif;
font-size: 26px;
color: #5D2E0C;
text-align: center;
margin-bottom: 20px;
text-shadow: 1px 1px 2px rgba(0, 0, 0, 0.2);
letter-spacing: 1px;
}
.info p {
color: #2F1B14;
line-height: 1.7;
font-size: 16px;
text-shadow: 0 1px 1px rgba(255, 255, 255, 0.5);
}
.info img {
width: 85%;
border-radius: 12px;
margin: 0 auto 15px;
display: block;
box-shadow: 0 0 25px rgba(0, 0, 0, 0.4);
border: 3px solid #8B4513;
filter: sepia(20%) contrast(110%);
}
a {
color: #5D2E0C;
text-decoration: none;
transition: all 0.3s ease;
font-weight: 500;
}
a:hover {
color: #8B4513;
text-shadow: 1px 1px 2px rgba(0, 0, 0, 0.2);
}
.button {
display: inline-block;
background: linear-gradient(145deg, #CD853F, #D2691E);
color: #2F1B14;
padding: 12px 24px;
border-radius: 8px;
cursor: pointer;
text-decoration: none;
transition: all 0.3s ease;
box-shadow: 0 4px 8px rgba(0, 0, 0, 0.2);
border: 2px solid #8B4513;
}
.button:hover {
background: linear-gradient(145deg, #D2691E, #CD853F);
box-shadow: 0 6px 15px rgba(139, 69, 19, 0.4);
transform: translateY(-2px);
}
pre {
background: linear-gradient(145deg, #F5DEB3, #DEB887);
padding: 20px;
border-radius: 8px;
overflow-x: auto;
border: 2px solid #8B4513;
box-shadow: inset 0 2px 5px rgba(0, 0, 0, 0.1);
}
code {
font-family: 'Courier New', monospace;
color: #2F1B14;
}
.info-card {
background: linear-gradient(145deg, #F5DEB3, #DEB887);
border: 3px solid #8B4513;
border-radius: 12px;
overflow: hidden;
box-shadow: 0 6px 15px rgba(0, 0, 0, 0.2);
}
.info-header {
background: linear-gradient(145deg, #CD853F, #D2691E);
padding: 25px;
border-bottom: 2px solid #8B4513;
}
.info-header h3 {
font-family: 'Cinzel', serif;
color: #2F1B14;
margin: 0 0 15px 0;
font-size: 22px;
text-align: center;
text-shadow: 1px 1px 2px rgba(0, 0, 0, 0.2);
letter-spacing: 1px;
}
.model-tags {
display: flex;
gap: 10px;
flex-wrap: wrap;
justify-content: center;
}
.model-tag {
background: linear-gradient(145deg, #DEB887, #CD853F);
color: #2F1B14;
padding: 6px 12px;
border-radius: 6px;
font-size: 12px;
border: 2px solid #8B4513;
font-weight: 500;
box-shadow: 0 2px 4px rgba(0, 0, 0, 0.1);
}
.model-composition {
padding: 25px;
border-bottom: 2px solid #8B4513;
}
.model-composition h4 {
font-family: 'Cinzel', serif;
color: #5D2E0C;
margin: 0 0 20px 0;
font-size: 18px;
text-align: center;
letter-spacing: 1px;
}
.composition-list {
list-style: none;
padding: 0;
margin: 0;
display: grid;
gap: 15px;
}
.composition-list li {
color: #2F1B14;
display: flex;
align-items: baseline;
gap: 12px;
padding: 10px;
background: rgba(245, 222, 179, 0.5);
border-radius: 6px;
border-left: 4px solid #8B4513;
}
.model-component {
font-weight: 600;
min-width: 120px;
}
.model-description {
padding: 25px;
background: linear-gradient(145deg, #F5DEB3, #F4E4BC);
}
.metrics-section {
margin-bottom: 30px;
}
.metrics-section details {
background: linear-gradient(145deg, #F5DEB3, #DEB887);
border: 3px solid #8B4513;
border-radius: 10px;
padding: 20px;
margin-bottom: 20px;
box-shadow: 0 4px 10px rgba(0, 0, 0, 0.2);
}
.metrics-section summary {
font-family: 'Cinzel', serif;
color: #5D2E0C;
font-size: 18px;
cursor: pointer;
outline: none;
padding: 10px 0;
text-align: center;
font-weight: 500;
letter-spacing: 1px;
}
.creator-section {
margin: 25px 0;
text-align: center;
}
.creator-badge {
display: inline-flex;
align-items: center;
background: linear-gradient(145deg, #CD853F, #D2691E);
border: 3px solid #8B4513;
border-radius: 10px;
padding: 15px 20px;
box-shadow: 0 4px 10px rgba(0, 0, 0, 0.2);
}
.creator-label {
color: #2F1B14;
font-size: 14px;
margin-right: 10px;
font-weight: 500;
}
.creator-link {
display: flex;
align-items: center;
gap: 8px;
color: #2F1B14;
text-decoration: none;
transition: all 0.3s ease;
}
.creator-name {
font-weight: 600;
}
.creator-arrow {
font-size: 16px;
transition: transform 0.3s ease;
}
.creator-link:hover .creator-arrow {
transform: translateX(5px);
}
.link-arrow {
display: inline-block;
transition: transform 0.3s ease;
}
a:hover .link-arrow {
transform: translateX(3px);
}
.axolotl-container {
text-align: center;
margin: 35px 0;
}
.axolotl-container img {
max-width: 300px;
border-radius: 10px;
box-shadow: 0 6px 15px rgba(0, 0, 0, 0.3);
border: 3px solid #8B4513;
filter: sepia(30%) contrast(110%);
}
.fossil-texture {
position: relative;
}
.fossil-texture::after {
content: '';
position: absolute;
top: 0;
left: 0;
right: 0;
bottom: 0;
background-image:
radial-gradient(circle at 25% 25%, rgba(139, 69, 19, 0.05) 2px, transparent 2px),
radial-gradient(circle at 75% 75%, rgba(160, 82, 45, 0.05) 1px, transparent 1px);
background-size: 50px 50px, 30px 30px;
pointer-events: none;
}
</style>
</head>
<body>
<div class="container fossil-texture">
<div class="header">
<h1>Austral 24B Winton</h1>
</p>
</div>
<div class="info">
<img src="https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jxUvuFK1bdOdAPiYIcBW5.jpeg" alt="Model banner">
<div style="text-align: center;">
<div class="creator-section">
<div class="creator-badge">
<span class="creator-label">Trained by</span>
<a href="https://huggingface.co/Delta-Vector" target="_blank" class="creator-link">
<span class="creator-name">Delta-Vector</span>
</a>
</div>
</div>
<div class="model-info">
<h2>Overview</h2>
<div class="info-card">
<div class="info-header">
<h3>Austral 24B - Winton</h3>
<div class="model-tags">
<span class="model-tag">Codex Finetune</span>
<span class="model-tag">Mistral Based</span>
<span class ="model-tag">KTO enhanced</span>
<span class ="model-tag">Adventure/Roleplay generalist</span>
<span class="model-tag">24B Sized model</span>
</div>
</div>
<div class="model-description">
<p style="font-weight: bold; font-style: italic;">More than 1.5-metres tall, about six-metres long and up to 1000-kilograms heavy, Australovenator Wintonensis was a fast and agile hunter. The largest known Australian theropod.</p>
<p>This is a finetune of Codex 24B to be a generalist Roleplay/Adventure model. I've removed some of the "slops" that i noticed in an otherwise great model aswell as improving the general writing of the model, This was a multi-stage finetune, all previous checkpoints are released aswell. In testing it has shown to be a great model for Adventure cards & Roleplay, Often pushing the plot forward better then other models, While avoiding some of the slops you'd find in models from Drummer and Co.</p>
<p>Support my finetunes / Me on Kofi: https://Ko-fi.com/deltavector | Thank you to Auri for helping/Testing ♥</p>
</div>
</div>
</div>
<div class="section">
<h2>Quants</h2>
<div class="info-card">
<div class="model-composition">
<h4>Quants Formats</h4>
<ul class="composition-list">
<li><span class="model-component"><a href="" target="_blank">GGUF</a></span>For use with LLama.cpp & Forks (coming soon!))</li>
<li><span class="model-component"><a href="" target="_blank">EXL3</a></span>For use with TabbyAPI (Coming soon!)</li>
</ul>
</div>
</div>
</div>
<div class="section">
<h2>Chat Format</h2>
<p>This model utilizes ChatML.</p>
<pre><code><|im_start|>user
Hi there!<|im_end|>
<|im_start|>assistant
Nice to meet you!<|im_end|>
<|im_start|>user
Can I ask a question?<|im_end|>
<|im_start|>assistant</code></pre>
</div>
<div class="section">
<h2>Training</h2>
<p>As the the Austral/Francois tradition, I built off another great finetune Codex-24B, I did 4 epochs ontop with roughly the same datamix as Francois-Huali/Austral 70B as a R128 Lora, then KTO alignment with a mix of Instruct/Small writing datasets and then finally another 4 epoch SFT with Rep_remover (Thanks Pocket!)</p>
<div class="metrics-section">
<details>
<summary>Config(Post-KTO SFT)</summary>
<pre><code>https://wandb.ai/new-eden/austral/runs/i85da0c6?nw=nwuserdeltavector</code></pre>
</details>
</div>
</div>
<div class="section">
<h2Fossil Formation Process</h2>
<p>This model was trained over 4 epochs using 8 x A100s (Ty to my work, Cognitive Computations) for the base SFT, Then i used KTO to clean up some coherency issues for 1 epoch, then finally training for another 4 epochs on Rep_Remover to delete slops. Total was roughly 80 hours total.</p>
<p style="text-align: center; margin-top: 20px;">
<div class="axolotl-container">
<a href="https://github.com/OpenAccess-AI-Collective/axolotl" target="_blank">
<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl">
</a>
</div>
<div class="section">
<h2>Credits</h2>
<p>TYSM to my friends: Auri, Lucy, Trappu, Alicat, Kubernetes Bad, Intervitens, NyxKrage & Kalomaze</p>
</p>
</div>
</div>
</div>
</div>
</div>
</body>
</html>