SuperbEmphasis
/

Viloet-Eclipse-2x12B-v0.2-MINI

Mixture of Experts

Model card Files Files and versions Community

SuperbEmphasis commited on 11 days ago

Commit

11a0d90

·

verified ·

1 Parent(s): d94ce11

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -18,6 +18,16 @@ tags:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65bc2496ce846f8aa90aacbe/rf8TNQYh39MuwA72gcHcl.png)
 2x12B version of Velvet Eclipse 4x12B
 I have been wanting a better model for RP on a 24GB Nvidia card.  And there are some great models out there, but I wanted something that I knew I could quantize to Q4, have a great context size, have a very fast response, and would provide some dynamic content.  The total is around 30B, but since there are 2/3 models active, the response is quite fast!
 This is using 2x Mistral Nemo finetunes, each with a separate purpose.

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65bc2496ce846f8aa90aacbe/rf8TNQYh39MuwA72gcHcl.png)
 2x12B version of Velvet Eclipse 4x12B
+You may notice the name change... This may or may not have happened due to me SSHing into my GPU virtual machine from my cell phone's SSH client...  Whoops.  But I decided to keep it this way because I kinda like it.
+| Model | Description |
+| ----- | ----------- |
+| [Velvet Eclipse 2x12B](https://huggingface.co/SuperbEmphasis/Viloet-Eclipse-2x12B-v0.2-MINI) | A slimmer model with the ERP and RP experts.|
+| [Velvet Eclipse 2x12B Reasoning](https://huggingface.co/SuperbEmphasis/Viloet-Eclipse-2x12B-v0.2-MINI-Reasoning) | A slimmer model with the ERP and the Reasoning Experts |
+| [Velvet Eclipse 4x12B Reasoning](https://huggingface.co/SuperbEmphasis/Velvet-Eclipse-4x12B-v0.2) | Full 4x12B Parameter Velvet Eclipse |
 I have been wanting a better model for RP on a 24GB Nvidia card.  And there are some great models out there, but I wanted something that I knew I could quantize to Q4, have a great context size, have a very fast response, and would provide some dynamic content.  The total is around 30B, but since there are 2/3 models active, the response is quite fast!
 This is using 2x Mistral Nemo finetunes, each with a separate purpose.