xxx777xxxASD
/

ChaoticSoliloquy-4x8B

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ChaoticSoliloquy-4x8B / README.md

432653dfg's picture

Update README.md

1cdc5b9 verified 7 months ago

|

1.07 kB

metadata

license: llama3
language:
  - en
tags:
  - moe

(Maybe i'll change the waifu picture later)

Experimental RP-oriented MoE, the idea was to get a model that would be equal to or better than the Mixtral 8x7B and it's finetunes in RP/ERP tasks.

ChaoticSoliloquy-4x8B

base_model: jeiku_Chaos_RP_l3_8B
gate_mode: random
dtype: bfloat16
experts_per_token: 2
experts:
  - source_model: ChaoticNeutrals_Poppy_Porpoise-v0.6-L3-8B
  - source_model: jeiku_Chaos_RP_l3_8B
  - source_model: openlynn_Llama-3-Soliloquy-8B
  - source_model: Sao10K_L3-Solana-8B-v1

Models used

Prompt format: Llama 3