xxx777xxxASD
/

L3-ChaoticSoliloquy-v1.5-4x8B

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

L3-ChaoticSoliloquy-v1.5-4x8B / README.md

xxx777xxxASD's picture

Update README.md

d83b3b8 verified 7 months ago

|

1.44 kB

	---
	license: llama3
	language:
	- en
	tags:
	- moe
	---

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/lbtLGEaqHvJt5nSz4zfHP.png)

	Experimental RP-oriented MoE, the idea was to get a model that would be equal to or better than the Mixtral 8x7B and it's finetunes in RP/ERP tasks.
	Im not sure but it should be better than the [first version](https://huggingface.co/xxx777xxxASD/ChaoticSoliloquy-4x8B)

	### Llama 3 ChaoticSoliloquy-v1.5-4x8B
	```
	base_model: NeverSleep_Llama-3-Lumimaid-8B-v0.1
	gate_mode: random
	dtype: bfloat16
	experts_per_token: 2
	experts:
	- source_model: ChaoticNeutrals_Poppy_Porpoise-v0.7-L3-8B
	- source_model: NeverSleep_Llama-3-Lumimaid-8B-v0.1
	- source_model: openlynn_Llama-3-Soliloquy-8B
	- source_model: Sao10K_L3-Solana-8B-v1
	```


	## Models used

	- [ChaoticNeutrals/Poppy_Porpoise-v0.7-L3-8B](https://huggingface.co/ChaoticNeutrals/Poppy_Porpoise-v0.7-L3-8B)
	- [NeverSleep/Llama-3-Lumimaid-8B-v0.1](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1)
	- [openlynn/Llama-3-Soliloquy-8B](https://huggingface.co/openlynn/Llama-3-Soliloquy-8B)
	- [Sao10K/L3-Solana-8B-v1](https://huggingface.co/Sao10K/L3-Solana-8B-v1)


	## Vision

	[llama3_mmproj](https://huggingface.co/ChaoticNeutrals/LLaVA-Llama-3-8B-mmproj)

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/yv4C6NalqORLjvY3KKZk8.png)


	## Prompt format: Llama 3