cloudyu
/

Mixtral_34Bx2_MoE_60B

Text Generation

Mixture of Experts

text-generation-inference

Model card Files Files and versions Community

Resources

View closed (3)

From your work, I find a new way to do model ensemble

#14 opened about 1 year ago by

Adding Evaluation Results

#12 opened about 1 year ago by

leaderboard-pr-bot

The function_calling and translation abilities are weaker than Mixtral 8x7b

#11 opened over 1 year ago by

Add mixture of experts tag

#10 opened over 1 year ago by

how this model goes work,can you share you idea or traning process? thanks

#9 opened over 1 year ago by

Add merge tag

#8 opened over 1 year ago by

Vram

#7 opened over 1 year ago by

source code and paper?

#6 opened over 1 year ago by

How does the MoE work?

#5 opened over 1 year ago by

PacmanIncarnate

Quant pls?

#4 opened over 1 year ago by

What is your config?

#3 opened over 1 year ago by

Should not be called mixtral, the models made into the moe are yi based

#2 opened over 1 year ago by

Add merge tags

#1 opened over 1 year ago by