Is this an MOE?

by AlgorithmicKing - opened Jan 21

Jan 21

if it is then how many active parameters does it have?

fsaudm

Jan 21

It is not, it is Llama 3.3 70B that has been fine-tuned using data generated by deepseek-R1

Jan 21

oh. thanks for pointing it out

AlgorithmicKing changed discussion status to closed Jan 21

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment