Nemotron 253B?
I understand there are supposed to be three models: the 8B, 49B and a 253B model. The other two were released over two weeks ago with the Ultra model listed as 'coming soon'. Is the model still cookin'? Nearly baked? When's it coming out of the oven and sliding it's way onto the bakery track?
I've never been able to run the Llama 405B model, even quantised down all the way I can't get it to run on my 96GB system. I've had to make do with 70B and 123B models, but would love to see how a ~250B model performs. I thought Nemotron was a superb enhancement to base Llama 3, so I'm particulated excited to see what 'Ultra' is like. I've been checking daily in the hopes it might've dropped. Is there an ETA?
Absolutely fantastic! Thank you, I've been waiting with bated breath for this one. :D
My machine (Mac Studio M2 Max 96GB) can support up to about 300B parameters on an IQ2 quant, but there's been nothing interesting over 123B and under under 300B. I know that your enhancements to Llama 3.1 70B had a massive impact on that model. The fact you're enhancing Llama 3.1 400B, whilst shrinking it down to 250B in size so I can run it is quite something.
I like a powerful all-rounder model that is also excellent for role-playing and creative writing, and I know that your enhancements to L3.1 70B were well received by the RP crowd and one of the few models to actually boost overall intelligence over the original to boot.
Just need for folks to GGUF this so I can try it now!