Titan Models

The development of SnowflakeCore-G1-7B-MoE it getting delay. In the mean time I am working on SnowflakeCore-G1-1B-MoE witch would be a pre-train chatbot.

1 reply

FlameF0X

in 2F-AI/Titan-Medium 12 days ago

how to do this

#1 opened 14 days ago by

Blazgo

FlameF0X

posted an update 13 days ago

Post

2938

The development of SnowflakeCore-G1-7B-MoE. I can't say when it would be publish yet because it's big and it requires a lot of computational power.

1 reply

FlameF0X

posted an update 18 days ago

Post

284

I just finished the benchmarks for FlameF0X/SnowflakeCore-G1-Tiny and FlameF0X/SnowflakeCore-G1-Tiny2 in comparation with openai-community/gpt2 .

FlameF0X

posted an update 20 days ago

Post

308

Hello! Important announcement, I will rename SnowflakeCore-G1-Medium to SnowflakeCore-G1-Tiny2 because it's going to have the same parameters as the Tiny version, but this one is trained on more data.

1 reply

FlameF0X

posted an update 22 days ago

Post

744

Currently working on SnowflakeCore-G1-Medium. [Updated loss cruve]

3 replies

FlameF0X

posted an update 25 days ago

Post

154

Hello there world! I am happy to announce that you now can fine-tune FlameF0X/SnowflakeCore-G1-Tiny , the code for that is in the model card.

I aslo lost the training log 😐

FlameF0X

updated a model 27 days ago

2F-AI/Titan-Medium

130560000000T • Updated 27 days ago • 6 • 1

FlameF0X

published a model 27 days ago

2F-AI/Titan-Medium

130560000000T • Updated 27 days ago • 6 • 1

FlameF0X

posted an update 29 days ago

Post

1203

Hello! I am sad to say but fine-tuning FlameF0X/SnowflakeCore-G1-Tiny is complicated and the instruct version would need to wait some time.

2 replies

FlameF0X

updated a Space about 1 month ago

README

📚

FlameF0X

updated a model about 1 month ago

2F-AI/Titan-Atom

Text Generation • 487.9T • Updated Jul 8 • 28 • 3

FlameF0X

published 2 models about 1 month ago

2F-AI/Titan-Atom

Text Generation • 487.9T • Updated Jul 8 • 28 • 3

2F-AI/Muffin-2.7

Text Generation • Updated Apr 5

FlameF0X

posted an update about 1 month ago

Post

227

SnowflakeCore-G1-Tiny has landed on Hugging Face! 🚀. Give it a try and let me know what you think: FlameF0X/SnowflakeCore-G1-Tiny.

FlameF0X

posted an update about 1 month ago

Post

253

SnowflakeCore-G1 Update:
Got it running and training! Context window is currently set to 2048 tokens.
Training is active and stable. Will share results once I have some metrics to report.

2 replies

FlameF0X

posted an update about 1 month ago

Post

1933

SnowflakeCore-G1 development update: We're building a 24-layer transformer with 32K context and 1024 embedding dimensions - pretty ambitious! Even running at batch_size=1 with heavy gradient accumulation, we're hitting memory walls at 300GB RAM. Scaling up to ~1TB will take some time, but the architecture is looking promising. Thanks for following along with the journey! 😅

1 reply

FlameF0X

posted an update about 2 months ago

Post

1152

Hello there!
I just find out that all the SnowflakeCore-G0 series are Mask Language Models instead of LLM's.
The development of SnowflakeCore-G0-Releas-3 would be delayed even more.

Edit: I officially end the development of SnowflakeCore-G0 and start the development of SnowflakeCore-G1 what SHOULD be the text generator.

Edit-2: After some evaluation of the code, the models are actual Text Generator. So the development of G0 will continue.

FlameF0X

posted an update about 2 months ago

Post

1372

Hi everyone!
The release of https://huggingface.co/FlameF0X/SnowflakeCore-G0-Release-3-1B is currently delayed due to hardware limitations—I'm currently lacking the compute resources needed to complete training. I'm exploring options and will keep you updated on any progress.
Thank you for your patience and support!

AI & ML interests

Recent Activity

Team members 1

2F-AI's activity

how to do this

README