AI & ML interests

Smol

Recent Activity

FlameF0XΒ  new activity 12 days ago
2F-AI/Titan-Medium:how to do this
FlameF0XΒ  updated a model 27 days ago
2F-AI/Titan-Medium
FlameF0XΒ  published a model 27 days ago
2F-AI/Titan-Medium
View all activity

FlameF0XΒ 
posted an update 2 days ago
view post
Post
190
the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus.
FlameF0XΒ 
posted an update 9 days ago
view post
Post
250
The development of SnowflakeCore-G1-7B-MoE it getting delay. In the mean time I am working on SnowflakeCore-G1-1B-MoE witch would be a pre-train chatbot.
  • 1 reply
Β·
FlameF0XΒ 
in 2F-AI/Titan-Medium 12 days ago

how to do this

4
#1 opened 14 days ago by
Blazgo
FlameF0XΒ 
posted an update 13 days ago
view post
Post
2938
The development of SnowflakeCore-G1-7B-MoE. I can't say when it would be publish yet because it's big and it requires a lot of computational power.
  • 1 reply
Β·
FlameF0XΒ 
posted an update 18 days ago
FlameF0XΒ 
posted an update 20 days ago
view post
Post
308
Hello! Important announcement, I will rename SnowflakeCore-G1-Medium to SnowflakeCore-G1-Tiny2 because it's going to have the same parameters as the Tiny version, but this one is trained on more data.
  • 1 reply
Β·
FlameF0XΒ 
posted an update 22 days ago
view post
Post
744
Currently working on SnowflakeCore-G1-Medium. [Updated loss cruve]
  • 3 replies
Β·
FlameF0XΒ 
posted an update 25 days ago
view post
Post
154
Hello there world! I am happy to announce that you now can fine-tune FlameF0X/SnowflakeCore-G1-Tiny , the code for that is in the model card.

I aslo lost the training log 😐
FlameF0XΒ 
posted an update 29 days ago
view post
Post
1203
Hello! I am sad to say but fine-tuning FlameF0X/SnowflakeCore-G1-Tiny is complicated and the instruct version would need to wait some time.
  • 2 replies
Β·
FlameF0XΒ 
updated a Space about 1 month ago
FlameF0XΒ 
posted an update about 1 month ago
FlameF0XΒ 
posted an update about 1 month ago
view post
Post
253
SnowflakeCore-G1 Update:
Got it running and training! Context window is currently set to 2048 tokens.
Training is active and stable. Will share results once I have some metrics to report.
  • 2 replies
Β·
FlameF0XΒ 
posted an update about 1 month ago
view post
Post
1933
SnowflakeCore-G1 development update: We're building a 24-layer transformer with 32K context and 1024 embedding dimensions - pretty ambitious! Even running at batch_size=1 with heavy gradient accumulation, we're hitting memory walls at 300GB RAM. Scaling up to ~1TB will take some time, but the architecture is looking promising. Thanks for following along with the journey! πŸ˜…
  • 1 reply
Β·
FlameF0XΒ 
posted an update about 2 months ago
view post
Post
1152
Hello there!
I just find out that all the SnowflakeCore-G0 series are Mask Language Models instead of LLM's.
The development of SnowflakeCore-G0-Releas-3 would be delayed even more.

Edit: I officially end the development of SnowflakeCore-G0 and start the development of SnowflakeCore-G1 what SHOULD be the text generator.

Edit-2: After some evaluation of the code, the models are actual Text Generator. So the development of G0 will continue.
FlameF0XΒ 
posted an update about 2 months ago
view post
Post
1372
Hi everyone!
The release of https://huggingface.co/FlameF0X/SnowflakeCore-G0-Release-3-1B is currently delayed due to hardware limitationsβ€”I'm currently lacking the compute resources needed to complete training. I'm exploring options and will keep you updated on any progress.
Thank you for your patience and support!