Jason233King's picture

Jason233King

Jason233

AI & ML interests

ai for video game assets.

Recent Activity

liked a model 2 days ago
ali-vilab/VACE-LTX-Video-0.9
liked a model 9 days ago
Clybius/Chroma-GGUF
liked a model 9 days ago
hanzla/Falcon3-Mamba-R1-v0
View all activity

Organizations

None yet

Jason233's activity

reacted to hanzla's post with 👍 9 days ago
view post
Post
1913
Hi community,

Few days back, I posted about my ongoing research on making reasoning mamba models and I found great insights from the community.

Today, I am announcing an update to the model weights. With newer checkpoints, the Falcon3 Mamba R1 model now outperforms very large transformer based LLMs (including Gemini) for Formal Logic questions of MMLU. It scores 60% on formal logic which is considered a tough subset of questions in MMLU.

I would highly appreciate your insights and suggestions on this new checkpoint.

Model Repo: hanzla/Falcon3-Mamba-R1-v0

Chat space: hanzla/Falcon3MambaReasoner
reacted to onekq's post with 🔥 13 days ago
view post
Post
3742
Folks, let's get ready.🥳 We will be busy soon. 😅🤗https://github.com/huggingface/transformers/pull/36878