19 126 469

alkinun

AtAndDev

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

updated a dataset 23 days ago

SubliminalMisalignment/abliterated-distill-30k

published a dataset 23 days ago

SubliminalMisalignment/abliterated-distill-30k

updated a dataset 24 days ago

SubliminalMisalignment/safe-distill-30k

View all activity

Organizations

reacted to ovi054's post with 🔥🚀 about 1 month ago

Post

2502

Z-Image Turbo + LoRA ⚡

ovi054/Z-Image-LORA

Z-Image Turbo is the No. 1 trending Text-to-Image model right now. You can add a custom LoRA and generate images with this Space.

👉 Try it now: ovi054/Z-Image-LORA

3 replies

reacted to mitkox's post with 👍🚀 about 1 month ago

Post

2344

Got to 1199.8 tokens/sec with Devstral Small -2 on my desktop GPU workstation. vLLM nightly.
Works out of the box with Mistral Vibe. Next is time to test the big one.

3 replies

reacted to branikita's post with 🚀 about 2 months ago

Post

3276

Proud to share the results of our engineering team’s recent work at

Robonine :

• Together, we applied advanced topology optimization to redesign critical brackets of the manipulator, achieving a 57–76% reduction in structural deflection.

• Our updated model also demonstrated a major stress decrease — from 93 MPa down to 25 MPa — all while staying within the allowed weight increase.

• Although we didn’t fully reach the target tip deviation of 0.3 mm (best achieved: 0.41 mm), the project gave us valuable insights and a solid foundation for the next design iteration.

replied to mrfakename's post 3 months ago

Whaaaaa damn thats really good!

reacted to mrfakename's post with 🔥 3 months ago

Post

6152

Trained a model for emotion-controllable TTS based on MiMo audio on LAION's dataset.

Still very early and does have an issue with hallucinating but results seem pretty good so far, given that it is very early into the training run.

Will probably kick off a new run later with some settings tweaked.

Put up a demo here: https://huggingface.co/spaces/mrfakename/EmoAct-MiMo

(Turn 🔊 on to hear audio samples)

5 replies

reacted to sourceoftruthdata's post with ❤️🤗 3 months ago

Post

3426

What a fantastic community!

1 reply

reacted to AdinaY's post with 🔥 3 months ago

Post

1892

Glyph 🔥 a framework that scales context length by compressing text into images and processing them with vision–language models, released by Z.ai.

Paper:https://huggingface.co/papers/2510.17800
Model:https://huggingface.co/zai-org/Glyph

✨ Compresses long sequences visually to bypass token limits
✨ Reduces computational and memory costs
✨ Preserves meaning through multimodal encoding
✨ Built on GLM-4.1V-9B-Base

reacted to appvoid's post with 👍 3 months ago

Post

4099

today is going to be a great day for small models, are you ready?

3 replies

reacted to s3nh's post with 🔥 3 months ago

Post

604

Eduhelp with more empathy, based on model finetuned on
psychotheraputic preferences just landed on

Beck-8B as a base model, 13000 steps on educational dataset.
Time to go further and build more 🥰
s3nh/EduHelp_Beck_8B
Thanks to @basilic_ai for computations <3

replied to MonsterMMORPG's post 4 months ago

This comment has been hidden

replied to MonsterMMORPG's post 4 months ago

This comment has been hidden

reacted to merve's post with 👍❤️ 5 months ago

Post

6080

first vision language model built off openai/gpt-oss-20b just dropped! 🔥

InternVL3.5 comes with 32 models 🤯 pre-trained, fine-tuned, aligned in various sizes OpenGVLab/internvl35-68ac87bd52ebe953485927fb
comes with gpt-oss or Qwen3 for LLM part ⤵️

1 reply

reacted to AdinaY's post with 🔥 5 months ago

Post

5556

✨ DeepSeek V3.1 just dropped on the hub.
deepseek-ai/DeepSeek-V3.1-Base

replied to appvoid's post 5 months ago

Also, I do not think someone will achive AGI as we dont know what AGI is. I think we will just do incremental perf insreases, not an "unlock" that creates AGI.

replied to appvoid's post 5 months ago

In my pov, it should be open, if I can achieve AGI, someday someone will too. So theres no need to slow things down like eu. Just let things happen, accelerate and decentralize.

reacted to appvoid's post with 🔥 5 months ago

Post

3606

suppose someone is working on a reasoning model, which ends up unlocking achievements that lead to agi, should it be open source?

keep in mind everybody will have access to it: scientists, governments, terrorists, average people, etc...

11 replies

alkinun

AI & ML interests

Recent Activity

Organizations

AtAndDev's activity