alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

liked a model 1 day ago

facebook/KernelLLM

reacted to Jofthomas's post with 👍 1 day ago

Meet our new agentic model : 𝗗𝗲𝘃𝘀𝘁𝗿𝗮𝗹 Devstral is an open-source LLM built software engineering tasks built under a collaboration between Mistral AI and All Hands AI 🙌. 𝗞𝗲𝘆 𝗳𝗲𝗮𝘁𝘂𝗿𝗲𝘀 : • 🤖 𝗔𝗴𝗲𝗻𝘁𝘀 : perfect for Agentic coding • 🍃 𝗹𝗶𝗴𝗵𝘁𝘄𝗲𝗶𝗴𝗵𝘁: Devstral is a 𝟮𝟰𝗕 parameter based on Mistral small. • ©️ 𝗔𝗽𝗮𝗰𝗵𝗲 𝟮.𝟬, meaning fully open-source ! • 📄 A 𝟭𝟮𝟴𝗸 context window. 📚Blog : https://mistral.ai/news/devstral ⚡API : The model is also available on our API under the name 𝗱𝗲𝘃𝘀𝘁𝗿𝗮𝗹-𝘀𝗺𝗮𝗹𝗹-𝟮𝟱𝟬𝟱 🤗 repo : https://huggingface.co/mistralai/Devstral-Small-2505 Can't wait to see what you will build with it !

reacted to Jofthomas's post with 🔥 1 day ago

Meet our new agentic model : 𝗗𝗲𝘃𝘀𝘁𝗿𝗮𝗹 Devstral is an open-source LLM built software engineering tasks built under a collaboration between Mistral AI and All Hands AI 🙌. 𝗞𝗲𝘆 𝗳𝗲𝗮𝘁𝘂𝗿𝗲𝘀 : • 🤖 𝗔𝗴𝗲𝗻𝘁𝘀 : perfect for Agentic coding • 🍃 𝗹𝗶𝗴𝗵𝘁𝘄𝗲𝗶𝗴𝗵𝘁: Devstral is a 𝟮𝟰𝗕 parameter based on Mistral small. • ©️ 𝗔𝗽𝗮𝗰𝗵𝗲 𝟮.𝟬, meaning fully open-source ! • 📄 A 𝟭𝟮𝟴𝗸 context window. 📚Blog : https://mistral.ai/news/devstral ⚡API : The model is also available on our API under the name 𝗱𝗲𝘃𝘀𝘁𝗿𝗮𝗹-𝘀𝗺𝗮𝗹𝗹-𝟮𝟱𝟬𝟱 🤗 repo : https://huggingface.co/mistralai/Devstral-Small-2505 Can't wait to see what you will build with it !

View all activity

Organizations

AtAndDev's activity

liked a model 1 day ago

facebook/KernelLLM

Text Generation • Updated 4 days ago • 1.58k • 97

liked a model 3 days ago

mistralai/Devstral-Small-2505

Text2Text Generation • Updated 1 day ago • 45.9k • 480

liked a model 6 days ago

agentica-org/DeepCoder-14B-Preview

Text Generation • Updated 13 days ago • 17.3k • 643

liked a model 7 days ago

Qwen/Qwen3-1.7B

Text Generation • Updated 3 days ago • 467k • 124

liked 3 datasets 9 days ago

nvidia/OpenCodeReasoning

Viewer • Updated 20 days ago • 753k • 13.3k • 439

Parveshiiii/opencode_reasoning_filtered

Viewer • Updated 16 days ago • 568k • 228 • 2

nvidia/OpenCodeInstruct

Viewer • Updated 26 days ago • 4.97M • 1.93k • 11

liked a model 9 days ago

ByteDance-Seed/Seed-Coder-8B-Reasoning

Text Generation • Updated 9 days ago • 9.28k • 114

liked a dataset 9 days ago

rombodawg/code_bagel

Viewer • Updated Oct 8, 2024 • 2.22M • 43 • 5

liked a model 11 days ago

google/gemma-3-1b-it

Text Generation • Updated Apr 4 • 2.17M • 422

liked a dataset 11 days ago

Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B

Viewer • Updated Jan 27 • 250k • 407 • 96

liked 2 models 13 days ago

Qwen/Qwen3-4B-FP8

Text Generation • Updated 3 days ago • 16.9k • 22

Qwen/Qwen3-4B

Text Generation • Updated 3 days ago • 416k • • 212

liked a dataset 13 days ago

livecodebench/code_generation_lite

Updated Apr 21 • 60.2k • 42

liked a model about 1 month ago

OpenGVLab/InternVL3-14B

Image-Text-to-Text • Updated 29 days ago • 140k • 57

liked a dataset about 1 month ago

madrylab/gsm8k-platinum

Viewer • Updated Mar 11 • 1.21k • 3.24k • 35

liked a model about 1 month ago

Qwen/Qwen2.5-0.5B-Instruct

Text Generation • Updated Sep 25, 2024 • 1.08M • 318

liked 3 models about 2 months ago

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated Apr 6 • 2.79M • 375

unsloth/Qwen2.5-VL-3B-Instruct-bnb-4bit

Image-Text-to-Text • Updated 12 days ago • 6.14k • 3

unsloth/gemma-3-4b-it-bnb-4bit

Image-Text-to-Text • Updated 12 days ago • 14k • 9