alkinun's picture

alkinun

AtAndDev

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

liked a model 1 day ago
facebook/KernelLLM
View all activity

Organizations

ESPnet's profile picture CVPR Demo Track's profile picture BigScience Biomedical Datasets's profile picture ONNXConfig for all's profile picture video-p2p-library's profile picture Gradio-Themes-Party's profile picture Gradio-Blocks-Party's profile picture scikit-learn's profile picture Open-Source AI Meetup's profile picture lora concepts library's profile picture OpenBuddy Community's profile picture ECCV 2022's profile picture Kornia AI's profile picture Tune a video concepts library's profile picture SIGGRAPH 2022's profile picture Interspeech2022's profile picture Stable Diffusion concepts library's profile picture SIGGRAPH Asia 2022 Demos's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Musika's profile picture Blog-explorers's profile picture OpenSky's profile picture ICCV2023's profile picture ICML2023's profile picture huggingPartyParis's profile picture Multi๐Ÿค–Transformers's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Pirates Party for all software open source's profile picture MLX Community's profile picture recipe research's profile picture Narra's profile picture Social Post Explorers's profile picture Cognitive Computations's profile picture M4-ai's profile picture Spinner-GPT-4's profile picture Dev Mode Explorers's profile picture Stable Diffusion Community (Unofficial, Non-profit)'s profile picture Hugging Face Discord Community's profile picture Nerdy Face's profile picture OpenEndedLM's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture None yet's profile picture

AtAndDev's activity

reacted to Jofthomas's post with ๐Ÿ‘๐Ÿ”ฅ 1 day ago
view post
Post
2222
Meet our new agentic model : ๐——๐—ฒ๐˜ƒ๐˜€๐˜๐—ฟ๐—ฎ๐—น

Devstral is an open-source LLM built software engineering tasks built under a collaboration between Mistral AI and All Hands AI ๐Ÿ™Œ.

๐—ž๐—ฒ๐˜† ๐—ณ๐—ฒ๐—ฎ๐˜๐˜‚๐—ฟ๐—ฒ๐˜€ :
โ€ข ๐Ÿค– ๐—”๐—ด๐—ฒ๐—ป๐˜๐˜€ : perfect for Agentic coding
โ€ข ๐Ÿƒ ๐—น๐—ถ๐—ด๐—ต๐˜๐˜„๐—ฒ๐—ถ๐—ด๐—ต๐˜: Devstral is a ๐Ÿฎ๐Ÿฐ๐—• parameter based on Mistral small.
โ€ข ยฉ๏ธ ๐—”๐—ฝ๐—ฎ๐—ฐ๐—ต๐—ฒ ๐Ÿฎ.๐Ÿฌ, meaning fully open-source !
โ€ข ๐Ÿ“„ A ๐Ÿญ๐Ÿฎ๐Ÿด๐—ธ context window.

๐Ÿ“šBlog : https://mistral.ai/news/devstral
โšกAPI : The model is also available on our API under the name ๐—ฑ๐—ฒ๐˜ƒ๐˜€๐˜๐—ฟ๐—ฎ๐—น-๐˜€๐—บ๐—ฎ๐—น๐—น-๐Ÿฎ๐Ÿฑ๐Ÿฌ๐Ÿฑ
๐Ÿค— repo : mistralai/Devstral-Small-2505

Can't wait to see what you will build with it !
  • 1 reply
ยท
replied to merve's post 1 day ago
view reply

Such a cool implementation of a model
Btw ben de tรผrkรผm :)

reacted to merve's post with ๐Ÿš€ 1 day ago
view post
Post
2676
Bu post'u รงevirebilirsiniz ๐Ÿค—๐Ÿ’—
ยท
reacted to YerbaPage's post with ๐Ÿค—๐Ÿ‘€๐Ÿ”ฅ 5 days ago
view post
Post
2971
Curated list of **Next Gen Code Generation** papers & benchmarks! ๐Ÿ”ฅ with 50+ โญ๏ธ now!

Stay ahead with the latest in:
โœ… Repo-level Issue Resolution (SWE-bench, Agents)
โœ… Repo-level Code Completion (Repo understanding)
โœ… Datasets & Benchmarks

๐Ÿ‘‰ Check it out: https://github.com/YerbaPage/Awesome-Repo-Level-Code-Generation ๐Ÿ”ฅ
๐Ÿ’กPRs are welcomed!
reacted to regisss's post with ๐Ÿš€ 7 days ago
replied to ProCreations's post 9 days ago
view reply

Well, a gpu (like 10k) has MUCH MUCH more cores than a cpu (like 12) and cpus have much more capable cores. GPUs just do simple matrix mul. so we would have the same num of cores with gpus but the cores will become much more capable. Which is a big W. Sorry for the nerdieness but I just had to hop in. :)

reacted to hesamation's post with ๐Ÿ‘๐Ÿ”ฅโค๏ธ about 1 month ago
replied to Steven10429's post about 1 month ago
view reply

It seems like they randomly filter out some ppl for some reason...

reacted to AdinaY's post with ๐Ÿ‘ about 1 month ago
view post
Post
1839
MAYE๐ŸŽˆa from-scratch RL framework for Vision Language Models, released by GAIR - an active research group from the Chinese community.

โœจMinimal & transparent pipeline with standard tools
โœจStandardized eval to track training & reflection
โœจOpen Code & Dataset

Code:
https://github.com/GAIR-NLP/MAYE?tab=readme-ov-file
Dataset:
ManTle/MAYE
Paper:
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme (2504.02587)
  • 1 reply
ยท
reacted to merterbak's post with โค๏ธ๐Ÿ‘€๐Ÿ”ฅ about 1 month ago
replied to their post about 2 months ago
posted an update about 2 months ago
view post
Post
2987
Llama 4 is out...
ยท
reacted to BestWishYsh's post with ๐Ÿ‘€ about 2 months ago