alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

reacted to nroggendorff's post with 🚀 about 2 hours ago

I'm not really doing much on HuggingFace right now due to their new Docker space policies, so if you want to keep up with most of what I'm up to, follow my [instagram](https://sly.sh/ig)

updated a dataset about 5 hours ago

GradyanAkincilari/30k-gemini-2.0-flash-v6

published a dataset about 5 hours ago

GradyanAkincilari/30k-gemini-2.0-flash-v6

View all activity

Organizations

upvoted a paper 1 day ago

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Paper • 2410.01215 • Published Oct 2, 2024 • 36

upvoted 2 collections 2 days ago

Gemma 3n

4 items • Updated 10 days ago • 153

DeepHermes

Preview models of hybrid reasoner Hermes series • 6 items • Updated Mar 13 • 40

upvoted a paper 6 days ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 66

upvoted a collection 6 days ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 23 items • Updated 3 days ago • 140

upvoted a paper 9 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 11 days ago • 57

upvoted a paper 17 days ago

Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task

Paper • 2506.08872 • Published 26 days ago • 11

upvoted an article 28 days ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 621

upvoted an article about 1 month ago

Article

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

By

•

Jun 2

• 13

upvoted a paper about 1 month ago

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Paper • 2505.18445 • Published May 24 • 64

upvoted an article about 2 months ago

Article

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

By

and 9 others •

May 15

• 35

upvoted a paper about 2 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 177

upvoted an article 3 months ago

Article

LLaMA 4 Fine-Tuning with Mental Health Counseling Data

By

•

Apr 14

• 3

upvoted a paper 3 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 276

upvoted a collection 3 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Apr 28 • 498

upvoted a paper 3 months ago

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published Apr 3 • 30

upvoted an article 3 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 145

upvoted a paper 3 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 126

upvoted a collection 3 months ago

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated 4 days ago • 66

upvoted a paper 4 months ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17 • 31