Ahmet's picture

Ahmet

atasoglu

·

atasoglu

AI & ML interests

NLP, LLMs.

Recent Activity

liked a model 2 days ago

mistralai/Voxtral-Small-24B-2507

reacted to danielhanchen's post with 🔥 3 days ago

Made some 245GB (80% size reduction) 1.8bit quants for Kimi K2! https://huggingface.co/unsloth/Kimi-K2-Instruct-GGUF

liked a model 3 days ago

ysdede/whisper-khanacademy-large-v3-turbo-tr

View all activity

Organizations

upvoted 2 collections 5 days ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated 17 days ago • 72

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated 6 days ago • 94

upvoted an article 6 days ago

Article

The Complete Guide to AI Architectures: From Neural Networks to Foundation Models

By

•

6 days ago

• 1

upvoted a collection 8 days ago

T5Gemma

32 items • Updated 8 days ago • 55

upvoted an article 9 days ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By

and 1 other •

9 days ago

• 572

upvoted 3 collections 9 days ago

VBART Base Models

Pre-trained base models. • 4 items • Updated May 15, 2024 • 1

VBART Finetuned Models

VBART model finetuned to specific cases. • 10 items • Updated May 15, 2024 • 2

Orpheus-TTS-Turkish

4 items • Updated Apr 22 • 1

upvoted an article 10 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

10 days ago

• 548

upvoted 4 collections 10 days ago

SmolLM3 pretraining datasets

datasets used in SmolLM3 pretraining • 14 items • Updated 10 days ago • 20

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 10 items • Updated 7 days ago • 59

Tar

Unifying Visual Understanding and Generation via Text-Aligned Representations • 5 items • Updated 16 days ago • 14

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 37 items • Updated Jun 13 • 47

upvoted 2 articles 10 days ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

By

and 6 others •

May 21

• 188

Article

Vision Language Models Explained

By

and 1 other •

Apr 11, 2024

• 413

upvoted 2 collections 10 days ago

Simple

A series of simple datasets and soon models by me! • 4 items • Updated 11 days ago • 3

Releases July 4

25 items • Updated 11 days ago • 7

upvoted an article 11 days ago

Article

Transformers Are Getting Old: Variants and Alternatives Exist!

By

•

13 days ago

• 40

upvoted a collection 17 days ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated 7 days ago • 151

upvoted a collection 19 days ago

GLiNER-X

The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type. • 6 items • Updated 24 days ago • 19