Koty KD's picture

Koty KD

kotyKD

·

AI & ML interests

None yet

Recent Activity

liked a dataset 2 days ago

JeanKaddour/minipile

liked a model 4 days ago

stepfun-ai/Step-3.5-Flash

upvoted an article 4 days ago

We Got Claude to Build CUDA Kernels and teach open models!

View all activity

Organizations

None yet

upvoted 2 articles 4 days ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

+2

21 days ago

•

138

Article

Custom Kernels for All from Codex and Claude

+2

5 days ago

•

56

upvoted a paper 20 days ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 31

upvoted a collection about 1 month ago

Falcon-H1-Tiny

A series of extremely small, yet powerful language models redefining capabilities at small scale • 22 items • Updated Jan 15 • 35

upvoted an article about 2 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Jul 29, 2024

•

367

upvoted 2 articles 3 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

58

Article

What makes good reasoning data

Oct 30, 2025

•

44

upvoted 3 collections 4 months ago

Pre-training Dataset Samples

A collection of pre-training datasets samples of sizes 10M, 100M and 1B tokens. Ideal for use in quick experimentation and ablations. • 19 items • Updated Dec 25, 2025 • 18

Essential-Web v1.0

10 items • Updated Jun 18, 2025 • 10

GPT-OSS General (4.2B to 20B)

Collection of pruned GPT-OSS models spanning 1-32 experts, maintaining general capabilities across domains while reducing computational requirements. • 29 items • Updated Aug 13, 2025 • 10

upvoted a collection 5 months ago

Granite 4.0 Language Models

13 items • Updated Nov 17, 2025 • 207

upvoted 2 articles 7 months ago

Article

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

Aug 3, 2025

•

7

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

756

upvoted a paper 9 months ago

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1, 2024 • 23

upvoted an article 9 months ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

May 20, 2025

•

61

upvoted a collection 10 months ago

RADLADS

7 items • Updated May 7, 2025 • 7

upvoted a paper 10 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 98

upvoted a collection 10 months ago

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 73 items • Updated 2 days ago • 360

upvoted a collection 11 months ago

ArgonneAI

Pretrained LLMs from scratch. • 5 items • Updated 24 days ago • 1

upvoted an article 11 months ago

Article

Open R1: Update #3

Mar 11, 2025

•

296