473 2022 23465

John Smith PRO

John6666

John6666cat

AI & ML interests

None yet

Recent Activity

updated a dataset about 14 hours ago

John6666/flux1-backup-202508

published a model about 15 hours ago

John6666/fuyu-hanfu-pony-nsfw-noobai-v10-sdxl

published a model about 15 hours ago

John6666/hawawa-mix-il-v10-sdxl

View all activity

Organizations

updated a dataset about 14 hours ago

John6666/flux1-backup-202508

Updated about 14 hours ago • 54

published 4 models about 15 hours ago

liked 3 models about 15 hours ago

GaMS-Beta/GaMS-9B-SFT-Translator

Text Generation • 9B • Updated 2 days ago • 26 • 1

mradermacher/GaMS-9B-SFT-Translator-GGUF

9B • Updated 1 day ago • 51 • 1

mradermacher/GaMS-9B-SFT-Translator-i1-GGUF

9B • Updated 1 day ago • 100 • 1

upvoted a collection about 15 hours ago

Finetunes | SLMs and LLMs

Collection

Various variants of LLMs finetuned using proprietary data. • 25 items • Updated 2 days ago • 3

liked 3 models about 15 hours ago

alpha-ai/gemma-3N-E2B-Just-Chatty

Image-Text-to-Text • 6B • Updated 3 days ago • 16 • 1

mradermacher/gemma-3N-E2B-Just-Chatty-GGUF

4B • Updated 1 day ago • 72 • 1

mradermacher/MedraN-E4B-Uncensored-EP7-i1-GGUF

7B • Updated 1 day ago • 37 • 1

updated 2 models about 18 hours ago

John6666/softy-v10-sdxl

Text-to-Image • Updated about 18 hours ago

John6666/illustrious-flat2d-from-hades-v201-sdxl

Text-to-Image • Updated about 18 hours ago

updated 2 models about 19 hours ago

John6666/hawawa-mix-il-v10-sdxl

Text-to-Image • Updated about 19 hours ago

John6666/fuyu-hanfu-pony-nsfw-noobai-v10-sdxl

Text-to-Image • Updated about 19 hours ago

updated a collection about 20 hours ago

Resources for Tagging / Captioning / Prompting / LLM

Collection

10515 items • Updated about 20 hours ago • 6

liked a dataset about 20 hours ago

deepghs/safebooru-webp-4Mpixel

Updated 1 minute ago • 1.05k • 2

upvoted an article about 23 hours ago

Article

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

•

2 days ago

• 4

reacted to codelion's post with 🚀 about 23 hours ago

Post

2165

Extended the ICM paper to show cross-model capability transfer - used Qwen3's mathematical reasoning to improve Gemma3 without any human supervision.

Key results:

Qwen3-0.6B: 63.2 → 66.0 on MATH-500 (+4%)
Gemma3-1B: 41.0 → 45.6 on MATH-500 (+11%)

The method extracts coherent reasoning patterns from one model via Internal Coherence Maximization, converts them to DPO training data, and uses that to improve a completely different model architecture.
This goes beyond the original ICM paper which only improved models using their own labels. We're showing you can transfer capabilities between any models - imagine extracting capabilities from strong models to improve your local ones.

Models available:

codelion/Qwen3-0.6B-ICM-DPO
codelion/gemma-3-1b-it-ICM-DPO

Complete collection with code and datasets:
codelion/internal-coherence-maximization-687a1bd1c1f5f1d6f76e9b3b

Full methodology and results:
https://huggingface.co/blog/codelion/internal-coherence-maximization

Planning to extend this to code generation next. The approach could enable community-driven capability sharing between different model families without expensive annotation.

John Smith PRO

AI & ML interests

Recent Activity

Organizations

John6666's activity

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation