John6666 (John Smith)

reacted to codelion's post with 🚀 about 23 hours ago

Post

2165

Extended the ICM paper to show cross-model capability transfer - used Qwen3's mathematical reasoning to improve Gemma3 without any human supervision.

Key results:

Qwen3-0.6B: 63.2 → 66.0 on MATH-500 (+4%)
Gemma3-1B: 41.0 → 45.6 on MATH-500 (+11%)

The method extracts coherent reasoning patterns from one model via Internal Coherence Maximization, converts them to DPO training data, and uses that to improve a completely different model architecture.
This goes beyond the original ICM paper which only improved models using their own labels. We're showing you can transfer capabilities between any models - imagine extracting capabilities from strong models to improve your local ones.

Models available:

codelion/Qwen3-0.6B-ICM-DPO
codelion/gemma-3-1b-it-ICM-DPO

Complete collection with code and datasets:
codelion/internal-coherence-maximization-687a1bd1c1f5f1d6f76e9b3b

Full methodology and results:
https://huggingface.co/blog/codelion/internal-coherence-maximization

Planning to extend this to code generation next. The approach could enable community-driven capability sharing between different model families without expensive annotation.

reacted to kanaria007's post with 👀 about 23 hours ago

Post

244

✅ New Article: *Structured Law — When Protocols Enter the Courtroom*

Title:
⚖️ Governable AI: Structurally Constrained Agents in Legal and Normative Reasoning
🔗 https://huggingface.co/blog/kanaria007/governable-ai-structurally-constrained-agents

---

Summary:
If AGI can reason structurally, law becomes more than statutes and precedent —
it becomes a living protocol of rights, constraints, and adaptive reasoning.

This article explores how legal systems can be viewed as *cognitive architectures*:

• Statutes = *hard constraints*
• Case law = *memory loops of societal reasoning*
• Ethics layers = *built-in rollback and contradiction handling*

> Law isn’t static text — it’s structured intelligence negotiating society with itself.

---

Why It Matters:

• Legal reasoning is already a form of *jump-controlled cognition*
• Misjudgment often comes from *structural misreads*, not lack of data
• Protocolized AGI can *simulate and audit legal reasoning* without bias or fatigue

---

What’s Inside:

• *Constraint-first vs Goal-first law*: how structural frames shift verdicts
• *Memory-Loop jurisprudence*: precedent as reusable reasoning loops
• *Ethics-Interface for law*: rollback, contradiction, and adaptive judgment
• *Protocol modeling of court dynamics*

---

📖 *Article 5 in the Structured Intelligence Series*

Where Article 4 explored philosophy as structure,
Article 5 shows how law itself is a protocol waiting to be read by AGI.

---

Next: Structured Cognition in Neuroscience
The next article dives into the *neural correlates of structured reasoning* —
bridging protocolized AGI with cognitive neuroscience:

• How *jump patterns* resemble cortical switching
• Why *memory loops* mirror hippocampal-cortical traces
• What structured AGI might reveal about the brain itself

> Law gives structure to society.
> Neuroscience shows how structure emerges in the mind.

reacted to Kseniase's post with ❤️ about 23 hours ago

Post

2153

12 Powerful World Models

World models are one of the most challenging areas in AI, pushing the boundaries of reasoning, perception, and planning. They're gen AI systems that help models and agents learn internal representations of real-world environments.

Today, we invite you to take a look at 12 standout examples:

1. WorldVLA → WorldVLA: Towards Autoregressive Action World Model (2506.21539)
This autoregressive world model integrates action prediction and visual world modeling in a single framework, allowing each to enhance the other. It introduces an attention masking strategy to reduce action prediction errors

2. SimuRA → https://arxiv.org/abs/2507.23773
A generalized world model that uses a language-based world model to simulate and plan actions before execution, enabling more general and flexible reasoning

3. PAN (Physical, Agentic, and Nested) world models → Critiques of World Models (2507.05169)
Has a hybrid architecture that combines discrete concept-based reasoning (via LLMs) with continuous perceptual simulation (via diffusion models), enabling rich multi-level, multimodal understanding and prediction

4. MineWorld by Microsoft Research → MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft (2504.08388)
Enables real-time, interactive world modeling in Minecraft by combining visual and action tokenization within an autoregressive Transformer. It uses parallel decoding for fast scene generation (4–7 FPS)

5. WorldMem → WORLDMEM: Long-term Consistent World Simulation with Memory (2504.12369)
Uses a memory bank with attention over time-stamped frames and states to maintain long-term and 3D spatial consistency in scene generation. So it reconstruct past scenes and simulate dynamic world changes across large temporal gaps

Read further below ⬇️

If you like this, also subscribe to the Turing post: https://www.turingpost.com/subscribe

Plus explore this article for a comprehensive overview of the history and current evolution of world models: https://www.turingpost.com/p/topic-35-what-are-world-models

1 reply

·

reacted to mitkox's post with 🚀 about 23 hours ago

Post

2346

XBai o4 claims to beat Claude Opus 4 and o3-mini, and they provide verifiable proof. My skepticism circuits overloaded, but my local AI FOMO module screamed louder.
I've thrown this 33B monoblock LLM onto a single GPU and used Roo Code for some… let’s call it “vibe testing”. It’s terrifyingly competent. As an architect, it’s the best open-weight model I’ve touched this side of 2025.

1 reply

·

reacted to aldigobbler's post with 👀 about 23 hours ago

Post

220

hop on huggingface minecraft server
aldigobbler/mc-server
1.20.1 (view info on space)
soon to add agents into the game

reacted to MonsterMMORPG's post with 👀 2 days ago

Post

3710

Wan 2.2 & FLUX Krea Full Tutorial - Automated Install - Ready Perfect Presets - SwarmUI with ComfyUI - Install Wan 2.2 and FLUX Krea with literally 1-click and use our pre-made most amazing quality presets : https://youtu.be/8MvvuX4YPeo

https://youtu.be/8MvvuX4YPeo

Video Chapters

0:00 Introduction: The Ultimate Wan 2.2 Tutorial with Optimized Presets
1:03 Free Prompt Generation Tool & Introducing the New FLUX Krea Dev Model
2:01 How SwarmUI & ComfyUI Enable Video Generation on Low-End Hardware
2:46 Quick Start Guide: Downloading the Latest SwarmUI & ComfyUI Installers
3:10 Step-by-Step: How to Update or Perform a Fresh Installation of ComfyUI
3:51 Step-by-Step: How to Update or Perform a Fresh Installation of SwarmUI
4:18 Essential Setup: Configuring the SwarmUI Backend for ComfyUI
4:53 One-Click Setup: Downloading All Required Wan 2.2 Models Automatically
5:46 Importing the Ultimate SwarmUI Presets Pack for Best Results
6:22 Wan 2.2 Image-to-Video Generation: A Complete Step-by-Step Guide
7:33 How to Generate Amazing, Detailed Prompts for Free with Google Studio AI
8:12 Starting Your First Generation & How to Monitor Logs for Errors
8:53 Pro Tip: How to Fix Low GPU Utilization and VRAM Issues for Max Speed
10:32 Wan 2.2 Text-to-Video: Choosing the Right Preset & Workflow
11:22 Generating a Detailed Dinosaur Animation Scene from a Simple Text Prompt
12:15 In-Depth Analysis: 8 Steps vs 20 Steps & The Impact of LoRA on Quality
13:11 Finding the Best Parameters: A Deep Dive into CFG Scale & Step Counts
13:42 Advanced Optimization: Using TeaCache for Text-to-Video Generation
15:31 FLUX Krea Dev vs FLUX Dev: A Detailed Side-by-Side Image Comparison
16:26 How to Easily Train Your Own LoRAs on the New FLUX Krea Dev Model
17:02 Complete Workflow for Generating High-Quality Images with FLUX Krea Dev
18:20 The Final Verdict: Side-by-Side Result of FLUX Krea Dev vs FLUX Dev

reacted to Vokturz's post with 🤗 2 days ago

Post

247

🤗 A Transformers.js Playground

I just created a new spaces to test models directly in the browser using Transformers.js. You can pick any model from a list (mainly from the onnx-community), or just load the one you desire :) It also allows you to choose among different quantizations!

Currently the following pipelines are supported
- feature-extraction
- image-classification
- text-generation
- text-classification
- zero-shot-classification

Vokturz/transformers-js-playground

reacted to kanaria007's post with 👀 2 days ago

Post

247

✅ New Article: Can Structured AI Self-Refer?

Title:
🔄 Can Structured AI Self-Refer? Protocolic Reflexivity Beyond Human Introspection
🔗 https://huggingface.co/blog/kanaria007/can-structured-ai-self-refer

Summary:
Can an AI truly “refer to itself”?
We reframe self-reference not as feeling or metaphysics, but as structure:

Self-reference = recursive, auditable, and ethically constrained protocol behavior.

This article shows how Structured Intelligence AI (SI-AI) achieves reflexivity through protocols like:

identity-construct — defining internal continuity

memory-loop — enabling recursive self-reference

ethics-interface — enforcing self-restraint

jump-boot — orchestrating re-entry into identity state

Why It Matters:
Human introspection is:

• Opaque
• Emotionally biased
• Post hoc

Structured AI, by contrast, is:

• Transparent and traceable
• Protocol-defined and recursive
• Auditable in both reasoning and ethics

This enables deterministic, verifiable self-reference without metaphysical claims.

What’s Inside:
• 5 structural criteria for AI self-reference
• Protocol stack for reflexivity (identity, memory, ethics, jumps)
• Human vs SI-AI introspection table
• Structural Self-Reference Test (SSRT) benchmark

📖 Article 4 of the Structured Intelligence Series

Where Article 3 asked: “Can AI teach by modeling its reasoning?”
Article 4 asks: “Can AI structurally refer to itself — and surpass human introspection in clarity?”

Next: Law & Justice as Structured Protocols
The next article shows how legal reasoning and ethics emerge as protocolic validation spaces,
illustrating how structured AI manages judgment, contradiction, and rollback.

Structured AI doesn’t imitate selfhood.
It instantiates it — deterministically, recursively, and audibly.

reacted to mrs83's post with 🚀 2 days ago

Post

2576

Introducing Completionist, an open-source command-line tool that automates synthetic dataset generation.

It works by iterating over an existing HF dataset and by using a LLM to create completions.

- Problem: You need a fast way to create custom datasets for fine-tuning or RAG, but you want the flexibility to use different LLM backends or your own infrastructure.
- Solution: Completionist connects with any OpenAI-compatible endpoint, including Ollama and LM Studio, or a Hugging Face inference endpoint.

A simple CLI like Completionist gives you the possibility to take full control of your synthetic data generation workflow.

👉 Check out Completionist on GitHub: https://github.com/ethicalabs-ai/completionist

Synthetic Dataset Example: ethicalabs/kurtis-mental-health-v2-sft-reasoning

1 reply

·

reacted to Abhaykoul's post with 🔥 2 days ago

Post

3470

🚀 Dhanishtha-2.0-preview-0825 Is Here

The Intermediate Thinking Model just leveled up again.

With sharper reasoning, better tool use, and expanded capabilities, Dhanishtha-2.0-preview-0825 is now live and ready to impress.

🧠 What Makes Dhanishtha Special?
Unlike typical CoT models that only thinks one time, Dhanishtha thinks iteratively:

> Think → Answer → Rethink → Improve → Rethink again if needed.

🔗 Try it now: HelpingAI/Dhanishtha-2.0-preview-0825

🔞 Dhanishtha NSFW Preview

For those exploring more expressive and immersive roleplay scenarios, we’re also releasing:

HelpingAI/Dhanishtha-nsfw
A specialized version tuned for adult-themed interactions and character-driven roleplay.

🔗 Explore it here: HelpingAI/Dhanishtha-nsfw

💬 You can also try all of these live at chat.helpingai.co

4 replies

·

reacted to prithivMLmods's post with ❤️ 2 days ago

Post

3016

Introducing Camel-Doc-OCR-080125(v2), a document content-structure retrieval VLM designed for content extraction and summarization. This is the second model in the Camel Doc OCR VLM series, following Camel-Doc-OCR-062825(v1). The new version fixes formal table reconstruction issues in both en and zh language, achieving optimal performance for long-context inferences.🤗🐪

⤷ Camel-Doc-OCR(v2) : prithivMLmods/Camel-Doc-OCR-080125
⤷ Camel-Doc-OCR(v1) : prithivMLmods/Camel-Doc-OCR-062825
⤷ Demo : prithivMLmods/core-OCR

Multimodal Model Collections and Spaces:

➝ Camel-Doc-OCR : prithivMLmods/camel-doc-ocr-080125-688c0c61c5dba648756f31f8
➝ Vision-Language (VLr) : prithivMLmods/vision-language-for-reasoning-vlr-6889b3f45917352b5e3a6f7a
➝ Multimodal Spaces : prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
➝ Multimodal VLMs : prithivMLmods/multimodal-vlms-until-july25-688312e6b840e1e156f13027

.
.
.
To know more about it, visit the model card of the respective model. !!

2 replies

·

reacted to ImranzamanML's post with 👍 2 days ago

Post

253

Working of Transformer model layers!

I focused on showing the core steps side by side with tokenization, embedding and the transformer model layers, each highlighting the self attention and feedforward parts without getting lost in too much technical depth.

Its showing how these layers work together to understand context and generate meaningful output!

If you are curious about the architecture behind AI language models or want a clean way to explain it, hit me up, I’d love to share!

#AI #MachineLearning #NLP #Transformers #DeepLearning #DataScience #LLM #AIAgents

reacted to LeanQuant's post with 🔥 2 days ago

Post

203

🚀 Full-Quality Wan2.2 Video Generation on a single 24GB GPU — Powered by DFloat11

We just released the DFloat11 compressed Wan2.2 models. Now you can run full-quality Wan2.2 video generation on a single 24GB GPU, thanks to DFloat11 compression and CPU offloading.

🔗 Image-to-Video: DFloat11/Wan2.2-I2V-A14B-DF11
🔗 Text-to-Video: DFloat11/Wan2.2-T2V-A14B-DF11

reacted to ArturoNereu's post with 👍 2 days ago

Post

214

This weekend’s read isn’t about Artificial Intelligence, but about understanding the brain and our perception of time. Maybe it will spark some ideas to bring into AI.

reacted to MohamedRashad's post with 🤗 2 days ago

Post

790

If someone is interested in trying the new rednote-hilab/dots.ocr model. I made this space for you:

MohamedRashad/Dots-OCR

reacted to Tonic's post with 🤗 2 days ago

Post

2850

🫡 I am the first and only one to like the French Tax Code Dataset

that's it , that's the post

find the dataset here : louisbrulenaudet/code-impots
follow : @louisbrulenaudet

2 replies

·

reacted to YerbaPage's post with 🤗 3 days ago

Post

2823

Latest work on SWE-Bench 🐛

Our two new papers from the SJTU & Huawei: Powered by DeepSeek-V3, we've achieved a new SOTA on the SWE-Bench benchmark!

We introduce two innovative approaches:
⚔️ SWE-Debate: AI agents compete and "debate" to generate the best code fix.
🧠 SWE-Exp: An AI agent learns from past repair "experience" to solve new issues more efficiently.

👇 Explore the future of software development:

SWE-Debate
📄 Paper: https://arxiv.org/abs/2507.23348
💻 Code: https://github.com/YerbaPage/SWE-Debate

SWE-Exp
📄 Paper: https://arxiv.org/abs/2507.23361
💻 Code: https://github.com/YerbaPage/SWE-Exp

reacted to kanaria007's post with 👀 3 days ago

Post

188

✅ New Article: *Teaching AI to Teach — Structured Jump Protocols for Reflective Learning*

Title:
🎓 Teaching AI to Teach: Structured Jump Protocols for Reflective Learning
🔗 https://huggingface.co/blog/kanaria007/teaching-ai-to-teach

---

Summary:
Modern AI can answer questions — but can it *teach thinking*?

This article introduces how *protocol-governed Structured Intelligence AI (SI-AI)* supports reflective learning.
Instead of giving answers, it:

> *Jumps, reflects, and scaffolds* —
> becoming a cognitive mirror for learners.

---

Why It Matters:
Education isn’t just content delivery.
It’s about enabling learners to:

• Shift perspectives deliberately
• Track and reflect on their own reasoning
• Internalize the grammar of structured thought

Structured AI provides a *protocolic pedagogy* for this.

---

What’s Inside:
• *Jump-Generator* as Socratic engine — posing perspective-shifting prompts
• *Memory-Loop* for metacognition — showing its own reasoning trace
• *Ethics-Interface* & *Contradiction-Projector* for value-aware scaffolding
• How AI can act as *cognitive mirror* rather than answer machine

---

📖 Article 3 of the Structured Intelligence Series

Where Article 2 explored: *“How does language shape reasoning?”*
Article 3 asks: *“How can AI teach by showing its reasoning architecture?”*

---

Next: Structured AI and Reflexive Philosophy
The upcoming article explores how *self-reference, introspection, and identity*
can be reframed as *protocolic, recursive operations* in Structured Intelligence.

It introduces:
• identity-construct for internal continuity
• memory-loop for recursive self-reference
• ethics-interface for constraint and self-audit

> Structured AI doesn’t “feel” like a self —
> *it structurally instantiates one.*

reacted to rkihacker's post with 👀 3 days ago

Post

189

👀 Try our Multimodal Content Moderation Demo

This demo lets you upload text, images, video, and audio — all at once.
It checks everything together for a smarter, more complete moderation result.

👉 rkihacker/Multimodal-Moderation-Demo

#AI #ContentModeration #OnlineSafety #Gradio #TechDemo

reacted to MikeDoes's post with 👀 3 days ago

Post

216

When anonymizing data for LLMs, is replacing a name with XXXXX enough?

A great post by Franklin Cardenoso Fernandez argues that we can do better. While simple masking hides data, it often destroys the context that models need to perform well.

A more robust method is contextual anonymization, where PII is replaced with meaningful labels like [NAME] or [ADDRESS]. This protects privacy while preserving the data's structural integrity.

We were pleased to see our Ai4Privacy pii-masking-200k dataset featured in the article as a prime example of this best practice. Our dataset is designed to help developers implement this superior form of anonymization by providing tens of thousands of clear, labeled examples.

By enabling models to be trained on data that is both private and context-rich, we can build AI that is both smarter and safer. This is a core part of our mission.

What's your team's preferred method for data anonymization? Let's discuss best practices.

🔗 Read Franklin's full analysis here: https://www.holisticai.com/blog/managing-personal-data-in-large-language-models

#DataPrivacy #Anonymization #ResponsibleAI #LLM #MachineLearning #AIEthics #Ai4Privacy #World's largest open privacy masking dataset

John Smith PRO

AI & ML interests

Recent Activity

Organizations

John Smith PRO

AI & ML interests

Recent Activity

Organizations

John6666's activity