Aurélien-Morgan CLAUDON's picture

Open to Work

Aurélien-Morgan CLAUDON

Aurelien-Morgan

·

https://huggingface.co/retrain-pipelines

AI & ML interests

None yet

Recent Activity

upvoted an article 8 days ago

One Year Since the “DeepSeek Moment”

liked a Space 21 days ago

HuggingFaceFW/FinePDFsBlog

liked a Space 21 days ago

Lightricks/ltx-2-distilled

View all activity

Organizations

upvoted an article 8 days ago

Article

One Year Since the “DeepSeek Moment”

8 days ago

•

35

liked 2 Spaces 21 days ago

FinePDFs: Liberating 3T of the finest tokens from PDFs

LTX-2 Video Fast

Fast high quality video with audio generation

upvoted a paper 23 days ago

CWM: An Open-Weights LLM for Research on Code Generation with World Models

Paper • 2510.02387 • Published Sep 30, 2025 • 9

upvoted a paper 27 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 28 days ago • 287

reacted to danielhanchen's post with ❤️ about 1 month ago

Post

5513

NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model! 🔥

Has 1M context window & best in class performance for SWE-Bench, reasoning & chat. Run the MoE model locally with 24GB RAM.

GGUF: unsloth/Nemotron-3-Nano-30B-A3B-GGUF
💚 Step-by-step Guide: https://docs.unsloth.ai/models/nemotron-3

1 reply

·

replied to their post about 2 months ago

Thanks Victor !
And, that's actually a QR-code to an article I published here but, yeah, QR-code for profile would be useful. QR-code for model / dataset / Space / Paper, when ? 😀

posted an update about 2 months ago

Post

340

Hey, I went to Hangzhou to talk about retrain-pipelines at the GOSIM Foundation's conference last september.
The recording just got released. Go check it out !
https://www.youtube.com/watch?v=nmrMachM5aM
Slides are there :
https://docs.google.com/presentation/d/1hnAzHJ0SbeAOtGJir-iH84RBtXT1OxVT/

2 replies

·

liked a model 2 months ago

Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated 21 days ago • 538k • • 3.97k

updated a Space 2 months ago

README

liked a Space 2 months ago

The Eiffel Tower Llama

Explore the Eiffel Tower Llama experiment with open-source models

upvoted an article 2 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

313

replied to sergiopaniego's post 2 months ago

🙋

reacted to sergiopaniego's post with 🔥 3 months ago

Post

5403

fine-tuning a 14B model with TRL + SFT on a free Colab (T4 GPU)?
thanks to the latest TRL optimizations, you actually can!
sharing a new notebook showing how to do it 😎

colab: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/sft_trl_lora_qlora.ipynb

notebooks in TRL: https://github.com/huggingface/trl/tree/main/examples/notebooks

2 replies

·

upvoted an article 3 months ago

Article

Streaming datasets: 100x More Efficient

+3

Oct 27, 2025

•

80

liked a Space 3 months ago

The Smol Training Playbook

The secrets to building world-class LLMs

upvoted 4 articles 3 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

75

Article

What makes good reasoning data

Oct 30, 2025

•

44

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

42

Article

Hugging Face and VirusTotal collaborate to strengthen AI security

Oct 22, 2025

•

43