4 12 12

Cédric

deltheil

AI & ML interests

None yet

Recent Activity

upvoted an article 28 days ago

"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack

upvoted an article about 1 month ago

PRX Part 3 — Training a Text-to-Image Model in 24h!

upvoted an article about 2 months ago

Text-to-image Architectural Experiments

View all activity

Organizations

upvoted an article 28 days ago

Article

"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack

Sep 16, 2025

•

upvoted an article about 1 month ago

Article

PRX Part 3 — Training a Text-to-Image Model in 24h!

Mar 3

•

upvoted an article about 2 months ago

Article

Text-to-image Architectural Experiments

Nov 13, 2025

•

New activity in finegrain/finegrain-image-enhancer 2 months ago

Your app is paused

#16 opened 2 months ago by

learrist

upvoted an article 2 months ago

Article

Training Design for Text-to-Image Models: Lessons from Ablations

Feb 3

•

updated a Space 2 months ago

Finegrain Image Enhancer

🖼

2.09k

Clarity AI Upscaler Reproduction

updated 3 Spaces 3 months ago

Finegrain Light Switcher (Lite Version)

💡

Instantly turn lamps on in your images

Finegrain Object Eraser (Lite Version)

🧽

433

Erase any object from an image with just a prompt

Finegrain Object Cutter

✂

516

Create HD cutouts from any image with just a prompt

repliedto piercus's post 5 months ago

zeroing and reshaping the text-related cross-attentions into self-attentions

It's actually narrowing, not zeroing (even though strategy="zeros" is used in the StateDictAdapter()).

For instance, the logs show:

Adapting down_blocks.0.attentions.0.transformer_blocks.0.attn2.to_k.weight by narrowing from shape torch.Size([320, 768]) to torch.Size([320, 320])

So the extra weights are just discarded in this case. Zero-filling is only used when expanding tensors to larger shapes.

Corresponding code: link.

reactedto piercus's post with 🔥 5 months ago

Post

4000

Starts erasing! 🎉 🎉 🎉
This is made with a one-step SD1.5 LBM [1] eraser !

Data is open. Data pipeline is open. Training code is open.
On our LBM fork : https://github.com/finegrain-ai/LBM

[1] LBM: Latent Bridge Matching for Fast Image-to-Image Translation (2503.07535)

1 reply

reactedto piercus's post with 👍 6 months ago

Post

859

🚧 Reproducing LBM-Eraser… in the open [1] !

Today we have trained a LBM [2] promptless inpainter using Re-LAION-Caption19M[3].

We use a subset of 1.25M images with aesthetic_score > 5.6 and pwatermark < 0.2 and LaMa [2] mask generation.

2 takeaways :
🖼 Inpainting is better compared to our RORD experiments [5]
🦶 "4 steps" outperforms single-step

[1] Finegrain LBM Fork : https://github.com/finegrain-ai/LBM
[2] LBM: Latent Bridge Matching for Fast Image-to-Image Translation (2503.07535)
[3] supermodelresearch/Re-LAION-Caption19M
[4] Resolution-robust Large Mask Inpainting with Fourier Convolutions (2109.07161)
[5] https://huggingface.co/posts/piercus/778833977889788

cc @supermodelresearch @presencesw

reactedto piercus's post with 🧠 6 months ago

Post

1870

🚧 Reproducing LBM-Eraser… in progress! [1]

When repurposing a T2I model into a pure I2I model, there’s always that orphaned text path — what do we do with it? 🤔

You can reuse it as learnable embeddings in multi-task setups [2], freeze an empty text prompt, distillate or prune the corresponding part.

In LBM, they take a clever route — zeroing [3] and reshaping [4] the text-related cross-attentions into self-attentions.
This gives you fresh weights for I2I computation, nicely integrated into your SD architecture.

📎 References
[1] Our LBM Fork: https://github.com/finegrain-ai/LBM
[2] OmniPaint: OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting (2503.08677)
[3] LBM Zeroing: https://github.com/gojasper/LBM/blob/cafebc46a9ac16dcc61691d289cc4676b5c75380/examples/training/train_lbm_surface.py#L147-L148
[4] LBM Reshaping: https://github.com/gojasper/LBM/blob/cafebc46a9ac16dcc61691d289cc4676b5c75380/examples/training/train_lbm_surface.py#L100

2 replies

upvoted an article 6 months ago

Article

SOTA OCR with Core ML and dots.ocr

Oct 2, 2025

•

reactedto piercus's post with 🔥 6 months ago

Post

3158

We have trained a LBM-Eraser with RORD-Dataset in the open 🔥

🚀 1-step only inference, no distillation
🪶 Light backbone :SD1.5
🧠 Light training : converge in 6k steps

Now let's improve this, especially the inpainting capabilities. Stay tuned for more :-)

LBM paper : LBM: Latent Bridge Matching for Fast Image-to-Image Translation (2503.07535)
Our LBM fork : https://github.com/finegrain-ai/LBM