CPP-Net Model for Cervical Intraepithelial Neoplasia 2 (CIN2) Nuclei Segmentation
Model
- cellseg_models.pytorch implementation of CPP-Net: https://arxiv.org/abs/2102.06867
- Backbone encoder: pre-trained efficientnet_b5 from pytorch-image-models https://github.com/huggingface/pytorch-image-models
USAGE
1. Install cellseg_models.pytorch and albumentations
pip install cellseg-models-pytorch
pip install albumentations
2. Load trained model
from cellseg_models_pytorch.models.cppnet import CPPNet
model = CPPNet.from_pretrained("hgsc_v1_efficientnet_b5")
3. Run inference for one image
from albumentations import Resize, Compose
from cellseg_models_pytorch.utils import FileHandler
from cellseg_models_pytorch.transforms.albu_transforms import MinMaxNormalization
model.set_inference_mode()
# Resize to multiple of 32 of your own choosing
transform = Compose([Resize(1024, 1024), MinMaxNormalization()])
im = FileHandler.read_img(IMG_PATH)
im = transform(image=im)["image"]
prob = model.predict(im)
out = model.post_process(prob)
# out = {"nuc": [(nuc instances (H, W), nuc types (H, W))], "cyto": None, "tissue": None}
3.1 Run inference for image batch
import torch
from cellseg_models_pytorch.utils import FileHandler
model.set_inference_mode()
# dont use random matrices IRL
batch = torch.rand(8, 3, 1024, 1024)
prob = model.predict(im)
out = model.post_process(prob)
# out = {
# "nuc": [
# (nuc instances (H, W), nuc types (H, W)),
# (nuc instances (H, W), nuc types (H, W)),
# .
# .
# .
# (nuc instances (H, W), nuc types (H, W))
# ],
# "cyto": None,
# "tissue": None
#}
4. Visualize output
from matplotlib import pyplot as plt
from skimage.color import label2rgb
fig, ax = plt.subplots(1, 3, figsize=(18, 6))
ax[0].imshow(im)
ax[1].imshow(label2rgb(out["nuc"][0][0], bg_label=0)) # inst_map
ax[2].imshow(label2rgb(out["nuc"][0][1], bg_label=0)) # type_map
Dataset Details
Semi-manually annotated CIN2 samples from a (private) cohort of Helsinki University Hospital
Contains:
- 370 varying sized image crops at 20x magnification.
- 168 640 annotated nuclei
Dataset classes
nuc_classes = {
0: "background",
1: "neoplastic",
2: "inflammatory",
3: "connective",
4: "dead",
5: "glandular_epithelial",
6: "squamous_epithelial",
}
Dataset Class Distribution
- connective nuclei: 46 222 (~27.3%)
- neoplastic nuclei: 49 493 (~29.4%)
- inflammatory nuclei 27 226 (~16.1%)
- dead nuclei 195 (~0.11%)
- glandular epithelial 14 310 (~8.5%)
- squamous epithelial 31194 (~18.5%)
Model Training Details:
First, the image crops in the training data were tiled into 224x224px patches with a sliding window (stride=32px).
Rest of the training procedures follow this notebook: [link]
Citation
cellseg_models.pytorch:
@misc{https://doi.org/10.5281/zenodo.12666959,
doi = {10.5281/ZENODO.12666959},
url = {https://zenodo.org/doi/10.5281/zenodo.12666959},
author = {Okunator, },
title = {okunator/cellseg_models.pytorch: v0.2.0},
publisher = {Zenodo},
year = {2024},
copyright = {Creative Commons Attribution 4.0 International}
}
CPP-Net original paper:
@article{https://doi.org/10.48550/arxiv.2102.06867,
doi = {10.48550/ARXIV.2102.06867},
url = {https://arxiv.org/abs/2102.06867},
author = {Chen, Shengcong and Ding, Changxing and Liu, Minfeng and Cheng, Jun and Tao, Dacheng},
keywords = {Computer Vision and Pattern Recognition (cs.CV), FOS: Computer and information sciences, FOS: Computer and information sciences},
title = {CPP-Net: Context-aware Polygon Proposal Network for Nucleus Segmentation},
publisher = {arXiv},
year = {2021},
copyright = {arXiv.org perpetual, non-exclusive license}
}
Licence
These model weights are released under the Apache License, Version 2.0 (the "License"). You may obtain a copy of the License at:
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
Additional Terms
While the Apache 2.0 License grants broad permissions, we kindly request that users adhere to the following guidelines: Medical or Clinical Use: This model is not intended for use in medical diagnosis, treatment, or prevention of disease of real patients. It should not be used as a substitute for professional medical advice.