1. Introduction

ORCID:

Semi-Supervised Segmentation of Functional Tissue Units at the Cellular Level

Volodymyr Sydorskyi

volodymyr.sydorskyi@gmail.com 0 1 2

Igor Krashenyi

igor.krashenyi@ucu.edu.ua 0 1 2

Denis Savka

0 1 2

Oleksandr Zarichkovyi

alexander.zarichkovyi@gmail.com 0 1 2 0 Kyiv , 03056 , Ukraine 1 National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute” , 37, Prosp. Peremohy 2 Ukrainian Catholic University , Ilariona Svjentsits'koho St, 17, Lviv, 79000 , Ukraine

000 0 0001

We present a new method for functional tissue unit segmentation at the cellular level, which utilizes the latest deep learning semantic segmentation approaches together with domain adaptation and semi-supervised learning techniques. This approach allows for minimizing the domain gap, class imbalance, and captures settings influence between HPA and HubMAP datasets. The presented approach achieves comparable with state-of-the-art-result in functional tissue unit segmentation at the semantic segmentation, functional tissue unit, semi-supervised learning

The source

1. Introduction

It is estimated that the human body contains approximately 37 trillion cells, and comprehending the complex relationships and functions among them poses a significant challenge for researchers, requiring a colossal effort [ 1 ]. One of the research directions aims to map human body at a cellular level to detect functional tissue units (FTU). FTU is defined as a unit consisting of a three-dimensional block of cells centered around a capillary, such that each cell in this block is within diffusion distance from any other cell in the same block [ 2 ]. These cellular compositions - cell population neighborhoods are responsible for performing an organ’s main physiologic functions. Functional tissue units, such as colonic crypts, renal glomeruli, alveoli, etc. (examples can be observed in Figure 1) have pathobiological relevance that are essential for modeling and comprehending the development of a disease. However, manually annotating FTUs is time consuming and costly. At the same time current algorithms suffer from poor generalizability and low accuracy [ 3 ]. So the task for the competition was to segment FTU on stained microscope slides in a way that is invariant to different staining protocols. In this paper a new method is proposed, which utilizes the latest deep learning semantic segmentation [ 4 ] approaches together with domain adaptation techniques and semi-supervised learning techniques.

2. Related work

One of the most common approaches to functional tissue units segmentation, specifically kidney glomerulus and colon crypt [ 3 ] segmentation is based on the use of supervised learning techniques and were introduced in the previous Kaggle competition [ 5 ]. In these methods, the training data consists of annotated images, where each pixel is labeled as belonging to a particular cell or background. These techniques typically require a large amount of labeled data to achieve high accuracy, which can be timeconsuming and expensive to obtain. EMAIL: (A.

1);

2);

2022 Copyright for this paper by its authors.

Most of these models are heavily inspired by the U-Net [ 6 ], UnetPlusPlus [ 7 ], FPN architectures [ 8 ], and DeepLabV3+ [ 9 ] in a combination with ImageNet pre-trained backbones such as resnet50_32x4d, resnet101_32x4d and RegNet [ 10 ]. Models used a combination of general data augmentation techniques such as flipping, rotation, scale shifting, artificial blurring, CutMix [ 11 ] and MixUp [ 12 ] to improve model performance. Models were trained using binary cross-entropy and Lovász Hinge loss [ 13 ] functions, RAdam [ 14 ], Lookahead [ 15 ], AdamW [ 16 ], SGD [ 17 ], and Adam [ 18 ] optimizers. These models used a dynamic sampling approach to sample tiles of size 512x512, 768x768 and 1024x1024 pixels from regions with visible glomeruli based on the annotations.

3. Dataset

The dataset includes biopsy slides from several organs, namely kidney, prostate, large intestine, spleen, and lung. The key feature of the proposed dataset is that it consists of images from two data sources: HPA [ 19-26 ] and HuBMAP [ 27 ]. Furthermore, the training data includes only HPA samples, while the test data comprises a mixture of HPA and HuBMAP samples [ 1 ]. Additionally, only HubMAP data was used for the final score (private dataset). The images from the HPA and HuBMAP data sources differ in staining protocol, pixel sizes, and sample thicknesses [ 1 ]. Figure 2 provides an example that illustrates the visual differences between HPA and HubMAP images. The whole slide images in the HPA and HuBMAP data sources were stained using three distinct protocols. HPA samples were stained with antibodies visualized with 3,3'-diaminobenzidine (DAB [ 28 ]), counterstained with hematoxylin, whereas HuBMAP samples were stained using either Periodic acid-Schiff (PAS [ 29 ]), hematoxylin and eosin stains (H&E [ 30 ]). Each of the staining protocols highlights different cellular structures using colored dyes, and the final stained slide images vary greatly in color, contrast, and overall image structure, making direct matching of cellular structures between images less straightforward (see Figure 4). Another crucial feature of the proposed dataset is that HubMAP images have different pixel sizes for different organs, while for HPA, it is constant (see Table 1) [ 1 ].

Finally, the images also differ in tissue section thickness. While all HPA images were sliced with a fixed thickness of 4 µm, the HuBMAP samples have tissue slice thicknesses ranging from 4 µm for the spleen and up to 10 µm for the kidney [ 1 ], adding another layer of complexity. The training dataset contained 352 samples along with additional metadata, including the dataset label (HPA or HuBMAP), organ, image height, image width, pixel size, tissue thickness, age (patient age), and sex (patient sex) [ 1 ]. During the testing stage, we had access to all meta information listed in the train dataset except for age and sex [ 1 ]. The test data comprised 550 images, of which 45% were for the public dataset and 65% for the private dataset [ 1 ]. The counterplot in Figure 3 also illustrates the class imbalance across organs, as presented in the training data.

4. Metric and Evaluation

For model evaluation Dice coefficient [ 31, 32 ] was used, which was simply averaged across all segmentation masks. For model evaluation metrics on three different datasets were used: 1. Out Of Fold predictions, using 5 Cross-Validation folds [ 33 ]. In order to preserve class imbalance and make metric more robust, stratification by organ was used.

2. Results from the public Kaggle test only on the HubMAP part. While the public Kaggle test set score was computed using both HPA and HuBMAP images, the final private dataset score was calculated using only HuBMAP data. We thus decided to focus solely on the HuBMAP score by not predicting masks for HPA images and adjusting the Kaggle public dataset score by the proportion of HuBMAP images (roughly 72%) [ 1 ].

3. Results from the private Kaggle test set.

5. Methods 5.1. Model Architecture

We have used Unet [ 34 ] and Unet++ [ 35 ] architectures with pre-trained EfficientNet B7 [ 36 ] and Mix Vision Transformer [ 37 ] encoders. In our experiments, Unet++ showed comparable or better results compared to the pure Unet decoder, and Mix Vision Transformer outperformed EfficientNet B7 encoders on both Cross-validation and on Private Kaggle Dataset. For our final solution, we used a simple average of predictions from 15 models using EfficientNet B7 and Mix Vision Transformer encoder along with Unet and Unet++ style decoder which outperformed either of the single models (Table 5).

5.2. Data Preparation

In this challenge, competitors were asked to build a solution that can segment FTUs in a way that is invariant to the staining protocol (HPA or HubMAP). To achieve this goal, organizers provided competitors with image data for microscope slides stained using HPA protocol and evaluated solutions on the mixed HPA+HubMAP dataset for Public Leaderboard and on the HubMAP dataset only for the Private Leaderboard. Therefore, the biggest challenge for this competition was domain adaptation from the HPA dataset to HubMAP. In order to solve it we had to adapt our training data in 3 ways: ● Pixel size ● Color space difference ● Tissue thickness difference

Adopting pixel size.

One of the key points was adapting to wildly varying pixel sizes. The image scales ranged from 6.3um/pixel for the prostate to 0.2um/pixel for the large intestine. We tackled this issue by rescaling our train dataset to the target HuBMAP resolution. However, to increase the model’s receptive field we applied additional downscalers for larger images and upscalers for smaller images (prostate). It is important to note that additional downscalers were also used at the inference stage to avoid changing the train/test pixel size. We used two datasets: one rescaled to HuBMAP scales and another with the original HPA scales. The latter one was not only important for HPA predictions (absent in the private LB) but also to provide some additional scaling information to the model. Therefore, we scaled down images of each organ by N times in order to match HubMAP pixel size and then by M times to upscale too small images of organs. Values of N and M can be found in Table 2.

Adopting color space.

The color spaces between HPA and HubMAP datasets were also different due to different stain methods - DAB [ 28 ] for HPA, PAS [ 29 ], and H&E [ 30 ] for HubMAP (see Figure 4). As the competition required segmentation of FTUs on slides stained using different staining protocols, we decided to make the neural network invariant to color variations by applying heavy color augmentations such as histogram matching [ 38 ] to match the color distribution of the training images to that of HuBMAP dataset (Figure 5). We also applied hue-value-saturation, contrast, and gamma augmentations. To provide additional robustness to scale and geometrical differences in FTU shapes, we also applied a range of geometric augmentations, which included random flips, rotations, scales, shifts, elastic transforms, and more. Some competition participants chose to apply stain normalization [ 39 ] to cycle color between different staining protocols. However, in our experiments, we didn’t see any improvement from stain normalization, probably because regular stain normalization techniques are specialized for one particular type of stain and don’t work well when applied to images stained with different protocols.

We have gathered additional data from the GTEX portal [ 40 ] and a few images from HubMAP to which we applied histogram matching [ 38 ] of all train data to GTEX and HubMAP images. The results of histogram matching may be observed in Figure 5. Besides we have used heavy augmentations Geometric, Color, Distortions, and Scales. The main idea behind the color augmentation was to suggest to the model that the color is not important and that it had to look for other features.

External data.

We have not tried to solve the problem of tissue thickness explicitly but we have decided to download additional data from different data sources and apply pseudo-labeling. We used data from GTEX [ 40 ] and HPA [ 19-26 ] portals to complement the initial training data. The GTEX data was especially important here because it was stained similarly to HuBMAP [ 27 ] slides with H&E [ 30 ]. From GTEX we downloaded prostate, large intestine, kidneys, and spleen data for patients with no apparent pathologies. We ignored lungs from GTEX as we couldn’t figure out how to segment them and neither manually nor using pseudo labeling. We were progressively adding GTEX images to our pipeline ending up with around 140 at the end of the competition, though it is worth mentioning that each image was quite large measuring tens of thousands of pixels in width and height. From the HPA site, we used a plethora of DAB [ 28 ] stained slides very similar to those provided by organizers. Overall, we have added between 57-61K of additional HPA images for each organ.

We pseudo-labeled both HPA and HuBMAP images with the best ensemble (according to the Cross Validation Score) available at the time of labeling. We did not select the most confident pseudo labels but rather sampled the HPA and GTEX datasets at random at training time. The selection process was inspired by the pseudo-labeling technique proposed in a semi-supervised paper [ 41 ]. We have repeated the pseudo-labeling procedure twice. Examples of pseudo-labeled images can be observed in Figure 6.

Cutmix.

СutMix [ 42 ] augmentation was among the top contributors to our score. We applied it with a probability of 0.5 and used uniform distribution to sample which part of the original image to replace with a patch from a different image. The key trick though was to apply CutMix augmentation within a single class. Examples of CutMixed image in Figure 7.

Filtering Lung samples.

FTUs on lungs were by far the most problematic part of the dataset with our baseline model scoring a mere 0.05 Dice on Cross Validation vs. 0.69 for the next hardest organ to segment - the spleen. Baseline model (Unet with EfficientNet B5 [ 34, 36 ]) Dice on different organs can be observed in Table 3.

There were two major problems with lung FTUs (alveoli): first is inconsistent segmentation of the FTUs between images (Figure 8), and the second is the shortage of well-segmented samples. Alveoli on lung images were present in a collapsed and inflated form as well as horizontally and vertically sectioned. The horizontally sectioned inflated alveoli were the most abundant group, while collapsed and vertically sectioned images contained only 15 samples on our estimate. When used as a part of the train set, they generated too much noise and we decided to remove these samples from our training pipeline.

5.3. Training Process

We have used 512x512 training crops to train CNN models and 1024x1024 crops for Mix Vision Transformer [ 37 ] models. Non-empty masks sampled with 0.5 probability. For parameter optimization, we have used Adam optimizer [ 43 ] with an initial learning rate of 0.001. We reduced it in the training process with the help of the ReduceLROnPlateau [ 44 ] algorithm with patience 3 by 0.5 factor monitoring validation dice loss. Initially, the constructed pipeline was a multiclass model with 5 channels - one for each organ. However, as only one class of organ FTUs was present at any given image we reformulated the task as a binary semantic segmentation with a single channel containing all the masks no matter what organ was present on an image. Such an approach allowed for improved generalization and better scores across all organs. To improve model robustness we have used a mixture of four losses: binary Cross-Entropy, Dice Loss, Focal Loss [ 45 ], and Jacquard Loss [ 46 ]. We have also trained an EfficientNet model with PointRend head [ 47 ] and scaled loss with a factor of 2. While we didn’t notice a meaningful performance boost from the PointRend alone we think that its main contribution was in adding diversity to our model ensemble as well as some regularization. We have used PyTorch [ 48 ] built-in mixed precision training in order to reduce GPU memory consumption which allowed us to use a batch size of 32 samples on A100 GPUs.

5.4. Inference Process

For each fold of each of our final models we have averaged model parameters of 3 best checkpoints by validation dice [ 49 ]. For ensembling, we have simply averaged probability masks from each model. We have also used Test Time Augmentations [ 50 ] with original images and three flips. We have removed small regions after thresholding to reduce noisy masks. To do so we have used the next heuristic: / < ℎ (1)

OrganTresh for different organs was found empirically by testing its effects on Cross Validation Dice and can be found in Table 4.

The results of training 5 models using 5 folds on out-of-fold data, public and private datasets are outlined in Table 5. Experiment ensembles include 5 models from each experiment and metrics from them are outlined in Table 6. Results of our approach compared to other top 5 best solutions can be found in Table 7.

6. Results 6.1. Final results

● Mixed Vision models [ 37 ] outperformed CNN models both on Cross-Validation and Private test data, which can advise that these models perform better in terms of segmentation quality and domain adaptation.

● Mean ensemble of CNNs and Mixed Vision models [ 37 ] slightly improved results comparing to solo CNN or Mixed Vision Transformer [ 37 ] approach.

6.2. Ablation Study

In this section we will outline model performance improvements in terms of Dice score [ 31, 32 ] when we have introduced changes, described in previous sections - Table 8. From this table above we can clearly see that:

7. Conclusion

● Introduced changes improved Out of Fold Dice and Private Dice, which means that overall model performance increased both on HPA and HubMAP datasets.

● Introduced changes decreased, mostly eliminated the gap between Out of Fold Dice, and Private Dice, which means that they have completed the domain adaptation task between HPA and HubMAP datasets.

Also, each organ dice improved, especially the lung dice was improved more than 10 times - Table This paper introduced the FTU segmentation training pipeline, which showed near state-of-the-art performance both on HPA [ 19-26 ] and HubMAP [ 27 ] datasets, minimizing the domain gap between them. Proposed methods allowed the adoption of models from the HPA domain to HubMAP, reducing the difference in the Dice score between test sets on HPA and HubMAP domains. Also, we have considerably increased our score on the HPA test set. We believe that the proposed methods can be used both for increasing the performance of semantic segmentation models on one domain and for adopting these models from one domain to another.

8. Acknowledgements

First, we would like to thank the Armed Forces of Ukraine, Security Service of Ukraine, Defence Intelligence of Ukraine, State Emergency Service of Ukraine for providing safety and security to participate in this great competition, complete this work, and help science, technology not stop and move forward. Also, we want to thank the Kaggle team, Google team, Genentech, and Indian University for hosting HuBMAP + HPA - Hacking the Human Body competition, which gave us all the needed data and materials to build models, test hypotheses, and write this paper. 9. References

[1]

Kaggle

: HuBMAP + HPA - Hacking the Human Body , 2022 . URL: https://www.kaggle.com/competitions/hubmap-organ-segmentation

[2] de Bono

, Grenon

, Baldock

, Hunter P.: “Functional tissue units and their primary tissue motifs in multi-scale physiology .”

J Biomed

Semantics . 2013 Oct 8 ; 4 ( 1 ): 22 . doi: 10 .1186/2041- 1480-4-22.

[3] Leah

Godwin , Yingnan Ju, Naveksha Sood, Yashvardhan Jain, Ellen M. Quardokus , Andreas Bueckle, Teri Longacre, Aaron Horning, Yiing Lin, Edward D.

Esplin , John W. Hickey, Michael P.

Snyder , N. Heath

Patterson , Jeffrey M. Spraggins , Katy Börner. “ Robust and generalizable segmentation of human functional tissue units .” bioRxiv 2021 . 11 .09.467810; doi: https://doi.org/10.1101/ 2021 .11.09.467810

[4]

Jonathan

Long , Evan Shelhamer, Trevor Darrell. “ Fully Convolutional Networks for Semantic Segmentation .” ( 2015 ) IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ( 2015 ). doi: 10 .1109/CVPR. 2015 .7298965

[5] Kaggle: HuBMAP - Hacking the Kidney . URL: https://www.kaggle.com/c/hubmap-kidneysegmentation.

[6] Dice , Lee R . "Measures of the Amount of Ecologic Association Between Species" . Ecology . 26 ( 3 ): 297 - 302 . ( 1945 ). doi: 10 .2307/1932409. JSTOR 1932409

[7]

Zongwei

Zhou , Md Mahfuzur Rahman Siddiquee, Nima Tajbakhsh and

Jianming

Liang . “UNet++ : A Nested U-Net Architecture for Medical Image Segmentation ” arXiv: 1807 . 10165 ( 2018 )

[8] Tsung-Yi

Lin

, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie: “ Feature Pyramid Networks for Object Detection” arXiv:1612.03144 ( 2016 )

[9] Liang-Chieh

Chen

, George Papandreou, Florian Schroff, Hartwig Adam: “ Rethinking Atrous Convolution for Semantic Image Segmentation” arXiv:1706.05587 ( 2017 )

[10] Jing

, Yu Pan, Xinglin Pan, Steven Hoi, Zhang Yi, Zenglin Xu: “RegNet: Self-Regulated Network for Image Classification ” arXiv: 2101 .00590 ( 2021 )

[11] Sangdoo

Yun

, Dongyoon Han, Seong Joon Oh,

Sanghyuk

Chun , Junsuk Choe, Youngjoon Yoo: “CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features” arXiv: 1905 . 04899 ( 2019 )

[12] Hongyi

Zhang

, Moustapha Cisse,

Yann N.

Dauphin , David Lopez-Paz: “mixup: Beyond Empirical Risk Minimization ” arXiv: 1710 .09412 ( 2017 )

[13] Jiaqian

, Matthew Blaschko: “ The Lovász Hinge: A Novel Convex Surrogate for Submodular Losses ” arXiv: 1512 .07797 ( 2015 )

[14] Liyuan

Liu

, Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao, Jiawei Han: “ On the Variance of the Adaptive Learning Rate and Beyond” arXiv: 1908 . 03265 ( 2019 )

[15] Michael

Zhang , James Lucas, Geoffrey Hinton, Jimmy Ba: “Lookahead Optimizer: k steps forward, 1 step back ” arXiv: 1907 . 08610 ( 2019 )

[16] Ilya

Loshchilov

, Frank Hutter: “ Decoupled Weight Decay Regularization” arXiv:1711.05101 ( 2017 )

[17] Herbert

Robbins

, Sutton Monro: “

A Stochastic

Approximation Method ” Ann. Math. Statist. 22 ( 3 ): 400 - 407 ( 1951 ). DOI: 10 .1214/aoms/1177729586

[18] Diederik

Kingma , Jimmy Ba: “Adam: A Method for Stochastic Optimization ” arXiv: 1412 .6980 ( 2014 )

[19] Uhlén

et al., “ Tissue-based map of the human proteome .” Science ( 2015 ) PubMed: 25613900 doi: 10.1126/science.1260419

[20] Thul

et al., “A subcellular map of the human proteome .” Science. ( 2017 ) PubMed: 28495876 doi: 10.1126/science.aal3321

[21] Sjöstedt

et al., “An atlas of the protein-coding genes in the human, pig, and mouse brain .” Science . ( 2020 ) PubMed: 32139519 doi: 10.1126/science.aay5947

[22] Karlsson

et al., “A single-cell type transcriptomics map of human tissues . ” Sci Adv. ( 2021 ) PubMed: 34321199 doi: 10.1126/sciadv.abh2169

[23] Uhlen

et al., “A pathology atlas of the human cancer transcriptome . ” Science . ( 2017 ) PubMed: 28818916 doi: 10.1126/science.aan2507

[24] Uhlen

et al., “A genome-wide transcriptomic analysis of protein-coding genes in human blood cells .” Science . ( 2019 ) PubMed: 31857451 doi: 10.1126/science.aax9198

[25] Sjöstedt

et al., “An atlas of the protein-coding genes in the human, pig, and mouse brain .” Science . ( 2020 ) PubMed: 32139519 doi: 10.1126/science.aay5947

[26] Uhlén

et al., “ The human secretome . ” Sci Signal. ( 2019 ) PubMed: 31772123 doi: 10.1126/scisignal.aaz0274

[27] Snyder , M.P. , Lin , S. , Posgai , A. et al. “ The human body at cellular resolution: the NIH Human Biomolecular Atlas Program . ” Nature 574 , 187 - 192 ( 2019 ). doi: 10 .1038/s41586-019-1629-x.

[28] Litwin

JA.

Histochemistry and cytochemistry of 3,3'-diaminobenzidine. A review . Folia Histochem Cytochem (Krakow) . 1979 ; 17 ( 1 ): 3 - 28 . PMID: 220157 .

[29] GOMORI G. The periodic-acid Schiff stain . Am J Clin Pathol . 1952 Mar; 22 ( 3 ): 277 - 81 . doi: 10 .1093/ajcp/22.3_ts.277. PMID: 14902736 .

[30] Feldman

, Wolfe

Tissue processing and hematoxylin and eosin staining . Methods Mol Biol . 2014 ; 1180 : 31 - 43 . doi: 10 .1007/978-1- 4939 -1050- 2 _ 3 . PMID: 25015141 .

[31] Sørensen , T. " A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on Danish commons" . Kongelige Danske Videnskabernes Selskab . 5 ( 4 ): 1 - 34 . ( 1948 ).

[32] Dice , Lee R . "Measures of the Amount of Ecologic Association Between Species" . Ecology . 26 ( 3 ): 297 - 302 . ( 1945 ). doi: 10 .2307/1932409. JSTOR 1932409

[33] Allen , David M. " The Relationship between Variable Selection and Data Agumentation and a Method for Prediction" . Technometrics . 16 ( 1 ): 125 - 127 . ( 1974 ). doi: 10 .2307/1267500. JSTOR 1267500

[34] Olaf

Ronneberger

, Philipp Fischer and

Thomas

Brox . “U-Net: Convolutional Networks for Biomedical Image Segmentation.” Medical Image Computing and Computer-Assisted

Intervention - MICCAI

2015 . MICCAI 2015. Lecture Notes in Computer Science() , vol 9351 . Springer, Cham. doi: 10 .1007/978-3- 319 -24574-4_ 28

[35] Zongwei

Zhou

, Md Mahfuzur Rahman Siddiquee, Nima Tajbakhsh, Jianming Liang. “UNet++ : A Nested U-Net Architecture for Medical Image Segmentation.” Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support . DLMIA ML-CDS 2018 2018. Lecture Notes in Computer Science() , vol 11045 . Springer, Cham. doi: 10 .1007/978-3- 030 -00889- 5 _ 1

[36] Mingxing , Tan. , Quoc , V. , Le . "EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks." arXiv: 1905 . 11946 ( 2019 ).

[37] Jie-Neng

Chen

, Shuyang Sun, Ju He, Philip Torr, Alan Yuille and

Song

Bai . “TransMix: Attend to Mix for Vision Transformers .” arXiv: 2111 .09833 ( 2021 ).

[38] Scikit-image Histogram matching URL: https://scikitimage .org/docs/dev/auto_examples/color_exposure/plot_histogram_matching.html

[39]

Marc

Macenko ,

Marc

Niethammer ,

J. S.

Marron , David Borland, John T. Woosley, Xiaojun Guan, Charles Schmitt, and Nancy E. Thomas “A METHOD FOR NORMALIZING HISTOLOGY SLIDES FOR QUANTITATIVE ANALYSIS ” https://www.cs.unc.edu/~mn/sites/default/files/macenko2009.pdf

[40]

GTEx

Histology Viewer URL :https://www.gtexportal.org/home/histologyPage

[41] Yauhen

Babakhin

, Artsiom Sanakoyeu, Hirotoshi Kitamura. “ Semi-Supervised Segmentation of Salt Bodies in Seismic Images using an Ensemble of Convolutional Neural Networks .” arXiv: 1904 . 04445 ( 2019 ).

[42] Sangdoo

Yun

, Dongyoon Han, Seong Joon Oh,

Sanghyuk

Chun , Junsuk Choe and

Youngjoon

Yoo . “ CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features .” in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), 2019 pp. 6022 - 6031 . doi: 10 .1109/ICCV. 2019 .00612

[43] Kingma , D. and Ba , J. “ Adam: A Method for Stochastic Optimization . ” Proceedings of the 3rd International Conference on Learning Representations (ICLR 2015 ). ( 2015 )

[44] PyTorch: ReduceLROnPlateau, 2022 . URL: https://pytorch.org/docs/stable/generated/torch.optim.lr_scheduler.ReduceLROnPlateau.html

[45] T. -Y. Lin , P.

Goyal , R.

Girshick , K.

He and P.

Dollár , "Focal Loss for Dense Object Detection." IEEE International Conference on Computer Vision (ICCV), 2017 , pp. 2999 - 3007 . ( 2017 ) doi: 10.1109/ICCV. 2017 . 324 .

[46] Jaccard , Paul. " The Distribution of the Flora in the Alpine Zone.1" . New Phytologist. 11 ( 2 ): 37 - 50 . ( 1912 ). doi: 10 .1111/j.1469- 8137 . 1912 .tb05611. x. ISSN 0028-646X

[47] Alexander

Kirillov

, Yuxin Wu, Kaiming He and

Ross

Girshick . “ PointRend: Image Segmentation as Rendering” . CVPR 2020 : 9796 - 9805 ( 2020 ).

[48] Paszke

, Gross

, Massa

, Lerer

, Bradbury

, Chanan

, et al. “ PyTorch: An Imperative Style, High-Performance Deep Learning Library .” In: Advances in Neural Information Processing Systems 32 . Curran Associates, Inc.; pp. 8024 - 35 . ( 2019 ).

[49] Izmailov , P. , Podoprikhin , D. , Garipov , T. , Vetrov , D. , & Wilson, A. G. ( 2018 ). Averaging weights leads to wider optima and better generalization . In R. Silva,

Globerson , & A. Globerson (Eds.), 34th Conference on Uncertainty in Artificial Intelligence 2018 , UAI 2018 (pp. 876 - 885 ). (34th Conference on Uncertainty in Artificial Intelligence 2018 , UAI 2018; Vol. 2 ). Association For Uncertainty in Artificial Intelligence (AUAI).

[50] Shorten , C. , Khoshgoftaar , T.M. “ A survey on Image Data Augmentation for Deep Learning . ” J Big Data 6 , 60 ( 2019 ). doi: 10.1186/s40537-019-0197-0.