<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Evaluation of Expectation Maximization for the Segmentation of Cervical Cell Nuclei</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Alexander</forename><surname>Ihlow</surname></persName>
							<email>alexander.ihlow@tu-ilmenau.de</email>
							<affiliation key="aff0">
								<orgName type="institution">Ilmenau University of Technology</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Christian</forename><surname>Held</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Fraunhofer Institute for Integrated Circuits IIS</orgName>
								<address>
									<settlement>Erlangen</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Christoph</forename><surname>Rothaug</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Ilmenau University of Technology</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Claudia</forename><surname>Dach</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Fraunhofer Institute for Integrated Circuits IIS</orgName>
								<address>
									<settlement>Erlangen</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Thomas</forename><surname>Wittenberg</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Fraunhofer Institute for Integrated Circuits IIS</orgName>
								<address>
									<settlement>Erlangen</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Dirk</forename><surname>Steckhan</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Fraunhofer Institute for Integrated Circuits IIS</orgName>
								<address>
									<settlement>Erlangen</settlement>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Evaluation of Expectation Maximization for the Segmentation of Cervical Cell Nuclei</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">60C632675196D14491A93CAA8844D9EF</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T04:32+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>As cervical cancer is one of the most common cancers worldwide, screening programs have been established. For that task stained slides of cervical cells are visually assessed under a microscope for dysplastic or malignant cells. To support this challenge, image processing methods offer advantages for objective classification. As the cell nuclei carry a high extent visual information, all depicted cell nuclei need to be delineated. Within this work, the expectation maximization (EM) algorithm is evaluated as a yet unused method for this task. The EM was trained on 33 micrographs, where nuclei were manually annotated as reference. The EM was evaluated with varying parameter for the number of classes and with four different color spaces (RGB, Lab, HSV, polar HSV). Segmentation results for all images and parameters were compared to the ground truth, yielding average accuracy and standard deviation for all cells. The best color spaces were RGB and Lab. The best number of classes to be used in the color space was found to be K = 3. It can be concluded that the EM is an appropriate and useful approach for cell nuclei segmentation, but needs some image post-processing for the elimination of false positives.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Since cervical cancer is one of the most common cancers amongst women worldwide, screening programs have been established and are carried out in many countries. For the screening, cervical cells are obtained from the portio vaginalis using a brush during routine examination. The cells are then prepared directly on a slide or are prepared applying a monolayer-preparation (Fig. <ref type="figure">1a</ref>). To make the cells visible for microscopic examinations, the slide is stained by the method of Papanicolaou, also known as PAP-stain <ref type="bibr" target="#b0">[1]</ref>.</p><p>For visual assessment the stained slides are put under a microscope and screened for dysplastic or malignant cells by a trained specialist. Since this work is tedious and tiring, the viewing and screening process depends on the professional competence as well as the personal and subjective comprehension of the specialist. This comprehension may change during a working day depending on stress, fatigue, and personal issues and may also differ between two people. Under these conditions methods of digital image processing can offer advantages for a more objective classification of these highly complex images. Under the assumption of standardized staining and smearing techniques, machines tend to be neutral and immune to inter-and intra-observer changes and influences.</p><p>As the nuclei of cervical cells carry a high extent of morphological and textural information, which can be used to diagnose pre-cancerous stages (CIN I-III) as well cancer itself, in a primary step all depicted cell nuclei need to be detected and delineated against the surrounding cell plasma and image back ground.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.1">State of the Art</head><p>Various approaches have been suggested to solve the problem of cervical cell nuclei detection and segmentation within automated micrograph analysis. Morphological image processing <ref type="bibr" target="#b1">[2]</ref> is well understood and very suitable for objects with well-defined size and form <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4]</ref> and nuclei can be detected using a nucleusshaped structuring element. <ref type="bibr" target="#b3">[4]</ref> have suggested the application of a linear tophat-operator <ref type="bibr" target="#b4">[5]</ref>. <ref type="bibr" target="#b5">[6]</ref> make use of thresholding and morphology, where in a postprocessing step candidates for cell nuclei are selected or rejected using a fuzzy clustering approach. An alternative is the approximation of the nuclei borders by a LoG edge detector <ref type="bibr" target="#b6">[7]</ref>, which are post-processed using morphological operators for the detection of cell nuclei. Another approach is the circular Hough transform <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b7">8]</ref>, which is applied to detect circular structures in gradient images. A technique intended to provide further information about cell nuclei is the use of multi-modal image pairs <ref type="bibr" target="#b8">[9]</ref>, where an additional fluorescence (FL) staining is applied high lightening only the cell nuclei. On FL image a thresholding operation can be applied to detect the cell nuclei. Disadvantage is that the FL image has to be registered to the PAP brightfield image, which is a difficult and expensive task Within this work the well-known expectation maximization (EM) approach <ref type="bibr" target="#b9">[10]</ref> is evaluated as a yet unused method for automated detection and segmentation of cervical cells.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Materials and Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Methods</head><p>Using the EM algorithm, usually a Gaussian mixture model (GMM) is applied to describe the data at hand. The distribution of the annotated color samples x is modeled by a mixture of</p><formula xml:id="formula_0">K classes f (x) = ∑ K k=1 π k f (x | λ k ) with ∑ K k=1 π k = 1 and ∫ x f (x | λ k ) dx = 1</formula><p>, where π k is the a-priori probability of class k and λ k denotes the set of parameters which describes the distribution of class k. As a description for the distribution of the mixture components the multivariate Gaussian is chosen due to its convenience. Its equal-probability surfaces describe (hyper)ellipsoids in the d-dimensional space. Here, d = 3 corresponds to the tristimulus color spaces RGB, Lab, HSV, and polar HSV. The model parameters λ k consist of the mean vector µ k ∈ R d×1 describing the center of the ellipsoid, and the covariance matrix Σ k ∈ R d×d determining its shape and orientation. This yields a GMM of</p><formula xml:id="formula_1">f (x | λ k = {µ k , Σ k }) = ( 1/ √ (2π) d |Σ k | ) exp [ − 1 2 (x − µ k ) T Σ −1 k (x − µ k )</formula><p>]</p><p>(1) The EM algorithm <ref type="bibr" target="#b9">[10]</ref> is an iterative technique for finding the maximum likelihood parameter estimates when fitting a distribution onto a given data set. During the iterations, the probability p of the N data samples x n belonging to class k is calculated by Bayes' theorem, known as the expectation step</p><formula xml:id="formula_2">p(k | x n ) = π k f (x n | λ k )/ ∑ K j=1 π j f (x n | λ j )</formula><p>In the subsequent maximization step, updated prior probabilities, mean vectors, and covariance matrices for each class are calculated, using</p><formula xml:id="formula_3">π new k = X/N , µ new k = 1 X ∑ N n=1 Y x n , Σ new k = 1 X ∑ N n=1 Y (x n − µ new k )(x n − µ new k ) T with X = ∑ N n=1 p(k | x n , λ k ) and Y = p(k | x n , λ k ).</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Experiments</head><p>For the training and evaluation of the proposed methods an image data set of 33 cervical micrographs with a spatial resolution of 1000 × 700 pixels has been used where all nuclei were manually annotated as reference or ground truth by an expert. The cells in these probes are typically stained depicting colors from basophile (blue) to eosinophile (red). The images used are ranging from healthy to a dysplastic CIN III state (nearly tumorous), and thus cover the complete range of cervical cells. Fig. <ref type="figure">1a</ref> shows a typical example of an micrograph with cervical cells, while in Fig. <ref type="figure">1b</ref> some representative regions of the classes nuclei and rest (including cell plasma and background) were manually marked. Fig. <ref type="figure">1c</ref> depicts the ground truth of cell nuclei used for later performance evaluation. The EM approach was evaluated with various parameters (K = 3, 4, 5, 6 classes) to describe the number of clouds in the color spaces as well as with different color spaces, including RGB, Lab, HSV, and polar HSV. For all experiments, the EM algorithm was terminated after four iterations. In Fig. <ref type="figure">2</ref>, the segmentation results for the example image from Fig. <ref type="figure">1a</ref> with the above described parameters is depicted. It can be seen that the depending on the two parameters investigated, being the number of classes K and the color spaces, the resulting images show more or less over-and under-segmentation artifacts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Results</head><p>For evaluation, each image was subdivided into Voronoi regions based on the ground truth nuclei (Fig. <ref type="figure">2</ref>). The segmentation accuracy A was determined for each Voronoi region by A = (N TP + N TN )/(N TP + N FP +N FN + N TN ), where N T P , N F P , N T N , and N F N denote the number of true and false positive, and true and false negative pixels, respectively. Based on these measurements, the average accuracy and standard deviation of all cells was evaluated as depicted in Fig. <ref type="figure" target="#fig_1">3</ref>. It is visible that the RGB and Lab color spaces outperform the HSV and polar HSV color spaces for all K. Furthermore, results obtained by EM with K = 3 in the RGB color space show the best average accuracy values while exhibiting the lowest standard deviation. This is consistent with the example segmentations shown in Fig. <ref type="figure">2</ref>. Therefore, this parameter set seems to be best suited for the presented task. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Discussion</head><p>We have presented and evaluated an EM approach using different color spaces (RGB, Lab, HSV, polar HSV) for the segmentation of cervical cell nuclei. Our results indicate that the RGB and LAB color spaces are most suitable for this task. Using these color spaces the EM is able to perform a segmentation with high sensitivity. A drawback of this method is caused by its strong dependency on the initialization. Future research could focus on the improvement of the specificity of the proposed methods by appropriate post processing steps. Furthermore, some post processing is needed to eliminate false positive pixels.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .Fig. 2 .</head><label>12</label><figDesc>Fig. 1. Micrograph with cervical cells (a), manual marking of representative areas (K = 6 classes) for EM initialization (b), and ground truth of cell nuclei (c).</figDesc><graphic coords="3,138.61,541.50,110.60,77.42" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. Average classification accuracy A for the different numbers of classes K and varying color spaces. Error bars indicate standard deviation for the parameterization.</figDesc><graphic coords="5,202.40,145.71,207.50,132.39" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">A new procedure for staining vaginal smears</title>
		<author>
			<persName><forename type="first">G</forename><surname>Papanicolaou</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Science</title>
		<imprint>
			<biblScope unit="volume">95</biblScope>
			<biblScope unit="page" from="438" to="439" />
			<date type="published" when="1942">1942</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Morphological Image Analysis: Principles &amp; Applications</title>
		<author>
			<persName><forename type="first">P</forename><surname>Soille</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1999">1999</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Cell segmentation with adaptive region growing</title>
		<author>
			<persName><forename type="first">D</forename><surname>Anoraganingrum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kröner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Gottfried</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc Int Conf Image Anal Process</title>
				<meeting>Int Conf Image Anal ess</meeting>
		<imprint>
			<date type="published" when="1999">1999</date>
			<biblScope unit="page" from="27" to="29" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Detection and segmentation of cervical cell nuclei</title>
		<author>
			<persName><forename type="first">H</forename><surname>Köhler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Wittenberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Paulus</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proc ICMP &amp; BMT, Biomed Tech</title>
		<imprint>
			<biblScope unit="volume">50</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="288" to="289" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Einführung in die digitale Bildverarbeitung</title>
		<author>
			<persName><forename type="first">W</forename><surname>Abmayer</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1994">1994</date>
			<publisher>Teubner</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Automated detection of cell nuclei in PAP stained cervical smear images using fuzzy clustering</title>
		<author>
			<persName><forename type="first">M</forename><surname>Plissiti</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc EMBEC</title>
				<meeting>EMBEC</meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="637" to="641" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">An image analysis-based approach for automated counting of cancer cell nuclei in tissue sections</title>
		<author>
			<persName><forename type="first">C</forename><surname>Loukas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Cytometry A</title>
		<imprint>
			<biblScope unit="volume">55</biblScope>
			<biblScope unit="page" from="30" to="42" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The Hough transform for locating cell nuclei</title>
		<author>
			<persName><forename type="first">A</forename><surname>Thomas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Davies</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Luxmoore</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEE Colloq Appl Image Proc in Mass Health Screening</title>
				<imprint>
			<date type="published" when="1992">1992</date>
			<biblScope unit="page" from="8" to="9" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">Robust cell image segmentation. Pattern Recogn Image Anal</title>
		<author>
			<persName><forename type="first">J</forename><surname>Lindblad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Bengsston</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Wählby</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="page" from="157" to="167" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Maximum Likelihood from Incomplete Data via the EM algorithm</title>
		<author>
			<persName><forename type="first">A</forename><surname>Dempster</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Laird</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Rubin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Royal Stat Soc B</title>
		<imprint>
			<biblScope unit="volume">39</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="38" />
			<date type="published" when="1977">1977</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
