<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Investigating Time Series Classification Techniques for Rapid Pathogen Identification with Single-Cell MALDI-TOF Mass Spectrum Data</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Christina</forename><surname>Papagiannopoulou</surname></persName>
							<email>christina.papagiannopoulou@ugent.be</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Data Analysis and Mathematical Modelling</orgName>
								<orgName type="institution">Ghent University</orgName>
								<address>
									<settlement>Ghent</settlement>
									<country key="BE">Belgium</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">René</forename><surname>Parchen</surname></persName>
							<email>parchen@biosparq.nl</email>
							<affiliation key="aff1">
								<orgName type="institution">BiosparQ B.V</orgName>
								<address>
									<settlement>Leiden</settlement>
									<country key="NL">the Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Willem</forename><surname>Waegeman</surname></persName>
							<email>willem.waegeman@ugent.be</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Data Analysis and Mathematical Modelling</orgName>
								<orgName type="institution">Ghent University</orgName>
								<address>
									<settlement>Ghent</settlement>
									<country key="BE">Belgium</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Investigating Time Series Classification Techniques for Rapid Pathogen Identification with Single-Cell MALDI-TOF Mass Spectrum Data</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">C60B11D40F830F7559CCF5F565667F2C</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T11:03+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>MALDI-TOF MS</term>
					<term>single-cell spectrum</term>
					<term>single-ionizationevent</term>
					<term>classification</term>
					<term>bacterial species identification</term>
					<term>time series</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry (MALDI-TOF-MS) is a well-known technology, widely used in species identification. Specifically, MALDI-TOF-MS is applied on samples that usually include bacterial cells, generating representative signals for the various bacterial species. However, for a reliable identification result, a significant amount of biomass is required. For most samples used for diagnostics of infectious diseases, the sample volume is extremely low to obtain the required amount of biomass. Therefore, amplification of the bacterial load is performed by a culturing phase. If the MALDI process could be applied to individual bacteria, it would be possible to circumvent the need for culturing and isolation, accelerating the whole process. In this work, we briefly describe an implementation of a MALDI-TOF MS procedure in a setting of individual cells and we demonstrate the use of the produced data for the application of pathogen identification. The identification of pathogens (bacterial species) is performed by using machine learning algorithms on the generated single-cell signals. The high predictive performance of the machine learning models indicates that the produced bacterial signatures constitute an informative representation, helpful in distinguishing the different bacterial species. In addition, we reformulate the bacterial species identification problem as a time series classification task by considering the intensity sequences of a given spectrum as time series values. Experimental results show that algorithms originally introduced for time series analysis are beneficial in modelling observations of single-cell MALDI-TOF MS.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In the diagnostics of infectious diseases, matrix-assisted laser desorption/ ionizationtime-of-flight mass spectrometry (MALDI-TOF-MS) is used to identify the causative organism of an infection as a first step in establishing an antibiotic therapy. Owing to its ease of use, its reliability and the low cost of ownership, the introduction of MALDI-TOF-MS revolutionized the diagnostics of infectious diseases during the last decade <ref type="bibr" target="#b0">[1]</ref>. BiosparQ in the Netherlands developed an instrument, called Cirrus D20, together with the appropriate protocols that is able to produce an information-rich signature of bacteria based on MALDI-TOF-MS technology. This abstract is based on <ref type="bibr" target="#b2">[3]</ref>, in which we evaluate the single-cell MALDI-TOF MS methodology developed by BiosparQ, demonstrating the use of single-particle spectra for the application of pathogen (bacterial species) identification.</p><p>The classification of single-cell bacterial fingerprints is not a trivial process even for human annotators. Thus, MALDI-TOF single cell spectrum analysis should be combined with statistical and machine learning methods. In <ref type="bibr" target="#b2">[3]</ref>, we focus on the analysis of MALDI-TOF single-cell spectra for rapid species identification using machine learning techniques. Instead of only applying general purpose machine learning techniques <ref type="bibr" target="#b1">[2]</ref>, we also experimented by framing the problem as a time series classification task. In particular, by mapping mass-overcharge (M/Z) ratios to the time axis, we consider the sequences of the different intensities in a spectrum as time series values. This way, standard time series classification methods can be applied. To the best of our knowledge, this is the first time that machine learning approaches and time series classification methods are being applied on single-cell MALDI-TOF data.</p><p>The contribution of our work is two-fold. Based on the implementation of the MALDI-TOF-MS methodology in a single-cell setting, we (i) experimentally prove that the single-cell signatures, produced by this MALDI-TOF-MS implementation, are informative in distinguishing different bacterial species by using machine learning data analysis, and (ii) find that algorithms originally introduced for time series analysis are beneficial in modelling observations of singlecell MALDI. As such, we believe that the use of single-cell MALDI-TOF-MS data combined with an accurate modelling approach comprises a solid framework that strives to solve the problem of fast pathogen identification (in terms of minutes or seconds), revolutionizing current state-of-the-art approaches.</p></div>		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Progress in proteomics for clinical microbiology: MALDI-TOF MS for microbial species identification and more</title>
		<author>
			<persName><forename type="first">A</forename><surname>Van Belkum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chatellier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Girard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Pincus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Deol</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M D</forename><surname>Jr</surname></persName>
		</author>
		<idno>14789450.2015.1091731</idno>
		<ptr target="https://doi.org/10.1586/" />
	</analytic>
	<monogr>
		<title level="j">Expert Review of Proteomics</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="595" to="605" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Bacterial species identification from maldi-tof mass spectra through data analysis and machine learning</title>
		<author>
			<persName><forename type="first">K</forename><surname>De Bruyne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Slabbinck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Waegeman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Vauterin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>De Baets</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Vandamme</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.syapm.2010.11.003</idno>
		<ptr target="http://dx.doi.org/10.1016/j.syapm.2010.11.003" />
	</analytic>
	<monogr>
		<title level="j">SYSTEMATIC AND APPLIED MICROBI-OLOGY</title>
		<imprint>
			<biblScope unit="volume">34</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="20" to="29" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Investigating time series classification techniques for rapid pathogen identification with single-cell maldi-tof mass spectrum data</title>
		<author>
			<persName><forename type="first">C</forename><surname>Papagiannopoulou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Parchen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Waegeman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD&apos;19)</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
	<note>accepted</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
