<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">The Medico-Task 2018: Disease Detection in the Gastrointestinal Tract using Global Features and Deep Learning</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Vajira</forename><surname>Thambawita</surname></persName>
							<email>vajira@simula.no</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Simula Research Laboratory</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="institution">Simula Metropolitan</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Debesh</forename><surname>Jha</surname></persName>
							<email>debesh@simula.no</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Simula Research Laboratory</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
							<affiliation key="aff3">
								<orgName type="institution">University of Tromsø</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Michael</forename><surname>Riegler</surname></persName>
							<affiliation key="aff0">
								<orgName type="laboratory">Simula Research Laboratory</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="institution">Simula Metropolitan</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
							<affiliation key="aff4">
								<orgName type="institution">University of Oslo</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pål</forename><surname>Halvorsen</surname></persName>
							<affiliation key="aff0">
								<orgName type="laboratory">Simula Research Laboratory</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="institution">Simula Metropolitan</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
							<affiliation key="aff4">
								<orgName type="institution">University of Oslo</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Hugo</forename><surname>Lewi Hammer</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">Oslo Metropolitan University</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Håvard</forename><forename type="middle">D</forename><surname>Johansen</surname></persName>
							<affiliation key="aff3">
								<orgName type="institution">University of Tromsø</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Dag</forename><surname>Johansen</surname></persName>
							<affiliation key="aff3">
								<orgName type="institution">University of Tromsø</orgName>
								<address>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">The Medico-Task 2018: Disease Detection in the Gastrointestinal Tract using Global Features and Deep Learning</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">6E7F166362ADD926CCBB706CC8C8C86B</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T02:17+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper, we present our approach for the 2018 Medico Task classifying diseases in the gastrointestinal tract. We have proposed a system based on global features and deep neural networks. The best approach combines two neural networks, and the reproducible experimental results signify the efficiency of the proposed model with an accuracy rate of 95.80%, a precision of 95.87%, and an F1score of 95.80%.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION</head><p>Our main goal for the Medico Task <ref type="bibr" target="#b14">[15]</ref> is to classify findings in images from the Gastrointestinal (GI) tract. This task provides two types of input data: Global Features (GFs) and original images. The 2017 Medico Task consisted of a balanced dataset with only 8 classes <ref type="bibr" target="#b11">[12]</ref> whereas the current task consists of a highly imbalanced dataset with 16 classes <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b11">12]</ref>, i.e., making this years task more complicated. Different approaches have been used in the last year medico task <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b6">7,</ref><ref type="bibr" target="#b8">9,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b13">14,</ref><ref type="bibr" target="#b16">17]</ref> based on GFs extractions and Convolutional Neural Networks (CNN) methods. We extend upon these solutions and present our solutions based on both GFs and transfer learning mechanisms using CNN. We achieve best results combining two CNNs and using an extra multilayer perceptron to combine the outputs of the two networks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">APPROACHES</head><p>We approach the problem of GI tract disease detection with small training datasets using five different methods: two based on GF extractions, and three based on CNN with transfer learning described below.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Global-feature-based approaches</head><p>Method 1 and Method 2 use the concept of GFs. For the extraction of GFs, we use Lucence Image Retrieveal (LIRE) <ref type="bibr" target="#b5">[6]</ref>. GFs are easy and fast to calculate, and can also be used for image comparison, image collection search and distance computing <ref type="bibr" target="#b13">[14]</ref>. Based on <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b15">16]</ref>, we use Joint Composite feature (JCD), Tamura, Color layout, Edge Histogram, Auto Color Correlogram and Pyramid Histogram of Oriented Gradients (PHOG). These features represent the overall properties of the images. Adding more GFs is possible, but it may increase the redundant information which can reduce the overall classification performance.</p><p>The extracted features are sent to the different machine learning classifier for the multi-class classification. Method 1 makes the use of extracted GFs that are sent to SimpleLogistic (SL) classifier. We input the same selected set of features to the logistic model tree (LMT) classifier in Method 2.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Transfer learning based approaches</head><p>Our CNN approaches use transfer learning mechanism with pretrained models using the ImageNet dataset <ref type="bibr" target="#b17">[18]</ref>. Resnet-152 <ref type="bibr" target="#b2">[3]</ref> and Densenet-161 <ref type="bibr" target="#b3">[4]</ref> have been selected, and this selection is based on top 1-error and top-5-errors rate of pre-trained networks in the Pytorch <ref type="bibr" target="#b7">[8]</ref> deep learning framework.</p><p>One of the main problems of the given dataset is the "out of patient"-category which has only four images while other classes have a considerable number. The colour distribution of this class shows a completely different colour domain compared to the other categories. We identified this difference via manual investigations of the dataset and moved all four images of this category into the corresponding validation set folder. Then, the training set folder is filled with random Google images which are not related to the GI tract. To overcome the problems of stopping training in a local minima, we use the stochastic gradient descent <ref type="bibr" target="#b0">[1]</ref> method with dynamic learning rate scheduling. The losses (loss 1 and loss 2 in Figure <ref type="figure">1</ref>) of CNN methods were calculated for each network separately. Additionally, horizontal flips, vertical flips, rotations and re-sizing data augmentations have been applied to overcome the problem of over-fitting.</p><p>Method 3 uses transfer learning with Resnet-152 which has the top-1-error and top-5-error rates. The last fully connected layer of Resnet-152, which is originally designed to classify 1000 classes of the ImageNet dataset, has been changed to classify the 16 classes in the MEdico task. Usually, the transfer learning freezes pre-trained layers to avoid back propagation of large errors. This is because of newly added layers with random weights. However, we did not freeze the pre-trained layers, because modifying only the last layer cannot propagate huge errors backwards in transfer learning. The network was trained until it reached to the maximum validation accuracy of the validation dataset.</p><p>Method 4 extends Method 3 by using two parallel pre-trained models, Resnet-152 and Densenet-161, to get a cumulative decision at the end as depicted in Figure <ref type="figure">1</ref>. The classification is based on an average of the two output probability vectors. Finally, one loss value was calculated and propagated for updating weights. However, this yields a restriction of updating weights of networks Resnet-152 and Densenet-161 separately as they required. Therefore, we calculated two different loss values (loss 1 and loss 2 in Figure <ref type="figure">1</ref>) from each network to update their weights separately. Both Method 5 was constructed to overcome the limitation of calculating the average of the probabilistic output of the two networks used in Method 4. Instead of calculating the average using the simple mathematical formula, another multilayer perceptron (MLP) has been merged with the above network to identify complex mathematical formula to get the cumulative decision as illustrated in Figure <ref type="figure">1</ref>. Therefore, we passed the probability output of two networks (16 probabilities from each network) to a new MLP with 32 inputs, 16 outputs (via sigmoid layer) and one hidden layer with 32 units. In this, we used pre-trained Resnet-152 and Densenet-161 using the dataset and froze them before training the MLP. Then, we trained only the MLP to identify the best mathematical formula to get the cumulative decision.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">RESULTS AND ANALYSIS</head><p>We have divided the development dataset into a training set (70%) and a validation set (30%). For the GFs based approach, ensembles of six extracted GFs were fetched to all the available machine learning classifiers (with different parameters) using WEKA <ref type="bibr" target="#b1">[2]</ref> library. The SL and LMT classifiers outperform all other available classifiers for the dataset. The other promising classifier were Sequential minimal optimization (RBF kernel), and a combination of PCA with LibSVM (RBF) classifier.</p><p>On validation set, all the CNN methods <ref type="bibr" target="#b2">(3)</ref><ref type="bibr" target="#b3">(4)</ref><ref type="bibr" target="#b4">(5)</ref> show accuracies of around 95% and specificities of around 99%. These are always better than the GFs based extraction methods (1,2) which have accuracies of around 82% and specificities of around 98%. According to the task organizers' evaluation results of the test dataset, Methods 3 to 5 show accuracies and specificities of around 99% again,which demonstrates our CNN methods are not overfitted with validation dataset.</p><p>Method 5 and 4 with Resnet-152 and Densenet-161 performs better compared to the Method 3 which has only Resnet-152 because of the capability of deciding the final answer based on two answers generated from two deep learning networks. However, getting a cumulative decision based on simple averaging function (Method 4) shows poor performance than the decision taken from a MLP (Method 5). As a result, Method 5 shows better results than method 4 by increasing the accuracy from 0.955 to 0.958. Therefore, Method 5 has been selected as our best method and confusion matrix represented in Table <ref type="table">1</ref> was generated. An overview of the individual results obtained from five different experiments along with their performance metrics is presented in Table <ref type="table" target="#tab_0">2</ref>. Results obtained from the organizers for the test dataset is presented in the Table <ref type="table" target="#tab_1">3</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 1: The Confusion Matrix of Method 5 in our study</head><p>A:blurry-nothing, B:colon-clear, C:dyed-lifted-polyps, D:dyed-resection-margins, E:esophagitis,F:instruments, G:normal-cecum, H:normal-pylorus, I:normal-z-line, J:out-of-patient, K:polyps, L:retroflex-rectum, M:retroflex-stomach, N:stool-inclusions, O:stool-plenty, P:ulcerative-colitis The main considerable point in the confusion matrix in Table <ref type="table">1</ref> is misclassification between categories E: esophagitis and I: normalz-line. A large number of misclassifications like 30 images from the validation set occurred and a manual investigation was done to identify the reason. We notice that the images of these two categories were very similar to each other because of the close location in the GI tract, and identifying these is also a challeng for physicians.</p><formula xml:id="formula_0">Predicted class A B C D E F G H I J K L M N O P Actual class A 53 _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ B _ 81 _ _ _ _ _ _ _ _ _ _ _ _ _ _ C _ _ 130 7 _ _ _ _ _ _ _ _ _ _ _ 1 D _ _ 3 122 _ _ _ _ _ _ _ _ _ _ _ _ E _ _ _ _ 115 _ _ _ 19 _ _ _ _ _ _ _ F _ _ _ _ _ 10 _ _ _ _ 1 _ _ _ _ _ G _ _ _ _ _ _ 125 _ _ _ _ _ _ _ _ _ H _ _ _ _ _ _ _ 132 _ _ _ _ _ _ _ _ I _ _ _ _ 11 _ _ _ 121 _ _ _ _ _ _ _ J _ _ _ _ _ 1 _ _ _ 3 _ _ _ _ _ _ K _ 1 _ _ _ _ 6 2 _ _ 172 _ _ _ _ _ L _ _ _ _ _ _ 1 _ _ _ _ 71 _ _ _ _ M _ _ _ _ _ _ _ _ _ _ _ 2 118 _ _ _ N _ _ _ _ _ _ _ _ _ _ _ _ _ 39 _ _ O _ _ _ _ _ _ _ _ _ _ _ _ _ _ 110 _ P _ _ _ _ 1 1 2 _ _ _ 4 1 _ _ _ 129</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">CONCLUSION</head><p>In this paper, we presented five different methods for the multi-class classification of GI tract diseases. The proposed approach are based on the GFs, and pre-trained CNN with transfer learning mechanism. The combination of Resnet-152 and Densenet-161 with an additional MLP achieved the highest performance with both the validation dataset and the test dataset provided by the task organizers. We show that a combination of pre-trained deep neural models on ImageNet has better capabilities to classify images into the correct classes because of cumulative decision-making capabilities. For future work, we will combine deeper CNNs parallelly to add more cumulative decision taking capabilities for classifying multi-class objects. In addition to that, Generative Adversarial Network (GAN) methods can be utilized to handle imbalance dataset by generating more data to train deep neural networks.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>MediaEval' 18 ,Figure 1 :</head><label>181</label><figDesc>Figure 1: Block diagram of the CNN methods networks were trained simultaneously until it reached to the best validation accuracy by changing hyper-parameters manually.Method 5 was constructed to overcome the limitation of calculating the average of the probabilistic output of the two networks used in Method 4. Instead of calculating the average using the simple mathematical formula, another multilayer perceptron (MLP) has been merged with the above network to identify complex mathematical formula to get the cumulative decision as illustrated in Figure1. Therefore, we passed the probability output of two networks (16 probabilities from each network) to a new MLP with 32 inputs, 16 outputs (via sigmoid layer) and one hidden layer with 32 units. In this, we used pre-trained Resnet-152 and Densenet-161 using the dataset and froze them before training the MLP. Then, we trained only the MLP to identify the best mathematical formula to get the cumulative decision.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 2 :</head><label>2</label><figDesc>Validation results</figDesc><table><row><cell>Method</cell><cell>REC</cell><cell>PREC</cell><cell>SPEC</cell><cell>ACC</cell><cell>MCC</cell><cell>F1</cell><cell>FPS</cell></row><row><cell>1</cell><cell>0.855</cell><cell>0.793</cell><cell>0.989</cell><cell>0.816</cell><cell>0.814</cell><cell>0.823</cell><cell>79</cell></row><row><cell>2</cell><cell>0.816</cell><cell>0.817</cell><cell>0.984</cell><cell>0.816</cell><cell>0.800</cell><cell>0.815</cell><cell>12</cell></row><row><cell>3</cell><cell cols="6">0.9536 0.9543 0.9968 0.9536 0.9498 0.9535</cell><cell>64</cell></row><row><cell>4</cell><cell cols="6">0.9555 0.9563 0.9969 0.9555 0.9519 0.9554</cell><cell>29</cell></row><row><cell>5</cell><cell cols="6">0.9580 0.9587 0.9971 0.9580 0.9546 0.9580</cell><cell>29</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 3 :</head><label>3</label><figDesc>Official results</figDesc><table><row><cell>Method</cell><cell>REC</cell><cell>PREC</cell><cell>SPEC</cell><cell>ACC</cell><cell>MCC</cell><cell>F1</cell></row><row><cell>1</cell><cell cols="6">0.8457 0.8457 0.9897 0.9807 0.8353 0.8456</cell></row><row><cell>2</cell><cell cols="6">0.8457 0.8457 0.9897 0.9807 0.8350 0.8457</cell></row><row><cell>3</cell><cell cols="6">0.9376 0.9376 0.9958 0.9922 0.9335 0.9376</cell></row><row><cell>4</cell><cell cols="6">0.9400 0.9400 0.9960 0.9925 0.9360 0.9400</cell></row><row><cell>5</cell><cell cols="6">0.9458 0.9458 0.9964 0.9932 0.9421 0.9458</cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">Ian</forename><surname>Goodfellow</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yoshua</forename><surname>Bengio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Aaron</forename><surname>Courville</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yoshua</forename><surname>Bengio</surname></persName>
		</author>
		<title level="m">Deep learning</title>
				<imprint>
			<publisher>MIT press Cambridge</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="volume">1</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">The WEKA data mining software: an update</title>
		<author>
			<persName><forename type="first">Mark</forename><surname>Hall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Eibe</forename><surname>Frank</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Geoffrey</forename><surname>Holmes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bernhard</forename><surname>Pfahringer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Peter</forename><surname>Reutemann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ian</forename><forename type="middle">H</forename><surname>Witten</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ACM SIGKDD explorations newsletter (SIGKDD Explor. Newsl.)</title>
				<imprint>
			<date type="published" when="2009">2009. 2009</date>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page" from="10" to="18" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Deep residual learning for image recognition</title>
		<author>
			<persName><forename type="first">Kaiming</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Xiangyu</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Shaoqing</forename><surname>Ren</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jian</forename><surname>Sun</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)</title>
				<meeting>the IEEE conference on computer vision and pattern recognition (CVPR)</meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="770" to="778" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Densely Connected Convolutional Networks</title>
		<author>
			<persName><forename type="first">Gao</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zhuang</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Laurens</forename><surname>Van Der Maaten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kilian</forename><forename type="middle">Q</forename><surname>Weinberger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)</title>
				<meeting>the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="2261" to="2269" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">HKBU at Media-Eval 2017 Medico: Medical multimedia task</title>
		<author>
			<persName><forename type="first">Yang</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zhonglei</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">William</forename><forename type="middle">K</forename><surname>Cheung</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes Proceedings of the MediaEval 2017 Workshop</title>
				<meeting><address><addrLine>MediaEval</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">LIRE: open source visual information retrieval</title>
		<author>
			<persName><forename type="first">Mathias</forename><surname>Lux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Riegler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pål</forename><surname>Halvorsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Konstantin</forename><surname>Pogorelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nektarios</forename><surname>Anagnostopoulos</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 7th International Conference on Multimedia Systems (MMSys)</title>
				<meeting>the 7th International Conference on Multimedia Systems (MMSys)</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="volume">30</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Ensemble of Texture Features for Finding Abnormalities in the Gastro-Intestinal Tract</title>
		<author>
			<persName><forename type="first">Syed</forename><surname>Sadiq</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ali</forename><surname>Naqvi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Shees</forename><surname>Nadeem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Muhammad</forename><surname>Zaid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Muhammad</forename><forename type="middle">Atif</forename><surname>Tahir</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes Proceedings of the MediaEval 2017 Workshop</title>
				<meeting><address><addrLine>MediaEval</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Automatic differentiation in PyTorch</title>
		<author>
			<persName><forename type="first">Adam</forename><surname>Paszke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sam</forename><surname>Gross</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Soumith</forename><surname>Chintala</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gregory</forename><surname>Chanan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Edward</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zachary</forename><surname>Devito</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zeming</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alban</forename><surname>Desmaison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Luca</forename><surname>Antiga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Adam</forename><surname>Lerer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of 31st Conference on Neural Information Processing Systems (NIPS)</title>
				<meeting>31st Conference on Neural Information Processing Systems (NIPS)</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Learning laparoscopic video shot classification for gynecological surgery</title>
		<author>
			<persName><forename type="first">Stefan</forename><surname>Petscharnig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Klaus</forename><surname>Schöffmann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">An International Journal of Multimedia Tools and Applications</title>
		<imprint>
			<biblScope unit="volume">77</biblScope>
			<biblScope unit="issue">7</biblScope>
			<biblScope unit="page" from="8061" to="8079" />
			<date type="published" when="2018">2018. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">An Inception-like CNN Architecture for GI Disease and Anatomical Landmark Classification</title>
		<author>
			<persName><forename type="first">Stefan</forename><surname>Petscharnig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Klaus</forename><surname>Schöffmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mathias</forename><surname>Lux</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes Proceedings of the MediaEval 2017 Workshop</title>
				<meeting><address><addrLine>MediaEval</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Nerthus: A Bowel Preparation Quality Video Dataset</title>
		<author>
			<persName><forename type="first">Konstantin</forename><surname>Pogorelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kristin</forename><forename type="middle">Ranheim</forename><surname>Randel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thomas</forename><surname>De Lange</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sigrun</forename><surname>Losada Eskeland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carsten</forename><surname>Griwodz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dag</forename><surname>Johansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Concetto</forename><surname>Spampinato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mario</forename><surname>Taschwer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mathias</forename><surname>Lux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Peter</forename><forename type="middle">Thelin</forename><surname>Schmidt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Riegler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pål</forename><surname>Halvorsen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 8th ACM on Multimedia Systems Conference (MMSYS)</title>
				<meeting>the 8th ACM on Multimedia Systems Conference (MMSYS)</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="170" to="174" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Kvasir: A multi-class image dataset for computer aided gastrointestinal disease detection</title>
		<author>
			<persName><forename type="first">Konstantin</forename><surname>Pogorelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kristin</forename><forename type="middle">Ranheim</forename><surname>Randel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carsten</forename><surname>Griwodz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sigrun</forename><surname>Losada Eskeland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thomas</forename><surname>De Lange</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dag</forename><surname>Johansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Concetto</forename><surname>Spampinato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Duc-Tien</forename><surname>Dang-Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mathias</forename><surname>Lux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Peter</forename><forename type="middle">Thelin</forename><surname>Schmidt</surname></persName>
		</author>
		<author>
			<persName><surname>Others</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 8th ACM on Multimedia Systems Conference (MMSYS)</title>
				<meeting>the 8th ACM on Multimedia Systems Conference (MMSYS)</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="164" to="169" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Efficient disease detection in gastrointestinal videos-global features versus neural networks</title>
		<author>
			<persName><forename type="first">Konstantin</forename><surname>Pogorelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Riegler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sigrun</forename><surname>Losada Eskeland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thomas</forename><surname>De Lange</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dag</forename><surname>Johansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carsten</forename><surname>Griwodz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Peter</forename><forename type="middle">Thelin</forename><surname>Schmidt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pål</forename><surname>Halvorsen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">An International Journal Multimedia Tools and Applications</title>
		<imprint>
			<biblScope unit="volume">76</biblScope>
			<biblScope unit="issue">21</biblScope>
			<biblScope unit="page" from="22493" to="22525" />
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">A comparison of deep learning with global features for gastrointestinal disease detection</title>
		<author>
			<persName><forename type="first">Konstantin</forename><surname>Pogorelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Riegler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pål</forename><surname>Halvorsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carsten</forename><surname>Griwodz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kristin</forename><forename type="middle">Ranheim</forename><surname>Thomas De Lange</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sigrun</forename><surname>Randel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dang</forename><surname>Eskeland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Duc</forename><surname>Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Olga</forename><surname>Tien</surname></persName>
		</author>
		<author>
			<persName><surname>Ostroukhova</surname></persName>
		</author>
		<author>
			<persName><surname>Others</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes Proceedings of the MediaEval 2017 Workshop</title>
				<meeting><address><addrLine>MediaEval</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Medico Multimedia Task at MediaEval</title>
		<author>
			<persName><forename type="first">Konstantin</forename><surname>Pogorelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Riegler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pål</forename><surname>Halvorsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thomas</forename><forename type="middle">De</forename><surname>Lange</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kristin</forename><forename type="middle">Ranheim</forename><surname>Randel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Duc-Tien</forename><surname>Dang-Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mathias</forename><surname>Lux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Olga</forename><surname>Ostroukhova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes Proceedings of the MediaEval 2018 Workshop</title>
				<imprint>
			<date type="published" when="2018">2018. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">From annotation to computeraided diagnosis: Detailed evaluation of a medical multimedia system</title>
		<author>
			<persName><forename type="first">Michael</forename><surname>Riegler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Konstantin</forename><surname>Pogorelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sigrun</forename><surname>Losada Eskeland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Peter</forename><forename type="middle">Thelin</forename><surname>Schmidt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zeno</forename><surname>Albisser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dag</forename><surname>Johansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carsten</forename><surname>Griwodz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pål</forename><surname>Halvorsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thomas</forename><forename type="middle">De</forename><surname>Lange</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page">26</biblScope>
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Multimedia for medicine: the medico Task at mediaEval</title>
		<author>
			<persName><forename type="first">Michael</forename><surname>Riegler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Konstantin</forename><surname>Pogorelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pål</forename><surname>Halvorsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carsten</forename><surname>Griwodz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thomas</forename><surname>Lange</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kristin</forename><forename type="middle">Ranheim</forename><surname>Randel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sigrun</forename><surname>Eskeland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dang</forename><surname>Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Duc</forename><surname>Tien</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mathias</forename><surname>Lux</surname></persName>
		</author>
		<author>
			<persName><surname>Others</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes Proceedings of the MediaEval 2017 Workshop</title>
				<meeting><address><addrLine>MediaEval</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017. 2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">ImageNet Large Scale Visual Recognition Challenge</title>
		<author>
			<persName><forename type="first">Olga</forename><surname>Russakovsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jia</forename><surname>Deng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hao</forename><surname>Su</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jonathan</forename><surname>Krause</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sanjeev</forename><surname>Satheesh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sean</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zhiheng</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andrej</forename><surname>Karpathy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Aditya</forename><surname>Khosla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Bernstein</surname></persName>
		</author>
		<author>
			<persName><surname>Others</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Computer Vision (IJCV</title>
		<imprint>
			<date type="published" when="2015">2015. 2015</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
