<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Extracting information from radiology reports by Natural Language Processing and Deep Learning</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Miguel</forename><surname>Ángel</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Universidad Carlos III de Madrid</orgName>
								<address>
									<postCode>28911</postCode>
									<settlement>Madrid</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Martín-Caro</forename><surname>García-Largo</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Universidad Carlos III de Madrid</orgName>
								<address>
									<postCode>28911</postCode>
									<settlement>Madrid</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Isabel</forename><surname>Segura-Bedmar</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Universidad Carlos III de Madrid</orgName>
								<address>
									<postCode>28911</postCode>
									<settlement>Madrid</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Extracting information from radiology reports by Natural Language Processing and Deep Learning</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">16894E5878AE13CC8DBD4F96B81054A2</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T20:44+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Natural Language Processing</term>
					<term>Named-entity recognition</term>
					<term>Radiology reports</term>
					<term>Information Extraction</term>
					<term>CRF</term>
					<term>BERT</term>
					<term>BiLSTM-CRF</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Radiology reports are texts that include the description and interpretation of ultrasound images. The automatic processing of these texts, if well performed, can help healthcare professionals and the diagnosis. This work is part of the Information Extraction from Spanish radiology reports task (SpRadIE) of CLEF eHealth 2021. Regarding the case of study of this work, it is remarkable the correct detection of unusual findings because they can affect the patient's health. Furthermore, it can help health professionals and researchers to be focused on problematic cases. Three different models have been proposed to face that task, evaluating and comparing their performance. Conditional Random Field (CRF), Bidirectional Long Short-Term Memory-Conditional Random Field (BiLSTM-CRF) and Bidirectional Encoders Representation from Transformers(BERT). With BiLSTM-CRF, two different approaches have been used: the use of randomized initialized vectors and the use of a pre-trained word embedding in Spanish. Both will appear more detailed in the paper. The task is complex and some of the reasons are: reports are written in Spanish, the extensive number of types of entities and the ambiguity in the language used by the doctors in the reports. The best results have been obtained by the CRF model, which has obtained a Lenient F1 score of 77% for a dataset that contained most of the words in the training dataset and a 67% of Lenient score for the dataset with words that are not present in the dataset used to train the model.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction 1.Motivation</head><p>Artificial Intelligence (AI) aims to create algorithms to make computers think smarter but a computer is still being a machine of computing with a memory. Natural Language Processing (NLP), a field of a AI, is a set of techniques to automatically 'understand' and create human language. Nonetheless, it has to be clear that today computers can not think as humans do. NLP it is a multidisciplinary field that involves many areas including Linguistics, Computer Science and Psychology among others. NLP is an ambitious area of research and it includes many tasks like: Information Retrieval, Information Extraction, Question-Answering, summarization, Machine Translation, dialogue systems, among others . <ref type="bibr" target="#b0">[1]</ref> Information Extraction consists of structuring information from the texts. IE comprises thre main subtasks: Named Entity Recognition (NER), relation extraction and co-reference resolution. NER consists of identifying the type of word or expressions that appear within a text. It is a crucial task for many NLP applications such as relation extraction, information retrieval, machine translation, text simplification, text summarization. SpRadIE task <ref type="bibr" target="#b1">[2]</ref> (included in the CLEF eHealth 2021) <ref type="bibr" target="#b2">[3]</ref> targets the detection of seven different entities as well as hedge cues. <ref type="bibr" target="#b1">[2]</ref> This paper describes the models that we have presented for the task. The goal of this work is to explore different NER approaches for detecting findings from radiology reports written in Spanish. Reports about patients in health services are essential as they collect the necessary information to understand the diseases or abnormalities of a patient. The correct detection of unusual findings from radiology reports could help health professionals and researchers to easily interpret these reports and accelerate the diagnostic of possible conditions. <ref type="bibr" target="#b3">[4]</ref> Most existing NLP for extracting information are based on supervised machine learning, which require annotated corpora for training and evaluating their models. Information extraction techniques allows to effectively transform unstructured text to structured data, which could bring benefits to the the radiology area. Nevertheless, it remains a complex task so that collaboration between radiologists, data scientists and engineers will be a key point to achieve optimal results. <ref type="bibr" target="#b4">[5]</ref> Unfortunately, there is a lack of these resources for other languages than English. There have already been some research initiatives to promote research on information extraction from clinical texts written in Spanish, such as the Cantemist Track for Cancer Text Mining in Spanish. <ref type="bibr" target="#b5">[6]</ref> or the ehealth knowledge discovery challenge <ref type="bibr" target="#b6">[7]</ref>. SpRadIE 2021 aims to promote NLP research applied to the extraction of information from radiology reports. Several challenges must be addressed in the task: the complexity of the entities, the language in which reports are written (Spanish) and the fact that some reports do not contain the same words than reports in the training dataset.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.2.">Objectives</head><p>The general objective of this project appears next:</p><p>To build a Named Entity Recognition (NER) system able to recognise entities in the domain of radiology (ultrasound images) clinical reports written in Spanish.</p><p>That objective is decomposed into four more specific objectives:</p><p>• Review the main approaches used to extract information from clinical text, particularly, from radiology reports. • Explore different NER approaches for detecting the entities of the dataset provided by the organisers of SpRadIE 2021. • Evaluate and compare the performance of the proposed models to determine the weak and strong points for each of them. • Participate in the SpRadiE 2021 competition and submit the results of our three best models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related work</head><p>Recently, the use of NLP in biomedical texts has increased <ref type="bibr" target="#b7">[8]</ref>. It presents challenging problems to deal with like discontinuous entities, misspellings, abbreviations, the low presence of some entity types and the existence of ambiguity.</p><p>In the last years, many competitions of NLP applied to biomedical texts have tried to look for good approaches to overcome these problems. These competitions are described in in <ref type="bibr" target="#b8">[9]</ref>. The task of NER has been used to solve different problems in the biomedical domain, like the anonymization of clinical reports <ref type="bibr" target="#b9">[10]</ref> or the extraction of information in various subfields such as pharmacovigilance <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b11">12]</ref> or oncology <ref type="bibr" target="#b12">[13]</ref>. This chapter is focused in the carried out works performed in texts about clinical reports or more specifically in radiology reports.</p><p>Clinic report anonymization is necessary to ensure the protection of data from the patients and during last years that task has been developed using NER systems. In <ref type="bibr" target="#b13">[14]</ref>, a NER system was developed for anonymizing private data in radiology reports written in Spanish. The used approach is focused on neural networks with different architectures (LSTM-CRF, BiLSTM-CRF, Convolutional-BiLSTM-CRF). The alternative was to use pattern matching but that implies some negative aspects. For example: as it only considers exact patterns the context is not taken into account and thus, plenty of information would be lost. Furthermore, mispelled words could have a bad influence on results. For example: An example which is very explanatory is the Spanish surname 'Cabeza' (head). Approaches based on pattern matching would not distinguish between the surname and this part of the body. The best results were obtained by an BiLSTM-CRF: Bi-LSTM initialized with character embeddings and embeddings pre-trained with Glove vectors and a CRF in its last layer. <ref type="bibr" target="#b13">[14]</ref> This model obtained an F1 score of 92.63% on the test data using their own radiology report dataset. It consists of 7,848 brain radiology reports taken from the Medical Imaging Databank of the Valencia Region (BIMCV). To ensure that retrieved reports contained personal information, only texts with more than two name tags were selected. Only a third part was annotated. This model was also tested on the MED-DOCAN challenge dataset, achieving an F1 score of 81%.</p><p>Another recent research <ref type="bibr" target="#b14">[15]</ref> has tried to use NER for radiological reports in Japanese using deep learning. The goal was to recognise mentions of the following entity types: observations, clinical findings, anatomical location modifiers, certainty modifiers, change modifiers, characteristic modifiers and size modifiers. Different neural networks were taken into account, among them: BiLSTM-CRF, BERT and BERT-CRF. BiLSTM was the network that achieved the highest F1 score with an F1 score of 95.36% using an in-house dataset. The in-house dataset was built using 118,078 reports from the radiology information system at Osaka University Hospital. The model was evaluated on external reports, providing an F1 score of 94.62% using. These external reports were 77 chest CT reports from the Osaka International Cancer Institute (OICI) <ref type="bibr" target="#b14">[15]</ref>. For the optimization of the networks hyper-parameters, Optuna <ref type="bibr" target="#b15">[16]</ref> which is a recent software for optimization was used.</p><p>The main goal of the paper <ref type="bibr" target="#b16">[17]</ref> was to develop a NER system to extract information from clin-ical reports written in Chinese. The authors explored a deep learning approach and compared it with a more traditional algorithm such as Condition Random Field (CRF). Two corpus were used: one for training and evaluating the models and another for creating a word embedding used later in the deep learning developed model. The dataset used for the training and evaluation of the models consists of 400 reports from the EHR database of Peking Union Medical Collegue Hospital. The dataset contains four types of entities: medications, procedures, problems and lab tests. For the creation of the embedding, 36,828 reports were selected from the same institute of China. The deep learning network consisted of a convolutional layer, followed by a non-linear layer and several linear layers. Two different approaches were taken: one where the input vectors were randomly initializated and another where the input vectors are taken from the word embedding model trained on the second corpus. The results were: 91.9% F1-score for the CRF, 90.7% for the CNN with random initialization and 92.8% for the CNN with word embeddings.</p><p>BERT is a general purpose language representation based on deep learning which takes into account the context of a word by looking to the left and the right of the word at the same time.</p><p>It only needs a fine-tuning to adapt to a new particular dataset. BERT has widely used in NER taks with very successful results. We now describe some of the most important works on the clinical domain.</p><p>Kim and Lee proposed a slightly better modified version of BERT. This version consists of modifying the labelling strategy. The proposed labeling strategy for BERT was carried out due to some peculiarities of the Korean language so it is not guarantee that in other languages the use of that labelling system would improve the results. Furthermore, results are not much different from the default BERT labels. The corpus was created by extracting texts from the biggest questions and answers platform in Korean (Kin Naver) and a total of 536 answers were retrieved. These texts were annotated with the following three entity types:diseases, symptoms and body parts. The authors also used another external dataset, the Exo Brain Korean dataset. It consists of 10,000 sentences with five different entity types: person, location, time, date and organization. Tokens were represented by using the IOB standard format. BERT and BiLSTM-CRF methods were compared. Macro-averages for the created dataset were 83% of F1 score for BERT and 79% for BiLSTM-CRF. For the Exo brain dataset, the macro-averages of F1 score were 94% for BERT and 89% for BiLSTM-CRF.</p><p>The scope of the NER tasks comprises different approaches such as rule-based and deep learning. In Gorinski et al. addressed the NER task by appliying thre different approaches: a rulebased method, a transfer learning approach and a deep learning model. Transfer learning consists of training a model for a specific task and then using that model with some fine hyper-parameter tuning to another problem. The authors used a Scottish radiology reports, the TayExt dataset. This dataset contains brain images reports. Some of the types of entities which appeared were: ischaemic strokes, hemorrhagic strokes, strokes, tumours, etc. The best results were obtained by the hand-written rule-based approach with a 93% of total F1 score.</p><p>The second place was for the transfer learning approach using the SemEHR tool <ref type="bibr" target="#b19">[20]</ref>, with an F1 score of 89. The third place was for the deep learning approach (LSTM-CRF) with an 80% of F1-score. The rule-based approach achieved the best results, however this kind of approaches needs from domain experts and takes longer to developed the rules. Moreover, these rules cannot be used to recognize entity types of other subfields. On the other hand, one of the main advantages of deep learning is that it does not requires domain experts.</p><p>SpRadIE proposes a challenging NER task since its dataset contains up to ten named entity types. As it has been explained in the beginning of this section, it presents a challenging scenario for developing a NER system. It is due to discontinuities, embedded entities, ambiguities, abbreviations and imbalanced datasets among others. Moreover, radiology reports are written in Spanish that although it is a language with presence in the domain of this task, it has been less studied in NLP than English <ref type="bibr" target="#b20">[21]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Data preprocessing</head><p>Texts were represented as vectors to feed the models with data. First, texts are split into sentences and each of those sentences is divided into tokens( words). Then, those tokens were assigned with a part of speech tag (PoS). We use Spacy, a python library for NLP. Moreover, each token is represented following the BIOES-V token annotation format, where 'B' is used for the beginning token of an entity, 'I' is used for tokens in the middle of entities, 'E' is a tag used for indicating a token is the end of an entity, 'S' is used when the entity is composed only by one token, 'V' is used when the token is part of a nested entity, and 'O' is used for those tokens that do not belong to an entity. Moreover, the position of the token in the sentence is also considered. These BIOES-V labels are obtained by using the information from the annotation files, which were provided in BRAT format.</p><p>In the case of the first approach for BiLSTM which uses random initialization of vectors, a vocabulary is created using the words of the texts. It is done by assigning to each of the words present in the texts a random vector. The counter part of random initialization, is that no relation between words is captured. Furthermore, the sentences for this model must have the same length. In the case of longer sentences than the fixed one, truncation is applied to remove the rest of tokens. In the case of sentences shorter than the fixed length, padding is applied.</p><p>The preprocessing of the BiLSTM network with the second approach (using a clinical Spanish embedding) is similar to the BiLSTM approach 1. The difference is that the vocabulary is created by assigning to each word a vector from the word embedding model. This model can capture the semantic relationships of the words in a corpus. As in the case of the BiLSTM approach 1, the length of the sentences are fixed but no much values were checked.</p><p>BERT is a model that does not need the PoS tag, so it is not necessary to take it into account.</p><p>The way data are preprocessed to prepare them for the model is by tokenizing them with the BERT tokenizer. That tokenizer divides the tokens of the input texts into smaller ones. In that way, the length of wordpieces is larger than the list of tokens. BERT assigns a vector to each token, similar to what was done for the BiLSTM networks and all the sentences are set to a fixed length.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">CRF</head><p>Our first approach is based on CRF <ref type="bibr" target="#b21">[22]</ref>. This model was state-of-art for NER <ref type="bibr" target="#b22">[23]</ref> before the appearance of deep learning. It fits perfectly for the NER task and it is a good baseline model to compare its results with more new approaches. Technically, it is a discriminative probabilistic non directed graph model based on the maximum entropy and Hidden Markov models. <ref type="bibr" target="#b23">[24]</ref> The difference between discriminative and generative models is that generative models learn the joint probability distribution 𝑝(𝑥, 𝑦), while discriminative models learn the conditional probability distribution 𝑝(𝑦|𝑥) (where x is the sequence of observations and y the sequence of output labels). In other words, generative models model the distribution of each class and discriminative models the frontiers between them. <ref type="bibr" target="#b24">[25]</ref> Before exposing the formula of the model, some simplified notation is presented to facilitate the labour of representation of the formula:</p><formula xml:id="formula_0">𝐹 𝑗 (𝑦, 𝑥) = 𝑛 ∑ 𝑖=1 𝑓 𝑗 (𝑦 𝑖−1 , 𝑦 𝑖 , 𝑥, 𝑖)<label>(1)</label></formula><p>The probability of a label sequence y given an observation sequence x, can be written as:</p><formula xml:id="formula_1">𝑝(𝑦|𝑥, 𝜆) = 1 𝑍 (𝑥) 𝑒𝑥𝑝(∑ 𝑗 𝜆 𝑗 𝐹 𝑗 (𝑦, 𝑥))<label>(2)</label></formula><p>where each 𝑓 𝑗 (𝑦 𝑖−1 , 𝑦 𝑖 , 𝑥, 𝑖) is a state function or a transition function and Z(x) is a normalization factor. <ref type="bibr" target="#b25">[26]</ref> 𝜆 parameters are estimated by the model by using an optimization algorithm. <ref type="bibr" target="#b23">[24]</ref> The features set used for the CRF classifier takes into account the following features: word, the previous, the next one tokens, lemmas of the word, as well as their PoS tags. As it was mentioned previously, these elements were obtained by using Spacy.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">BiLSTM-CRF</head><p>Our second approach is a deep learning architecture, which is composed of two parts: a BiL-STM network and a CRF, as the last layer (BiLSTM-CRF) <ref type="bibr" target="#b26">[27]</ref>. BiLSTM is a kind of recurrent neural network (RNN) which are able to consider past observations. Moreover, BiLSTM takes into account the tokens around the current token to predict its label. Therefore, this is a effective approach for NER. <ref type="bibr" target="#b27">[28]</ref> Moreover, among recurrent neural networks, BiLSTM are a good candidate because they deal with both the gradient vanishing problem and the explosion problem. The vanishing gradient problem appears after multiplying the gradient many times by a number lower than one. On the other hand, the explosion gradient problem appears after multiplying many times the gradient by a number greater than one. This kind of networks deal with those problems. To deal with these problems RNN propose the definition of cell. A cell is an operation with two inputs (the sequence values and a given past state) and two outputs (the current state and the sequence of values computed for the current state).</p><p>The solution is to set cells that link the past state with the current output. <ref type="bibr" target="#b28">[29]</ref> The characteristics which allow BiLSTM networks to consider past observations and control the importance that is given to them, resides in the interior of the cells. Figure ?? shows the structure that a cell usually has. Forget, input and output gates play a key role in that task of extracting the insights from data that are of interest. Oblivion gate controls when some part of the information is forgotten, input gate controls when new information should enter the cell and the output gate controls when the information stored in the cell is used in the result of that cell. <ref type="bibr" target="#b28">[29]</ref> It has been said that as BiLSTM considers the past, it is appropriate for the NER task but not only because of that. As it is bidirectional, it considers more information making it even more appropriate. Bidirectionality was introduced in <ref type="bibr" target="#b29">[30]</ref> and it can be considered as if two independent agents were extracting information from the same problem (each one in one direction) to finish concatenating the information and passing it to another layer. <ref type="bibr" target="#b28">[29]</ref> The output of the BiLSTM are two vectors (one per direction) and they are received by the CRF. Typically the output function in many neural network tasks is the softmax function but as CRF considers context, it is preferred. <ref type="bibr" target="#b22">[23]</ref> Two approaches have been used with the BiLSTM model: the use of randomized initializated vectors and the use of vectors from a pre-trained word embedding model which was trained on a collection of texts in Spanish <ref type="bibr" target="#b30">[31]</ref>. In the first case, each sentence is divided into tokens and a random vector is assigned to each token. The problem with that is that words that do not appear in the training set and the relationships with the words in the training set, could be not correctly identified. Word embeddings can capture syntactic and semantic relationships between words. <ref type="bibr" target="#b31">[32]</ref> The texts <ref type="bibr" target="#b30">[31]</ref> that were used to create the word embedding model contain information from many different biomedical areas and although radiology reports are not the main content, they are present in the corpus. The method which was applied to create the embedding was Word2Vec with an Skip-gram architecture. <ref type="bibr" target="#b30">[31]</ref> The SkipGram architecture receives a vector representing a word and returns another vector representing the probabilities of other words in the vocabulary to be near to the given word.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.">BERT</head><p>BERT is a bidirectional self-attention (which is the ability to understand the context) transformer composed of two unsupervised steps: the masked language model (MLM) and the next sentence prediction (NSP). The first is essential to achieve bidirectional training avoiding the model to 'see' the target and the second, to understand sentence relationships. <ref type="bibr" target="#b32">[33]</ref> BERT was developed in 2019. The power of that model was the wide range of applications to which it can be used. The employed model in this work uses a pre-trained BERT and then performs fine-tuning over it to find the optimal combination of parameters for the task of NER.</p><p>The pre-trained BERT model uses the following parameters: L=12, H=768, A=12 and Total Pa-rameters=110M, where L denotes the number of layers (or transformed blocks), H is the hidden size and A is the number of self-attention heads. This fixed establishment of the parameters in the pre-training BERT, allows focusing on the parameters of fine-tuning (maximum length, batch size, learning rate and the number of epochs). <ref type="bibr" target="#b32">[33]</ref> In the work of this master thesis the chosen values for those parameters have been: maximum sequence length=75, batch size=32, learning rate=3e-5 and number of epochs=3. They were chosen based on the values which were used in <ref type="bibr" target="#b22">[23]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Experiments</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Dataset description</head><p>From the website of the competition the dataset is presented: 'The data consists of ultrasonography reports provided by a pediatric hospital in Argentina. Reports are unstructured, have an abundance of orthographical grammatical errors and have been anonymized in order to remove patient IDs, and names and the enrollment numbers of the physicians. Reports were annotated by clinical experts and then revised by linguists. Annotation guidelines and training were provided for both rounds of annotations. Automatic classifiers will be expected to perform well in those cases where human annotators have a strong agreement, and worse in cases that are difficult for human annotators to identify consistently. Annotations are provided in brat format'. <ref type="bibr" target="#b1">[2]</ref> The proposed task is ambitious and not at all easy. It can be confirmed by checking the numerous entities which have been considered in the clinical reports and the strict annotation process which has had to be taken in order to get a set of annotations of quality. All of that supports the idea of being a complex task. The entities which appear in the dataset are: findings, anatomical entities, location, measure, type of measure, texture, negation, uncertainty terms, abbreviations and temporal terms. The meaning for each of them is described in <ref type="bibr" target="#b33">[34]</ref>. An interesting fact is the huge quantity of negations that this dataset contains. It would be useful in order to evaluate the performance of methods which pretend to detect negations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Global results and confusion matrices for the best model in development and validation datasets</head><p>In this subsection the results are divided for each of the data partitions described in the previous section: development and held-out datasets. The vocabulary of the entities in the development set, is the same with which the algorithms have been trained. In contrast, the held-out dataset, contains vocabulary which has not been present during the training phase of the algorithms. In order to not extend a lot this paper, only the confusion matrices for the best model (CRF) have been included.</p><p>The obtained global results for the development dataset have been:</p><p>It can be seen that in the case of the development dataset, the best method has been CRF with a significant difference. In the second position, can be found BERT and the BILSTM2 (BERT has obtained slightly better results but the difference is not significant). The method with the worst results has been the BILSTM1.</p><p>Although the dataset is not exactly the same with which the methods have been trained, it can be seen that CRF is able of identifying the entitites in a correct manner in most of the occasions.</p><p>The obtained global results for the held-out dataset have been:</p><p>The results are worse than in the case of the development set, which was something expected. Furthermore, the performance of the models follows the same order: the best model is CRF followed by BERT, BILSTM2 and BILSTM1.</p><p>The confusion matrix of Table <ref type="table">4</ref> despite being less accurate than the matrix in Table <ref type="table">2</ref>, contains more interesting insights.</p><p>In the description of the task <ref type="bibr" target="#b1">[2]</ref>, it is stated that there are challenging situations like the fact that there exists regular polysemy between anatomical entities and locations. The irregular use of abbreviations also increases the difficulty of the task. That a priori description of problems related to the task, has been confirmed after looking at the results of Table <ref type="table">4</ref>. The anatomical entities, locations and findings are easily confused between them. Apart from this, the irregular abbreviations also cause many confusions with the anatomical entities. In the case of the held-out dataset confusion matrix, it is also remarkable the fact that the number of entities not classified as any type of the present entities, is high (see last column in Table <ref type="table">4</ref>). In both of the cases, by looking at both confusion matrices, it is confirmed the high presence of negations in the reports. This reinforces the idea of using the datasets as examples to evaluate entity recognition models build to detect negations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">Results for test dataset in the competition</head><p>The global obtained Lenient F1 score in the competition has been of 75.64% whereas the winner result has been 85.51%. Dividing the result into the different entities, the following results have   </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions and future work</head><p>Best results were obtained by CRF despite the implementation of other models like BiLSTM-CRF and BERT, which are state-of-the-art in the named entity recognition area. This means that not always an approach based on neural networks is the best. It is true that they can get good results like the shown in Section 2 of this paper but if the selection of hyperparameters is not good, it is going to be difficult to obtain the best results.</p><p>A great improvement (18% of F1 score) has been seen in the BiLSTM-CRF network when using a Spanish biomedical word embedding versus when not using it. This suggests, that as future work, maybe the use of BETO <ref type="bibr" target="#b34">[35]</ref> (Spanish BERT) could improve the performance of BERT until the point of surpassing the CRF model.</p><p>This task has used Spanish data from Argentina. It could also be interesting to compare the behaviour of the models in radiological texts in Spanish but from different areas like Spain and South America.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Global results for development dataset</figDesc><table><row><cell></cell><cell>CRF</cell><cell cols="2">BILSTM1 BILSTM2</cell><cell>BERT</cell></row><row><cell cols="2">Lenient F1 0.77658</cell><cell>0.46019</cell><cell>0.68015</cell><cell>0.68155</cell></row><row><cell>Exact F1</cell><cell>0.69430</cell><cell>0.38515</cell><cell>0.56708</cell><cell>0.58847</cell></row><row><cell>Table 2</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="3">CRF confusion matrix for entities in development dataset</cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell cols="2">PREDICTED</cell><cell></cell></row><row><cell cols="5">Ana Fin Unc Neg Loc Con Typ Mea Abb Deg Oth</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 3</head><label>3</label><figDesc>been obtained: 83.72% for abbreviations, 70.07% for anatomical entities, 61.54% for conditional temporal, 53.44% for degree, 69.18% for findings, 68.35% for locations, 62.50% for measures, 93.78% for negations, 86.28% for types of measures and 73.26% for uncertainties.</figDesc><table><row><cell cols="4">Global results for held-out dataset</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>CRF</cell><cell cols="4">BILSTM1 BILSTM2</cell><cell>BERT</cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell cols="4">Lenient F1 0.67134</cell><cell>0.20137</cell><cell></cell><cell>0.62514</cell><cell cols="2">0.63036</cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell>Exact F1</cell><cell></cell><cell>0.60776</cell><cell>0.13658</cell><cell></cell><cell>0.56126</cell><cell cols="2">0.56308</cell><cell></cell><cell></cell></row><row><cell>Table 4</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="7">CRF confusion matrix for entities in held-out dataset</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="3">PREDICTED</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell cols="11">Ana Fin Unc Neg Loc Con Typ Mea Abb Deg Oth</cell></row><row><cell></cell><cell cols="2">Ana 507</cell><cell>17</cell><cell>0</cell><cell>0</cell><cell>25</cell><cell>0</cell><cell>1</cell><cell>0</cell><cell>8</cell><cell>0</cell><cell>44</cell></row><row><cell></cell><cell>Fin</cell><cell>46</cell><cell>451</cell><cell>1</cell><cell>0</cell><cell>37</cell><cell>0</cell><cell>7</cell><cell>13</cell><cell>5</cell><cell>0</cell><cell>268</cell></row><row><cell></cell><cell>Unc</cell><cell>0</cell><cell>0</cell><cell>6</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>1</cell></row><row><cell>R E A L</cell><cell>Neg Loc Con Typ Mea</cell><cell>0 19 0 0 1</cell><cell>9 16 0 0 23</cell><cell>1 0 0 0 0</cell><cell>234 0 0 0 0</cell><cell>1 107 0 5 1</cell><cell>0 0 0 0 0</cell><cell>0 2 0 106 0</cell><cell>0 0 0 0 208</cell><cell>0 0 0 0 0</cell><cell>0 0 0 0 0</cell><cell>26 57 2 0 17</cell></row><row><cell></cell><cell>Abb</cell><cell>12</cell><cell>2</cell><cell>0</cell><cell>0</cell><cell>2</cell><cell>0</cell><cell>1</cell><cell>1</cell><cell>196</cell><cell>0</cell><cell>16</cell></row><row><cell></cell><cell>Deg</cell><cell>0</cell><cell>1</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>0</cell><cell>12</cell><cell>8</cell></row><row><cell></cell><cell>Oth</cell><cell>34</cell><cell>46</cell><cell>1</cell><cell>0</cell><cell>19</cell><cell>0</cell><cell>1</cell><cell>11</cell><cell>1</cell><cell>0</cell><cell>2295</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Acknowledgments</head><p>This work was supported by the NLP4RARE-CM-UC3M, which was developed under the Interdisciplinary Projects Program for Young Researchers at University Carlos III of Madrid. The work was also supported by the Multiannual Agreement with UC3M in the line of Excellence of University Professors (EPUC3M17), and in the context of the V PRICIT (Regional Programme of Research and Technological Innovation).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Natural language processing</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">D</forename><surname>Liddy</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">V</forename><surname>Cotik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Alonso Alemany</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Roller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Luque</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Vivaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Filippo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ayach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Carranza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>De Francesca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dellanzo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">Fernández</forename><surname>Urquiza</surname></persName>
		</author>
		<title level="m">Overview of clef ehealth task 1 -spradie: A challenge on information extraction from spanish radiology reports</title>
				<imprint>
			<publisher>ceur-ws</publisher>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
	<note>Clef 2021 evaluation labs and workshop: Online working notes</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Overview of the clef ehealth evaluation lab</title>
		<author>
			<persName><forename type="first">H</forename><surname>Suominen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Goeuriot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Kelly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Alonso Alemany</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Bassani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Brew-Sam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Cotik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Filippo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>González-Sáez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Luque</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Mulhem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Pasi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Roller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Seneviratne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Upadhyay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Vivaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Viviani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Xu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Clef 2021 -12th conference and labs of the evaluation forum</title>
		<title level="s">lecture notes in computer science</title>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2021">2021. 2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Detection of unexpected findings in radiology reports: A comparative study of machine learning approaches</title>
		<author>
			<persName><forename type="first">P</forename><surname>López-Úbeda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">C</forename><surname>Díaz-Galiano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Martín-Noguerol</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ureña-López</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-T</forename><surname>Martín-Valdivia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Luna</surname></persName>
		</author>
		<ptr target="https://www.sciencedirect.com/science/article/pii/S0957417420304711" />
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">T</forename><surname>Martín-Noguerol</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Paulano-Godino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>López-Ortega</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Górriz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Riascos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Luna</surname></persName>
		</author>
		<ptr target="https://n9.cl/gdxlc" />
		<title level="m">Artificial intelligence in radiology: relevance of collaborative work between radiologists and engineers for building a multidisciplinary team</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Named entity recognition, concept normalization and clinical coding: Overview of the cantemist track for cancer text mining in spanish, corpus, guidelines, methods and results</title>
		<author>
			<persName><forename type="first">A</forename><surname>Miranda-Escalada</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Farré</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Krallinger</surname></persName>
		</author>
		<ptr target="https://www.bsc.es/research-and-development/publications/named-entity-recognition-concept-normalization-and-clinical" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020</title>
				<meeting>the Iberian Languages Evaluation Forum (IberLEF 2020</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note>CEUR Workshop Proceedings</note>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Overview of the ehealth knowledge discovery challenge at iberlef</title>
		<author>
			<persName><forename type="first">A</forename><surname>Piad-Morffis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Gutiérrez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Canizares-Diaz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Estevez-Velarde</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Muñoz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Montoyo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Almeida-Cruz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020)</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<meeting>the Iberian Languages Evaluation Forum (IberLEF 2020)</meeting>
		<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">A survey on literature based discovery approaches in biomedical domain</title>
		<author>
			<persName><forename type="first">V</forename><surname>Gopalakrishnan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Jha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Jin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Zhang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of biomedical informatics</title>
		<imprint>
			<biblScope unit="volume">93</biblScope>
			<biblScope unit="page">103141</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Community challenges in biomedical text mining over 10 years: success, failure and the future</title>
		<author>
			<persName><forename type="first">C.-C</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Lu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Briefings in bioinformatics</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="page" from="132" to="144" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><surname>Colón-Ruiz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Segura-Bedmar</surname></persName>
		</author>
		<title level="m">Protected health information recognition bybilstm-crf</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Using a shallow linguistic kernel for drug-drug interaction extraction</title>
		<author>
			<persName><forename type="first">I</forename><surname>Segura-Bedmar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Martinez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>De Pablo-Sánchez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of biomedical informatics</title>
		<imprint>
			<biblScope unit="volume">44</biblScope>
			<biblScope unit="page" from="789" to="804" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Extracting drug indications and adverse drug reactions from spanish health social media</title>
		<author>
			<persName><forename type="first">I</forename><surname>Segura-Bedmar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>De La Peña González</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Martínez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of BioNLP 2014</title>
				<meeting>BioNLP 2014</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="98" to="106" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Exploring deep learning for named entity recognition of tumor morphology mentions</title>
		<author>
			<persName><forename type="first">G</forename><surname>De Vargas Romero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Segura-Bedmar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2020)</title>
				<meeting>the Iberian Languages Evaluation Forum (IberLEF 2020)</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note>CEUR Workshop Proceedings</note>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">-Vaya, De-identifying spanish medical texts-named entity recognition applied to radiology reports</title>
		<author>
			<persName><forename type="first">I</forename><surname>Perez-Diez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Perez-Moraga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lopez-Cerdan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-M</forename><surname>Salinas-Serrano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>De La Iglesia</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Biomedical Semantics</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="page" from="1" to="13" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Extracting clinical terms from radiology reports with deep learning</title>
		<author>
			<persName><forename type="first">K</forename><surname>Sugimoto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Takeda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-H</forename><surname>Oh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Wada</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Konishi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Yamahata</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Manabe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tomiyama</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Matsunaga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Nakanishi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Biomedical Informatics</title>
		<imprint>
			<biblScope unit="volume">116</biblScope>
			<biblScope unit="page">103729</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Optuna: A next-generation hyperparameter optimization framework</title>
		<author>
			<persName><forename type="first">T</forename><surname>Akiba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Sano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Yanase</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Ohta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Koyama</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery &amp; data mining</title>
				<meeting>the 25th ACM SIGKDD international conference on knowledge discovery &amp; data mining</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="2623" to="2631" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Named entity recognition in chinese clinical text using deep neural network</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Jiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Lei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Xu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Studies in health technology and informatics</title>
		<imprint>
			<biblScope unit="volume">216</biblScope>
			<biblScope unit="page">624</biblScope>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Korean clinical entity recognition from diagnosis text using bert</title>
		<author>
			<persName><forename type="first">Y.-M</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T.-H</forename><surname>Lee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">BMC Medical Informatics and Decision Making</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="page" from="1" to="9" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">J</forename><surname>Gorinski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Grover</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Tobin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Talbot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Whalley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Sudlow</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Whiteley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Alex</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1903.03985</idno>
		<title level="m">Named entity recognition for electronic health records: a comparison of rulebased and machine learning approaches</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Semehr: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research</title>
		<author>
			<persName><forename type="first">H</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Toti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">I</forename><surname>Morley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><forename type="middle">M</forename><surname>Ibrahim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Folarin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Jackson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Kartoglu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Agrawal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Stringer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Gale</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the American Medical Informatics Association</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="page" from="530" to="537" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Impact of translation on named-entity recognition in radiology texts</title>
		<author>
			<persName><forename type="first">L</forename><surname>Campos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Pedro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Couto</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Database</title>
		<imprint>
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<title level="m" type="main">Conditional random fields: Probabilistic models for segmenting and labeling sequence data</title>
		<author>
			<persName><forename type="first">J</forename><surname>Lafferty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mccallum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">C</forename><surname>Pereira</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<title level="m" type="main">Development of a named entity recognition system to automatically assign tumor morphology entity mentions to health-related documents in Spanish</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D V</forename><surname>Romero</surname></persName>
		</author>
		<imprint>
			<biblScope unit="page" from="2019" to="2020" />
		</imprint>
		<respStmt>
			<orgName>Universidad Carlos III de Madrid</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Master Thesis dissertation</note>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Named entity recognition based on conditional random fields</title>
		<author>
			<persName><forename type="first">S</forename><surname>Song</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Huang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Cluster Computing</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="page" from="5195" to="5206" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">Generative vs discriminative models</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">M</forename><surname>Joshi</surname></persName>
		</author>
		<ptr target="URL:shorturl.at/yEH26" />
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Conditional random fields: An introduction</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">M</forename><surname>Wallach</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">CIS)</title>
		<imprint>
			<biblScope unit="page">22</biblScope>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
	<note type="report_type">Technical Reports</note>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<author>
			<persName><forename type="first">Z</forename><surname>Zhai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">Q</forename><surname>Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Verspoor</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1808.08450</idno>
		<title level="m">Comparing cnn and lstm character-level embeddings in bilstm-crf models for chemical and disease named entity recognition</title>
				<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Effective use of bidirectional language modeling for transfer learning in biomedical named entity recognition</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">S</forename><surname>Sachan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Xie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sachan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">P</forename><surname>Xing</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Machine learning for healthcare conference</title>
				<imprint>
			<publisher>PMLR</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="383" to="402" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<author>
			<persName><forename type="first">L</forename><surname>Bagén</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">R</forename><surname>Toni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">R</forename><surname>Anna</surname></persName>
		</author>
		<title level="m">Deep learning: principios y fundamentos, Deep learning</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="1" to="260" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Bidirectional recurrent neural networks</title>
		<author>
			<persName><forename type="first">M</forename><surname>Schuster</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">K</forename><surname>Paliwal</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE transactions on Signal Processing</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="page" from="2673" to="2681" />
			<date type="published" when="1997">1997</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Gutiérrez-Fandiño</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Armengol-Estapé</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">P</forename><surname>Carrino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><forename type="middle">D</forename><surname>Gibert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Gonzalez-Agirre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Villegas</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2102.12843</idno>
		<title level="m">Spanish biomedical and clinical language embeddings</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Linguistic regularities in continuous space word representations</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>-T. Yih</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Zweig</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2013 conference of the north american chapter of the association for computational linguistics: Human language technologies</title>
				<meeting>the 2013 conference of the north american chapter of the association for computational linguistics: Human language technologies</meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="746" to="751" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">BERT: Pre-training of deep bidirectional transformers for language understanding</title>
		<author>
			<persName><forename type="first">J</forename><surname>Devlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-W</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Toutanova</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/N19-1423</idno>
		<ptr target="10.18653/v1/N19" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
		<title level="s">Long and Short Papers</title>
		<meeting>the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>Minneapolis, Minnesota</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1423">2019. -1 4 2 3</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="4171" to="4186" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b33">
	<analytic>
		<title level="a" type="main">Annotation of entities and relations in Spanish radiology reports</title>
		<author>
			<persName><forename type="first">V</forename><surname>Cotik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Filippo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Roller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Uszkoreit</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Xu</surname></persName>
		</author>
		<idno type="DOI">10.26615/978-954-452-049-6_025</idno>
		<idno>-9 5 4 -4 5 2 -0 4 9 -6 _ 0 2 5</idno>
		<ptr target="10.2" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017</title>
				<meeting>the International Conference Recent Advances in Natural Language Processing, RANLP 2017<address><addrLine>Varna, Bulgaria</addrLine></address></meeting>
		<imprint>
			<publisher>INCOMA Ltd</publisher>
			<date type="published" when="0978">2017. 6 6 1 5 / 9 7 8</date>
			<biblScope unit="page" from="177" to="184" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Spanish pre-trained bert model and evaluation data</title>
		<author>
			<persName><forename type="first">J</forename><surname>Cañete</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Chaperon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Fuentes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-H</forename><surname>Ho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Kang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pérez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">PML4DC at ICLR</title>
				<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
