<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">TextAI: Enhancing TextAE with Intelligent Annotation Support</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Maximilian</forename><surname>Grundke</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Hasso Plattner Institute Potsdam</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Johannes</forename><surname>Jasper</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Hasso Plattner Institute Potsdam</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Mariya</forename><surname>Perchyk</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Hasso Plattner Institute Potsdam</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jan</forename><forename type="middle">Philipp</forename><surname>Sachse</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Hasso Plattner Institute Potsdam</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ralf</forename><surname>Krestel</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Hasso Plattner Institute Potsdam</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author role="corresp">
							<persName><forename type="first">Mariana</forename><surname>Neves</surname></persName>
							<email>mariana.neves@hpi.de</email>
							<affiliation key="aff0">
								<orgName type="institution">Hasso Plattner Institute Potsdam</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">TextAI: Enhancing TextAE with Intelligent Annotation Support</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">48D8FF64FAD8363CFA3385F8DBFE4060</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T20:55+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>We present TextAI, an extension to the annotation tool TextAE, that adds support for named-entity recognition and automated relation extraction based on machine learning techniques. Our learning approach is domain-independent and increases the quality of the detected relations with each added training document. We further aim at accelerating and facilitating the manual curation process for natural language documents by supporting simultaneous annotation by multiple users.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Faced with rapidly growing numbers of publicly available natural language documents, it is becoming increasingly difficult to extract the underlying knowledge in a structured manner. Thus, annotation of documents for the purpose of extracting this information is an important task in many research domains today. Creating these annotations is mostly done manually, even though it is a very time consuming work and requires deep understanding and domain knowledge <ref type="bibr" target="#b4">(Hirschman et al., 2012)</ref>.</p><p>TextAI<ref type="foot" target="#foot_0">1</ref> is a tool developed to support annotators as a first step on the way to minimize the effort of extracting information from written texts. There exists already a wide range of annotation editors <ref type="bibr" target="#b6">(Neves and Leser, 2012)</ref>. Thus, we chose not to implement an additional standalone editor, but to build our system around the TextAE<ref type="foot" target="#foot_1">2</ref> tool, which is an existing open source editor. We extended TextAE by providing additional features to manage documents of multiple users and predicting entities and relations, which can then be adopted by users into their set of annotations. The system learns from its users and improves prediction of relations over time based on previously annotated documents. We use the biomedical domain as use case for our system and as basis for evaluation. The developed approach, however, can be used across different domains that require manual annotation. With its automated annotation suggestions and its multi-user support, TextAI introduces capabilities that are conceptually different from other annotation tools.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related Work</head><p>Automatic Annotation Suggestions. Various studies on prediction of annotations confirm that automated recommendations increase the speed and improve the quality of annotations. Lingren et al. <ref type="bibr" target="#b5">(Lingren et al., 2014)</ref> determined that automatic annotation suggestions for named entities result in 13.83% to 21.5% time saving without reducing the inter-annotator agreement (IAA) or qualitative annotator performance. Additionally, Fort and Sagot <ref type="bibr" target="#b2">(Fort and</ref><ref type="bibr" target="#b2">Sagot, 2010) and</ref><ref type="bibr" target="#b7">South et al. (South et al., 2014)</ref> showed a significant gain in quality. <ref type="bibr" target="#b3">Hernandez et al. (Hernandez et al., 2014)</ref> reported an improvement of non-expert annotator performance using automated namedentity recognition. The WebAnno annotation tool <ref type="bibr" target="#b11">(Yimam et al., 2014)</ref> includes automatic suggestions for three generic structures: spans, relations, and chains. It integrates an external machine learning tool, which requires users to configure the features themselves. While this allows for domain specific optimization, it excludes non-expert users from using this functionality. By allowing definition of custom annotation labels, WebAnno increases its flexibility, however eliminates labels as a suitable machine learning feature.  Annotation Editors. TextAE is a browser-based annotation tool which comes without a server backend, but supports importing documents and annotations based on a simple JSON format. Being implemented using HTML and JavaScript, TextAEs functionality can be easily expanded. Furthermore, it supports both a wide range of keyboard shortcuts and usability improvements that aim to increase performance of its users. The brat rapid annotation tool <ref type="bibr" target="#b8">(Stenetorp et al., 2012)</ref> features a client-server architecture, supports annotation of documents by multiple users simultaneously and allows comparison of different sets of annotations for one document. We chose to adapt several of these features and enhance TextAE to support them, while leaving it a simple and easy to use editor, as feature-rich tools, such as brat, quickly become complicated to use as they require complex configuration.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">System Architecture</head><p>Annotating documents with entities and relations traditionally involves manually highlighting entities in the document and marking relations between them. Our system employs machine learning techniques to automate these steps by using the workflow depicted in Figure <ref type="figure" target="#fig_0">1</ref>. The user interacts with a central front-end component, which is used to manage annotation tasks, documents and users, as well as importing documents from multiple sources. Users also interact with the annotation editor for editing and reviewing documents and their annotations. In the back end, we use an in-memory database (IMDB), which provides document storage, text analysis features and integration with machine learning algorithms. The middleware layer mediates between the different interfaces and the IMDB and deals with all logic concerning user and document management.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Front end</head><p>The user interacts with a central front-end component, which allows importing documents from multiple sources, such as the local file system, e.g., files in the BioC file format <ref type="bibr" target="#b1">(Comeau et al., 2013)</ref> or by querying PubMed<ref type="foot" target="#foot_2">3</ref> , based on plain text. After importing documents into TextAI, these can be loaded from the database (cf. Back end). Besides document storage, the IMDB further offers basic text mining and analysis features that we use and expand. Domain independence is achieved through the introduction of another level of hierarchy called "tasks", which can be used to organize documents under one semantic group. This has the technical implication that each task has its own machine learning models, hence annotations within one task have no impact on predictions in another one.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Annotation Editor</head><p>Users can review and edit documents and their annotations through the annotation editor. The later also allows to trigger NER, relationship prediction, and other methods aimed at improving annotation quality and speed. We extended TextAE, which is a powerful standalone annotation tool based on JSON-formatted input, that allows loading text and creating annotations and relations. It allows adding multiple annotations to each position in the document and displaying different layers of annotations through color coding. Instead of displaying different kinds of annotations, such as POS tags and domain-specific information, e.g., medical terms, we use different colors for annotations made by different users on the same document. Because of the HTML span-tag-based implementation for annotation rendering, TextAE displays overlapping annotations in a stacked way instead of inline. Users can create custom labels for their annotations in TextAI. However, we ask the users to map their custom labels to one of the UMLS semantic types<ref type="foot" target="#foot_3">4</ref> , in order to improve the ability of our system to learn based on annotations made by users and to normalize the annotations made by different users and tasks. For instance, when annotating the DDI corpus <ref type="bibr" target="#b4">(Herrero-Zazo et al., 2013)</ref>, users could create four labels, one for each drug type included in this corpus, but they are asked map every type to a UMLS semantic type, for instance, "T2000 -Clinical Drug" or "T121 -Pharmacologic Substance". Currently, we only allow mapping a label to one single UMLS semantic type.</p><p>Annotating large corpora is a task that can involve multiple experts, which requires user management to be available in a collaborative annotation system. A problem that occurs when multiple people do the same work is the creation of annotations with similar, but not identical meaning. Therefore, our system nudges users towards better annotations by asking them to map their labels to the UMLS semantic types. Further, during annotation, users can choose to hide their annotations from both other users and the machine learning part of our system, while still being able to use all of its features.</p><p>Users can correct the predictions made by the system by selectively adding them to their own annotation sets and editing them if necessary. At any point in the process, users can also manually add, remove, and alter entities and relations to their own set.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Back end</head><p>We use Rserve<ref type="foot" target="#foot_4">5</ref> , which provides an interface to the statistical computing language R with its extensive text mining and machine learning capabilities. Further, our system relies on the text analysis functionalities of an IMDB for entity and relation predictions. Besides the documents, taskspecific metadata, such as trained models, domain specific stopword lists and NER dictionaries are stored in the database, allowing fast access to im-portant domain-dependent information.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Information Extraction</head><p>The automated suggestion of annotations involves two major steps, NER and relation prediction. For NER we apply a simple dictionary-based approach using the UMLS dictionary and part-of-speech (POS) tagging of the documents.</p><p>We train two support vector machines<ref type="foot" target="#foot_5">6</ref> to detect relations between two entities: one to determine whether a relation exists between two entities, and in case of positive, one to classify the type of relation. The relations are represented by n-dimensional feature vectors. We get descriptive characteristics of the relation of two entities by combining lexical and syntactical features In the pre-processing step, the document text is split in sentences and every word is tokenized, lemmatized and POS-tagged. Similar to the approach described in <ref type="bibr" target="#b0">(Bui et al., 2014)</ref>, we divided the sentence into three groups by adding the prefix "b" to each token appearing before the first entity, the prefix "i" to the words in between the two entities and the prefix "a" to the words after the second entity. While we remove stop words and ignore entities in the context, we do not filter out punctuation, i.e. comma, colon and semicolon, since their appearance between two entities can be a strong indicator that there is no relation between these entities. We only consider three tokens on either side of each entity in order to emphasize the near context around the annotations. The annotated types of both entities are considered as a feature as well. We also include the distance, i.e. the number of words and the number of characters between two entities, as a feature. We avoid using trigger words as proposed by <ref type="bibr" target="#b0">(Bui et al., 2014)</ref>, since this would contradict with our domain-independence objective.</p><p>Every newly annotated document is used to retrain the models for relationship extraction, thus learning over time to improve its performance. Further, using the UMLS predefined set of types also improves the learning capabilities of our system, as entities can now be chosen from a finite set and the entity type becomes a stronger feature for relation prediction. Since training these models is only possible and reasonable given a certain amount of information, TextAI needs a set of preannotated documents, either by importing or manually annotating a few documents of the corpus in advance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.5">Middleware</head><p>As both database and front-end components have highly independent schemata and interfaces, transforming data between them is a key role of the middleware. The middleware implements a RESTful interface representing users, tasks, documents, user-documents and their content to pass on information about data objects to the browserbased front end. Additionally, user management and access rights management are handled here. An integrated user model ensures that every action taken is properly authenticated and authorized. Our annotation editor can load and export data provided in JSON format, thus the middleware is also responsible for transforming information between the different representations in our system. In addition, the middleware provides an interface to import and export documents. Internally, we use the BioC format <ref type="bibr" target="#b1">(Comeau et al., 2013)</ref>, as it is general enough to be used across different annotation domains.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Experiments</head><p>We focused on the medical domain and used the DDI corpus <ref type="bibr" target="#b4">(Herrero-Zazo et al., 2013)</ref> to evaluate the performance of our NER and relation extraction procedures.</p><p>NER. In our evaluation, we consider not only exact matches of the gold-standard and the predicted entities, but also overlapping entity label markers are accepted. Our average F-1 score was 77.3% with a recall of 85.62% and a precision of Relation Extraction. In total, the DDI corpus names five different relation types: non-relation, mechanism, effect, advise and a general interaction. We achieved a precision of 72.95% on the test set of the DDI corpus, averaged over 10 iterations. Other researchers who performed relation prediction on the DDI corpus achieve comparable performance results of 60.9% and 62.99% macro averaged F-score <ref type="bibr" target="#b9">(Thomas et al., 2013a)</ref>.</p><p>Training Set Size. When users annotate new relations, the systems performance increases since the SVM models are retrained with new user input. We measure the impact of this input on the efficiency of our classifier with a setup as follows. Starting with only one annotated document, we train both SVM classifiers on the given data and evaluate on 20 randomly selected documents. We then select 10 more documents for training and again test on 20 test documents. This cycle repeats until 500 documents are included in the training set. Figure <ref type="figure" target="#fig_2">3</ref> illustrates the F-score for each of the DDI interaction types averaged over 10 runs with varying training documents.</p><p>In comparison to our performance for other relations, the "interaction" type obtained lower results and these have oscillated over our experiments. This relation type is under-represented in the DDI corpus as it constitute only 6% of the annotated DDI relations. As discussed in previous work <ref type="bibr" target="#b10">(Thomas et al., 2013b)</ref>, this resulted in lower performance of the systems for this relation type on the test set. We believe that the oscillation on the results occurs for those test sets which con-tained more or less instances of the "interaction" relation type that could not be correctly detected by our system.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusions and Future Work</head><p>We have presented a prototype that extends the annotation editor TextAE with multi-user functionality and annotation prediction. This was achieved by creating a concept of per-user annotation sets and tasks, as well as a NER framework and relation prediction algorithm. Our system provides users with functionality for annotations prediction without interfering with their day-to-day annotation work.</p><p>As future work, we plan on conducting a user study on annotation speed and quality. Further, we also want to explore NER algorithm based on machine learning and on the labels which are normalized to the UMLS semantic types and not only on the current dictionary-based approach. Finally, semi-supervised learning approaches, such as active learning to leverage user feedback, could improve NER and relation extraction even further.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Components of the TextAI system.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Screen-shot of the TextAI annotation editor showing annotations from different users (1), a drug-drug-interaction relation (2), the user-defined label (3) and the prediction functionality (4).</figDesc><graphic coords="3,72.00,62.81,453.54,141.25" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Influence of training set size on prediction performance.</figDesc><graphic coords="4,318.19,62.81,196.44,141.73" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://github.com/LearningToNote</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">http://textae.pubannotation.org/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">http://www.ncbi.nlm.nih.gov/pubmed</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">https://metamap.nlm.nih.gov/ SemanticTypesAndGroups.shtml</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">https://rforge.net/Rserve/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">e1071 (https://cran.r-project.org/ package=e1071), an R interface to LIBSVM with its default RBF kernel</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">A novel featurebased approach to extract drugdrug interactions from biomedical text</title>
		<author>
			<persName><forename type="first">Quoc-Chinh</forename><surname>Bui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Peter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Erik</forename><forename type="middle">M</forename><surname>Sloot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jan</forename><forename type="middle">A</forename><surname>Van Mulligen</surname></persName>
		</author>
		<author>
			<persName><surname>Kors</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bioinformatics</title>
		<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Bioc: a minimalist approach to interoperability for biomedical text processing</title>
		<author>
			<persName><forename type="first">C</forename><surname>Donald</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Rezarta</forename><surname>Comeau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Islamaj Doan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kevin</forename><surname>Ciccarese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Martin</forename><surname>Bretonnel Cohen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Florian</forename><surname>Krallinger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zhiyong</forename><surname>Leitner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yifan</forename><surname>Lu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fabio</forename><surname>Peng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Manabu</forename><surname>Rinaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alfonso</forename><surname>Torii</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Karin</forename><surname>Valencia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thomas</forename><forename type="middle">C</forename><surname>Verspoor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cathy</forename><forename type="middle">H</forename><surname>Wiegers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">John</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><surname>Wilbur</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2013">2013. 2013</date>
		</imprint>
	</monogr>
	<note type="report_type">Database</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Influence of pre-annotation on pos-tagged corpus development</title>
		<author>
			<persName><forename type="first">Karën</forename><surname>Fort</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Benoît</forename><surname>Sagot</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Fourth Linguistic Annotation Workshop, LAW IV &apos;10</title>
				<meeting>the Fourth Linguistic Annotation Workshop, LAW IV &apos;10<address><addrLine>Stroudsburg, PA, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="56" to="63" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Testing pre-annotation to help non-experts identify drug-drug interactions mentioned in drug product labeling</title>
		<author>
			<persName><forename type="first">Harry</forename><forename type="middle">S</forename><surname>Andres M Hernandez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">John</forename><forename type="middle">R</forename><surname>Hochheiser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Rebecca</forename><forename type="middle">S</forename><surname>Horn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Richard</forename><forename type="middle">D</forename><surname>Crowley</surname></persName>
		</author>
		<author>
			<persName><surname>Boyce</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Second AAAI Conference on Human Computation and Crowdsourcing</title>
				<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">The ddi corpus: An annotated corpus with pharmacological substances and drug-drug interactions</title>
		<author>
			<persName><forename type="first">María</forename><surname>Herrero-Zazo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Isabel</forename><surname>Segura-Bedmar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paloma</forename><surname>Martínez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thierry</forename><surname>Declerck ; Lynette ; Hirschman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">P</forename><surname>Gully</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Martin</forename><surname>Burns</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cecilia</forename><surname>Krallinger</surname></persName>
		</author>
		<author>
			<persName><surname>Arighi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Text mining for the biocuration workflow</title>
				<editor>
			<persName><forename type="first">Alfonso</forename><surname>Bretonnel Cohen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Cathy</forename><forename type="middle">H</forename><surname>Valencia</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Andrew</forename><surname>Wu</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Karen</forename><forename type="middle">G</forename><surname>Chatr-Aryamontri</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Eva</forename><surname>Dowell</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Anlia</forename><surname>Huala</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Robert</forename><surname>Loureno</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Anne-Lise</forename><surname>Nash</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Thomas</forename><surname>Veuthey</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Andrew</forename><forename type="middle">G</forename><surname>Wiegers</surname></persName>
		</editor>
		<editor>
			<persName><surname>Winter</surname></persName>
		</editor>
		<imprint>
			<publisher>Database</publisher>
			<date type="published" when="2012">2013. 2012. 2012</date>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="page" from="914" to="920" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements</title>
		<author>
			<persName><forename type="first">Todd</forename><surname>Lingren</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Louise</forename><surname>Deleger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Katalin</forename><surname>Molnar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Haijun</forename><surname>Zhai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jareen</forename><surname>Meinzen-Derr</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Megan</forename><surname>Kaiser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Laura</forename><surname>Stoutenborough</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Qi</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Imre</forename><surname>Solti</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the American Medical Informatics Association</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="406" to="413" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">A survey on annotation tools for the biomedical literature</title>
		<author>
			<persName><forename type="first">Mariana</forename><surname>Neves</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ulf</forename><surname>Leser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Briefings in bioinformatics</title>
		<imprint>
			<biblScope unit="page">84</biblScope>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text</title>
		<author>
			<persName><forename type="first">R</forename><surname>Brett</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Danielle</forename><surname>South</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ying</forename><surname>Mowery</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jianwei</forename><surname>Suo</surname></persName>
		</author>
		<author>
			<persName><surname>Leng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stephane</forename><forename type="middle">M</forename><surname>Ferrndez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Wendy</forename><forename type="middle">W</forename><surname>Meystre</surname></persName>
		</author>
		<author>
			<persName><surname>Chapman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Special Issue on Informatics Methods in Medical Privacy</title>
		<imprint>
			<biblScope unit="volume">50</biblScope>
			<biblScope unit="page" from="162" to="172" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
	<note>Journal of Biomedical Informatics</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">brat: a web-based tool for nlp-assisted text annotation</title>
		<author>
			<persName><forename type="first">Pontus</forename><surname>Stenetorp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sampo</forename><surname>Pyysalo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Goran</forename><surname>Topić</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tomoko</forename><surname>Ohta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sophia</forename><surname>Ananiadou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jun'ichi</forename><surname>Tsujii</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics</title>
				<meeting>the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics<address><addrLine>Avignon, France</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2012-04">2012. April</date>
			<biblScope unit="page" from="102" to="107" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Wbi-ddi: Drug-drug interaction extraction using majority voting</title>
		<author>
			<persName><forename type="first">Philippe</forename><surname>Thomas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mariana</forename><surname>Neves</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tim</forename><surname>Rocktäschel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ulf</forename><surname>Leser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013)</title>
				<meeting>the Seventh International Workshop on Semantic Evaluation (SemEval 2013)<address><addrLine>Atlanta, Georgia, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2013-06">2013a. June</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="628" to="635" />
		</imprint>
	</monogr>
	<note>Second Joint Conference on Lexical and Computational Semantics (*SEM)</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Wbi-ddi: drug-drug interaction extraction using majority voting</title>
		<author>
			<persName><forename type="first">Philippe</forename><surname>Thomas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mariana</forename><surname>Neves</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tim</forename><surname>Rocktäschel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ulf</forename><surname>Leser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Second Joint Conference on Lexical and Computational Semantics (* SEM)</title>
				<imprint>
			<date type="published" when="2013">2013b</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="628" to="635" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Automatic annotation suggestions and custom annotation layers in webanno</title>
		<author>
			<persName><forename type="first">Chris</forename><surname>Seid Muhie Yimam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Richard</forename><surname>Biemann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Iryna</forename><surname>Eckart De Castilho</surname></persName>
		</author>
		<author>
			<persName><surname>Gurevych</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations</title>
				<meeting>52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations</meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="91" to="96" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
