<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Automatic Semantic Annotation for Abstracts of Scientific Discourses</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Lan</forename><surname>Huang</surname></persName>
							<email>huanglan@jlu.edu.cn</email>
							<affiliation key="aff0">
								<orgName type="department">College of Computer Science and Technology</orgName>
								<orgName type="institution">Jilin University</orgName>
								<address>
									<postCode>130012</postCode>
									<settlement>Changchun</settlement>
									<country key="CN">China</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jinchao</forename><surname>Zhu</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">College of Computer Science and Technology</orgName>
								<orgName type="institution">Jilin University</orgName>
								<address>
									<postCode>130012</postCode>
									<settlement>Changchun</settlement>
									<country key="CN">China</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yang</forename><surname>Chi</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">College of Software</orgName>
								<orgName type="institution">Jilin University</orgName>
								<address>
									<postCode>130012</postCode>
									<settlement>Changchun</settlement>
									<country key="CN">China</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Hao</forename><surname>Xu</surname></persName>
							<email>xuhao@jlu.edu.cn</email>
							<affiliation key="aff0">
								<orgName type="department">College of Computer Science and Technology</orgName>
								<orgName type="institution">Jilin University</orgName>
								<address>
									<postCode>130012</postCode>
									<settlement>Changchun</settlement>
									<country key="CN">China</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Automatic Semantic Annotation for Abstracts of Scientific Discourses</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">7C495BF4EEBD113D5F4EB120B887BE3C</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T16:55+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>semantic structure</term>
					<term>automatic annotation</term>
					<term>abstract</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The abstract of scientific papers has strong semantic structure, which contains abundant meaningful information, such as the background, research problem, solution, and result. Marking it out can help the computer understand and use this underlying information, which can provide great help for searching and scanning papers. In order to annotate the semantics of the paper automatically, we modeled the rhetorical structure of an abstract by linguistic clues and position information.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>We can complete the acquisition, dissemination, and communication of knowledge through scientific papers. The exponential growth of electronic scientific papers has made finding and selecting them difficult. Modeling and annotating the rhetorical structure of scientific articles can improve the efficiency of searching and reading <ref type="bibr" target="#b9">[10]</ref>. On one hand, it can help search engines to quickly retrieve insight into the core of scientific article <ref type="bibr" target="#b8">[9]</ref>; on the other hand, it can help the reader quickly browse and understand those articles. The study of the rhetorical structure and annotation of discourse has long-standing traditions. Automatic annotation of full articles under the existing stateof-the-art is difficult to achieve <ref type="bibr" target="#b2">[3]</ref>. The abstract of scientific articles is briefer than full articles, which makes the annotation possible.</p><p>There are numbers of well-known approaches to modeling the rhetorical structure of publications, such as Harmsze <ref type="bibr" target="#b7">[8]</ref>, ABCDE (Annotations,Background, Contribution, Discussion and Entities) <ref type="bibr" target="#b12">[13]</ref>, and SALT(Semantically Annotated L A T E X) <ref type="bibr" target="#b6">[7]</ref>. Based on the previous approaches, we provide a new model for abstract semantic analysis that includes the background, research problem, solution, and result. The model includes a preliminary verification by means of data crawling from the Internet for testing purposes. According to the methods for machine annotation of scientific discourse, Argumentative zoning <ref type="bibr" target="#b10">[11]</ref>, XIP(Xerox incremental parser ) <ref type="bibr" target="#b0">[1]</ref>, and SemTag <ref type="bibr" target="#b3">[4]</ref>, this paper use linguistic clues and location information to annotate the scientific publication's abstract automatically.</p><p>The predecessors of semantic technology have made many contributions for semantic annotated structure, which can improve the efficiency of publications' searching and reading <ref type="bibr" target="#b9">[10]</ref>. There are well-known approaches to modeling the rhetorical structure of publications. Harmsze proposed one of the first and the most comprehensive models for extracting the rhetoric and argumentation within scientific papers. This model focused on developing a modular representation for the creation and evaluation of scientific publications <ref type="bibr" target="#b7">[8]</ref>.De Waard and Tel introduced a different model for representation of discourse called ABCDE, which developed a L A T E X style sheet to identify five components in a discourse <ref type="bibr" target="#b12">[13]</ref>. They proposed finer-grained annotation to complement these structures and relationship types <ref type="bibr" target="#b2">[3]</ref>. A semantic authoring framework to enrich scientific publications with semantic metadata was called SALT, offering an improved coarse-grained rhetorical structure and a fine-grained semantic network <ref type="bibr" target="#b6">[7]</ref>.</p><p>After modeling the rhetorical structure of publications, we proceed to the automatic annotating of the scientific discourse. The first attempt to automatically annotate rhetorical expressions in research papers is called argumentative zoning <ref type="bibr" target="#b10">[11]</ref>. XIP detect rhetorical expressions from language uses of the authors, targeting salient sentences within scientific articles <ref type="bibr" target="#b0">[1]</ref>. Both models use clear linguistic clues to annotate the scientific articles. SemTag is a system offering an automatic ontology of semantic information, which identifies the candidate instance's keywords needed annotation. The system is based on TAP, a knowledge base from Stanford University which constructs two text vectors -context (before and after each 10 words) and candidate instance-calculating similarity and selecting best matches <ref type="bibr" target="#b3">[4]</ref>. The automatic annotation of research papers should capture and represent the evolution of ideas and findings that authors described in the articles <ref type="bibr" target="#b11">[12]</ref>. The main line of the above research aims at extracting factual information from the texts of the articles and transforming them into structured data <ref type="bibr" target="#b1">[2]</ref> <ref type="bibr" target="#b5">[6]</ref>. Semantic structure models and machine annotation models are used for full scientific publications, but the abstracts of papers contain more standardized semantic structures and rhetoric <ref type="bibr" target="#b4">[5]</ref>. Some journals like Nature provide a constant structure for their papers, which makes automatic annotation of abstracts possible.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Design</head><p>Based on the models for full publications mentioned in Section 2-Harmsze, ABCDE, SALT-we provide a new model for abstracts that includes background, research problem, solution, and result. We use linguistic clues extracted from abstract annotation and position information to annotate it automatically.</p><p>The flowchart for our system is in Figure <ref type="figure" target="#fig_0">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Framework of Process Model of System</head><p>The system is divided into the following seven main phases: Phase 1: Data Acquisition Module The four semantic tags were added manually to every sentence of scientific papers' abstracts, which includes background, research problem, solution, and result.</p><p>Phase 3: Data Preprocessing Module This module cleaned the text of publications' abstracts, such as removing numbers and punctuation, reducing word roots, converting all words to lower case, and so on.</p><p>Phase 4: Features Selection and Weighted Module In this module, TF-IDF algorithm was used to manage the numerous and complicated information of texts, which helped us discover key words that are more important and appeared more frequently in one text.</p><p>Phase 5: Sentences Segmentation and Preprocessing Module This module was used to segmented the abstract text by period, cleaned the sentences as in Phase 3, and stored them as feature vectors with an ID, which described the sentences' positions in the paper's abstract.</p><p>Phase 6: Similarity Calculation Module This module acquired the similarity level of two feature vector quantities by cosine similarity algorithm, which calculated the value of the included angle of two vectors.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Phase 7: Annotation Based on Similarity and Position Module</head><p>The automatic annotation the semantics of the paper,by linguistic clues(They propose...) and position information(the position of the sentence).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Feature Selection and Weight</head><p>The results of TF-IDF revealed the information of words as well as the texts in the form of a matrix, and every word had a TF-IDF weight value in each text, which presented the significance of the word in that text. The calculation of the weighted value became the core of the arithmetic.</p><p>TF-IDF algorithm was divided into the concepts TF and IDF. TF(Term frequency) reflects that the words with higher rates will attain a higher TF value. The IDF is the acronym for inverse document frequency. Compared with these words in common use, the ones with a high frequency acquired a higher IDF value, which did not often appear in other texts. As a result, the TF-IDF weight value was calculated with the formula TF*IDF. The key words with high values were chosen according to the integrated conditions of their TF and IDF.</p><p>The weighted feature words of the research problem are in Table <ref type="table" target="#tab_0">1</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Similarity Calculation</head><p>We calculated the similarity using the cosine formula as follows:</p><p>(A=[A1,A2,...,An],B=[B1,B2,...,Bn])</p><p>Coupled with the growth of the value, the similarity level of two vectors was reduced. In this way, we were able to learn whether the two vectors were similar. In our system, every text in the TF-IDF matrix was considered a word vector, and the test word-frequency vector was another. We used this arithmetic to reveal the similarity level of the two texts. Table <ref type="table" target="#tab_1">2</ref> revealed the similarity between the sentence and tags that we proposed in the system. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Annotation Based on Similarity and Position</head><p>Similarity was calculated by featuring words in Phase 6. We found that the difficult point was how to select the sentences from the abstract, which should add the research problem or solution tag. We discovered that the label of the sentence was relevant to their position. We processed information of abstract location using 88 articles from DMKD. The first one to two sentences were the background, the next one to two sentences were research questions, and then the solution was discussed in the next two to six sentences, and the final one to two sentences were the result. Different tags were labeled according to the location of the sentence. For example, for the third sentence mentioned in Phase 6, the system annotated the solution label to it because it is the fifth sentence of the paper's abstract and the sentence in front of it has been marked as the research problem.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Model Application</head><p>This paper studies how to annotate semantic structure to the abstract of scientific papers automatically, based on real data from DBLP DB. We developed a small experimental system in which the abstract text is the system's input and the abstract with four labels is the output. The system's interface is as shown in Figure <ref type="figure" target="#fig_1">2</ref>. Eighty articles were used to test the model proposed, the test results were as Table <ref type="table" target="#tab_2">3</ref>: This paper established a model to annotate the semantic structure for scientific papers' abstracts automatically. The first step of the process is to build a rhetorical structure that includes background, research problem, solution, and result. Then, through feature extraction and calculating weight for every segment of an abstract, we attain the feature vector. Next, according to the cosine algorithm, the system computes the score of the similarity. Last, the abstract was annotated by similarity and sentence positions.</p><p>The correct rate the system proposed in this paper could only reach about 67.5% of the automatic annotation of the semantic structure, and the effect will be better if the ontology system is introduced in the future, and the rhetorical words of scientific articles in different journals and different areas may be different; therefore, the next focus is to improve the accuracy and create a cross-domain model.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. system's flowchart This system developed a web crawler with Python to obtain data from DBLP DB. We obtained 208 articles from two different sources, of which 88 were from the journal Data Mining and Knowledge Discovery (DMKD), and the rest were from the European Conference on Principles of Data Mining and Knowledge Discovery (PKDD). Phase 2: Annotation of Semantic Structure Module The four semantic tags were added manually to every sentence of scientific papers' abstracts, which includes background, research problem, solution, and result. Phase 3: Data Preprocessing Module This module cleaned the text of publications' abstracts, such as removing numbers and punctuation, reducing word roots, converting all words to lower case, and so on. Phase 4: Features Selection and Weighted Module In this module, TF-IDF algorithm was used to manage the numerous and complicated information of texts, which helped us discover key words that are more important and appeared more frequently in one text. Phase 5: Sentences Segmentation and Preprocessing Module This module was used to segmented the abstract text by period, cleaned the sentences as in Phase 3, and stored them as feature vectors with an ID, which described the sentences' positions in the paper's abstract. Phase 6: Similarity Calculation Module This module acquired the similarity level of two feature vector quantities by cosine similarity algorithm, which calculated the value of the included angle of two vectors. Phase 7: Annotation Based on Similarity and Position Module The automatic annotation the semantics of the paper,by linguistic clues(They propose...) and position information(the position of the sentence).</figDesc><graphic coords="3,165.95,79.97,283.47,158.69" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. System interface</figDesc><graphic coords="5,137.60,382.50,340.15,150.43" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Weighted feature words of research problem.</figDesc><table><row><cell>Word</cell><cell>Value</cell><cell>Word</cell><cell>Value</cell><cell>Word</cell><cell>Value</cell></row><row><cell cols="4">generate 0.11142059 discrimination 0.08177003</cell><cell>output</cell><cell>0.06733483</cell></row><row><cell cols="2">use 0.11142059</cell><cell>answer</cell><cell cols="3">0.06733483 association 0.06733483</cell></row><row><cell cols="2">process 0.1090267</cell><cell>attempt</cell><cell cols="3">0.06733483 category 0.06733483</cell></row><row><cell cols="2">success 0.10100225</cell><cell>classic</cell><cell>0.06733483</cell><cell>choice</cell><cell>0.06733483</cell></row><row><cell cols="2">neighbor 0.10100225</cell><cell>criterion</cell><cell>0.06733483</cell><cell>design</cell><cell>0.06733483</cell></row><row><cell cols="2">cluster 0.08913647</cell><cell>move</cell><cell>0.06733483</cell><cell>lack</cell><cell>0.06733483</cell></row><row><cell cols="2">match 0.0854057</cell><cell cols="4">outcome 0.06733483 publish 0.06733483</cell></row><row><cell cols="2">require 0.08177003</cell><cell>space</cell><cell>0.06733483</cell><cell>warp</cell><cell>0.06733483</cell></row><row><cell cols="6">become 0.08177003 condition 0.06733483 subsequence 0.06733483</cell></row><row><cell cols="2">change 0.08177003</cell><cell>critic</cell><cell>0.06733483</cell><cell>warp</cell><cell>0.06733483</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Similarity between the sentences and tags. Sentence Background Research problem Solution Result 1. It defines the possible world model with probability intervals, and proves that probability intervals of all possible worlds are feasible.</figDesc><table><row><cell>0.1007</cell><cell>0.0401</cell><cell>0</cell><cell>0.0427</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 .</head><label>3</label><figDesc>Test results.</figDesc><table><row><cell>Tags</cell><cell cols="4">Background Research problem Solution Result</cell></row><row><cell>Correct number</cell><cell>55</cell><cell>52</cell><cell>44</cell><cell>65</cell></row><row><cell>Error number</cell><cell>25</cell><cell>28</cell><cell>36</cell><cell>15</cell></row><row><cell>Correct rate</cell><cell>68.75%</cell><cell>65%</cell><cell cols="2">55% 81.25%</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Acknowledgements</head><p>This work is supported by the National Natural Science Foundation of China (No. 61300147), China Postdoctoral Science Foundation (No. 2014M551185), and Science and Technology Program of Changchun (No. 14GH014).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Robustness beyond shallowness: incremental dependency parsing</title>
		<author>
			<persName><forename type="first">S</forename><surname>Ait-Mokthar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Chanod</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Special issue of NLE Journal</title>
		<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Biorat: extracting biological information from full-length papers</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">P</forename><surname>Corney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">F</forename><surname>Buxton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">B</forename><surname>Langdon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">T</forename><surname>Jones</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bioinformatics</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">17</biblScope>
			<biblScope unit="page" from="3206" to="3213" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Modeling scientific research articles-shifting perspectives and persistent issues</title>
		<author>
			<persName><forename type="first">A</forename><surname>De Waard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kircz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. ELPUB2008 Conference on Electronic Publishing</title>
				<meeting>ELPUB2008 Conference on Electronic Publishing</meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="234" to="245" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Semtag and seeker: Bootstrapping the semantic web via automated semantic annotation</title>
		<author>
			<persName><forename type="first">S</forename><surname>Dill</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Eiron</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Gibson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Gruhl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Guha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jhingran</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Kanungo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Rajagopalan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tomkins</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Tomlin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 12th international conference on World Wide Web</title>
				<meeting>the 12th international conference on World Wide Web</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2003">2003</date>
			<biblScope unit="page" from="178" to="186" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">The rhetoric structure of research article abstracts in english studies journals</title>
		<author>
			<persName><forename type="first">K</forename><surname>Dor</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Prague Journal of English Studies</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="119" to="139" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Garten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">B</forename><surname>Altman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">BMC bioinformatics</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page">S6</biblScope>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
	<note>Suppl</note>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Salt-semantically annotated\ mbox {\ LaTeX} for scientific publications</title>
		<author>
			<persName><forename type="first">T</forename><surname>Groza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Handschuh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Möller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Decker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Semantic Web: Research and Applications</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="518" to="532" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A P</forename><surname>Harmsze</surname></persName>
		</author>
		<title level="m">A modular structure for scientific articles in an electronic environment</title>
				<imprint>
			<date type="published" when="2000">2000</date>
		</imprint>
		<respStmt>
			<orgName>University of Amsterdam</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Mapping knowledge domains</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">M</forename><surname>Shiffrin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>B02rner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the National Academy of Sciences of the United States of America</title>
		<imprint>
			<biblScope unit="volume">101</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="5183" to="5185" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
	<note>suppl. Suppl</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">09gnes Sndor: Scientific discourse on the semantic web: A survey of models and enabling technologies</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">B</forename><surname>Shum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Clark</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Groza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Handschuh</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Semantic Web Journal Interoperability Usability Applicability</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<title level="m" type="main">Argumentative zoning: Information extraction from scientific text</title>
		<author>
			<persName><forename type="first">S</forename><surname>Teufel</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2000">2000</date>
			<publisher>Citeseer</publisher>
		</imprint>
	</monogr>
	<note type="report_type">Ph.D. thesis</note>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Identifying the epistemic value of discourse segments in biology texts</title>
		<author>
			<persName><forename type="first">A</forename><surname>De Waard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Buitelaar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Eigner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Eighth International Conference on Computational Semantics</title>
				<meeting>the Eighth International Conference on Computational Semantics</meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="351" to="354" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">The abcde format enabling semantic conference proceedings</title>
		<author>
			<persName><forename type="first">A</forename><surname>De Waard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tel</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2006">2006</date>
			<publisher>SemWiki</publisher>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
