<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Short Survey of Discourse Representation Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tudor Groza</string-name>
          <email>tudor.groza@deri.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Siegfried Handschuh</string-name>
          <email>siegfried.handschuh@deri.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tim Clark</string-name>
          <email>twclark@nmr.mgh.harvard.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simon Buckingham Shum</string-name>
          <email>S.Buckingham.Shum@open.ac.uk</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anita de Waard</string-name>
          <email>A.dewaard@elsevier.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DERI, National University of Ireland</institution>
          ,
          <addr-line>Galway, IDA Business Park, Lower Dangan, Galway</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Elsevier Labs</institution>
          ,
          <addr-line>Radarweg 29, 1043 NX Amsterdam</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Initiative in Innovative Computing, Harvard University 60</institution>
          <addr-line>Oxford Street, Cambridge, MA 02138</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Knowledge Media Institute, The Open University Milton Keynes</institution>
          ,
          <addr-line>MK7 6AA</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>With the advancement of technology and the wide adoption of ontologies as knowledge representation formats, in the last decade, a handful of models were proposed for the externalization of the rhetoric and argumentation captured within scientific publications. Conceptually, most of these models share a similar representation form of the scientific publication, i.e. as a series of interconnected elementary knowledge items. The main differences are given by the terminology used, the types of rhetorical and / or argumentation relations connecting the knowledge items and the foundational theories supporting these relations. This paper analyzes the state of the art and provides a concise comparative overview of the five most prominent discourse representation models, with the goal of sketching an unified model for discourse representation.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Dissemination, an important phase of scientific research, can be seen as a
communication process between scientists. They expose and support their findings, while
discussing claims stated in related scientific publications. This communication takes
place over the course of several publications, where each paper itself contains a
rhetorical discourse structure laying out supportive evidence for the raised claims.
This discourse structure is usually hidden in the semantics expressed by the writer
within the publication’s content and thus hard to discover by the reader.</p>
      <p>
        Externalization, as defined by Nonaka [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], represents the process of articulating
tacit knowledge into explicit concepts. It holds the key to social knowledge creation
and crystallization, through a process of sharing, discussion and testing with others.
In the scientific dissemination context, externalization has a dual form. On the one
hand, scientific publications represent intrinsically a form of cognitive
externalization, making explicit the scientists’ thoughts. On the other hand, in order to make
these publications much more accessible to computation, and more specifically on
the Web, so that information can be easier navigated, compared and understood,
we need for a formal externalization, i.e. stepping from the freely expressed text
to machine-processable structures. In the case of the rhetorical and argumentation
discourse based on claims and evidence, the degree of formalization can be a couple
of keywords, or a weakly structured text, both possibly including direct references
to the publications first stating the actual claims or providing evidence for new
claims.
      </p>
      <p>
        In the last decade, a handful of models targeting the externalization of the
rhetoric and argumentation captured within the discourse of scientific publications,
were proposed. Conceptually, most of these models share a similar representation
form of the scientific publication, i.e. as a series of interconnected elementary
knowledge items. The main differences are given by the terminology used, the types of
rhetorical and / or argumentation relations connecting the knowledge items and the
foundational theories supporting these relations. While the argumentation side is,
in general, inspired from IBIS (Issue Based Information Systems) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], the rhetorical
structuring uses different foundations, such as Cognitive Coherence Relations [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
or the Rhetorical Structure Theory (RST) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. A number of discourse-relationship
approaches, including a comparison between RST and other taxonomies has been
discussed by Hovy [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        This paper provides a concise comparative overview of some of these discourse
representation models. We will focus on five such models: SWAN (Semantic Web
Applications in Neuromedicine) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], SALT (Semantically Annotated LATEX) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and
the models proposed by the Scholarly Ontologies project [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], Harmsze [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and de
Waard [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. We would like to point out that in addition to these
representationdriven approaches, and in the same category of analysis of scientific publications
content, research was also performed on automatic extraction of epistemic items.
Relevant work includes the efforts of Teufel [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], Mizuka [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] or Lisacek [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
Nevertheless, in this paper, we concentrate only on the former approaches with the goal
of trying to find a common denominator that could lead to an unified model for
discourse representation.
      </p>
      <p>The remainder of the paper is structured as follows: Sect. 2 lists the aspects
on which we have focused for the comparative analysis, Sect. 3 details the five
above-mentioned models and before concluding in Sect. 5, we discuss the overall
comparison of the representations in Sect. 4.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Analysis features</title>
      <p>For building a comprehensive comparison of the discourse representation models,
we compiled a list of features to be observed throughout the overall analysis. The
list consists of the following elements:
– Course-grained rhetorical structure – identifies the existence of a
coursegrained rhetorical structure representation within the model. Its goal is to
capture the semantics of larger blocks of text inside the publication’s content that
have an associated rhetorical role.
– Fine-grained rhetorical structure – as opposed to the previous feature,
this feature considers the fine-grained content composing the discourse (i.e.
restricted discourse knowledge items in forms of claims, positions, arguments, etc)
between which usually emerges a network arrangement driven by the different
types of relations that connect the fine-grained elementary items.
– Relations – looks at the types of relations used for linking the fine-grained
structure into an unitary network.
– Polarity – specifies if the model includes explicitly the polarity of the relations
(i.e. positive or negative). For example, a supports relation would have a
positive polarity attached, while a refutes relation would have a negative polarity.
Generally, this polarity is to some extent similar to the polarity extracted in
the opinion mining and sentiment analysis field, which, we will not focus on,
since it is out of the scope of this paper.
– Weights – specifies if the model considers explicitly the weights of the relations,
i.e. if some relations are stronger than others. This feature can be tightly coupled
to the polarity. For example, the supports relation might be considered stronger
than the agrees with relation, both being positive from the polarity perspective.
– Provenance – indicates whether the model encapsulates also the provenance
information attached to the fine-grained rhetorical structure (i.e. the accurate
localization of the text span that represents the textual counterpart of the
discourse knowledge item).
– Shallow metadata support – shows if the model has embedded support for
shallow metadata (e.g. authors, titles, etc)
– Domain knowledge – analyses the close coupling of the model to particular
domain knowledge areas.
– Purpose – presents the purpose, or intended use, of the model as envisioned
by their creators.
– Evaluation and uptake – mentions the evaluation and uptake status of the
model.</p>
      <p>These last two features in the list try to capture the “practicality” dimension of the
discourse representation models, with the last one pointing in essence to a
realitycheck, in terms of deployment, adoption and adequacy of the models in actual use
by scientists.
3
3.1</p>
    </sec>
    <sec id="sec-3">
      <title>Discourse representation models</title>
      <sec id="sec-3-1">
        <title>Harmsze’s Model</title>
        <p>
          One of the first and probably the most comprehensive models for capturing the
rhetoric and argumentation within scientific publications was introduced by
Harmsze [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. She focused on developing a modular representation for the creation and
evaluation of scientific articles. Although the corpus used as a foundation for the
analysis was about experimental molecular dynamics, the resulted model is
uniformly valid for any scientific domain.
        </p>
        <p>The author models the discourse by means of a coarse-grained structure split
into modules and a series of links to connect these modules. The six modules
proposed by Harmsze are as follows: (i) Meta-Information is a support module that
keeps the entire publication glued together. It consists of several parts, such as, the
bibliographic information, abstract, lists of references or acknowledgements; (ii)
Positioning sets the context of the research presented in the publication. It describes
the situation in which the research issues are considered and the central problem of
the research. (iii) Methods acts as a container for the authors’ response to the
central problem. The model provides three types of possible methods, i.e. experimental,
numerical and theoretical methods. (iv) Results details the results achieved with
the methods previously mentioned. It consists of raw data and the treated results.
(v) Interpretation contains the authors’ interpretation of the results. It usually
deals with the process of interpreting the results and the argumentation of the
plausibility and on the relevance of the interpretation. (vi) Outcome aggregates the
authors’ findings and the leads to further research.</p>
        <p>To connect the above mentioned modules, the model introduces two types of
relations: (i) organizational links, and (ii) scientific discourse relations. The
organizational links provide the reader with the means to easily navigate between the
modules composing the scientific publication. They connect only modules as
entire entities and do not refer to the segments encapsulated in them, which in turn
would identify the content. Harmsze distinguishes six types of organizational links:
hierarchical, proximity, range-based, administrative, sequential and representational.
On the other hand, regarding the links between segments of modules (scientific
discourse relations), the model describes two main categories: relations based on the
communicative function, that have the goal of increasing the reader’s understanding
and maybe acceptance of the publication’s content, and content relations, that allow
the structuring of the information flow within the publication’s content. The first
category is split into: Elucidation, as Clarification and Explanation, and
Argumentation. The second category contains: Dependency in the problem-solving process,
Elaboration, as Resolution and Context, Similarity, Synthesis, as Generalization and
Aggregation, and Causality. Generally, the relations present an implicit polarity and
don’t have attached explicit weights or temporal aspects.</p>
        <p>From the evaluation perspective, the authors performed a preliminary
evaluation of the model, which showed that the model satisfies the purpose for what it
was designed, but in reality, to our knowledge, it was not deployed in an actual
application and consequently it failed to be adopted.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>The Scholarly Ontologies project</title>
        <p>
          A much more focused approach was the one followed by Buckingham Shum et al. [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]
in the Scholarly Ontologies (ScholOnto) project. They were the first to propose the
decomposition of a scientific publication into elementary discourse knowledge items
and their connection via a set of relations emerged from an established theoretical
foundation, i.e. Cognitive Coherence Relations [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. As opposed to other
representations, Buckingham Shum et al. do not model the coarse-grained rhetorical or linear
structure of the publications, but rather concentrate strictly on organizing the
coherence among the content segments. Their research resulted in a series of tools for
the annotation and visualization of argumentation (the latest with accent on Web
2.0 technologies [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]), that acted as inspiration to other approaches.
        </p>
        <p>The elementary discourse knowledge items introduced by Buckingham Shum’s
model are the atomic nodes, that represent short pieces of text, within the
publication, succinctly summarizing the authors’ contribution. The granularity of these
nodes is left for decision to the author, and thus, can vary from parts of sentences
to blocks of sentences. Nodes can have several types (e.g. Data, Language, Theory),
encoded in the links that connects them. Two such connected nodes form a Claim.
In addition to nodes, the model contains also two kinds of composite elements:
(i) sets that group several nodes sharing a common type (or theme), and (ii) claim
triples formed by linking sets or atomic nodes.</p>
        <p>
          In terms of relations, Buckingham Shum’s Discourse Ontology comprises six
main types: (i) causal links, e.g. predicts, envisages, causes or prevents; (ii) problem
related links, e.g. addresses or solves; (iii) similarity links, e.g. is identical to, is
similar to, or shares issues with; (iv) general links, e.g. is about, improves on, or
impairs; (v) supports / challenges links, e.g. proves, refutes, is evidence for, or agrees
with; (vi) taxonomic links, e.g. part of, example of, or subclass of Each relation has
attached an explicit polarity (positive or negative), and a specific weight. The link’s
polarity denotes explicitly the author’s position in regards to particular statements
present in the related work, similar to Teufel’s approach [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. At the same time,
the weight indicates how strong or weak is the author’s position. For example,
the causal links envisages and causes have both a positive polarity, but different
weights, the former being considered weaker that the latter. Similarly, is unlikely to
affect and prevents have a negative polarity with different weights, again the latter
being considered stronger than the former.
        </p>
        <p>The authors performed an extensive evaluation of their approach, the model
being deployed in several applications developed by the authors, such as,
Compendium 5 and Cohere 6, applications that are currently widely used.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>De Waard’s Model</title>
        <p>
          A different discourse representation model was proposed by de Waard [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. They
started with a rhetorical block structure for scientific publications called ABCDE,
similar to the IMRAD (Introduction, Material and Methods, Results and
Discussion) 7 structure. The title holds the acronym of the five types of blocks present
in the model: (i) Annotations, representing the set of shallow metadata associated
with each publication (usually expressed in DublinCore 8 terms) (ii) Background,
describing the positioning of the current research and the ongoing issues; (iii)
Contribution, describing the work performed by the authors; (iv) Discussion, comparing
the current work to other approaches, including implications and next steps; (v)
Entities, denoting references, personal names, project websites, etc.
        </p>
        <p>
          At a later stage, the authors enriched their model with a fine-grained
representation of the discourse, by identifying seven basic discourse segment types [A]: Fact,
Hypothesis, Goal, Method, Result, Implication, and Problem. These types
correspond to those found by Mizuka et al.[
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] using automated techniques based on the
argumentation zoning approach developed by Teufel et al.[
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] in a corpus of biology
texts. A first attempt was made to find these segment types computationally [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ].
3.4
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>The SWAN Ontology</title>
        <p>
          The SWAN (Semantic Web Applications in Neuromedicine) 9 project focuses on
developing a semantically structured framework for representing biomedical discourse.
5 http://compendium.open.ac.uk/institute/
6 http://cohere.open.ac.uk/
7 http://www.uio.no/studier/emner/hf/imk/MEVIT4725/h04/resources/imrad.xml
8 http://dublincore.org/
9 http://swan.mindinformatics.org/ontology.html
SALT (Semantically Annotated LATEX) 12 [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] represents a semantic authoring
framework targeting the enrichment of scientific publications with semantic metadata.
SALT adopts elements from the Rhetorical Structure Theory (RST) [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] with the
goal of modeling discourse knowledge items and their intrinsic coherence relations.
The framework comprises three ontologies: (i) the Document Ontology, modeling
the linear structure of a document, in terms of Sections, Paragraphs or TextChunks
(ii) the Rhetorical Ontology, capturing the rhetorical and argumentation structure
of the publication, and (iii) the Annotation Ontology, that connects the rhetorical
structure present within the document’s content to the actual content of the
document. This ontology acts as a semantic bridge between the other two ontologies
and in addition it re-uses well-known concepts and properties for exposing shallow
metadata from the FOAF vocabulary.
10 http://www.foaf-project.org/
11 http://bibliontology.com/
12 http://salt.semanticauthoring.org/
        </p>
        <p>The Rhetorical Ontology consists of three major sides: (i) rhetorical relations
side that models elementary rhetorical elements (e.g. claims or supports) and the
relations connecting them (e.g. antithesis, circumstance, concession or purpose);
(ii) rhetorical blocks side that provides a coarse-grained structure for modeling
the discourse (e.g. abstract, motivation, background or conclusion); (iii)
argumentation side that captures the argumentation present in the publication via concepts
like Issue, Position or Argument.</p>
        <p>
          Applying SALT on a scientific publication leads to a local instance model
capturing the inter-connected linear, rhetorical and argumentation structures within
that publication. At a later stage, the authors dealt with the global scope of
modeling discourse knowledge items, i.e. items and relations that span across multiple
publications. In [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], following a semiotic approach inspired from Peirce’s direction
of semiotics (the semiotic triangle [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]), the authors introduce a model for
externalizing argumentative discourse networks.
        </p>
        <p>Different aspects of SALT were evaluated in a series of experiments in the last
three years, while the model as a whole was recommended for creating semantic
metadata for scientific publications by different workshops, such as the SemWiki
(Semantic Wiki) workshop series, or the Scripting and Development for the
Semantic Web workshop series.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Discussion: Towards an unified discourse representation</title>
      <p>Fig. 1 presents a concise comparative overview of the five discourse representation
models we have previously described. To make the first steps towards an unified
discourse representation model, we believe that we have to find a proper balance
between the features each of the currently existing models presents. In the following
we will try to sculpt the skeleton of such an unified model, to be later discussed
within the community.</p>
      <p>The first aspect to be considered is the overall structure of the model. By
following a layered approach, such as the one proposed by SWAN and SALT, the unified
model will gain flexibility, which in turn will be reflected in a more straightforward
evolution. This would clearly decouple the rhetoric and argumentation from the
provenance information, and from the shallow metadata and domain knowledge,
while at the same time providing the opportunity for a modular enrichment of the
model as a whole.</p>
      <p>
        The second aspect is the discourse structuring level. To be able to capture the
complete semantics hidden within the discourse, the model needs to address it at
different levels. Consequently, it needs to present both a coarse-grained structure,
as well as, a fine-grained structure. The former can be easily created by adopting a
mixture of rhetorical blocks from de Waard’s model and SALT, or from a different
source, such as Teufel’s zones [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. These blocks would have the role of organizing
the publication’s rhetoric at a high level. The latter structure refers to decomposing
the publication’s content into fine-grained discourse knowledge items, to be
connected via different types rhetorical and argumentation relations. This will lead to
a network of inter-linked elementary items that will externalize the content’s
coherence and argumentation thread. Fortunately, all the presented models contain such
a fine-grained structure, the only difference being the terminology used. We believe
that a core term such as Discourse element, with an underlying synonymy to claim
or hypothesis should be easily acceptable.
      </p>
      <p>Fig. 1. Comparative overview of the five analyzed discourse representation approaches</p>
      <p>Another remaining open question is the set of relations used to connect the
elementary discourse knowledge items, as this is the point where the divergence
between the existing models is the biggest. Having a closer look at the five sets
of relations, we observe two distinct tendencies which can lead to a common
denominator. On one side we have a mixture of cognitive coherent and argumentative
relations (in the Scholarly Ontologies project, SWAN and Harmsze), while on the
other side we have a more linguistic approach materialized in the rhetorical relations
used by SALT. Both directions can be used in a complementary fashion. After a
refinement of the rhetorical relations, we envision a co-existence of both sets, one
modeling the argumentative support of the discourse, while the other capturing the
coherence and rationale of the argumentation.</p>
      <p>From the properties that relations can carry, we believe that polarity should
be featured in the unified model, as it is extremely useful both for analysis and
visualization of the discourse. The relations’ weights are dependent on the extraction
mechanism, and therefore should be defined by the corresponding approach and
not included in the model, as such discrete quantifiers do not really provide a direct
added value for an author / reader. The model also needs to contain the provenance
information in addition to the shallow metadata describing the authorship and
references.</p>
      <p>Finally, the most important “non-functional” element to be considered when
designing such an unified model is the adoption from the existing models of the
lessons learned with regards to evaluation and uptake. The practical evaluation of
the features to be selected for the model should play a crucial role in the overall
design. Consequently, the resulting framework needs not only to be elegant and
to satisfy all the requirements of a proper formal externalization, but also to be
attractive for the average Web user. Contrarily, it will fail to achieve an appropriate
community uptake and will remain just an elegant model on paper.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this paper we presented a succinct overview of five of the existing discourse
representation models: de Waard’s and Harmsze’s models, ScholOnto, SWAN and
SALT. In addition, we have also made a brief comparative analysis of their main
features in terms of organizational structure, types and attributes of the relations
between the discourse knowledge items and openness to complementary models or
domain knowledge.</p>
      <p>Starting from the guidelines proposed in our discussion, we intend to pursue
our goal of designing an unified discourse representation model. The next steps will
include a series of open discussions with the members of our community and the
creation and exposure of a common model, achieved via a shared agreement and
understanding.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>The work presented in this paper has been funded by Science Foundation Ireland
under Grant No. SFI/08/CE/I1380 (Lion-2).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Nonaka</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Takeuchi</surname>
          </string-name>
          , H.:
          <article-title>The Knowledge-Creating Company: How Japanese Companies Create the Dynamics of Innovation</article-title>
          . Oxford University Press (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Kunz</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rittel</surname>
          </string-name>
          , H.:
          <article-title>Issues as elements of information system</article-title>
          .
          <source>Working paper 131</source>
          , Institute of Urban and Regional Development, University of California (
          <year>1970</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Sanders</surname>
            ,
            <given-names>T.J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spooren</surname>
            ,
            <given-names>W.P.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Noordman</surname>
            ,
            <given-names>L.G.M.</given-names>
          </string-name>
          :
          <article-title>Coherence Relations in a Cognitive Theory of Discourse Representation</article-title>
          .
          <source>Cognitive Linguistics</source>
          <volume>4</volume>
          (
          <issue>2</issue>
          ) (
          <year>1993</year>
          )
          <fpage>93</fpage>
          -
          <lpage>133</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Mann</surname>
            ,
            <given-names>W.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          :
          <article-title>Rhetorical Structure Theory: A theory of text organization</article-title>
          .
          <source>Technical Report RS-87-190</source>
          , Information Science Institute (
          <year>1987</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hovy</surname>
          </string-name>
          , E.:
          <article-title>Automated discourse generation using discourse structure relations</article-title>
          .
          <source>Artificial Intelligence</source>
          (
          <volume>63</volume>
          ) (
          <year>1993</year>
          )
          <fpage>341</fpage>
          -
          <lpage>385</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Ciccarese</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wong</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ocana</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kinoshita</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruttenberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>The SWAN biomedical discourse ontology</article-title>
          .
          <source>J. of Biomedical Informatics</source>
          <volume>41</volume>
          (
          <issue>5</issue>
          ) (
          <year>2008</year>
          )
          <fpage>739</fpage>
          -
          <lpage>751</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Groza</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Handschuh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , M¨oller,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Decker</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.:</surname>
          </string-name>
          <article-title>SALT - Semantically Annotated LATEX for Scientific Publications</article-title>
          .
          <source>In: Proceedings of the 4th European Semantic Web Conference (ESWC</source>
          <year>2007</year>
          ), Innsbruck, Austria (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Mancini</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shum</surname>
            ,
            <given-names>S.J.B.</given-names>
          </string-name>
          :
          <article-title>Modelling discourse in contested domains: A semiotic and cognitive framework</article-title>
          .
          <source>International Journal of Human-Computer Studies</source>
          <volume>64</volume>
          (
          <issue>11</issue>
          ) (
          <year>2006</year>
          )
          <fpage>1154</fpage>
          -
          <lpage>1171</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Harmsze</surname>
            ,
            <given-names>F.A.P.:</given-names>
          </string-name>
          <article-title>A modular structure for scientific articles in an electronic environment</article-title>
          .
          <source>PhD thesis</source>
          , University of Amsterdam (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>de Waard</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tel</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>The ABCDE format - enabling semantic conference proceeding</article-title>
          .
          <source>In: Proceedings of 1st Workshop:</source>
          ”
          <fpage>SemWiki2006</fpage>
          - From Wiki to Semantics”, Budva,
          <string-name>
            <surname>Montenegro.</surname>
          </string-name>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Teufel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moens</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Summarizing Scientific Articles: Experiments with Relevance and Rhetorical Status</article-title>
          .
          <source>Computational Linguistics</source>
          <volume>28</volume>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Mizuka</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Korhonen</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mullen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Collier</surname>
          </string-name>
          , N.:
          <article-title>Zone analysis in biology articles as a basis for information extraction</article-title>
          .
          <source>International Journal of Medical Informatics</source>
          <volume>75</volume>
          (
          <year>2006</year>
          )
          <fpage>468</fpage>
          -
          <lpage>487</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Lisacek</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chichestera</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaplan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sandor</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Discovering Paradigm Shift Patterns in Biomedical Abstracts: Application to Neurodegenerative Diseases</article-title>
          .
          <source>In: Proceedings of the 1st International Symposium on Semantic Mining in Biomedicine (SMBM)</source>
          .
          <article-title>(</article-title>
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Shum</surname>
            ,
            <given-names>S.J.B.</given-names>
          </string-name>
          :
          <source>Cohere: Towards Web 2.0 Argumentation. In: Proceedings of 2nd International Conference on Computational Models of Argument</source>
          , IOS Press (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Teufel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carletta</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moens</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>An annotation scheme for discourse-level argumentation in research articles</article-title>
          .
          <source>In: Proc. of the 9th Conf. on European Chapter of the ACL</source>
          , Morristown, NJ, USA, ACL (
          <year>1999</year>
          )
          <fpage>110</fpage>
          -
          <lpage>117</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>de Waard</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buitelaar</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eigner</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Identifying the Epistemic Value of Discourse Segments in Biology Texts</article-title>
          .
          <source>In: Proc. of the 8th Int. Conf. on Computational Semantics (IWCS-8</source>
          <year>2009</year>
          ).
          <article-title>(</article-title>
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Groza</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , M¨oller,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Handschuh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Trif</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Decker</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.:</surname>
          </string-name>
          <article-title>SALT: Weaving the claim web</article-title>
          .
          <source>In: Proceedings of the 6th International Semantic Web Conference (ISWC</source>
          <year>2007</year>
          ), Busan, Korea (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Ogden</surname>
            ,
            <given-names>C.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Richards</surname>
            ,
            <given-names>I.A.</given-names>
          </string-name>
          :
          <article-title>The Meaning of Meaning: A Study of the Influence of Language upon Thought and of the Science of Symbolism</article-title>
          . Magdalene College, University of Cambridge (
          <year>1923</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>