<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Daniela Baiamonte</string-name>
          <email>daniela.baiamonte01@uni</email>
          <email>daniela.baiamonte01@uni versitadipavia.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tommaso Caselli Irina Prodanof</string-name>
          <email>irina.prodanof@gmail.com</email>
          <email>t.caselli@vu.nl</email>
          <email>t.caselli@vu.nl irina.prodanof@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Pavia</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>VU University Amsterdam University of Pavia</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>English. This paper presents a methodology for the annotation of the semantic and functional components of news articles (Content Zones, henceforth CZs). We distinguish between narrative and descriptive zones and, within them, among finergrained units contributing to the overall communicative purpose of the text. Furthermore, we show that the segmentation in CZs could provide valuable cues for the recognition of time relations between events.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Italiano. In questo lavoro viene
presentata una metodologia per l’annotazione
delle componenti semantiche e
funzionali del testo giornalistico (Zone di
Contenuto). Distinguiamo tra zone narrative
e descrittive e, al loro interno, tra
ulteriori unità che contribuiscono al
dispiegamento dello scopo comunicativo del testo.
Inoltre, mostriamo che la segmentazione
in Zone di Contenuto offre preziosi indizi
per il riconoscimento delle relazioni
temporali tra eventi.</p>
    </sec>
    <sec id="sec-2">
      <title>1 Introduction</title>
      <p>The logical structure of a document, i.e. its
hierarchical arrangement in sections, paragraphs,
sentences and the like, reflects a functional
organization of the information flow and creates
expectations on where the desired information may be
located. As it is often the case, however, breakups in
sections and paragraphs are motivated by style or
even arbitrary choices.</p>
      <p>
        The segmentation of the text in Content Zones
(CZs, henceforth), i.e. functional categories
contributing to the overall message or purpose, as
induced by the genre of the text1, provides more
reliable and fine-grained cues to access the
structure of its types of functional content. Previous
attempts to annotate CZs have mainly focused
on highly standardized texts like scientific articles
        <xref ref-type="bibr" rid="ref10 ref11 ref16 ref23">(Teufel et al., 2009; Liakata et al., 2012; Liakata
and Teufel et al., 2010)</xref>
        and scheduling dialogues
        <xref ref-type="bibr" rid="ref22">(Taboada and Lavid, 2003)</xref>
        , or on semi-structured
texts like film reviews
        <xref ref-type="bibr" rid="ref21 ref5">(Bieler et al., 2007; Taboada
et al., 2009)</xref>
        . Other work
        <xref ref-type="bibr" rid="ref13 ref14 ref15">(Palmer and Friedrich,
2014; Mavridou et al., 2015)</xref>
        adopts the theory of
discourse modes
        <xref ref-type="bibr" rid="ref19">(Smith, 2003)</xref>
        to distinguish
between the different types of text passages in a text
document.
      </p>
      <p>
        To the best of our knowledge, no efforts have
been undertaken to devise an annotation scheme
targeting the functional structure of news articles
in terms of their content: the inverted pyramid
structure, i.e. the gathering of key details at the
beginning, followed by supporting information in
order of diminishing importance, is too
coarsegrained to be effectively used for information
extraction purposes. Our hypothesis is that modeling
the document’s content via CZs could yield
benefits for high-level NLP tasks such as Temporal
Processing, Summarization, Question-Answering,
among others. In addition to this, CZs qualify as
a higher-level analysis of a text/discourse which
captures different information with respect to
Discourse Relations. The remainder of the paper is
structured as follows: in Section 2 the motivations
of this work are presented, together with related
studies. Section 3 reports on our inventory of CZs,
used to annotate a corpus of English news
articles. Details on the corpus are provided in
Section 4. In Section 5, we describe a case-study
on the correlation between CZs and temporal
re1We adopt Systemic Functional Linguistics’ view of
genre as “a staged, goal oriented, purposeful activity in
which speakers engage as members of our culture"
        <xref ref-type="bibr" rid="ref12">(Martin,
1984:25)</xref>
        .
lations to show that the segmentation in CZs can
provide cues in recognizing temporal relations
between events. Finally, Section 6 draws on
conclusion and suggests directions for future work.
two macro CZs is further divided into more
finegrained categories.
      </p>
      <p>The class NARRATION (NARR) includes the
following zones:
2</p>
    </sec>
    <sec id="sec-3">
      <title>Motivations and related work</title>
      <p>
        The bulk of the work on discourse structures has
focused on low-level structures corresponding to
Discourse Relations holding between textual
segments pairs. CZs take a different view on texts,
as they perform a function towards the text as
a whole. As an instance of a particular genre,
every text is meant to accomplish a
culturallyestablished communicative purpose, e.g. a news
article reports on events happening in the world.
This goal is not accomplished all at once:
separate functional stages (i.e. CZs) convey fragments
of its overall meaning
        <xref ref-type="bibr" rid="ref7">(Eggins and Martin, 1997)</xref>
        .
Therefore, the knowledge about the typical
functional structure of genres can be exploited to
predict the internal organization of a text. This kind
of information can be of help to produce balanced
summaries or to select the passages most likely to
contain the answer to a question.
      </p>
      <p>
        <xref ref-type="bibr" rid="ref23">Teufel et al. (2009)</xref>
        and
        <xref ref-type="bibr" rid="ref11">Liakata et al. (2010)</xref>
        ’s
works present two complementary perspectives
on scientific papers: the former models their
argumentative/rhetorical structure (following the
knowledge claims made by the authors); the
latter treats them as the humanly readable
representations of scientific investigations. In the works of
        <xref ref-type="bibr" rid="ref5">Bieler et al. (2007)</xref>
        and
        <xref ref-type="bibr" rid="ref21">Taboada et al. (2009)</xref>
        , two
different kinds of zones are recognized in film
reviews: formal zones (required by the genre, e.g.
credits and cast) and functional zones (reflecting
the abstract functions of describing and
commenting).
      </p>
      <p>
        In the elaboration of news articles’ CZs, we
were mostly inspired by
        <xref ref-type="bibr" rid="ref9">Labov (2013)</xref>
        ’s study of
oral narratives of personal experiences and by
        <xref ref-type="bibr" rid="ref4">Bell
(1991)</xref>
        ’s analysis of the structure of news stories.
3
      </p>
    </sec>
    <sec id="sec-4">
      <title>Annotation Schema</title>
      <p>
        The opposition between dynamicity and staticity,
mainly realized by grammatical and lexical aspect,
is adopted as the basic parameter for
differentiating between two macro CZs: NARRATION and
DESCRIPTION. The former is aimed at reporting
temporally interrelated (dynamic) events, the latter
is used to comment by focusing on selected
entities, properties, and states of affairs. Each of these
Foreground (FGR): text span containing
the most salient events, i.e. those in the
focus of attention
        <xref ref-type="bibr" rid="ref6">(as intended by Boguraev
and Kennedy, 1999)</xref>
        . The information it
conveys is both referentially and relationally new
        <xref ref-type="bibr" rid="ref8">(Gundel and Fretheim, 2005)</xref>
        , as it is usually
mentioned at the beginning of the article.
Background (BGR): ancillary,
referentially and relationally old information
performing an explanatory function (through
causal and temporal precedence relations)
towards FGRs.
      </p>
      <p>Follow-up (FUP): reactions and
consequences to FGR events (to whom they’re
related through cause-effect and temporal
succession relations), i.e. relationally new
information moving the discourse forward.</p>
      <p>Expectation (EXP): assumptions and
probable or possible outcomes, i.e. non
factual information (e.g. conditionals,
modality).</p>
      <p>The class DESCRIPTION (DSCR) includes the
following zones:</p>
      <p>Description (DES): characteristics of
a person or an object, customary
circumstances, or states of affairs.</p>
      <p>Evaluation (EVL): subjective
descriptions, explicit judgements showing the author
or some other agent’s attitude towards a
target.</p>
      <p>In addition, a third macro-class is posited, OTHER
(OTHR), containing categories performing
auxiliary functions towards the other CZs:</p>
      <p>
        Attribution (ATT): text span containing
the source and, if present, the cue of an
attribution
        <xref ref-type="bibr" rid="ref16">(as intended by Pareti and Prodanof,
2010)</xref>
        - while the content is assigned the
relevant CZ(s).
      </p>
      <p>Metatext (MTX): text span guiding the
reader’s attention towards metatextual
elements like figures or tables.</p>
      <p>Interrogative (INT): questions directly
addressed to the reader, e.g. to introduce a
new topic or to prompt a reaction.</p>
      <p>Tense
PRESENT
PAST
FUTURE
PRESPART
PASTPART
NONE
46
21
6
2
6</p>
      <sec id="sec-4-1">
        <title>INFINITIVE 32 51 26 18</title>
        <p>Major approaches to functional discourse
structuring adopt the sentence or the paragraph as unit of
annotation. On the other hand, we have opted for
a clause level annotation as this allows us to
better deal with news articles’ high level of
information density. Although CZs are conceptually
nonoverlapping, empirical analysis indicates that an
annotation unit may fit into more than one
category, that is a clause may represent complex
contents. Cases as such suggest that the more
informative content should be preferred. In the
example below, the tag ATT is assigned, even though a
descriptive content may as well be recognized.
1. [On an office wall of the Senate intelligence
committee hangs a quote from Chairman
David Boren,]AT T {PDTB2, wsj_0771}
The annotation of CZs is further complicated
by the fact the distribution of the zones does not
follow the linear order of the text. In most cases,
CZs are discontinuous, that is either their
contiguity may be “broken” by the presence of other CZs
or the same CZ may appear in different sentences
along the entire document (see example ?? for the
FGR zone).</p>
        <p>2. [South Korea registered a trade deficit of
$101 million in October,]F GR [reflecting
the country’s economic sluggishness,]EV L
[according to government figures released
Wednesday.]AT T [Preliminary tallies by the
Trade and Industry Ministry showed
another trade deficit in October, the fifth
monthly setback this year,]F GR [casting
a cloud on South Korea’s export-oriented
economy.]EV L {PDTB, wsj_0011}</p>
        <p>In other cases, due to the use of the clause as
minimal annotation span of a CZ, nested CZs may
occurr (see example ??).</p>
        <p>3. [South Korea’s economic boom, [which
began in 1986,]BGR stopped this year
because of prolonged labor disputes, trade
conflicts and sluggish exports.]BGR {PDTB,
wsj_0011}
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Description of the corpus</title>
      <p>
        We used the CZs annotation schema and the
annotation tool CAT
        <xref ref-type="bibr" rid="ref3">(Bartalesi Lenzi et al., 2012)</xref>
        to construct a small corpus of 57 news articles
      </p>
      <sec id="sec-5-1">
        <title>2Penn Discourse TreeBank (Prasad et al., 2008).</title>
        <p>
          (20 from the test section of TempEval-3
          <xref ref-type="bibr" rid="ref24">(UzZaman et al., 2013)</xref>
          , 20 shared between the PDTB
          <xref ref-type="bibr" rid="ref17">(Prasad et al., 2008)</xref>
          and the training section of the
TimeBank
          <xref ref-type="bibr" rid="ref18">(Pustejovsky et al., 2003)</xref>
          , 17 from the
PDTB). The corpus contains 2059 annotation units
and it is dominated by narrative sections (57%).
Within them, the most frequent CZ is the BGR
(26.5%), followed by FGR (12.4%), EXP (9.6%)
and FUP (8.4%). These figures show that news
articles mostly consist of redundant information,
only mentioned in order to help the reader to
anchor the new data to the prior knowledge.
Descriptive sections constitute the 25.5% of the
corpus: EVLs are slightly more frequent than DESs
(14.8% vs. 8.9%) — contradicting the alleged
objectivity expected in news reports (note,
however, that EVLs tend to occur in association with
ATTs). As to the OTHER macro CZ, it makes up
the 17.4% of the corpus: this percentage almost
entirely refers to ATTs, since MTXs and INTs are
only marginal zones (0.19% and 0.33%,
respectively).
        </p>
        <p>To test our hypotheses about some formal
properties of CZs, we carried out a corpus study. The
results are reported below.</p>
        <p>Position in the text. 71.7% of FGRs are
located in the opening sections of the articles and
their occurrence decreases towards the central
(18.4%) and closing sections (9.8%). BGRs show
a fairly complementary distribution to FGRs, as
they mostly occupy the central (31.6%) and
closing sections (27.3%) of the articles. As expected,
ATTs are quite evenly distributed among the three
sections. The remaining CZs do not show any
clear-cut tendencies.</p>
        <p>Verbal tenses. Table ?? shows the distribution
target
of verbal tenses, as annotated in the TimeBank
corpus, among CZs. BGRs and ATTs are
dominated by the past tense, this is in accordance with
our expectations as the former is characterized
by temporal precedence relations to FGR events
and the second mostly contains events of saying.
CZs belonging to the DSCR class are significantly
dominated by the present tense, usually associated
with imperfective aspect and staticity. The high
frequency of present tenses in FGRs and BGRs
doesn’t necessarily defy our expectations, since
FGRs contain both dynamic and static events and
the tag PRESENT is also used to refer to instances
of present perfect.</p>
        <p>Modality markers. The majority of modality
markers is located in EXPs and, more broadly, in
the narrative CZs, as shown in Figure ??. In the
TimeBank corpus, the MODALITY tag is mostly
assigned to modal auxiliaries, we believe that the
annotation of modal adverbs would further raise
the percentages observed in EXPs and in the NARR
class.</p>
        <p>Pronouns. Looking at Figure ?? we can see
that almost 50% of all pronouns is located in
BGRs. The percentages are consistent with our
expectations as BGRs convey referentially old
information and, although FUPs and EXPs elaborate
on FGR events, they often introduce new referents.
Note that the distribution of pronouns is not, alone,
a sufficient indicator of referential oldness since
also lexical and zero anaphoras should be taken
into account.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>5 Interactions between CZs and time relations</title>
      <p>In news articles events are not iconically presented
in the linear order of their real succession, this
poses a challenge to systems aimed at
uncover43:93
47:32
0 20 30 40 50</p>
      <p>Percentage (%)
Figure 1: Distribution of modality markers and
pronouns among CZs.
ing their temporal event structure. Therefore, we
used the annotations available for the TimeBank
section of the corpus to check whether some
connections between CZs and temporal relations
between event pairs exist. The full set of temporal
relations specified in TimeML contains 14 types
of relations: BEFORE, AFTER, IBEFORE, IAFTER,
BEGINS, BEGUN_BY, ENDS, ENDED_BY,
DURING, DURING_INV, INCLUDES, IS_INCLUDED,
SIMULTANEOUS and IDENTITY. We simplified
the set as follows: the relation types that invert
each other were collapsed into a single type; given
the low frequency of the relation type IBEFORE,
it was mapped to the corresponding more
coarsegrained type BEFORE.</p>
      <p>Given the narrative shape of news articles, the
corpus is considerably dominated by precedence
source - target
(BEFORE) and succession (AFTER) relations.
Table ?? shows that the majority of temporal
relations holds between events belonging to the same
CZ types: events tend to precede, include,
occur during, begin, end, be simultaneous to and
anaphorically evoke (through TimeML IDENTITY
temporal relations) other events belonging to the
same zone.</p>
      <p>FGR events precede rather than follow ATT,
FUP and EXP events. BGR events, the most
involved in BEFORE relations, tend to precede other
events, especially if located in ATTs and FGRs.
Unexpected outcomes mostly occur in cases like
the following, where the FGR event precedes the
BGR one. This is because conflicting contents
may be expressed in the same unit (in this case
a reaction to the FGR event and the list of its
premises):
4. [Delta Air Lines earnings soared to 33%
to a record in the fiscal first quarter,]F GR
[bucking the industry trend toward declining
profits.]F UP [The Atlanta-based airline, the
third largest in the U.S., attributed the
increase to higher passenger traffic, new
international routes and reduced service by Rival
Eastern Airlines...]BGR {PDTB, wsj_1011}
As highlighted in Table ??, NARR events begin or
end other NARR or DSCR events (more specifically,
these relations hold between events belonging to
instances of the same CZ) and DSCR events
include (rather than being included in) other events.</p>
      <p>IDENTITY relations mostly involve FGRs: as
6</p>
    </sec>
    <sec id="sec-7">
      <title>Conclusions and future work</title>
      <p>We have developed an inventory of zone labels
for the genre news article and shown that the
so-generated content structure could help
narrowing down the range of time relations connecting
events.</p>
      <p>
        Future work would involve testing the
stability and reproducibility of the annotation scheme
through the measurement of inter-annotator
agreement and elaborating a separate annotation
scheme for editorials, whose argumentative style
reflects different structuring principles than those
acting in news reports. Finally, we would like
to automatize the process of annotation and test
the effectiveness of the approach in texts
belonging to different genres, e.g. novels
        <xref ref-type="bibr" rid="ref14 ref15">(Ouyang and
McKeown, 2014)</xref>
        and historical essays. Even the
basic distinction between narrative and
descriptive zones could facilitate the performance of more
complex NLP tasks by targeting the relevant
informational zones. The corpus and the annotation
guidelines are publicly available3.
      </p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgment</title>
      <p>This has been partially supported by the Erasmus
+ Traineeship Program 2015/2016 from
University of Pavia and the NWO Spinoza Prize project
Understanding Language by Machines (sub-track
3).</p>
      <p>3https://github.com/cltl/ContentZones.
git</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Baiamonte</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <year>2016</year>
          .
          <article-title>Annotazione di Zone di Contenuto: una strutturazione funzionale del testo giornalistico</article-title>
          .
          <source>Thesis of the Master in Theoretical and Applied Linguistics</source>
          . University of Pavia, Pavia.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Bärenfänger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hilbert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lobin</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lüngen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puskás</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <year>2006</year>
          .
          <article-title>Cues and constraints for the relational discourse analysis of complex text types - the role of logical and generic document structure</article-title>
          . Sidner C.L.,
          <string-name>
            <surname>Harpur</surname>
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Benz</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kühnlein</surname>
            <given-names>P</given-names>
          </string-name>
          . (eds.),
          <source>Proceedings of the Workshop on Constraints in Discourse. Maynooth, Ireland</source>
          .
          <fpage>27</fpage>
          -
          <lpage>34</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Bartalesi</given-names>
            <surname>Lenzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Moretti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Sprugnoli</surname>
          </string-name>
          ,
          <string-name>
            <surname>R.</surname>
          </string-name>
          <year>2012</year>
          .
          <article-title>CAT: the CELCT Annotation Tool</article-title>
          .
          <source>Proceedings of LREC 2012</source>
          . Istanbul.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Bell</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>1991</year>
          .
          <article-title>The Language of News Media</article-title>
          . Blackwell Publishers, Oxford.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Bieler</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dipper</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stede</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2007</year>
          .
          <article-title>Identifying Formal and Functional Zones in Film Reviews</article-title>
          . Keizer S.,
          <string-name>
            <surname>Bunt</surname>
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paek</surname>
            <given-names>T</given-names>
          </string-name>
          . (eds.),
          <source>Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue</source>
          . Antwerp, Belgium.
          <fpage>75</fpage>
          -
          <lpage>78</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Boguraev</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Kennedy</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <year>1999</year>
          .
          <article-title>Salience-based content characterisation of text documents</article-title>
          . Inderjeet M. and
          <string-name>
            <surname>Maybury M. T.</surname>
          </string-name>
          (eds.),
          <source>Advances in Automatic Text Summarization</source>
          . MIT Press, Cambridge, MA.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Eggins</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Martin</surname>
            ,
            <given-names>J. R.</given-names>
          </string-name>
          <year>1997</year>
          .
          <article-title>Genres and registers of discourse</article-title>
          . van Dijk T. (ed.),
          <string-name>
            <given-names>Discourse</given-names>
            <surname>Studies</surname>
          </string-name>
          .
          <article-title>Discourse as structure and process</article-title>
          , volume
          <volume>1</volume>
          .
          <string-name>
            <surname>Sage</surname>
          </string-name>
          , London (UK) and Thousand Oaks (CA).
          <volume>230</volume>
          -
          <fpage>257</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Gundel</surname>
            ,
            <given-names>J. K.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Fretheim</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <year>2005</year>
          .
          <article-title>Topic and Focus</article-title>
          . Horn L. and
          <string-name>
            <surname>Ward</surname>
            <given-names>G</given-names>
          </string-name>
          . (eds.),
          <source>The Handbook of Pragmatics. Blackwell Publishing</source>
          ,
          <fpage>175</fpage>
          -
          <lpage>196</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Labov</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <year>2013</year>
          .
          <article-title>The Language of Life and Death</article-title>
          . Cambridge University Press, Cambridge, UK.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Liakata</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saha</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dobnik</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Batchelor</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rebholz-Schuhmann</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <year>2012</year>
          .
          <article-title>Automatic recognition of conceptualization zones in scientific articles and two life science applications</article-title>
          .
          <source>Bioinformatics</source>
          <year>2012</year>
          , volume
          <volume>28</volume>
          .
          <fpage>991</fpage>
          -
          <lpage>1000</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Liakata</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teufel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Siddharthan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Batchelor</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <year>2010</year>
          .
          <article-title>Corpora for conceptualisation and zoning of scientific papers</article-title>
          .
          <source>Proceedings of the 7th conference on Internation Language Resource and Evaluation (LREC10).</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Martin</surname>
            ,
            <given-names>J. R.</given-names>
          </string-name>
          <year>1984</year>
          .
          <article-title>Language, register and genre</article-title>
          . Christie F. (ed.),
          <article-title>Language studies: Children writing</article-title>
          . Reader. Deakin University Press, Geelong, Australia.
          <fpage>21</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Mavridou</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedrich</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peate</surname>
            <given-names>Sørensen</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Palmer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            , and
            <surname>Pinkal</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <year>2015</year>
          .
          <article-title>Linking discourse modes and situation entity types in a crosslinguistic corpus study</article-title>
          .
          <source>September</source>
          <year>2015</year>
          .
          <source>In Proceedings of Linking Models of Lexical</source>
          .
          <article-title>Sentential and Discourse-level Semantics (LSDSem)</article-title>
          . , Lisbon, Portugal.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Ouyang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>McKeown</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Towards automatic detection of narrative structure</article-title>
          .
          <source>Proceedings of LREC14</source>
          , Reykjavik, Iceland.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Palmer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Friedrich</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Genre distinctions and discourse modes: Text types differ in their situation type distributions</article-title>
          .
          <source>Proceedings of the Workshop on Frontiers and Connections between Argumentation Theory and Natural Language Processing</source>
          . Forlí-Cesena, Italy.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Pareti</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Prodanof</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <year>2010</year>
          .
          <article-title>Annotating Attribution Relations: Towards an Italian Discourse Treebank</article-title>
          .
          <source>Proceedings of LREC10.</source>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Prasad</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dinesh</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miltsakaki</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Robaldo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joshi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Webber</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>The Penn Discourse TreeBank 2.0</article-title>
          .
          <source>Proceedings of the 6th International Conference on Language Resources</source>
          and
          <string-name>
            <surname>Evaluation (LREC). Marrakech</surname>
          </string-name>
          , Morocco.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Pustejovsky</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanks</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sauri</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>See</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Day</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferro</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaizauskas</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lazo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Setzerr</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sundheim</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <year>2003</year>
          .
          <article-title>The TimeBank Corpus</article-title>
          .
          <source>Corpus Linguistics</source>
          .
          <fpage>647</fpage>
          -
          <lpage>56</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>S. Carlota</given-names>
          </string-name>
          <year>2003</year>
          .
          <article-title>Modes of discourse: The local structure of texts</article-title>
          . Cambridge University Press.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Stede</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2011</year>
          .
          <string-name>
            <given-names>Discourse</given-names>
            <surname>Processing</surname>
          </string-name>
          . Morgan &amp; Claypool Publishers.
          <fpage>7</fpage>
          -
          <lpage>38</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Taboada</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brooke</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stede</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2009</year>
          .
          <article-title>Genre based paragraph classification for sentiment analysis</article-title>
          .
          <source>Proceedings of SIGDIAL</source>
          <year>2009</year>
          :
          <article-title>the 10th Annual Meeting of the Special Interest Group in Discourse and Dialogue</article-title>
          . Queen Mary University of London. 62-
          <fpage>70</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Taboada</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lavid</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2003</year>
          .
          <article-title>Rhetorical and Thematic Patterns in Scheduling Dialogues: A Generic Characterization</article-title>
          .
          <source>Functions of Language</source>
          ,
          <volume>10</volume>
          (
          <issue>2</issue>
          ).
          <fpage>147</fpage>
          -
          <lpage>179</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Teufel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Siddharthan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Batchelor</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <year>2009</year>
          .
          <article-title>Towards discipline-independent argumentative zoning: Evidence from chemistry and computational linguistics</article-title>
          .
          <source>Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing</source>
          ,
          <string-name>
            <surname>EMNLP</surname>
          </string-name>
          <year>2009</year>
          . Singapore.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>UzZaman</surname>
          </string-name>
          , N.,
          <string-name>
            <surname>Llorens</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Derczynski</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Allen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Verhagen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pustejovsky</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2013</year>
          . SemEval-2013 Task 1:
          <fpage>TempEval</fpage>
          -3:
          <string-name>
            <given-names>Evaluating</given-names>
            <surname>Time</surname>
          </string-name>
          <string-name>
            <surname>Expressions</surname>
          </string-name>
          , Events, and
          <string-name>
            <given-names>Temporal</given-names>
            <surname>Relations</surname>
          </string-name>
          .
          <source>Second Joint Conference on Lexical and Computational Semantics (*SEM)</source>
          , Volume
          <volume>2</volume>
          :
          <source>Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval</source>
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>