<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An Approach for Agile Interdisciplinary Digital Humanities Research - a Case Study in Journalism</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Eetu Mäkelä</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anu Koivunen</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Antti Kanner</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maciej Janicki</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Auli Harju</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Julius Hokkanen</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olli Seuri</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Digital Humanities University of Helsinki</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Faculty of Information Technology and Communication Studies Tampere University</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Faculty of Social Sciences Tampere University</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <fpage>4</fpage>
      <lpage>14</lpage>
      <abstract>
        <p>In this paper, we present insights into how a research process facilitating fluent interdisciplinary collaboration was developed for a project joining together 1) computer scientists, 2) linguists and 3) media scholars. In lieu of describing the actual results from our analyses in the project, we instead describe our approach, and how it led into a versatile general template for an iterative and discursive approach to digital humanities research, which moves toward questions of interest both fast, as well as with high capability to truly capture the phenomena from the viewpoints of interest.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Despite an opening up of large primary source datasets, wide-scale research done
on such data has not permeated the core disciplines of humanities or social
science [
        <xref ref-type="bibr" rid="ref17 ref23 ref24 ref6">6,17,23,24</xref>
        ]. We argue that one reason for this is that a gap exists between
what is in the data, or what can be produced through established automated
means (e.g. word counting, topic modelling or sentiment analysis), and the
nuanced human categories of interest [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Reaching more meaningful conclusions
requires developing novel analysis methods specifically tuned for both the data
as well as the questions asked of it. Developing such is however by no means
easy, and requires input from all participating disciplines.
      </p>
      <p>
        Drawing on scholarship on cross-disciplinary work cultures and practices [
        <xref ref-type="bibr" rid="ref12 ref13 ref7">7,13,12</xref>
        ],
we argue that the usual means for collaboration (e.g. user group workshops
for requirements and testing, intermediate communication between disciplinary
workgroups) are not enough to obtain a good result. What is needed instead
is constant, integrated, iterative collaboration between the computer scientists
developing the methods and processes, and the humanities and social science
end-users using them for actual research.
      </p>
      <p>In this paper, we present insights into how a research process facilitating
fluent interdisciplinary collaboration was developed for a project joining together
1) computer scientists, 2) linguists and 3) media scholars. Concretely, the work
relates to a pilot study in the project Flows of Power: media as site and agent
of politics4 (Academy of Finland 2019–2022). As a whole, the project
investigates the agency of journalistic media in the flows of information, public opinion
and power. Through a large-scale empirical analysis of Finnish news journalism
between 1998 and 2018, the project explores the strategies of journalistic news
media in staging and managing political processes, as well as seeks to find out if
they have changed with the advent of social media.</p>
      <p>While the full time scale of the project is vast, for our first pilot we decided to
focus on a more tightly-defined case study: how was afectivity mediated,
modulated and managed by the media in the reporting of, and commentary on, a
particular yearlong political conflict between the right-wing conservative
government and the trade unions in Finland (2015–2016). Focusing on the government’s
goal to increase the competitiveness of the Finnish economy by lowering labour
costs, this yearlong, multi-peak process was interesting, because while it was
centred around a single issue, it also had enough duration and variability to be
on the upper boundary for manual research, and thus amenable to benefit from
scalable computational means of analysis.</p>
      <p>
        On the other hand, in choosing the management of afectivity (or
emotiveness) as our object of interest, we were setting the bar quite high. Journalism
in general has an innate culture of seeking to project an appearance of balance
and objectivity, of dealing with issues fairly and without bias [
        <xref ref-type="bibr" rid="ref19 ref4">19,4</xref>
        ].
According to Tuchman [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], this objectivity norm manifests so strongly that it can be
described as a ’strategic ritual’, whereby journalists utilise various strategies to
appear dispassionate, disembodied and impartial. Given such an environment, it
was clear that we could not use for example of-the-self approaches to sentiment
analysis to obtain a credible and nuanced analysis of the phenomena in question.
Instead, we needed to work closely together between the domain experts,
linguists and computer scientists to derive new indicators for capturing the unique
ways by which afect was expressed in this domain. Further, we also needed
to develop tools to identify the ways by which this afect was then mediated,
modulated and displaced in the service of the ritual of objectivity.
      </p>
      <p>In the following, we will not describe the actual results from our analysis, but
instead describe our approach in this pilot study, and how it led into a versatile
general template for an iterative and discursive approach to digital humanities
research, which moves toward questions of interest both fast, as well as with high
capability to truly capture the phenomena from the viewpoints of interest.</p>
      <sec id="sec-1-1">
        <title>4 http://flopo.rahtiapp.fi/</title>
        <p>5/143</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Facilitating Discussion in a Shared Space</title>
      <p>At the core of our approach is to, as early as possible, build a common
environment where all partners of the project can not only view, but highlight to
each other and discuss what is interesting in the data from their perspective. By
doing this, for example, humanities scholars are able to highlight to the
computer scientists new phenomena of interest in the data derived from their close
reading, while the computer scientists can show what they are currently
automatically able to bring forth from the data. Through this, everyone is kept on
the same page, misunderstandings are avoided, and the most fruitful avenues
for development can be negotiated in a shared space where everyone contributes
equally.</p>
      <p>
        In the FLOPO project, this began with making use of existing tools. First,
a Slack chat space was established for discussion amongst project participants.
Second, the whole 20 years of FLOPO data from three diferent news sources was
loaded into Octavo [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], a custom-built Lucene index for rich queries, which also
includes a web user interface for easy end-user access to query results. Crucially,
the state of the query and the interface are serialised in the URL, facilitating
easy passing of a particular view between the project participants through Slack
messages.
      </p>
      <p>In deciding what to focus on for the pilot case study, this interface was
used for qualitatively verifying the interestingness of the material for analysis.
Further, through passing query state URLs back and forth between the media
scholars and the computer scientists, the Octavo interface was used to iteratively
develop and verify the query constraints used to extract the relevant subset of
competitiveness pact -related articles for further study.</p>
      <p>Soon after, a repeatable pipeline was designed which downloaded the results
of a given Octavo query and converted them into plain text and CSV files, which
could then be further processed computationally, but also close read as is by the
media scholars to get an overview of the data subset. Further, the metadata
CSVs for each data source were loaded into a shared Google Drive folder, which
the media scholars then used Google Sheets to record their notes in a shared
location for discussion on Slack.</p>
      <p>After multiple iterations using a combination of the Octavo interface, Google
Drive sheets and the local plain text files, we arrived at a much more refined
definition of what to include and exclude from the pilot study data set. In this,
the ability to flexibly rerun the pipeline to produce new versions of the pilot
data set proved crucial. Later, it also proved crucial in enabling us to flexibly
add a fourth data source to our analysis, which became available to us only later
in the research process.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Building a Pipeline for the Agile Development of</title>
    </sec>
    <sec id="sec-4">
      <title>Computational Indicators</title>
      <p>While the Octavo interface and shared text and metadata files allowed us to zero
in on an interesting subset of the data, as well as explore its basic structure, the
6/143
adequate identification of the handling of afectivity required the development
of novel computational indicators. Eficiently creating such indicators in a
collaborative manner required two things. First, for communication purposes, an
environment was required where the computer scientists could highlight to the
media scholars the results of their development, and where the media scholars
on the other hand could mark what was interesting to them in the texts. Second,
to enable iterative development, our environment was required to facilitate easy
updates of data in all directions between the enrichment, evaluation and later
analysis environments.</p>
      <p>
        For visualising to our media scholars what our computational approaches
could extract from the texts, we decided to employ WebAnno [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>While WebAnno was originally intended for the creation of datasets for
language technology tasks, its functionality is designed to be very general, which
enabled its use in a wide variety of projects involving text annotation5. In
WebAnno, annotations to the texts are shown as highlighted text spans, with feature
values shown as colourful bubbles over the text (see Figure 1). Crucially, in
addition to the usual linguistic layers of annotation, like lemma or head, it allows
the creation of custom layers and feature sets. Further, the various annotation
layers can be shown or hidden on demand. For us, this provided the means to
show what our computational tools could dig up from the data in an easy way.
Similarly, the media scholars could use the manual annotation functionalities of
WebAnno to both explicitly highlight to us new features of interest, as well as
to provide corrections and further training data for our algorithms.</p>
      <p>What remained to be done was to figure out how WebAnno could be hooked
into the rest of our components. To function as a communicative tool (and later</p>
      <sec id="sec-4-1">
        <title>5 see: https://webanno.github.io/webanno/use-case-gallery/</title>
        <p>7/143
to function as part of our analytical environment), we needed to be able to link
to the view of a particular document from outside. Here, while WebAnno did
provide URLs taking the user to a particular document, these were based on
internal document ids instead of any external identifies we used. However, we
were able to derive a mapping between the two, which was then loaded into the
shared Google Sheets that tracked our pilot study data. When this later proved
too cumbersome to maintain, due to WebAnno being open source, we could add
functionality for external identifiers straight to the tool, and contributed our
additions also back upstream.</p>
        <p>As the second requirement, we needed to be able to iteratively synchronise
the data between WebAnno and our environment for both running the
computational enrichments as well as later computational analyses. This required
identifying a core data format that would be easy to transform both to and
from the formats required by particular tools.</p>
        <p>
          Our primary toolbox for statistical analysis is R. This motivated using a
‘tidy data’ CSV-based format [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ] as our main data format. On the other hand,
as most linguistic analysis is currently based on variations of the CONLL-U6
format, we sought to design our core data model in similar terms. However, we
could not use CONLL-U directly due to two reasons. First, while being based
on TSV, the CONLL formats are an extension to it, with non-tabular means
to denote paragraph and sentence boundaries. Second, even within the tabular
part of CONLL-U, the semantics of all its columns are predetermined, with just
a single ”miscellaneous” column for recording additional information.
        </p>
        <p>As a result, we ended up with a core tidy table format that contains all
CONLL-U columns, but with added columns for documentID, paragraphId and
sentenceId. Together, these facilitate combining the per-document CONLL-like
ifles often produced by NLP tools into a single unified table, as well as remove
the need to separately handle the way CONLL codes paragraph and sentence
boundaries. Given this base format, all annotation layers could also be relegated
to separate CSV files, where tuples like ( documentId, sentenceId, spanStartId,
spanEndId, annotationValue) were stored. This way, new annotations could easily
be produced separately, and joined only as needed for analysis or visualisation.</p>
        <p>
          Concretely, to act as bases for our own annotation enrichments, we crafted
converters to turn the CONLL-U output from the TurkuNLP pipeline [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] to this
format, as well as the output of the rule-based FINER named entity recognition
tool [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]). Then, all of our own enrichments were programmed to operate from
this data, although some of them for example converted from this common base
to Prolog files and back to facilitate rule-based deduction.
        </p>
        <p>Coming back to WebAnno, it also supported several data formats for
import and export, all of them assuming one document per file. Among others,
diferent variants of the CONLL format were supported. However, while in
the CONLL formats, the semantics of all fields are predefined, WebAnno also
provided WebAnno-TSV as its own tab-separated text format, which included
support for custom annotation layers. Because it is a text format and is well</p>
      </sec>
      <sec id="sec-4-2">
        <title>6 https://universaldependencies.org/format.html</title>
        <p>8/143
documented, we were able to implement a fully automatic bidirectional
conversion between our corpus-wide, per-annotation CSV files and the per-document
WebAnno-TSV files.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Indicator Development and Analysis</title>
      <p>To reiterate, now that WebAnno was working as part of our iterative development
environment, we crucially had an interface that could highlight to the user all
results from our automated computational annotation. Through this interface,
the media scholars evaluated, corrected and further taught our computational
approaches. However, they could equally also use the interface to highlight new
aspects of interest for the computer scientists to try to capture. Making use
of these tools, we were finally able to develop indicators for the study of the
management of afectivity in our news corpora, as well as to develop the axes by
which we’d compare them.</p>
      <p>
        Previous studies, especially corpus-based studies based on appraisal theory
(e.g. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]) and those applying computational methods such as sentiment analysis,
have mostly approached afectivity and emotions through lexemes signalling clear
negative or positive evaluations. While we knew this would not sufice for our
purposes, we also started from this established baseline in our approach.
      </p>
      <p>We first started with the word class of adjectives, from which we extracted
those that had evaluative meaning. To ensure coverage, we automatically
extracted from the corpora all 4,732 adjective lexeme (word) types. Evaluative
lexemes (N = 2,857) were then picked manually. The excluded, non-evaluative
lexemes consisted mostly of technical and temporal qualifications and other
classifications such as nationalities.</p>
      <p>
        However, in a given setting, almost any linguistic structure can express
affectivity. Particularly given the importance of the ”ritual of objectivity” [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]
and neutrality in news language, our hypothesis was that we’d also need other,
more context-dependent and subtle indicators of afect apart from the
universally evaluative lexemes. Thus, we also close-read the news articles in our corpus
to obtain a data-informed handle on their emotive and evaluative vocabulary
beyond adjectives. We were struck by the large amount of metaphorical language
– an evidently afective practice [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], but more indirect than the evaluative
expressions. In the news texts, using metaphors that transposed events from the
source domain (the labour market) into e.g. one of sports or armed conflict
enabled writers to utilise afective intensity in a safe manner without appearing
confrontational.
      </p>
      <p>While such metaphors were an interesting indicator of afective intensity, they
are also content specific and cannot be identified using general-purpose word
lists; nor can they be described using syntactic rules. In order to operationalise
the identification of metaphors for this particular case, we manually analysed
a sample of the articles and marked all passages with metaphorical language.</p>
      <p>Because we started this phase early, most of this work was done purely manually
9/143
through the media scholars copy-pasting sentences into Google Sheets, but as
soon as we had WebAnno set up, we migrated to using that.</p>
      <p>We then examined the central vocabulary of these annotations and manually
extracted a seed word list of around 500 words that marked strongly
metaphorical discourse. Using pre-trained word embeddings to discover other words that
were used in similar ways, we then automatically expanded this list to 2,100
words. For analysis, we further divided this set into two subsets, one for verbs
and the other for nouns.</p>
      <p>
        To further study how the ritual of objectivity interacted with expressing
afect in the texts, we also wanted to study whether and how afect was
modulated or ‘hedged’ in the material. Here, on advice from our linguist, we turned to
grammatical and lexical structures used to express ‘evidentiality’ and epistemic
modality. In communication, these structures are typically used to express
reservations or to communicate degrees of uncertainty, and have been shown to be
heavily involved, especially in academic genres, in politeness strategies [
        <xref ref-type="bibr" rid="ref1 ref16 ref26">1,16,26</xref>
        ].
The logic of their face-protecting pragmatic function is that the afective content
of the message appears to be less strong if it is expressed as a mere possibility
instead of a certainty. Our hypothesis was that, as with academic text, these
structures would also appear in journalistic texts in managing and containing
afect in the service of the ritual of objectivity. In our corpora, we identified these
structures using a list of grammatical structures that expressed evidentiality and
epistemic modality in Finnish, based on a review of established research [
        <xref ref-type="bibr" rid="ref10 ref15 ref5">15,10,5</xref>
        ].
Concretely, these expressions were identified in the corpora using Prolog rules
that operated on a dependency-parsed version of the material as produced by
TurkuNLP.
      </p>
      <p>
        After listing the linguistic markers used in the study, we delineated the axes
by which to compare them. First, we wished to study the hypothesis of the
ritual of emotionality [
        <xref ref-type="bibr" rid="ref21 ref22">21,22</xref>
        ]. This hypothesis starts from the understanding that
appealing to emotions is a good way to rouse and maintain audience interest,
and thus should be available also in the repertoire of journalists. However, due
to the needs of the ritual of objectivity, this afectivity cannot appear in the
journalists’ own discourse, but will be outsourced to quotes. We thus developed
methods to identify both direct and indirect quotations in the articles. Due to
the nature of the material, where the accurate attribution of information and
claims is deemed extremely important, this was a relatively easy matter. First,
each source had their own clear and consistent markers used to identify direct
quotations. Identifying indirect quotations was based on automatic syntactic
dependency parsing to extract syntactic structures used for reported speech in
Finnish.
      </p>
      <p>We next sought to examine possible external factors that could have
influenced the quantity of evaluative and emotive words. We tested the ‘click-bait’
hypothesis, which posits that afectively intensive content will be located at the
beginning of the article as a means to solicit reader interest. We also investigated
whether the afectivity and intensity of the actual events afected the journalism
by 1) comparing the reporting on the competitiveness pact with baseline
po10/143
litical journalism of a randomly chosen other period and 2) comparing articles
written during periods of peak activity with those written at other times.</p>
      <p>Next, we analysed the diferences among diferent news media, ranging from
the tabloid Iltalehti to the Finnish news agency STT, whose direct clients are
not the public but other news outlets, which may choose to republish or extend
their stories on their own sites.</p>
      <p>Finally, we focused on afectivity in various types of news articles: 1) news
reporting focused on facts; 2) news analyses with an emphasis on
contextualising, explaining and interpreting events; 3) news commentaries or columns where
the writer’s voice and opinion are foregrounded; and 4) editorials and leading
articles, either signed or non-signed, that express the viewpoints of the paper.
To distinguish between these, we used the iterative capabilities of our pipeline
to draft classification rules for each news source, making use of both structured
metadata such as section information, as well as explicit article type markers in
for example the article headlines.</p>
      <p>After all of the above, in the end we had more than twelve diferent indicator
signals available for analysis, combined with six axes or dimensions of analysis.
Naturally, this leads to an explosion of possible combinations to analyse, only
some of which are interesting. Identifying which aspects to focus on again
required closely working together between the computer scientists, the linguists
and the media scholars. To facilitate this, we added one final piece of software
to our ecosystem: a web-based tool for explorative statistical analysis, which
al11/143
lowed anyone in the project to 1) get statistical overviews of a desired indicator
variable across a custom combination of dimensions and 2) to also get a
statistical overview of how each data point was placed with respect to others from
the viewpoint of an indicator (see Figure 2). Naturally, this interface was also
programmed to facilitate the sharing of queries and UI state through copying
and pasting the URL visible in the browser. In addition, the individual articles
visible in the second view were all linked to WebAnno, so that outliers could be
investigated, and reasons for particular distributions qualitatively explored.
5</p>
    </sec>
    <sec id="sec-6">
      <title>Discussion</title>
      <p>
        As already said, the actual results of our analyses will not be discussed here. For
them, the reader is referred to our article in Journalism [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Instead, we will
summarise the aspects pertinent to our process here.
      </p>
      <p>First, in this paper, we’ve presented an example of a general approach to
digital humanities that allows combining computational analysis with the knowledge
of domain experts in all steps of the process, from the development of
computational indicators to final analysis.</p>
      <p>Once fully functioning, our solution rests on three pillars. The first of these is
an interface for close reading, but crucially one which is able to highlight to the
user all results from automated computational annotation. Beyond pure close
reading, through this interface, the user is thus also able to evaluate the quality
of computational analysis. Further, the interface supports manual annotation of
the material, facilitating correction and teaching of machine-learned approaches.</p>
      <p>The second of our pillars is an interface for statistical analysis, where the
phenomena of interest can be analysed en masse. However crucially, this
interface is also linked to the close-reading one to further let the users delve into
interesting outliers. Through this, they are not only able to derive hypotheses
and explanations of the phenomena, but can also identify cases where outliers
are more due to errors and omissions in our computational pipeline.</p>
      <p>Finally, our third pillar is an agile pipeline to move data between these
interfaces and our computational environment. In application, this third pillar is
crucial, as it allows us to iteratively experiment with diferent computational
indicators to capture the objects of our interest, with the results quickly
making their way to experts for evaluation and explorative analysis. Through this
analysis and evaluation, we then equally quickly get back information on not
just the technical accuracy of our approach, but also if it captures the question
of interest. Further, beside direct training data, we also get suggestions on new
phenomena of interest to try to capture.</p>
      <p>While in any given project, some of these pillars will not be available from the
very start or will be replaced with better ones during the course of the project, the
idea and principles remain the same. By maintaining from the start environments
and interfaces that allow both computer scientists and humanities scholars to not
only view, but highlight to each other all aspects of the data, we further a shared
understanding between the participants. For example, humanities scholars are
12/143
easily able to highlight to the computer scientists new phenomena of interest
in the data derived from their close reading, while the computer scientists can
easily show what they are currently automatically able to bring forth from the
data. Through this, everyone is kept on the same page, misunderstandings are
avoided, and the most fruitful avenues for development can be negotiated in a
shared space where everyone contributes equally.</p>
      <p>In fact, we argue that it is precisely getting such an environment going as early
as possible (even if by ready tools that are not completely ideal) that facilitates
a fruitfully directed cycle of discussion, agile development and experimentation.
Together, we feel that these insights provide a versatile template for an iterative
and discursive approach to digital humanities research, which moves toward
questions of interest both fast, as well as with high capability to truly capture
the phenomena from the viewpoints of interest.
13/143</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Brown</surname>
          </string-name>
          , P.,
          <string-name>
            <surname>Levinson</surname>
            ,
            <given-names>S.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levinson</surname>
            ,
            <given-names>S.C.</given-names>
          </string-name>
          :
          <article-title>Politeness: Some universals in language usage</article-title>
          , vol.
          <volume>4</volume>
          . Cambridge university press (
          <year>1987</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Eckart de Castilho, R., Mújdricza-Maydt, É.,
          <string-name>
            <surname>Yimam</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartmann</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gurevych</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frank</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Biemann</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>A Web-based Tool for the Integrated Annotation of Semantic and Syntactic Structures</article-title>
          .
          <source>In: Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH)</source>
          . pp.
          <fpage>76</fpage>
          -
          <lpage>84</lpage>
          . Osaka,
          <string-name>
            <surname>Japan</surname>
          </string-name>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Edmond</surname>
          </string-name>
          , J.:
          <article-title>Strategies and Recommendations for the Management of Uncertainty in Research Tools and Environments for Digital History</article-title>
          .
          <source>Informatics</source>
          <volume>6</volume>
          (
          <issue>3</issue>
          ),
          <volume>36</volume>
          (Sep
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Epstein</surname>
            ,
            <given-names>E.J.</given-names>
          </string-name>
          : News from Nowhere. New York: Vintage. Random
          <string-name>
            <surname>House</surname>
          </string-name>
          (
          <year>1973</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hakulinen</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Korhonen</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vilkuna</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koivisto</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Iso suomen kielioppi [Descriptive Grammar of Finnish]</article-title>
          .
          <source>Suomalaisen kirjallisuuden seura</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Hill</surname>
            ,
            <given-names>M.J.:</given-names>
          </string-name>
          <article-title>Invisible interpretations: reflections on the digital humanities and intellectual history</article-title>
          .
          <source>Global Intellectual History</source>
          <volume>1</volume>
          (
          <issue>2</issue>
          ),
          <fpage>130</fpage>
          -
          <lpage>150</lpage>
          (May
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Hine</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>New Infrastructures for Knowledge Production: Understanding E-science</article-title>
          . Idea Group Inc (IGI) (
          <year>Jan 2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Huan</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>The strategic ritual of emotionality in Chinese and Australian hard news: a corpus-based study</article-title>
          .
          <source>Critical Discourse Studies</source>
          <volume>14</volume>
          (
          <issue>5</issue>
          ),
          <fpage>461</fpage>
          -
          <lpage>479</lpage>
          (
          <year>2017</year>
          ). https://doi.org/10.1080/17405904.
          <year>2017</year>
          .
          <volume>1352002</volume>
          , https://doi.org/10. 1080/17405904.
          <year>2017</year>
          .1352002
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kanerva</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ginter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miekka</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leino</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salakoski</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Turku neural parser pipeline: An end-to-end system for the CoNLL 2018 shared task</article-title>
          .
          <source>In: Proceedings of the CoNLL</source>
          <year>2018</year>
          <article-title>Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies</article-title>
          . pp.
          <fpage>133</fpage>
          -
          <lpage>142</lpage>
          . Association for Computational Linguistics, Brussels, Belgium (Oct
          <year>2018</year>
          ). https://doi.org/10.18653/v1/
          <fpage>K18</fpage>
          -2013, https: //www.aclweb.org/anthology/K18-2013
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kangasniemi</surname>
          </string-name>
          , H.:
          <article-title>Modal expressions in Finnish</article-title>
          .
          <source>Ph.D. thesis</source>
          , University of Helsinki (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Kanner</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marjanen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vaara</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roivainen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lähteenoja</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>TarkkaRobinson</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mäkelä</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lahti</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tolonen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Octavo - analysing early modern public communication</article-title>
          .
          <source>In: Digital Humanities at Oxford Summer School posters (Jul</source>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Kemman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Boundary practices of digital humanities collaborations</article-title>
          .
          <source>DH Benelux journal 1(1)</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>24</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Kemman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kleppe</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>User required? on the value of user research in the digital humanities</article-title>
          .
          <source>In: Selected Papers from the CLARIN 2014 Conference, October 24-25</source>
          ,
          <year>2014</year>
          , Soesterberg, The Netherlands. pp.
          <fpage>63</fpage>
          -
          <lpage>74</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Koivunen</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kanner</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Janicki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harju</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hokkanen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mäkelä</surname>
          </string-name>
          , E.:
          <article-title>Emotive, evaluative, epistemic: a linguistic analysis of afectivity in news journalism</article-title>
          .
          <source>Journalism</source>
          (
          <year>2020</year>
          ),
          <article-title>in press, available as a preprint from the project website</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Laitinen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Välttämättömyys ja persoona. Suomen murteiden nesessiivisten rakenteiden morfosyntaksia ja semantiikkaa</article-title>
          .
          <source>Licentiate thesis</source>
          , University of Helsinki (
          <year>1989</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Myers</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>The pragmatics of politeness in scientific articles</article-title>
          .
          <source>Applied linguistics 10(1)</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>35</lpage>
          (
          <year>1989</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Poole</surname>
            ,
            <given-names>A.H.</given-names>
          </string-name>
          :
          <article-title>The conceptual ecology of digital humanities</article-title>
          .
          <source>Journal of Documentation</source>
          <volume>73</volume>
          (
          <issue>1</issue>
          ),
          <fpage>91</fpage>
          -
          <lpage>122</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Ruokolainen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kauppinen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silfverberg</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lindén</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>A Finnish news corpus for named entity recognition</article-title>
          .
          <source>Lang Resources &amp; Evaluation (August</source>
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Schudson</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The objectivity norm in american journalism*</article-title>
          .
          <source>Journalism</source>
          <volume>2</volume>
          (
          <issue>2</issue>
          ),
          <fpage>149</fpage>
          -
          <lpage>170</lpage>
          (
          <year>2001</year>
          ). https://doi.org/10.1177/146488490100200201, https://doi.org/ 10.1177/146488490100200201
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Tuchman</surname>
          </string-name>
          , G.:
          <article-title>Objectivity as strategic ritual: An examination of newsmen's notions of objectivity</article-title>
          .
          <source>American Journal of Sociology</source>
          <volume>77</volume>
          (
          <issue>4</issue>
          ),
          <fpage>660</fpage>
          -
          <lpage>679</lpage>
          (
          <year>1972</year>
          ), http://www. jstor.org/stable/2776752
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Wahl-Jorgensen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Emotions, media and politics</article-title>
          . John Wiley &amp; Sons (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Wahl-Jorgensen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Questioning the Ideal of the Public Sphere: The Emotional Turn</article-title>
          .
          <source>Social Media + Society</source>
          <volume>5</volume>
          (
          <issue>3</issue>
          ),
          <volume>2056305119852175</volume>
          (
          <year>2019</year>
          ). https://doi.org/10.1177/2056305119852175, https://doi.org/10.1177/ 2056305119852175
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Wallach</surname>
          </string-name>
          , H.:
          <article-title>Computational social science ̸= computer science + social data</article-title>
          .
          <source>Commun. ACM</source>
          <volume>61</volume>
          (
          <issue>3</issue>
          ),
          <fpage>42</fpage>
          -
          <lpage>44</lpage>
          (
          <year>Feb 2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Watts</surname>
            ,
            <given-names>D.J.:</given-names>
          </string-name>
          <article-title>Computational social science: Exciting progress and future directions</article-title>
          .
          <source>The Bridge on Frontiers of Engineering</source>
          <volume>43</volume>
          (
          <issue>4</issue>
          ),
          <fpage>5</fpage>
          -
          <lpage>10</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Wetherell</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Afect and Emotion: A New Social Science Understanding</article-title>
          . Sage, London (
          <year>2012</year>
          ). https://doi.org/10.4135/9781446250945
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>White</surname>
            ,
            <given-names>P.R.</given-names>
          </string-name>
          :
          <article-title>Beyond modality and hedging: A dialogic view of the language of intersubjective stance</article-title>
          .
          <source>Text - Interdisciplinary Journal for the Study of Discourse</source>
          <volume>23</volume>
          (
          <issue>2</issue>
          ),
          <fpage>259</fpage>
          -
          <lpage>284</lpage>
          (
          <year>2003</year>
          ). https://doi.org/10.1515/text.
          <year>2003</year>
          .
          <volume>011</volume>
          , https://www.degruyter. com/view/j/text.1.
          <year>2003</year>
          .
          <volume>23</volume>
          .issue-2/text.
          <year>2003</year>
          .011/text.
          <year>2003</year>
          .
          <volume>011</volume>
          .xml
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Wickham</surname>
          </string-name>
          , H.:
          <article-title>Tidy data</article-title>
          .
          <source>Journal of Statistical Software</source>
          <volume>59</volume>
          (
          <issue>10</issue>
          ) (
          <year>August 2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>