<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>FEIL: Emotion Lexicons for Finnish</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Emily S. Öhman</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Waseda University</institution>
          ,
          <addr-line>Nishiwaseda 1-6-1, Shinjuku, Tokyo</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <fpage>424</fpage>
      <lpage>432</lpage>
      <abstract>
        <p>This paper introduces a Sentiment and Emotion Lexicon for Finnish (SELF) and a Finnish Emotion Intensity Lexicon (FEIL). Sentiment analysis and emotion detection require annotated data regardless of the chosen approach, but most existing resources are for the English language. To overcome this, the SELF and FEIL lexicons use projected annotations from existing resources with carefully edited translations and domain adaptations. In this paper the creation process and translation issues are explained in detail to allow others to create similar lexicons for other languages. The usefulness of SELF and FEIL are demonstrated via several interdisciplinary afect-related projects. To our best knowledge, this is the first comprehensive sentiment and emotion lexicon for Finnish.</p>
      </abstract>
      <kwd-group>
        <kwd>sentiment analysis</kwd>
        <kwd>lexicon creation</kwd>
        <kwd>emotion detection</kwd>
        <kwd>lexicon validation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        There are three main approaches to sentiment analysis and emotion detection: machine learning,
lexicon-based, and hybrid methods that combine the first two approaches. The one thing all
of these approaches have in common is the need for annotated data. This annotated data
can consist of labeled datasets for training and testing classifiers, or lexicons to be used with
diferent types of word-matching approaches. Usually, the annotation process requires human
annotators in an iterative validation process to confirm the validity of each and every label and
is therefore labor-intensive and expensive [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        The benefit of lexicon-based methods is in the re-usability of them for multiple domains,
and therefore also the lower cost as there is no need to re-annotate the lexicons for each
project. Although both machine learning datasets and lexicons are somewhat context-dependent,
lexicons are slightly less so [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and they are typically easier and cheaper to edit for a new
domain than machine learning datasets. Emotion and sentiment lexicons can be used “as is”
in purely lexicon-based approaches or as a feature extraction tool for data-driven classifiers,
increasing their context-sensitivity (see e,g. Schmidt et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]). However, most emotion lexicons
have been created for the English language and there are very few quality emotion lexicons for
other languages, including Finnish.
      </p>
      <p>This paper introduces the SELF (Sentiment and Emotion Lexicon for Finnish) and FEIL
(Finnish Emotion Intensity Lexicon) lexicons.These are to the author’s best knowledge the
2022.
CEUR
ifrst and only emotion and sentiment dictionaries that have been made widely available for
Finnish and for which translations have been manually verified by independent annotators.
In this paper the focus is on the manual creation of lexicons, specifically the augmentation of
existing ones that were originally created manually and later automatically translated. The
goal of this pilot project is to help other researchers create similar lexicons for their languages,
give step-by-step directions on how to increase domain-specificity for their projects, and to
demonstrate the usability of these lexicons in interdisciplinary research projects. In the next
section, lexicon-based sentiment analysis and lexicon creation in general is explained, after
which an overview of the lexicon creation process is presented. Section 4 illustrates how these
lexicons can be used in interdisciplinary projects and serves as a real-world evaluation of the
quality of the lexicons followed by a concluding discussion in section 5.</p>
      <p>The SELF (Sentiment and Emotion Lexicon for Finnish) and FEIL (Finnish Emotion Intensity
Lexicon) lexicons are available on GitHub1. Domain- and project-specific versions of the lexicons
will be made available after the publication of the respective papers they were used in.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <p>
        Emotion lexicons can be created automatically or manually, but both approaches often, at least
partially, utilize dictionaries that list emotion categories in some way. For automatic creation
no manual verification is applied. Instead, other features are used to determine the sentiment
or emotion. For example, Kimura and Katsurai [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] automatically created an emoji-lexicon
by leveraging cosine similarity values and co-occurence from WordNet Afect [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].The manual
creation of lexicons involves using human annotators and asking them to determine how specific
words relate to diferent sentiments and/or emotions. This process needs to be repeated and
each data point annotated by multiple independent annotators as humans rarely agree with
each other on any labeling task, but even less so with emotion labeling [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ]. The agreement
between annotators is calculated using inter-rater agreement scores such as Krippendorf’s  .
      </p>
      <p>
        One of the most widely-used and well-known emotion lexicons is the NRC (National Research
Council Canada) Emotion Lexicon [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ]. The first version of the NRC Emotion Lexicon (a.k.a.
EmoLex) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] was created using Mechanical Turk, a platform for crowd-sourcing annotations
from humans. The lexicon was later significantly augmented to its current 14,182 lexical items
also using Mechanical Turk [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. For the first version, the annotators were asked to annotate
for emotions evoked by the words, and for the later version, they were asked to annotate for
emotions associated with the words as the developers of the lexicon discovered this led to
better agreement scores. Although, it too was created based on English data, the English words
have been translated using Google Translate and there are now 14,182 entries in total translated
into 104 languages. Even though very few of these translations have been manually verified,
some of the multilingual versions of the NRC Emotion Lexicon have been tested previously on
at least Spanish, Portuguese, and Arabic [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ] to evaluate emotion preservation in translation,
and the translations as well as the original English lexicon are also a built-in part of the syuzhet
package for R [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>1https://github.com/Helsinki-NLP/SELF-FEIL</p>
      <p>
        The annotations for the NRC Emotion Intensity Lexicon [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] were compiled using best-worst
scaling (BWS). BWS is an eficient method to collect massive amounts of scaled annotations
and has been proven to beat rating scales and other methods in both quality and cost2 [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
The main diference between the intensity lexicon and the emotion lexicon is that the intensity
lexicon does not include sentiments (positive, negative), and that instead of a Boolean option
for each emotion (0 for not associated with a particular emotion, 1 for being associated with a
particular emotion) the intensity lexicon gives a score as to how intense the emotion associated
with a word is (a score between 0 and 1, with 0 for no association and therefore no intensity, to
1 for the highest intensity).
      </p>
      <p>
        There are many considerations that need to be taken into account when using resources
created for one language with another language. Emotion words are closely linked with culture
and emotions and feelings are expressed quite diferently in diferent cultures and languages.
Although research shows the universality of afect categories [
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ], Mohammad et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]
list several error types when translating emotion words. These are, e.g., mistranslation, cultural
diferences and diferent word connotations, as well as diferent sense distributions. When
projecting annotations, particularly emotion annotations, from one language to another, it is
important to consider all of these aspects.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Lexicon Creation and Description</title>
      <p>The automatic Finnish translations of the NRC Lexicons were re-translated with the most
current version of Google Translate. First, within these translations, duplicates were marked
and all translations were carefully evaluated to match the English word’s meaning as well as
the associated emotions and in the case of the FEIL lexicon, intensity too. The problem with
Google Translate is that it chooses the most common translation, especially when translating
single words out of context rather than words in context. This process means that with an
emotion dictionary with many synonyms and near-synonyms, Google Translate provides the
same common word for all as the default translation.</p>
      <p>Naturally, this leads to issues with representativity as the Finnish lexicon, if left as is,
would only represent the most common words in Finnish, disregarding any synonyms or
nearsynonyms leading to a loss of nuances and insuficient coverage of emotion words. Hence, those
duplicates were carefully examined in order to find alternative translations that matched the
original word better in terms of meaning and connotation, both by human experts and synonym
dictionaries and thesauruses (see table 1 for an example). The original lexicon also included
several alternative spelling options, which were translated as the same word. Such duplicates
(e.g. tumor/tumour) were also removed. Another type of over-generalization was also
discovered: connotative-generalization, i.e. the meaning of the original word was over-generalized so
that the translation had lost all original connotations and many emotion associations (see the
example of ‘emaciated’ in table 2 in the Appendix).</p>
      <p>The reverse was also found with some instances where English has a more fine-grained
separation of some concepts where Finnish does not. E.g. Finnish does not diferentiate between
poison-venom-toxin. It is all the same word myrkky. Some English words can be both nouns and
2Cost because other methods take exponentially more time to collect the same amount of annotations.
verbs, and therefore, if not specified it is impossible to tell which one is being evaluated 3 or if
both are. One such word was rape. In this case the verb form was added with the same emotion
and intensity associations as the noun form. There were also cases of clear mistranslations,
some of which occurred with polysemic and homonymous words where the Finnish translation
was clearly not the one meant based on the associated emotions and/or intensities. For example
birch had been translated as ‘koivu’, a birch tree, but the negative associations suggested that
what had been meant was the act of flogging. In many cases, the least contentious solution was
to remove ambiguous entries; a recommendation if there is only one annotator. With multiple
annotators, agreement scores can be calculated and contested annotations can be solved by
various means, including, but not limited to removal. See table 2 for an overview of common
corrections.</p>
      <p>For emaciated, the automatic translation was too general. The corrected form is less common
but almost identical in meaning and connotation to the original English. As the term rabble has
such negative connotations in English, the automatic translation was much too neutral and was
changed to a word with similar connotations. As for corroborate the issue was with there not
existing a one-word translation for corroborate in Finnish and the meaning of the automatically
translated word being much closer to strengthen. The meaning of the original English words
cede, relinquish are somewhat synonymous so it makes sense to have both words in the English
lexicon with nearly identical emotional intensities, but there is only one one-word translation
3In the original task for English, the annotators were given a test for each word to check that they understood
the meaning of the word as intended. This information is not available in the published dictionary.
for Finnish so the duplicate was removed. Another example is furor which in English had anger
intensities of 0.9 and in Finnish had been translated into villitys - fad, but the best translation
was already paired with rage and as no suitable alternative translation was found, the entry
was deleted.</p>
      <p>The best translation is not always necessarily the best choice for a lexicon entry. If one
translated word truly is the best translation for several English words, an evaluation of the best
match needs to be made. If possible, the duplicate is not removed, but altered to still match the
English word, but using a diferent word, even if not as accurately translated to increase the
diversity of the lexicon while remaining true to the original emotion.</p>
      <p>
        Cultural diferences were also evident in the lexicon. North-Americans are on average more
religious than the Nordic people [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], which means that many religious words seem to have
much more positive connotations in the US and Canada than they do in Northern Europe, and
Scandinavia in particular (see the example of hurskas in table 1). This was most evident with
religious words, but other cultural connotations seemed to be present as well. In these cases
though, unless the association was glaringly wrong, they were kept as is: It is a slippery slope
to try and push one’s own judgment onto the intensity scores and emotion-word associations.
Everyone is inherently biased and relies on their own experience when making judgments.
Only when enough people agree on a judgment, can it be seen as culturally representative.
Since in this case only one person did most of the corrections, words were rather deleted than
letting a lone annotator’s subjective judgment overly influence the lexicon 4.
      </p>
      <p>The final distribution of the SELF and FEIL lexicons can be seen in table 3. All the adjustments
mentioned in this section resulted in an overall reduction in lexicon size by 12.2% from 14182
word-emotion association pairs to 12448 entries in Finnish. The final distribution of the FEIL
lexicon with all the adjustments mentioned in this section resulted in an overall reduction in
lexicon size by 10.5% from 8149 intensities to 7291 entries for Finnish.</p>
      <p>positive negative anger anticipation disgust fear
sadness surprise trust Lexicon
2117
2938</p>
      <p>The distribution of the types of corrections is presented in table 4. It is easier to detect sense
and specificity mistranslations in the intensity lexicon when the intensities of the associated
emotions are clear. This is likely the reason for the higher relative occurrence of such corrections
in the intensity lexicon. However, these types of errors are also the hardest to detect overall,
and therefore, this is the category that is most likely to increase the most when the lexicon is
updated and revised further.</p>
      <p>4Newer versions of the lexicon and domain adaptations have been evaluated by four annotators including
additions and intensity changes.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Lexicon Evaluation</title>
      <p>
        identical target words with no alternative translation
identical target words with alternative translation
birch to birch tree instead of flogging
emaciated to laihtunut instead of riutunut
part-of-speech diference
overly positive connotations of religious words
SELF was first evaluated against the original EmoLex by using the R package syuzhet [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] on
the Finnish novel Rautatie by Juhani Aho.The results can be seen in figure 1 where on the left
is the original EmoLex in Finnish and on the right SELF. The two plots show similar overall
patterns, but are decidedly diferent in detail, with SELF providing the more accurate plot when
evaluated by a literary expert on Aho [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>The emotion word distributions (see table 5) reveal that in general the SELF lexicon finds
more emotion words in the novel despite having a reduction in lexical items of 10%, but that
anger, disgust, and sadness, and therefore also negative were over-represented in the original
lexicon, likely by duplicates.</p>
      <p>
        Both SELF and FEIL were used on a project where Finnish political party manifestos between
the years of 1945-2019 were analyzed for patterns of emotion [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. In this case, FEIL proved to be
the more useful lexicon, as it allowed for us to examine more detailed expressions of emotions.
The output using the lexicon-based approach was evaluated against manual annotations as well
as by checking for statistical significance of the results. The validity of this evaluation process
has been discussed in detail in Ohman [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. The most interesting finding was perhaps that
positive negative anger anticipation disgust fear joy
sadness surprise trust
populist parties use the same amount of emotion words as other parties, but that the intensity
of the emotion words they use is significantly higher.
      </p>
      <p>
        Finally, FEIL has also successfully been used in conjunction with structural topic modeling
(STM) to measure change in attitudes towards specific entities during the COVID-19 pandemic
[
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. In this project we also made sure that the most common words in the massive dataset we
collected were included in FEIL. This resulted in some additions to the lexicon, the emotional
intensity of which were carefully evaluated by 4 humans. This COVID-19 enhanced version of
FEIL will also be made public after the publication of the original article.
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Concluding Discussion</title>
      <p>The SELF and FEIL lexicons have proved to be valuable augmentations of the NRC Emotion and
Intensity Lexicons. The lexicons have shown that they can match more words in real-world texts
than the original translations of EmoLex/NRC lexicon despite being approximately 10% smaller
in size. The problems with the use of the NRC lexicons’ automatically translated versions do not
seem to be caused by any issues with the original annotations (also attested by the thousands
of projects that have successfully used them for English), nor with the annotation projection as
such, but mostly by Google Translate’s algorithms which are not optimal for translating single
words out of context. With these issues fixed, SELF and FEIL have proven to be useful in several
interdisciplinary tasks and likely many more in the future.</p>
      <p>
        Both SELF and FEIL have been evaluated against the original automatically translated NRC
EmoLex as well as had their results validated against human annotations of texts. In all cases
SELF and FEIL produced reliable, real-world congruent results that closely aligned with human
perceptions of emotions expressed in those texts. It is the recommendation of the author to
use FEIL over SELF whenever methodologically possible as it showed closely matching results
to human impressions of longer texts [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] and if simply comparing emotion presence, is more
real-world accurate than SELF. It is also recommended to confirm SELF and FEIL results with
statistical significance testing or similar. If the method makes it possible to validate against
human annotations, it is of course always prudent to do so.
      </p>
      <p>Both lexicons work especially well on longer texts with semi-informal registers. To increase
the domain-specific coverage of the lexicon, it is also recommended to compile a list of the
most common tokens in the dataset that is being examined and check to see that all words that
are associated with emotions are added to the lexicon. Their emotion associations should be
confirmed by at least 3 annotators to reduce the risk of a single annotator’s biases influencing
the results.</p>
      <p>As for future work, it would be interesting to simultaneously use multiple emotion lexicons to
examine if they could be used together to automate some of the manual verification required for
translated data. The lexicon could also be used in conjunction with machine learning methods,
particularly with literary works and similar, where manually annotating large texts to create
ifne-grained training and testing data is usually infeasible.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>I would like to thank Dr. Saif Mohammad, the creator of the NRC emotion lexicons, for taking
the time to discuss the SELF and FEIL lexicons with me, and for his helpful suggestions in
improving the draft version of this paper.</p>
      <p>I would also like to thank the anonymous reviewers for their helpful comments that helped
improve this paper.</p>
      <p>This work was in part supported by JSPS KAKENHI Grant Number 22K18154.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>E.</given-names>
            <surname>Öhman</surname>
          </string-name>
          ,
          <article-title>Challenges in Annotation: Annotator Experiences from a Crowdsourced Emotion Annotation Task</article-title>
          ,
          <source>in: Digital Humanities in the Nordic Countries</source>
          <year>2020</year>
          , CEUR Workshop Proceedings,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Taboada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Brooke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tofiloski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Voll</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stede</surname>
          </string-name>
          ,
          <article-title>Lexicon-based methods for sentiment analysis</article-title>
          ,
          <source>Computational linguistics 37</source>
          (
          <year>2011</year>
          )
          <fpage>267</fpage>
          -
          <lpage>307</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Dennerlein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wolf</surname>
          </string-name>
          ,
          <article-title>Using deep learning for emotion analysis of 18th and 19th century german plays (</article-title>
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kimura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Katsurai</surname>
          </string-name>
          ,
          <article-title>Automatic construction of an emoji sentiment lexicon</article-title>
          ,
          <source>in: Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining</source>
          <year>2017</year>
          , ASONAM '17,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery,
          <year>2017</year>
          , p.
          <fpage>1033</fpage>
          -
          <lpage>1036</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C.</given-names>
            <surname>Strapparava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Valitutti</surname>
          </string-name>
          , et al.,
          <article-title>Wordnet afect: an afective extension of wordnet</article-title>
          .,
          <source>in: Proceedings of the International Conference on Language Resources and Evaluation (LREC)</source>
          , volume
          <volume>4</volume>
          ,
          <year>2004</year>
          , p.
          <fpage>40</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>P. S.</given-names>
            <surname>Bayerl</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. I. Paul</surname>
          </string-name>
          ,
          <article-title>What determines inter-coder agreement in manual annotations? A meta-analytic investigation</article-title>
          ,
          <source>Computational Linguistics</source>
          <volume>37</volume>
          (
          <year>2011</year>
          )
          <fpage>699</fpage>
          -
          <lpage>725</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Mohammad</surname>
          </string-name>
          ,
          <article-title>A practical guide to sentiment annotation: Challenges and solutions</article-title>
          ., in: WASSA@
          <string-name>
            <surname>NAACL-HLT</surname>
          </string-name>
          ,
          <year>2016</year>
          , pp.
          <fpage>174</fpage>
          -
          <lpage>179</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Mohammad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Turney</surname>
          </string-name>
          ,
          <article-title>Emotions evoked by common words and phrases: Using mechanical turk to create an emotion lexicon</article-title>
          ,
          <source>in: Proceedings of the NAACL HLT</source>
          <year>2010</year>
          <article-title>workshop on computational approaches to analysis and generation of emotion in text,</article-title>
          <year>2010</year>
          , pp.
          <fpage>26</fpage>
          -
          <lpage>34</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Mohammad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. D.</given-names>
            <surname>Turney</surname>
          </string-name>
          ,
          <article-title>Crowdsourcing a word-emotion association lexicon</article-title>
          ,
          <source>Computational Intelligence</source>
          <volume>29</volume>
          (
          <year>2013</year>
          )
          <fpage>436</fpage>
          -
          <lpage>465</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Salameh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mohammad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kiritchenko</surname>
          </string-name>
          ,
          <article-title>Sentiment after translation: A case-study on Arabic social media posts, in: Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: Human language technologies</article-title>
          ,
          <year>2015</year>
          , pp.
          <fpage>767</fpage>
          -
          <lpage>777</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Mohammad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Salameh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kiritchenko</surname>
          </string-name>
          ,
          <article-title>How translation alters sentiment</article-title>
          ,
          <source>Journal of Artificial Intelligence Research</source>
          <volume>55</volume>
          (
          <year>2016</year>
          )
          <fpage>95</fpage>
          -
          <lpage>130</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Jockers</surname>
          </string-name>
          ,
          <source>Syuzhet 1.0</source>
          .4 now
          <string-name>
            <surname>on</surname>
            <given-names>CRAN</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matthew</surname>
            <given-names>L</given-names>
          </string-name>
          .
          <string-name>
            <surname>Jockers</surname>
          </string-name>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Mohammad</surname>
          </string-name>
          ,
          <article-title>Word afect intensities, in: Proceedings of the 11th Edition of the Language Resources and Evaluation Conference (LREC-</article-title>
          <year>2018</year>
          ), Miyazaki, Japan,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kiritchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mohammad</surname>
          </string-name>
          ,
          <article-title>Best-worst scaling more reliable than rating scales: A case study on sentiment intensity annotation</article-title>
          ,
          <source>in: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>2</volume>
          :
          <string-name>
            <surname>Short</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <source>Association for Computational Linguistics</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>465</fpage>
          -
          <lpage>470</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A. S.</given-names>
            <surname>Cowen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Laukka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. A.</given-names>
            <surname>Elfenbein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Keltner</surname>
          </string-name>
          ,
          <article-title>The primacy of categories in the recognition of 12 emotions in speech prosody across two cultures</article-title>
          ,
          <source>Nature human behaviour 3</source>
          (
          <year>2019</year>
          )
          <fpage>369</fpage>
          -
          <lpage>382</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>K. R.</given-names>
            <surname>Scherer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. G.</given-names>
            <surname>Wallbott</surname>
          </string-name>
          ,
          <article-title>Evidence for universality and cultural variation of diferential emotion response patterning</article-title>
          .,
          <source>Journal of personality and social psychology 66</source>
          <volume>2</volume>
          (
          <year>1994</year>
          )
          <fpage>310</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>V.</given-names>
            <surname>Skirbekk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Connor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stonawski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. P.</given-names>
            <surname>Hackett</surname>
          </string-name>
          ,
          <source>The future of world religions: Population growth projections</source>
          ,
          <year>2010</year>
          -
          <fpage>2050</fpage>
          , Pew Research Center,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>E.</given-names>
            <surname>Öhman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rossi</surname>
          </string-name>
          ,
          <article-title>Afect and Emotions in Finnish Literature: Combining Qualitative and Quantitative Approaches</article-title>
          , forthcoming).
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Koljonen</surname>
          </string-name>
          , E. Öhman,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ahonen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mattila</surname>
          </string-name>
          ,
          <article-title>Strategic sentiments and emotions in post-Second World War party manifestos in Finland</article-title>
          ,
          <source>Journal of Computational Social Science (2022-forthcoming).</source>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>E. Ohman,</surname>
          </string-name>
          <article-title>The validity of lexicon-based emotion analysis in interdisciplinary research</article-title>
          ,
          <source>in: Proceedings of the ICON 2021 workshop NLP4DH</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>J.</given-names>
            <surname>Paakkonen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Ohman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.-M.</given-names>
            <surname>Laaksonen</surname>
          </string-name>
          ,
          <article-title>Unconventional communicators in the covid-19 crisis</article-title>
          . (Forthcoming).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>