=Paper=
{{Paper
|id=Vol-3792/paper10
|storemode=property
|title= Problematic Cases of Attitude Annotation in Diplomatic Speeches
|pdfUrl=https://ceur-ws.org/Vol-3792/paper10.pdf
|volume=Vol-3792
|authors=Mariia Anisimova,Šárka Zikánová
|dblpUrl=https://dblp.org/rec/conf/itat/AnisimovaZ24
}}
== Problematic Cases of Attitude Annotation in Diplomatic Speeches
==
Problematic cases of attitude annotation in diplomatic
speeches
Mariia Anisimova1,* , Šárka Zikánová1
1
Charles University
Faculty of Mathematics and Physics
Institute of Formal and Applied Linguistics, 6 Malostranské náměstí 25, 118 00 Praha, Czech Republic
Abstract
This paper provides an overview of challenges that occurred during the creation of the annotation scenario for the attitude
annotation in diplomatic speeches of the UNSC. The scenario followed the attitude part of the Appraisal theory. The various
challenges in annotating the speeches such as the extent of arguments, identification of attitude in verbal forms, and complex
structures, were classified, and, in part, resolved. This paper would be useful for anyone considering this type of attitude
analysis when working with diplomatic and political texts.
Keywords
Appraisal theory, attitude analysis, manual annotation, diplomatic discourse, corpus linguistics
1. Introduction types of discourse. The prominent characteristics of these
texts are the understated tone [5] and indirectness. These
The research on annotating attitudes [1] and its various pragmatic features prove to be very important to how
categories has long been an ongoing process [2], [3]. As diplomats express opinions, which are most frequently
described before, annotating attitudes is a complex pro- not of their own but of the political body they represent.
cedure not only due to elaborated annotation schemes It is also because of them, that the diplomatic attitudes
but also due to the lack of definitive criteria for the iden- form a separate group of attitude-bearing expressions
tification and categorization of attitudes and the other and require a comprehensive approach in the process of
appraisal labels. annotation.
Another problematic side of this annotation type is In our previous publications, we have discussed the
that the researchers often omit the step of creating a notion of attitude in diplomatic discourse and described
formalized annotation scenario, which is why motivation our view on the most suitable annotation schemes for
and consistency of their choices are hard to follow [2]. its evaluation [6], explained the annotation process and
This, in turn, causes issues with testing the consistency environment, as well as the criteria for selecting the data
of the annotations, and test-retest reliability [4]. for our corpus of diplomatic speeches. We have then
This study aims to present a discussion on the issues provided the outcomes of the first batch of annotation
observed while creating the discourse-specific annota- [7], which was then utilized for redefining the annotation
tion scenario suited for annotating types of attitudes and scenario based on problematic and unclear cases of anno-
their subcategories as defined by [1]. We classify the tation. In the most recent addition to our project [8], we
issues and particularities observed during the creation have also presented the Corpus of Diplomatic Attitudes
of the scenario as well as the annotation process and (CoDipA 1.0), which includes a description of the selected
compare them to the issues observed by [2]. We then speeches from the United Nations Security Council anno-
offer our perspective on the choice of a possible solution tated with the attitude part of the Appraisal theory and
and discuss further experiments. expands on corpus and inter-annotator statistics, such as
Diplomatic speeches form a very particular and pe- our calculations and commentary on the Cohen’s kappa
culiar group of texts that are very different from other which varies between 0.44 and 0.32 depending on the
selected category.
ITAT (Information technologies – Applications and Theory), Workshop
The objectives of this research are to define and ex-
on Automata, Formal and Natural Languages 2024 (WAFNL 2024)
*
Corresponding author. emplify the challenging parts of the annotation scenario
$ anisimova@ufal.mff.cuni.cz (M. Anisimova); and offer solutions for the observed issues to make the
sarka.zikanova@mff.cuni.cz (Š. Zikánová) annotation process reliable and stable, resulting in a sat-
https://ufal.mff.cuni.cz/mariia-anisimova (M. Anisimova); isfactory inter-annotator agreement. The clarity and
https://ufal.mff.cuni.cz/sarka-zikanova (Š. Zikánová)
reproducibility of the annotation process would also help
0000-0002-2478-2815 (M. Anisimova); 0000-0002-7805-9649
(Š. Zikánová) further automatize the processing of attitudes.
© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License
Attribution 4.0 International (CC BY 4.0).
The structure of the paper is outlined in three main
CEUR
Workshop
Proceedings
http://ceur-ws.org
ISSN 1613-0073
CEUR Workshop Proceedings (CEUR-WS.org)
CEUR
ceur-ws.org
Workshop ISSN 1613-0073
Proceedings
sections, namely the brief description of the annotation more convenient to start with, while gradually adapting
scenario, the detailed description of the issues observed, a the evaluation criteria to the task.
brief outline of the future work, and the main conclusions.
Sections 5 and 6 correspond to the Limitations and Ethical 2.1. On identifying attitudes and the
considerations of the presented research.
scope of annotated fragment
The pilot annotation task completed to support the sce-
2. Description of the annotation nario creation has first shown a set of challenges, which
scenario vary in nature, and difficulty and have proven to be either
general to this type of text analysis [2], or particular to
The annotation scenario was developed to annotate the the chosen text type.
attitudes [1] in the diplomatic speeches of the United The first task that the annotator should be able to do is
Nations Security Council [9]. The scenario first offers a to find the attitudinal fragments. This task is not straight-
brief introduction to the Appraisal theory by first describ- forward and lacks direct instructions and explicit descrip-
ing its three main subsystems: attitude, engagement, and tion in the original Appraisal theory documentation [1].
graduation. The three subsystems refer to the descrip- This has been posing a general problem for researchers
tion of our feelings and emotional reactions, sourcing of working with various discourse types [2], in diplomatic
attitudes and the play of voices around opinions in dis- discourse this issue is also of critical importance.
course, and grading the phenomena [1]. It then focuses
on the detailed description of the subsystem of attitude,
which was selected to define emotion and subjectivity
2.2. Attitude identification in diplomatic
in diplomatic speeches. Finally, a practical how-to guide texts
for setting up the annotation environment in the doc- From our empirical experience, there can be several ap-
cano annotation tool [10] is offered along with a set of proaches to the process of identification of an attitude.
annotation labels and references. It is, first of all, possible to judge the presence or ab-
The subsystem of attitude according to the Appraisal sence of an attitude based on the token’s polarity, as if
theory provides a framework for the analysis of evalua- judging each token individually by deciding if it may
tive expressions by categorizing them as being an affect signify the presence of an attitudinal expression. Such
(an emotional reaction), a judgement (an expression of an approach, however, does not allow for capturing all
ethical evaluation), or an appreciation (an evaluation of of the available attitudinal meanings. Let’s take a look at
aesthetics) and defining their polarity and explicitness. Example 1. and analyze it by first highlighting all of the
Each category is then subdivided into a separate tree of positive and negative connotations available to estimate
choices making the system a complex and informative if the available sentiment in separate tokens correlates
structure. with the identification of expressions of attitude.
The dataset that is to be annotated, is also described Example 1: Even items subject to control would go to Iraq
in detail in the scenario. It consists of a corpus of 100 once there is [confidence: POS] that they would not be used
diplomatic speeches, selected according to a set of crite- to rebuild Iraq’s [[weapons: NEG] of [mass [destruction:
ria that allows for proportional representation of diver- NEG]: NEG]] or [[improve: POS] its military capabilities].
sity according to the topic of a speech, the year of the Such complex lexical structures may contain multiple
meeting, and the country the speaker represents. The layers of semantic connotations, which may lead to the
language of the data is English, and the speeches were annotator’s confusion due to the superposition of the la-
either originally presented in English, or were the offi- bels. The annotators may be drawn to find the attitudinal
cial UN translations. The information about the original meaning of every available token conveying attitudinal
language of the speech, as well as the speaker’s affilia- polarity which often leads to viewing an annotated frag-
tion and sex, the topic of the session, and its year are ment in isolation of the broader text meaning.
stored in the metadata of each text. The corpus consists Another approach to attitude identification is based
of 105592 tokens and 7296 types, and the average length on the subjective evaluation of a textual fragment under
of a sentence is 32 tokens. consideration based on the contextual boundaries of the
Our initial goal was for the guidelines to be of a qual- syntactic structure that bounds it, as well as the context
ity that would allow for the full disambiguation of the of the whole text (which in the case of our study is a
task, which means the annotators would select the same speech).
text span and assign completely identical labels to it in Let us take another look at the same sentence, where
terms of attitude type and subtype, sentiment polarity, the possible locations of attitude are defined by its sub-
and explicitness. We therefore apply the strictest ap- jective interpretation and context of the whole structure:
proach to annotations assessment at the time as it is
Even items subject to control would go to Iraq [once Let’s analyze an excerpt from one of the diplomatic
there is confidence that they would not be used to rebuild speeches provided in Example 4:
Iraq’s weapons of mass destruction or improve its military Example 4: Experts in our countries must immediately
capabilities: JUDGEMENT-INVOKED]. begin to analyse it and then draw the appropriate conclu-
Here, the speaker provides an implicit evaluation of the sions.
future possible behaviour of Iraq, expressing their hope Here, there is a definite presence of positive judgement
and inclination towards Iraq not rebuilding its weapons coded by the directive ’must’. experts must begin to anal-
of mass destruction. This scenario is presented as a sub- yse would be our first choice to annotate. The problem
jective evaluation of the possible future behaviour, which is, such a fragment, if non-explicit, does not convey any
means that the attitude in question should be perceived attitude on its own, therefore inclusion of the whole con-
as an implicit evaluation of the desired appropriate action text necessary for understanding of the chosen label is
of Iraq. advised: beginning of analysis is only good if it leads to
Even though this way of finding attitudes leads us drawing appropriate conclusions.
to a somewhat more vague definition of the borders of
the annotated expression, it also provides us with more 2.3.1. On annotating articles
contextual meaning as opposed to the cumulative polarity
of separate tokens. Another required decision is related to the annotation of
Let us exemplify this approach again with the Example determiners. As most of the annotated entities constitute
2: either an evaluative adjective or a collocation of an eval-
Example 2: [A number of these have been under discus- uative adjective and an evaluative noun, it is necessary
sion for some time in this Council: AFFECT-INVOKED], and to decide whether or not articles defining the evaluative
[if we were able to agree on this package, it would be an im- phrase should be included in the annotation or not.
portant step forward in that regard as well: JUDGEMENT- Our solution to this question is related to the overall
INVOKED]. viewpoint on the attitude framework and depends on t
In this example, the speaker implies their emotional If an attitude is expressed explicitly (inscribed tag)
state of unsatisfaction by the amount of time that an issue as in ’a capable management’, ’we are happy about the
is already under discussion while providing their implicit resolution’, ’the valuable lesson’, etc.) and the perceived
evaluation of the appropriate course of future actions. meaning of the adjective+noun collocation is enough to
Our preferred approach is to judge the span of the understand it in the full, annotation of any additional
annotated fragment by the context and the explicitness tokens should be omitted. Usually, it is enough to only
of the evaluation, which would lead to a clearer logic and mark one token or collocation that is used for expressing
accountability of the annotation process. an attitude. There is no need to include any additional
tokens in the annotated fragment, for example, if an at-
titude is found within a phrase [they have thought of a
2.3. On identifying the borders of wonderful solution], we would advise only marking the
inscribed and invoked attitudes evaluative adjective and omit to annotate the article that
Attitude examples observed in the corpus were proven precedes it and the noun that follows. We also advise
to make use of both explicit and implicit (inscribed and against article annotation when a noun that is in colloca-
invoked) ways of expressing an attitude. The diplomatic tion with the attitude-bearing adjective has no attitudinal
discourse is known for its indirectness [5] and a partic- meaning itself (the unfortunate circumstances).
ularly subtle expression of subjectivity. However, the However, if an attitude is a part of the superlative
inscribed and invoked attitudes are not the same regard- form (the best) or is expressed implicitly and the broader
ing the additional context needed for discerning the mes- context is necessary for understanding the expression,
sage’s meaning. any needed number of tokens within one syntactic clause
The inscribed attitudes only require the explicitly sub- could be considered (as in the veto today will not prevent
jective tokens to be annotated, as shown in Example 3: that).
Example 3:France [fully supports: AFFECT-INSCRIBED]
the search for a political solution. 2.4. Overlapping attitudes
This decision would lead to a sufficient increase in the
Another situation that may cause confusion and subse-
expected average length of the annotated fragment for
quent mistakes in annotation includes the cases, where
the invoked annotated attitudes in comparison to the
the two annotated fragments overlap. An overlap hap-
inscribed attitudes.
pens if one attitude occurs within the scope of the an-
Our main solution for annotating invoked attitudes is
notated fragment of another attitude. This may happen
to capture as much meaning as possible by annotating
as few tokens as needed.
in many ways, however, if there are only two attitudi- has led to a parallel annotation experiment as well as
nal elements, the overlap may look one of the following to the creation of a background for the newly-published
ways: corpus of evaluation in diplomatic speeches of the UNSC
1) The first option is for the second label to overlap (CoDipA 1.0).
with the first one as in (a(b)); We are specifically interested in investigating the atti-
2) The second option is for the first element to overlap tude development processes throughout the selected time
with the second one as in ((a)b); frame and in comparing the findings on an inter-conflict
3) The third option is the inclusion of an element (b) scale. Another expected development is the practical
in the middle of the first element (a) following a scheme application of the acquired data on a fine-tuned large lan-
(a(b)a) as in Example 5: guage model (such as Bert or GPT 4) to establish whether
Example 5: There is no doubt that we all want there is a potential for expanding the analysis to a bigger
[Affect-inclination] to resolve this problem [Judgement- scale.
tenacity]. The results of both quantitative and qualitative eval-
Here, the whole sentence is an invoked judgement with uation of the annotation outcomes are expected to be
an inclusion of an inscribed affect (in bold) corresponding published as well.
to the expression of the speaker’s desire.
To technically deliver this solution it is necessary to
allow for overlapping annotation in doccano [10] when 4. Conclusions
creating a project.
In this research paper, we have outlined some of the
challenging parts of annotating attitudes according to the
2.5. Interrupted attitudes Appraisal theory scheme [1] in the diplomatic speeches
The issue of interrupted annotation refers to a situation, and provided our viewpoint on their resolution, as well
where a span of attitudinal text includes a sequence of as technical comments on how to implement this type of
non-attitudinal tokens. Annotating unnecessary tokens annotation in practice.
may lead to a decrease in the quality and reliability of the The first problem that an annotator would encounter
future corpus. As per our observations, such an annota- if they would seek attitude in diplomatic speeches, would
tion scenario occurs solely with the explicitly formulated be attitude identification and deciding on the textual
attitudes, therefore deciding on this type of label is ad- borders of an attitudinal fragment. We have outlined
vised as a first step. these processes and provided examples from our corpus.
Here is an example of an interrupted attitude (Exam- In the next sections, we have offered an overview of the
ple 6), an excerpt from another diplomatic speech. An technically challenging parts of the attitude annotation
invoked judgment of the previous lack of actions of the process, such as overlapping and interrupted types of
Security Council is interrupted by a referral to one of attitudes, as well as their solution.
the resolutions, which does not add any additional atti-
tudinal meaning to the annotated fragment and should Limitations
therefore be excluded from it.
Example 6: Unfortunately, in the past hundred days, the This paper presents an up-to-date analysis of the ongo-
very [limited suspension of the sanctions: JUDGEMENT- ing research based on the annotation project, including
INVOKED-PART1] established by the Security Council res- the development of a discourse-specific annotation sce-
olution 943 (1994) [has also not been entirely fulfilled: nario. The findings from the presented annotation sce-
JUDGEMENT-INVOKED-PART2]. nario should not yet be considered as final, they may
Our solution to resolving it is purely technical and be updated in the final version of the scenario before its
includes the creation of an additional label [none] that is publication.
to be assigned to the tokens excluded from the annotated
fragment.
Ethics Statement
3. Related ongoing and future We honour the ethical code set out in the ACL Code of
Ethics and there are no special ethical issues involved
work during the creation of this research paper.
The acquired annotation scenario has helped us further
define our approach to the category of attitude as it is
understood by the Appraisal theory [1] and adapt it to
the needs of the diplomatic discourse analysis, which
5. Acknowledgements [10] H. Nakayama, T. Kubo, J. Kamura, Y. Taniguchi,
X. Liang, doccano: Text annotation tool
The research reported in this paper was supported by 207- for human, 2018. URL: https://github.com/
01/207PROV and the Czech Science Foundation (project doccano/doccano, software available from
no. 24-11132S, Disagreement in Corpus Annotation and https://github.com/doccano/doccano.
Variation in Human Understanding of Text), and partially
supported by SVV project number 260 698; a part of the
used data comes from the project no. LM2018101 by the
Czech Ministry of Education, Youth and Sports (Digital
Research Infrastructure for Language Technologies, Arts
and Humanities).
References
[1] J. R. Martin, P. White, The Language of Evaluation:
Appraisal in English, Springer, 2005. doi:10.1057/
9780230511910.
[2] M. Fuoli, A step-wise method for annotating ap-
praisal, Functions of Language 25 (2018) 229 – 258.
doi:10.1075/fol.15016.fuo.
[3] M. O’Donnell, Exploring identity through appraisal
analysis: A corpus annotation methodology, Lin-
guistics and the Human Sciences 9 (2013) 95–116.
doi:10.1558/lhs.v9i1.95.
[4] P. C. Cozby, Methods in behavioral research,
McGraw-Hill, New York, 2012.
[5] N. Stanko, Use of language in diplomacy, in: H. S.
J. Kurbalija (Ed.), Language and Diplomacy, Diplo-
Projects, Mediterranean Academy of Diplomatic
Studies, University of Malta, Msida MSD 06, Malta,
2001, pp. 39–48.
[6] M. Anisimova, An introductory overview of
evaluating facts and attitudes in diplomatic dis-
course, in: 2nd Workshop on Automata, For-
mal and Natural Languages – WAFNL 2021
Open Session Proceedings, Faculty of Mathe-
matics and Physics, Charles University, Prague,
2021, pp. 1–4. URL: https://itat.ics.upjs.sk/public/
WAFNL2021OpenSessionProc.pdf.
[7] M. Anisimova, S. Zikánová, Attitude in diplomatic
speeches: a pilot study, 2022, pp. 151–158. URL:
https://ceur-ws.org/Vol-3226/paper17.pdf.
[8] M. Anisimova, Š. Zikánová, Attitudes in diplo-
matic speeches: Introducing the CoDipA UNSC
1.0, in: H. Bunt, N. Ide, K. Lee, V. Petukhova,
J. Pustejovsky, L. Romary (Eds.), Proceedings of
the 20th Joint ACL - ISO Workshop on Interopera-
ble Semantic Annotation @ LREC-COLING 2024,
ELRA and ICCL, Torino, Italia, 2024, pp. 17–26. URL:
https://aclanthology.org/2024.isa-1.3.
[9] M. Schoenfeld, S. Eckhard, R. Patz, H. v. Meegden-
burg, A. Pires, The UN Security Council Debates,
2019. URL: https://doi.org/10.7910/DVN/KGVSYH.
doi:10.7910/DVN/KGVSYH.