<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ATAnalysis - Toward a psycholinguistic method to analyze video textual information</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Helder Yukio Okuno</string-name>
          <email>helder.okuno@eic.cefet-rj.br</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fla´vio Carvalho</string-name>
          <email>flavio.carvalho@eic.cefet-rj.br</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gustavo Paiva Guedes</string-name>
          <email>gustavo.guedes@cefet-rj.br</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marcelle Torres</string-name>
          <email>torres.m.a@hotmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>RJ - Brazil</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>73</fpage>
      <lpage>79</lpage>
      <abstract>
        <p>Political statements of world leaders may affect many lives, so it is important to study what they express through language. We propose a method to do psycholinguistic analysis of statements extracted from videos. To show the relevance and some interesting information, we conducted some experiments in video subtitles of world leaders Donald Trump and Kim Jong-un amid imminent agreement that could lead to peace in the Korean peninsula. Results suggest less security in statements of the North Korean leader while threatening to unleash an “unimaginable strike” at the US territory. Moreover, the US president shows less honesty by saying he hopes never to use the nuclear arsenal. This approach may be useful in future studies to reveal what the language used by candidates can show.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Social networks applications allow users to share ideas and inform about events, while
the platform that provides the service stores sentiment expressions in many formats (e.g.,
written and video records). For example, YouTube1 has established itself as a social
network providing user-generated content for entertainment and information [
        <xref ref-type="bibr" rid="ref10">Moghavvemi
et al., 2018</xref>
        ]. YouTube uses content popularity (“thumbs-up” button), comments and
number of subscribers to observe social aspects [
        <xref ref-type="bibr" rid="ref6">de Arantes et al., 2015</xref>
        ]. Offering users the
ability to upload, view, rate, share, and comment on videos, it became an important
social media that also enables people to engage more directly with political issues [
        <xref ref-type="bibr" rid="ref8">Howard
et al., 2011</xref>
        ].
      </p>
      <p>
        Social networks are also being used to propagate positioning and statements of
world leaders. With the attention, credibility and other resources that they have at their
disposal, what they say matters because it can show how they intend to set the tone of
their administration [
        <xref ref-type="bibr" rid="ref5">Cohen, 1995</xref>
        ]. Thereby, it is relevant to watch what has been said
by protagonists of one of the most drastic war of words and rhetoric of attacks in the
international scene.
      </p>
      <p>Donald Trump (DJT) and Kim Jong-un (KJU), after more than a year of threats
and insults, agreed to meet in a summit to discuss the denuclearization of the Korean
peninsula. It can be the first presidential direct talks between the United States and North
Korea since the end of Korean War in 1953, and the beginning for a path toward the peace
in Northeast Asia. The world leaders face larger and more decisive international crisis,
not only for their own nations but also for the future of the humanity.</p>
      <p>
        In this work, we propose a method for the psycholinguistic analysis of oral
discourses extracted from videos. Some global leaders, such as the KJU, do not use written
language on social networks. However, there are speeches recorded in video format that
can be found on YouTube. In this scenario, we conducted experiments using YouTube
automatic subtitles extraction to collect DJT and KJU videos subtitles. Next, we
submitted these subtitles in the 2015 version of the Linguistic Inquiry Word Count (LIWC2015)
[
        <xref ref-type="bibr" rid="ref11">Pennebaker et al., 2015</xref>
        ] program for text analysis. The results show higher authenticity
and confidence in DJT speeches than in KJU speeches. On the other hand, KJU speeches
shows higher analytical thinking. All the values related to analytical thinking were very
high in KJU speeches.
      </p>
      <p>This work is structured so that after this introductory section, we present in
section 2 works related to how computational linguistics analysis can be applied to obtain
information about world leaders. We then explain some aspects of LIWC2015, an
available tool for computational linguistics analysis in text, in section 3. In section 4, we
describe how we obtained the data. The results are presented in section 5. The conclusion
and future work are shown in section 6.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Recently, studies investigating language analysis of DJT’s statements as a political
figure can be found in the literature [
        <xref ref-type="bibr" rid="ref1">Ahmadian et al., 2017</xref>
        ,
        <xref ref-type="bibr" rid="ref15">Savoy, 2017</xref>
        ]. By adopting
LIWC2015,
        <xref ref-type="bibr" rid="ref9">Jordan and Pennebaker [2017</xref>
        ] intended to determine how US candidates
differed across linguistic style categories analyzing the language of political figures from
DJT back to George Washington. They found that, compared to other politicians, a very
low score in analytic thinking for DJT was indicated in the analysis of documents in text
format. It was possible to notice also that presidents and presidential candidates have been
becoming less analytic [
        <xref ref-type="bibr" rid="ref9">Jordan and Pennebaker, 2017</xref>
        ].
      </p>
      <p>
        Regarding psycholinguistic behavior in Korean leaders’ statements, it is fitting to
bring a study using the Korean Key Words in Context (KrKwic2) and Korean Linguistic
Inquiry and Word Count (K-LIWC) [
        <xref ref-type="bibr" rid="ref4">Chung and Park, 2010</xref>
        ]. It tested the literary
characteristics of inaugural addresses from linguistic, quantitative and psychological
perspectives of two presidential inaugural addresses, Moo-Hyun Roh (2003–2008) and
MyungBak Lee (2008–2013). It had, according to the authors, significance as an investigative
tool to support discovering the relationship between rhetorical substance and style, and
also the characteristics of presidents’ political and social viewpoints.
      </p>
      <p>Our approach in this work is different from those previous publications as it uses
exclusively speech and statements from content available in video format on YouTube.
The process involves also, in KJU case, the automatic translation before submitting it to
LIWC2015. Another difference is that it compares the speeches of two distinct current
world leaders which have been trading threats during the last times, and recently DJT has
publicly agreed to meet KJU.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Computational Text Analysis</title>
      <p>
        Content analysis is a methodology that uses categorization and classification of
communication such as speech, written text, pictures, audio or video [
        <xref ref-type="bibr" rid="ref3">Bryman and Bell, 2015</xref>
        ].
All around the world, an increasing number of people is using communications and
social network applications, uninterruptedly generating text, images, audio and video files
by recording a lot of events, feelings and emotions. As a result, researchers are using
computer-assisted methods, like text analysis, in social sciences studies to address many
issues. Not only that, but also official documents, technical reports, theatre play scripts,
lesson plans, books and a variety of texts are of social scientific interest [
        <xref ref-type="bibr" rid="ref2">Brier and Hopp,
2011</xref>
        ].
      </p>
      <p>LIWC2015 is an available tool for computerized text analysis. It consists of the
main program, which has a text analysis module with a user interface, and the dictionary
file. This dictionary is used to associate words from any given text under the categories
of important linguistic, psychological, and social processes.</p>
      <p>
        In addition to the dictionary developed at 2001, two more were produced for the
English language: one in 2007, containing 4, 500 words; the other in 2015, with a total
of 6, 400 words. The 2015 version also brings summary variables, which are percentiles
based on other categories, rather than raw frequencies. The summary categories refer to
aspects like analytical thinking (Analytic), clout, authenticity (Authentic) and emotional
tone (Tone) [
        <xref ref-type="bibr" rid="ref11">Pennebaker et al., 2015</xref>
        ].
      </p>
      <p>Analytic is drawn through the high use of nouns, articles, and prepositions,
indicating how people use words that suggest thought patterns that are formal, logical and
hierarchical. Clout reflects the confidence, which is indicated by higher use of first
person plural pronouns and words related to social processes, like the ones in the categories
“family” and “friend”. On the other hand, it is related to a lower use of first person
singular pronouns, negations and swear words. Authentic is related to when people reveal
themselves in an authentic or honest way, being more personal, humble and vulnerable
(i.e., words with I, me, my and present tense verbs). Emotional tone uses values from
categories of positive and negative emotions in a single summary variable.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Materials and procedures used to obtain psycholinguistic behavior in statements</title>
      <p>For this study, we used speeches and statements of DJT and KJU from YouTube videos.
The illustration in Figure 1 shows the process used in this study. It starts with the
transcription of videos containing DJT and KJU speeches using a free web application
(DownSub3) that can download and save subtitles directly from YouTube.</p>
      <p>
        After the extraction of subtitles, the Korean text was submitted to a
computerassisted translation from Korean to English using a free online tool named NAVER4. This
was chosen to use a sole method of computerized text analysis at the end, since translation
to English does not significantly interfere with the accuracy or comprehensiveness of
methods for text analysis [
        <xref ref-type="bibr" rid="ref13">Reis et al., 2015</xref>
        ]. In Table 1, we present basic information
about the text content submitted into LIWC2015.
      </p>
      <p>3Available at downsub.com
4Available at translate.naver.com; a free online language translation service.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>Figure 2 illustrates analytic values as the size of the plot marker, against values
for authentic category in the X axis and clout in the Y axis. In the discourse style of DJT,
lower values were observed in the variables reflecting analytical thinking, where higher
values of authenticity and confidence were associated. In the speeches of KJU, on the
other hand, all the values related to analytical thinking were very high.</p>
      <p>Analyzing the results, during the New Year’s speech (KJU1), which provides the
country’s guideline for this year, KJU had the lowest Clout revealing his uncertainties
regarding the goals of the country in 2018. Speaking with less confidence compared to
other speeches, KJU expressed that the entire United States territory is under the range
of North Korean nuclear missiles emphasizing a secured powerful deterrence against the
nuclear threat from the United States.</p>
      <p>The most recent speech of the North Korean leader (KJU4), just on the eve of
the Winter Olympic Games also known as Olympics for Peace, showed higher values in
Tone and Clout representing his most confident and positive declaration. In that one, KJU
declared North Korea as a global military power and asked for the military permanent high
alert to ensure the sovereignty and national security against any foreign threat, revealing
his behavior even in an apparent moment of progress in reducing military tensions in the
region. Even with the pause in North Korean nuclear launches and the recent opening
for diplomatic negotiations on peace and security in Northeast Asia, the North Korean
strategy and alert remain the same.</p>
      <p>
        Also, comparing the speeches of both leaders, it is possible to observe that KJU
maintains a more analytical style while DJT reveals to be more intuitive suggesting that he
is more impulsive in making decisions. For example, in December 2017, the US Senate
Foreign Relations Committee examined the DJT allusion to the use of military force or a
preventive strike against North Korea, when DJT said North Korean threats would be met
with “fire and fury like the world has never seen”. Some US Senators took attention for
the words and personality of the US president raising the discussion on the importance of
the US Congress to impose limits in DJT sole decisions that could harm the US security
interests [
        <xref ref-type="bibr" rid="ref12">Reif, 2017</xref>
        ].
      </p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>We conducted experiments in which we extracted subtitles and performed
psycholinguistic analysis from text of an important world leader that does not use social networks, like
KJU. This demonstrated the relevance of the approach, and some interesting information
could be integrated examining dimensions of each speech. Studies from the international
relations fields could benefit from the identification of text dimensions like thinking style,
clout, and authenticity to see how these expressions in words by leaders may suggest an
ulterior political aggression.</p>
      <p>
        International relations scholars agree that states opt for developing nuclear
weapons when facing security threats and do not find other solutions [
        <xref ref-type="bibr" rid="ref14">Sagan, 1997</xref>
        ]. This
can essentially explain part of one of the main North Korean policy, the byungjin line,
which results in nuclear and economic development [
        <xref ref-type="bibr" rid="ref7">Han and Joo, 2014</xref>
        ]. However, they
have the will to remain non-nuclear states once potential threats are solved [
        <xref ref-type="bibr" rid="ref14">Sagan, 1997</xref>
        ].
      </p>
      <p>Given the recent military tensions in Northeast Asia, the work sought to use
LIWC2015 to identify the psycholinguistic behavior of world leaders DJT and KJU by
analyzing their recent speeches. The results of LIWC2015 suggest that KJU is less secure
in discourses in which he threatens nuclearly the entire US territory. On the other hand,
he reveals more confidence and sense of certainty in describing his country as a global
military power and by demanding continued readiness from its military, even in a
moment that the international community observes as conducive to advances in diplomatic
negotiations for the denuclearization of the Korean peninsula.</p>
      <p>In relation to DJT, results show his narrative style as more intuitive, revealing
impulsiveness in his decision making. Also, during the State of the Union speech, a high
value for clout was observed (i.e., speaking confidently with a sense of certainty), when
he emphasized the need to increase the national defense sector spendings to allow
modernization and reconstruction of the US nuclear arsenal. A lower value for authenticity,
which may suggest a lesser degree of honesty, was observed when DJT stated his hopes
of never needing to use the nuclear arsenal.</p>
      <p>As future studies, we intend to analyze verbal expression of candidates in
Brazilian presidential election debates, looking at how these candidates compare on three
dimensions: thinking style, clout, and authenticity. We also intend to analyze tone of voice
and non-verbal expressions in videos as a complementary approach. The results can be
used to compare the emotions expressed verbally to the ones that appear in expressions
and gestures.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>The authors thank FAPERJ for partial funding support.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Ahmadian</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Azarshahi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Paulhus</surname>
            ,
            <given-names>D. L.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Explaining Donald Trump via communication style: Grandiosity, informality, and dynamism</article-title>
          .
          <source>Personality and Individual Differences</source>
          ,
          <volume>107</volume>
          :
          <fpage>49</fpage>
          -
          <lpage>53</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Brier</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Hopp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Computer assisted text analysis in the social sciences</article-title>
          .
          <source>Quality &amp; Quantity</source>
          ,
          <volume>45</volume>
          (
          <issue>1</issue>
          ):
          <fpage>103</fpage>
          -
          <lpage>128</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Bryman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Bell</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Business research methods</article-title>
          . Oxford University Press, USA.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Chung</surname>
            ,
            <given-names>C. J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Park</surname>
            ,
            <given-names>H. W.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Textual analysis of a political message: the inaugural addresses of two Korean presidents</article-title>
          .
          <source>Social science information</source>
          ,
          <volume>49</volume>
          (
          <issue>2</issue>
          ):
          <fpage>215</fpage>
          -
          <lpage>239</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>J. E.</given-names>
          </string-name>
          (
          <year>1995</year>
          ).
          <article-title>Presidential rhetoric and the public agenda</article-title>
          .
          <source>American Journal of Political Science</source>
          , pages
          <fpage>87</fpage>
          -
          <lpage>107</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>de Arantes</surname>
          </string-name>
          , M. V. S.,
          <string-name>
            <surname>de Figueiredo</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Almeida</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Uma caracterizac¸a˜o dos padro˜es de navegac¸a˜o de usua´rios em uma aplicac¸a˜o social de streaming de v´ıdeo</article-title>
          .
          <source>In IV Brazilian Workshop on Social Network Analysis and Mining (BraSNAM</source>
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Han</surname>
            ,
            <given-names>T. S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Joo</surname>
            ,
            <given-names>J. K.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Can North Korea catch two rabbits at once: Nuke and economy? one year of the Byungjin line in North Korea and its future</article-title>
          .
          <source>The Korean Journal of Defense Analysis</source>
          ,
          <volume>26</volume>
          (
          <issue>2</issue>
          ):
          <fpage>134</fpage>
          -
          <lpage>136</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Howard</surname>
            ,
            <given-names>P. N.</given-names>
          </string-name>
          et al. (
          <year>2011</year>
          ).
          <article-title>The Arab Spring's cascading effects</article-title>
          .
          <source>Pacific Standard</source>
          ,
          <volume>23</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Jordan</surname>
            ,
            <given-names>K. N.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Pennebaker</surname>
            ,
            <given-names>J. W.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>The exception or the rule: Using words to assess analytic thinking, Donald Trump, and the American presidency</article-title>
          .
          <source>Translational Issues in Psychological Science</source>
          ,
          <volume>3</volume>
          (
          <issue>3</issue>
          ):
          <fpage>312</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Moghavvemi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sulaiman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jaafar</surname>
            ,
            <given-names>N. I.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Kasem</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Social media as a complementary learning tool for teaching and learning:</article-title>
          <source>The case of YouTube. The International Journal of Management Education</source>
          ,
          <volume>16</volume>
          (
          <issue>1</issue>
          ):
          <fpage>37</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Pennebaker</surname>
            ,
            <given-names>J. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boyd</surname>
            ,
            <given-names>R. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jordan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Blackburn</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>The development and psychometric properties of LIWC2015</article-title>
          .
          <source>Technical report.</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Reif</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Senate examines launch authority</article-title>
          .
          <source>Arms Control Today</source>
          ,
          <volume>47</volume>
          (
          <issue>10</issue>
          ):
          <fpage>30</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Reis</surname>
            ,
            <given-names>J. C.</given-names>
          </string-name>
          , Gonc¸alves,
          <string-name>
            <surname>P.</surname>
          </string-name>
          , Arau´jo,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Pereira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. C.</given-names>
            , and
            <surname>Benevenuto</surname>
          </string-name>
          ,
          <string-name>
            <surname>F.</surname>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Uma abordagem multilıngue para ana´lise de sentimentos</article-title>
          .
          <source>In IV Brazilian Workshop on Social Network Analysis and Mining (BraSNAM</source>
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Sagan</surname>
            ,
            <given-names>S. D.</given-names>
          </string-name>
          (
          <year>1997</year>
          ).
          <article-title>Why do states build nuclear weapons? Three models in search of a bomb</article-title>
          .
          <source>International security</source>
          ,
          <volume>21</volume>
          (
          <issue>3</issue>
          ):
          <fpage>54</fpage>
          -
          <lpage>86</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Savoy</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Trump's and Clinton's style and rhetoric during the 2016 Presidential Election</article-title>
          .
          <source>Journal of Quantitative Linguistics</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>