<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The predictive power of the sentiment of financial reports</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jan-Hendrik Meier</string-name>
          <email>jan-hendrik.meier@fh-kiel.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Walid Esmatyar</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rasmus Frost</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Kiel University of Applied Sciences</institution>
          ,
          <addr-line>Sokratesplatz 2, 24149 Kiel</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The present study examines the predictive power of the tone or sentiment of 10-K annual and 10-Q quarterly financial statements for future corporate development. The sentiment indicator was calculated using word lists developed for financial texts by Loughran and McDonalds [23] and Henry [14] and applying a conventional and a tf-idf weighted word count. The results show that the sentiment indicator is of significant incremental prognostic quality both for the next quarter and the quarter following it. Unlike suggested by previous literature, neither the scope and content of the word lists nor the weighting method applied had a significant influence on forecasting quality.</p>
      </abstract>
      <kwd-group>
        <kwd>Sentiment Analysis</kwd>
        <kwd>Sentiment</kwd>
        <kwd>Text Mining</kwd>
        <kwd>Text Analysis</kwd>
        <kwd>Prediction</kwd>
        <kwd>Tone</kwd>
        <kwd>tf-idf</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Both professional financial analysts and interested private investors have been able to
develop their skills in analyzing financial statements in recent years. Sophisticated
computer-aided analysis tools help them to process increasing amounts of data
efficiently and free from bias to create decision-relevant information. Of interest in this
context is that the technical solutions almost exclusively focus on quantitative
financial data and figures, while qualitative financial data, mainly texts, are hardly
considered in the analyses. The reason is that figures are considered to be more reliable and
less prone to manipulation than textual data because of their better verifiability; in
contrast, texts offer more room for interpretation.</p>
      <p>Numerous studies have shown that the texts of annual financial statements contain
important information that cannot be obtained directly from key figures. While
complex verbal statements – despite all advances in the field of natural language
processing – cannot yet be adequately interpreted and evaluated by applying a software,
implicit information in the tone of printed texts can be detected and evaluated by
means of sentiment analysis. The present analysis deals with the question of whether
a positive or negative tone in annual and quarterly financial statements can be used to
forecast future earnings figures, thus allowing insight into a company’s future
development. In order answer this question, a sentiment analysis based on a total of 19,390
annual and quarterly financial statements of the companies listed in the Standard &amp;
Poor's 500 Index (S&amp;P 500) for the years 2005 to 2015 was conducted. The results
show that the sentiment indicators for annual and quarterly financial statements
provide a highly significant incremental contribution to predicting the return on assets
(ROA) for the next two quarters. The present study provides empirical evidence on
the basis of a very broad random sample and confirms its findings by using two
established text-mining methods and comparing their results. The study also assesses the
extent to which the scope of the word lists used and the weighting method influence
forecasting quality.</p>
      <p>Section 2 of this study presents a literature review and provides an overview of the
current state of research. Section 3 describes the methodology used and describes the
sample. Section 4 presents and discusses the results. This study concludes with a
summary.
2</p>
      <p>
        Literature Review and Current State of Research
Two different methods dominate the field of sentiment analysis in empirical
accounting and capital market research. Early research used test subjects or the researchers’
own subjective assessment to categorize the tone of company publications as positive
or negative [
        <xref ref-type="bibr" rid="ref11 ref16 ref19">11, 16, 19</xref>
        ]. Core [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] points out that the work load necessary for this
manual approach is too high to evaluate sufficiently large samples, and notes that a
computer-assisted evaluation of the texts can help to reduce the workload and to
increase the analyses’ accuracy and objectivity. Regarding computer-assisted
evaluation, two approaches can be identified in the literature, a fully-automated and a
semiautomated approach [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. The statistical approach uses machine learning and is
applied among others by Li [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], who uses information from management discussion
and analysis sections (MD&amp;A) in 10-K annual and 10-Q quarterly reports. The author
evaluates 30,000 randomly selected sentences and subjectively determines if their
tone is positive, negative, neutral or uncertain. The words of this training data set are
then used to train a naive Bayesian algorithm to determine the largest statistical
correlation of the remaining texts with one of the categories. However, research using text
analysis prefers a second method, which employs predefined word lists which
determine for individual words whether they imply a positive or negative tone. In this
method, the number of positive and / or negative words in each text is counted, and it
is assumed that the order of the words is irrelevant for the tone [
        <xref ref-type="bibr" rid="ref22 ref24">22, 24</xref>
        ].
      </p>
      <p>
        Four different word lists have been established in accounting and capital market
research by the following authors: Osgood [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ], Hart [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], Henry [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], and Loughran
and McDonald [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. While word lists by Loughran/McDonald and Henry were
developed specifically for financial texts, the word lists by Osgood and Hart were
originally designed for research in psychology and, respectively, in political communication.
      </p>
      <p>
        Empirical accounting and capital market research primarily analyzes the tone of
10-K annual financial statements, 10-Q quarterly statements, company press releases,
articles in print media, articles on Internet platforms, and other company publications
[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. The sentiment of these publications is used to predict the development of
companies, that is, either the price development of their securities or the development of
assets, earnings, and financial ratios.
      </p>
      <p>
        Tetlock [33] based his analysis on the word list by Osgood and showed that a
pessimistic tone in press releases results in a negative investor reaction and thus a
decrease in stock returns. The press releases were taken from the Wall Street Journal
column "Abreast of the Market". A number of other studies have also confirmed
Tetlock's results with respect to short-term market responses to tone [
        <xref ref-type="bibr" rid="ref1 ref26 ref3 ref8 ref9">1, 3, 8, 9, 26</xref>
        ]. In
addition, both Solomon [32] and Huang, Teoh and Zhang [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] conclude that this
effect is reversed over time: an initial reaction normalizes over time and is therefore to
be considered as a market overreaction. Garcia [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] points out that a negative tone in
newspaper articles results in a much stronger stock-market reaction when the
economy is in recession.
      </p>
      <p>
        The two profit figures regularly forecasted in empirical studies are the return on
assets and the unexpected earnings. The latter figure is calculated as the difference
between earnings per share and the average of earnings per share expected by analysts.
Tetlock, Saar-Tsechansky and Macskassy [34] come to the conclusion that a negative
tone in press releases causes negative earnings in the following quarters. Other studies
arrive at similar results based on the tone in 10-K annual financial statements or of
social media posts [
        <xref ref-type="bibr" rid="ref3 ref9">3, 9</xref>
        ]. Loughran and McDonald [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] on the other hand find that a
negative tone in 10-K financial statements follows a positive development of
unexpected earnings. A distortion of the results due to the applied word list can be ruled
out since both results were obtained using the Osgood as well as the Loughran and
McDonald word list. Thus, the studies differ mainly due to the analyzed text types.
Thus, Loughran and McDonald [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] conclude with reference to the results of Tetlock,
Saar-Tsechansky and Macskassy [34] that due to the independency of journalists the
tone of newspaper articles is a better indicator of future profits, while managers of
companies can use 10-K annual financial statements to pursue impression or
expectation management.
      </p>
      <p>
        Davis, Piger and Sedor [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] show based on Hart's word list that the tone of corporate
earnings announcements in the press is a good predictor of the company's return on
assets for the next four quarters. Furthermore, they present evidence that the stock
market reacts directly to the sentiment of these communications. The authors argue
that managers can report their own earnings expectations more subtly by tone choice
than by simply presenting figures. Using tone, managers provide the capital market
with important signals.
      </p>
      <p>
        Davis and Tama-Sweet [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] investigate 10-K and 10-Q quarterly statements in
addition to press releases and confirm Davis, Piger and Sedor's results based on their
expanded sample. In addition, the authors find that managers’ tone choice is with
significant frequency more optimistic and less pessimistic in earnings announcements in the
press than in the MD&amp;A sections of financial statements. The authors conclude that
managers use the freedom afforded to them by press releases to strategically influence
the stock market.
      </p>
      <p>
        In their analysis, Ferris, Hao and Liao [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] evaluate securities prospectuses of
companies in the technology sector that are about to go public. They show a forecast
of the return on assets for a period of three years into the future to be possible with
this information. Huang, Teoh and Zhang present opposite findings for the predictive
capacity of press sentiment for the return on assets. According to their results, the
return on assets shows a negative sign for three years if the press sentiment was
previously positive. Davis, Piger and Sedor [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and Davis and Tama-Sweet [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] use the
Hart word list in their investigations, whereas Huang, Teoh and Zhang [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] apply the
other three word lists.
      </p>
      <p>Against the background of the inconsistency in the results of previous research, the
present analysis aims at investigating the predictive power of the sentiment in 10-K
and 10-Q financial statements for the return on assets on the basis of a representative
sample. For this purpose, two different word lists were used to assure that the results
are not influenced by word-list choice. The lists by Loughran/McDonald and Henry
were applied since they were designed for the analysis of financial texts.
3</p>
    </sec>
    <sec id="sec-2">
      <title>Sample and Method</title>
      <p>In order to analyze the predictivity of the sentiment in financial reports, a sentiment
analysis based on English-language 10-K annually and 10-Q quarterly US financial
statements was conducted and the predictivity of the sentiment on the company's
performance in terms of return on assets (ROA) was examined. The analysis was based
on the 10-K and 10-Q filings of all companies listed in the Standard &amp; Poor's 500
Index (S&amp;P 500) as of March 31, 2016, i.e., the 500 largest listed companies in the
United States. For these companies, data from 2005 to 2015 were retrieved. The data
were obtained from the Securities and Exchange Commission's (SEC) Electronic File
Gathering, Analysis and Retrieval System (EDGAR). The initial sample contained a
total of 20,435 annually and quarterly financial statements. The size of the sample
was reduced by 1,045 to 19,390 documents as not all relevant financial data
pertaining to an annual report were available, which was partly due to incomplete financial
years or quarters. Associated financial data were retrieved from the Thomson Reuters
Eikon database and randomly tested using annual report data.</p>
      <p>
        The preparation of the texts for processing was based on the so-called parsing
procedure by Loughran and McDonald [
        <xref ref-type="bibr" rid="ref27 ref28">27, 28</xref>
        ]. Since the documents were partly HTML
documents, HTML tags were removed [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. Likewise, tables and figures were
removed with the exception of text tables.1 Furthermore, the texts were cleared of
frequently recurring words and words not related to the content (stopwords) [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. In a
final step, numbers, special characters, and single letters were removed.
      </p>
      <p>
        As mentioned above, this study used the word list by Loughran and McDonald and
– to countercheck the results – the much shorter list by Henry. The Loughran and
McDonald word list contains a total of six word categories for positive, negative,
uncertain, litigious, strongly modal, and weakly modal expressions [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. This study
uses only the 355 positive and 2,355 negative words of this list. The word list by
Henry contains a total of 105 positive and 85 negative words [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Loughran and
McDon1 As in Loughran and McDonald [
        <xref ref-type="bibr" rid="ref27 ref28">27, 28</xref>
        ], tables embedded in HTML Code are analyzed to
ascertain if the proportion of numerals contained in them is greater than 15% of the total of
all numerals and letters. If this is the case, the entire table, including its content, is deleted
since it is assumed that it only presents financial-reporting figures and, therefore, is
irrelevant for textual analysis.
ald [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] argue that managers have far more ways of implying positive or negative
sentiment than can be captured by Henry's word list. In particular, Henry's list lacks
entries such as loss, losses, adverse and impairment, all of which are considered
negative in the corporate context.
      </p>
      <p>
        The relevance of a positive or negative word for a certain text can be determined
unweighted, that is, by ordinary counting of the words, or by so-called term frequency
– inverse document frequency (tf-idf) weighting. The latter attributes more influence
to a positive or negative word if it occurs frequently in the document (term frequency,
tf), but lessens its influence if the word occurs in many documents and, as a result,
appears "ordinary" (inverse document frequency, idf). The tf-idf weighting logic used
in this study follows Chrisholm and Kolda [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The word frequency tfw,d is calculated
as follows [
        <xref ref-type="bibr" rid="ref22 ref24 ref4">4, 22, 24</xref>
        ]:
, =
      </p>
      <p>,
( )
=
Here, tf´w,d reflects the frequency of word w in document d, while ad represents the
average word count in document d. The inverse document frequency idfw is calculated
based on the following equation:</p>
      <p>represents the total number of documents and dfw the number of documents in
which the word w occurs at least once. The tf-idf weight results from the
multiplication of both terms, tfw,d∙idfw. The resulting weight measure is then used as the count
for the respective word, while in the unweighted approach each occurrence of a word
is counted equally.</p>
      <p>
        The tf-idf weighting has been criticized by Henry and Leone [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] because its
results are contingent upon the totality of documents used in a sample, and, thus, the
result for a particular document can vary significantly depending on the other
documents included in the sample. The authors showed that the weighting does not lead to
improved results, thus refuting the findings of Loughran and McDonald [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], who
observed a significant improvement in the results if tf-idf weighting was applied. Due
to contradictory results, this study applied both methods, the unweighted count and
the tf-idf weighting, in parallel and tests them against each other.
      </p>
      <p>
        Since for every company i only one 10-K annual or 10-quarter financial statement
is available in each quarter t, the document index d can clearly be replaced by the
index i,t. To determine the sentiment of each quarter or annual document, the
sentiment indicator SENTi,t or, respectively, SENTd can be calculated according to the
following equation:
, =
, =
,
, ,
, ,
, ,
, ,
(3)
tf´pos,d denotes the number of weighted or, respectively, unweighted positive words
in the document d, tf´neg,d denotes the number of weighted or, respectively,
un(1)
(2)
weighted negative words [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Accordingly, the sentiment indicator can have a value
between 1 for a perfectly positive sentiment and -1 for a perfectly negative sentiment.
      </p>
      <p>This study analyzed the predictivity of the sentiment for the return on assets;
therefore, the return on assets ROAi,t+n with a lag of n quarters was considered as a
dependent variable in the econometrics. The return on assets ROAi,t+n was calculated
based on earnings before interest and taxes (EBIT) and the average assets of the
observed quarter.</p>
      <p>
        The control variables used in this study were based on the relevant literature [
        <xref ref-type="bibr" rid="ref17 ref6 ref7">6, 7,
17</xref>
        ]. The natural logarithm of the market capitalization at the time t was used as an
indicator for the size of the company SIZEi,t. The correction for company size is
necessitated due to the small firm effect according to Banz [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In order to control for
expected growth as well as other key balance sheet figures and company news, the
market to book ratio MBRi,t was used as an indicator that incorporates a variety of
publicly known information and news. To take the leverage effect and the findings of
Modigliani and Miller [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ] into account, the leverage ratio LEVi,t is also used as a
control variable in the model. Since the return on assets ROAi,t is a good indicator of
the return on assets in the following quarters ROAi,t+n, it is also used a control
variable. Finally, the Boolean variables DLOSSi,t for loss quarters and D10Ki,t for annual
reports are integrated into the model. They compensate for the effects of differing
company responses in loss quarters as well as the potential differences in the
predictivity of annual financial statements compared to quarterly financial statements. To
avoid outlier-induced results, all control variables were winsorized at the first and
99th percentile. For an overview of the variables used please refer to Table 1.
aV xpE
ir l
ab ia
      </p>
      <p>n
le ed</p>
      <p>E
V xp
ra la
lieab troan
y</p>
      <p>C
r on
ia t
b ro
lse lV
a</p>
      <p>Variable</p>
      <sec id="sec-2-1">
        <title>ROAi,t Ti,t</title>
        <p>SEN</p>
      </sec>
      <sec id="sec-2-2">
        <title>SIZEi,t</title>
        <p>MBRi,t
ln(
, ,
, ,
,
, )</p>
      </sec>
      <sec id="sec-2-3">
        <title>TR.CompanyMarketCap</title>
      </sec>
      <sec id="sec-2-4">
        <title>TR.CompanyMarketCap TR.TotalEquity</title>
        <p>Si,t
,t</p>
      </sec>
      <sec id="sec-2-5">
        <title>DLOS</title>
        <p>D10Ki
,
,
1, if Net Incomei,t &lt; 0
0, else
1, if 10 K-Filing
0, if 10 Q-Filing</p>
      </sec>
      <sec id="sec-2-6">
        <title>TR.TotalLiabilities TR.TotalAssetsReported TR.NetIncome</title>
        <p>In the analysis, the word lists by Loughran and McDonald and Henry were each
used both with and without tf-idf weighting to predict the return on assets for one and
two quarters. This results in four explainatory variables. While the list of Loughran
and McDonald returns negative sentiment indicators in most cases, the list of Henry
returns almost positive sentiments. This effect can be attributed to the strong deviation
of the two lists. All other variables show inconspicuous behavior.</p>
        <p>As in the analysis, the word lists by Loughran and McDonald and Henry were each
used both with and without tf-idf weighting to predict the return on assets for one and
two quarters, this results in a total of eight variants to be calculated. The
corresponding pooled OLS, fixed-effects, and random-effects panel data models were calculated
for all variants. However, the F- and Hausmann specification tests used to compare
the models show the superiority of the fixed-effects model with fixed-effects for each
company and every quarter ; the model is applied according to the following
equation:
5 ∙
, +
=
, +
+
6 ∙
+ 1 ∙
, +</p>
        <p>, + 2 ∙
7 ∙ 10 , + ,
, +
3 ∙
, +
4 ∙
, +
(3)</p>
        <p>In addition to the panel data models, all variants were calculated by means of a
robust least absolute deviations (LAD) regression in order to rule out outlier-induced
results. However, the panel data models and LAD regression were not significantly
different from each other; thus, the validity of the panel data models can be assumed.
Multicollinearity problems were excluded by the analysis of the variance inflation
factors that all showed numbers below three. All eight variants were tested for
heteroscedasticity using the Breusch-Pagan test. All variants show heteroskedasticity
problems. Therefore, the results shown in Table 1 include the heteroskedasticity-robust
standard errors according to White (HC 0) and the significance values calculated on
their basis. Endogeneity problems were precluded due to the time lag between the
explained variable and explanatory variables and, therefore, no separate consideration
is required.
4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results and Discussion</title>
      <p>The descriptive statistics and the correlation matrix are presented in Table 2 and
Table 3 respectively. The results of the analysis are summarized in Table 4 for one
quarter (Lag 1) and Table 5 for two quarters (Lag 2). For the purpose of clarity, individual
fix effects, which do not show any abnormalities, have been omitted. The same
applies to the fixed quarterly effects, which, albeit particularly significant in financial
crisis quarters, do not show abnormalities.</p>
      <p>All models show significant model statistics, which implies a high prediction
character. The models with Lag 1 have an explained variance R2 of – depending on the
model – around 20%, which can be considered as high in this type of models. The
Lag 2 models are naturally weaker in terms of the explained variance as forecasts for
the distant future are generally associated with weaker prediction qualities.
Accordingly, experiments with higher lags only sporadically show significant models, which
is why they are not presented here.</p>
      <p>The sentiment indicator has a significant positive impact in all tested models.
Therefore, it can be used as a predictor for future corporate development forecasts. It
is important to underline that in the present analysis the sentiment indicator was
corrected for the influence of several significant indicators. In particular, the
market-tobook ratio already contains the aggregated information of a large number of balance
sheet ratios and company news, which are incorporated in the price by the market.
The sentiment indicator thus contains incremental information beyond the information
imparted by balance sheet figures and thereby provides additional forecasting power.
This implies that companies use texts to impart information about positive or negative
developments which cannot yet be expressed in figures.</p>
      <p>For sentiment analysis, it does not seem to matter whether the extensive word list
by Loughran and McDonald or the much shorter list by Henry is used. The quality
measures of both word lists seem to differ only marginally. It also appears to be
irrelevant if the sentiment indicator is determined by mere counting or by applying the
tfidf weighting. The advantages of tf-idf weighting in other areas of text analysis, such
as the creation of word clouds for faster collection of content, can hardly be
transferred to sentiment analysis. Although the method is capable of identifying relevant
words in documents, this advantage is mainly limited to nouns that are frequent in
individual documents but rare in other documents. Adjectives and adverbs, which
often carry a positive or negative connotation in the corporate publications in
question, tend to be underweighted by the method because they are commonplace in
everyday language.</p>
      <p>Explanatory Variable s
Min.
1st Qu.</p>
      <p>Me dian
Me an
3rd Qu.</p>
      <p>Max.</p>
      <p>Min.
1st Qu.</p>
      <p>Me dian
Me an
3rd Qu.</p>
      <p>Max.</p>
      <p>Min.
1st Qu.</p>
      <p>Me dian
Me an
3rd Qu.</p>
      <p>Max.</p>
      <p>Control-Variable s</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>The present study demonstrated the predictive power of the sentiment of 10-K annual
and 10-Q quarterly statements for future corporate development. The sentiment
indicator has significant incremental prognostic quality both for the following quarter and
the subsequent quarter. If the forecast horizon was three quarters or more, only
isolated cases of significant predictiveness were detected. As a result, analysts and
investors should include the sentiment of corporate publications into their analyses to
gather latent information from company and detect subtle signals from management that is
of value in their decision-making. Corporate publications contain valuable "between
the lines" information which may be relevant to the assessment of potential
opportunities and risks. Accordingly, a professionalized evaluation of textual data can provide
an information advantage.</p>
      <p>The present study drew on two very different word lists, the Loughran and
McDonald and the Henry word lists, which are both geared toward financial texts.
Both were used in conventional word-counting as well as tf-idf-weighting approaches.
However, the quality of the word lists used and the weighting approach taken
influenced forecasting quality to a much lesser extent than suggested by the existing
literature. The calculated models were almost identical in terms of quality.
31. Price, S. M., Doran, J. S., Peterson, D. R., Bliss, B. A.: Earnings conference calls and stock
returns: The incremental informativeness of textual tone. Journal of Banking &amp; Finance,
36, 992-1011 (2012).
32. Solomon, D. H.: Selective Publicity and Stock Prices. The Journal of Finance, 67(2),
599637 (2012).
33. Tetlock, P. C.: Giving Content to Investor Sentiment: The Role of Media in the Stock
Market. The Journal of Finance, 62(3), 1139-1168 (2007).
34. Tetlock, P. C., Saar-Tsechansky, M., Macskassy, S.: More Than Words: Quantifying
Language to Measure Firms’ Fundamentals. The Journal of Finance, 63(3), 1437-1467 (2008).
35. Twedt, B., Rees, L: Reading between the lines: An empirical examination of qualitative
attributes of financial analysts’ reports. Journal of Accounting and Public Policy, 31, 1-21
(2012).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Allee</surname>
          </string-name>
          , K. D., DeAngelis, M. D.:
          <article-title>The Structure of Voluntary Disclosure Narratives: Evidence from Tone Dispersion</article-title>
          .
          <source>Journal of Accounting Research</source>
          ,
          <volume>53</volume>
          (
          <issue>2</issue>
          ),
          <fpage>241</fpage>
          -
          <lpage>274</lpage>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Banz</surname>
            ,
            <given-names>R.:</given-names>
          </string-name>
          <article-title>The relationship between return and market value of common stocks</article-title>
          .
          <source>Journal of Financial Economics</source>
          ,
          <volume>9</volume>
          (
          <year>1981</year>
          ),
          <fpage>3</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Chen</surname>
            , H., De,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hwang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Wisdom of Crowds: The Value of Stock Opinions Transmitted Through Social Media</article-title>
          .
          <source>Review of Financial Studies</source>
          ,
          <volume>27</volume>
          (
          <issue>5</issue>
          ),
          <fpage>1367</fpage>
          -
          <lpage>1403</lpage>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Chrisholm</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kolda</surname>
          </string-name>
          , T. G.:
          <article-title>New Term Weighting Formulas for the Vector Space Model in Information Retrieval</article-title>
          .
          <year>1999</year>
          . Oak Ridge, Tennessee.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Core</surname>
            ,
            <given-names>J. E.</given-names>
          </string-name>
          :
          <article-title>A review of the empirical literature discussion</article-title>
          .
          <source>Journal of Accounting and Economics</source>
          ,
          <volume>31</volume>
          ,
          <fpage>441</fpage>
          -
          <lpage>456</lpage>
          (
          <year>2001</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>A. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piger</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sedor</surname>
            ,
            <given-names>L. M.</given-names>
          </string-name>
          :
          <article-title>Beyond the Numbers: Measuring the Information Content of Earnings Press Release Language</article-title>
          .
          <source>Contemporary Accounting Research</source>
          ,
          <volume>29</volume>
          (
          <issue>3</issue>
          ),
          <fpage>845</fpage>
          -
          <lpage>868</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>A. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tama-Sweet</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          : Managers'
          <article-title>Use of Language Across Alternative Disclosure Outlets: Earnings Press Releases versus MD&amp;A</article-title>
          . Contemporary Accounting Research,
          <volume>29</volume>
          (
          <issue>3</issue>
          ),
          <fpage>804</fpage>
          -
          <lpage>837</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Dougal</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Engelberg</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsons</surname>
            ,
            <given-names>C. A.</given-names>
          </string-name>
          :
          <article-title>Journalists and the stock market</article-title>
          .
          <source>Review of Financial Studies</source>
          ,
          <volume>25</volume>
          (
          <issue>3</issue>
          ),
          <fpage>639</fpage>
          -
          <lpage>679</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Feldman</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Govindaraj</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Livnat</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Segal</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Management's tone change, post earnings announcement drift and accruals</article-title>
          .
          <source>Review of Accounting Studies</source>
          ,
          <volume>15</volume>
          (
          <issue>9</issue>
          ),
          <fpage>915</fpage>
          -
          <lpage>953</lpage>
          (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Ferris</surname>
            ,
            <given-names>S. P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hao</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liao</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The Effect of Issuer Conservatism on IPO Pricing and Performance</article-title>
          .
          <source>Review of Finance</source>
          ,
          <volume>17</volume>
          ,
          <fpage>993</fpage>
          -
          <lpage>1027</lpage>
          (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Francis</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Philbrick</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schipper</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Shareholder Litigation and Corporate Disclosures</article-title>
          .
          <source>Journal of Accounting Research</source>
          ,
          <volume>32</volume>
          (
          <issue>2</issue>
          ),
          <fpage>137</fpage>
          -
          <lpage>164</lpage>
          (
          <year>1994</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Garcia</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Sentiment during Recessions</article-title>
          .
          <source>The Journal of Finance</source>
          ,
          <volume>68</volume>
          (
          <issue>3</issue>
          ),
          <fpage>1267</fpage>
          -
          <lpage>1300</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Hart</surname>
            ,
            <given-names>R.P.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Redeveloping</surname>
            <given-names>DICTION</given-names>
          </string-name>
          :
          <article-title>Theoretical Considerations</article-title>
          . In: West, M. D. (Eds.):
          <article-title>Theory, method, and practice in computer content analysis</article-title>
          , New York,
          <fpage>43</fpage>
          -
          <lpage>60</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Henry</surname>
          </string-name>
          , E.:
          <article-title>Are Investors Influenced by How Earnings Press Releases are Written</article-title>
          ?
          <source>Journal of Business Communication</source>
          ,
          <volume>45</volume>
          (
          <issue>4</issue>
          ),
          <fpage>363</fpage>
          -
          <lpage>407</lpage>
          (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Henry</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leone</surname>
            ,
            <given-names>A. J.</given-names>
          </string-name>
          :
          <article-title>Measuring Qualitative Information in Capital Markets Research: Comparison of Alternative Methodologies to Measure Disclosure Tone</article-title>
          .
          <source>The Accounting Review</source>
          ,
          <volume>91</volume>
          (
          <issue>1</issue>
          ),
          <fpage>153</fpage>
          -
          <lpage>178</lpage>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Hoskin</surname>
            ,
            <given-names>R. E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hughes</surname>
            ,
            <given-names>J. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ricks</surname>
          </string-name>
          , W. E.:
          <article-title>Evidence on the Incremental Information Content of Additional Firm Disclosures Made Concurrently with Earnings</article-title>
          .
          <source>Journal of Accounting Research</source>
          ,
          <volume>24</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>32</lpage>
          (
          <year>1986</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teoh</surname>
            ,
            <given-names>S. H.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>Y.: Tone</given-names>
            <surname>Management</surname>
          </string-name>
          .
          <source>The Accounting Review</source>
          ,
          <volume>89</volume>
          (
          <issue>3</issue>
          ),
          <fpage>1083</fpage>
          -
          <lpage>1113</lpage>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Kearney</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Textual sentiment in finance: A survey of methods and models</article-title>
          .
          <source>International Review of Financial Analysis</source>
          ,
          <volume>33</volume>
          ,
          <fpage>171</fpage>
          -
          <lpage>185</lpage>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Lang</surname>
            ,
            <given-names>M. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lundholm</surname>
          </string-name>
          , R. J.:
          <article-title>Voluntary Disclosure and Equity Offerings: Reducing Infor-mation Asymmetry or Hyping the Stock?</article-title>
          .
          <source>Contemporary Accounting Research</source>
          ,
          <volume>17</volume>
          (
          <issue>4</issue>
          ),
          <fpage>623</fpage>
          -
          <lpage>662</lpage>
          (
          <year>2000</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Textual Analysis of Corporate Disclosures: A Survey of the Literature</article-title>
          .
          <source>Journal of Accounting Literature</source>
          ,
          <volume>29</volume>
          ,
          <fpage>143</fpage>
          -
          <lpage>165</lpage>
          (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>The Information Content of Forward-Looking Statements in Corporate Filings - A Naïve Bayesian Machine Learning Approach</article-title>
          .
          <source>Journal of Accounting Research</source>
          ,
          <volume>48</volume>
          (
          <issue>5</issue>
          ),
          <fpage>1049</fpage>
          -
          <lpage>1102</lpage>
          (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Loughran</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McDonald</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>When Is a Liability Not a Liability? Textual Analysis, Diction-aries, and 10-Ks</article-title>
          .
          <source>The Journal of Finance</source>
          ,
          <volume>66</volume>
          (
          <issue>1</issue>
          ),
          <fpage>35</fpage>
          -
          <lpage>65</lpage>
          (
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Loughran</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McDonald</surname>
            ,
            <given-names>B.: 2014 Master</given-names>
          </string-name>
          <string-name>
            <surname>Dictionary</surname>
          </string-name>
          .
          <volume>03</volume>
          .
          <year>2015</year>
          . As of:
          <volume>01</volume>
          .
          <fpage>11</fpage>
          .
          <year>2017</year>
          . URL: https://www3.nd.edu/~mcdonald/Word_Lists.html
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Loughran</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McDonald</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Textual Analysis in Accounting and Finance: A Survey</article-title>
          .
          <source>Journal of Accounting Research</source>
          ,
          <volume>54</volume>
          (
          <issue>4</issue>
          ),
          <fpage>1187</fpage>
          -
          <lpage>1230</lpage>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Mayew</surname>
            ,
            <given-names>W. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sethuraman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Venkatachalam</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>MD&amp;A Disclosure and the Firm's Ability to Continue as a Going Concern</article-title>
          .
          <source>The Accounting Review</source>
          ,
          <volume>90</volume>
          (
          <issue>4</issue>
          ),
          <fpage>1621</fpage>
          -
          <lpage>1651</lpage>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Mayew</surname>
            ,
            <given-names>W. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Venkatachalam</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The Power of Voice: Managerial Affective States and Future Firm Performance</article-title>
          .
          <source>The Journal of Finance</source>
          ,
          <volume>67</volume>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>43</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>McDonald</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Documentation for Stage One Parse</article-title>
          .
          <source>Last updated: 03.09</source>
          .
          <year>2013</year>
          . As of:
          <volume>13</volume>
          .
          <fpage>08</fpage>
          .
          <year>2017</year>
          , URL: https://www3.nd.edu/~mcdonald/Word_Lists.html.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>McDonald</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Documentation of Master Dictionary and Document Dictionaries</article-title>
          .
          <source>Last updated: 18.03</source>
          .
          <year>2014</year>
          . As of:
          <volume>13</volume>
          .
          <fpage>08</fpage>
          .
          <year>2017</year>
          , URL: https://www3.nd.edu/~mcdonald/Word_Lists.html.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Modigliani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Corporate Income Taxes and the Cost of Capital: A Correction</article-title>
          .
          <source>The American Economic Review</source>
          ,
          <volume>53</volume>
          (
          <issue>1</issue>
          ),
          <fpage>433</fpage>
          -
          <lpage>443</lpage>
          (
          <year>1963</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>N.N.</surname>
          </string-name>
          :
          <article-title>Descriptions of Inquirer Categories and Use of Inquirer Dictionaries</article-title>
          .
          <source>As of 01.11</source>
          .
          <year>2017</year>
          . URL: http://www.wjh.harvard.edu/~inquirer/homecat.htm.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>