<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Task-offload Tools Improve Productivity and Performance in Geopolitical Forecasting</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ion Juvina</string-name>
          <email>ion.juvina@wright.edu</email>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Othalia Larue</string-name>
          <email>othalia.larue@wright.edu</email>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Colin Widmer</string-name>
          <email>colin@kairos-research.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Subhashini Ganapathy</string-name>
          <email>subhashini.ganapathy@wright.edu</email>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Srikanth Nadella</string-name>
          <email>srikanth@kairos-</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Brandon Minnery</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lance Ramshaw</string-name>
          <email>lance.ramshaw@raytheon.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Emile Servan-Schreiber</string-name>
          <email>emile@lumenogic.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maurice Balick</string-name>
          <email>mbalick@lumenogic.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ralph Weischedel</string-name>
          <email>weisched@isi.edu</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Hypermind, LLC</institution>
          ,
          <addr-line>Paris</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Kairos Research</institution>
          ,
          <addr-line>Fairborn, OH</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Raytheon BBN Technologies Corp.</institution>
          ,
          <addr-line>Cambridge, MA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>School of Collective Intelligence</institution>
          ,
          <addr-line>Mohammed VI Polytechnic Univ., Ben Guerir</addr-line>
          ,
          <country country="MA">Morocco</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of Southern California, Information Sciences Institute</institution>
          ,
          <addr-line>Los Angeles, CA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>Wright State University</institution>
          ,
          <addr-line>Dayton, OH</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Recent studies in geopolitical forecasting have identified psychological variables that predict forecasting accuracy. We studied the effect of providing human forecasters with automated information search and task management support tools. Our research aimed to determine whether use of the support tools could explain additional variance in forecasting performance above and beyond psychological variables. We found that the provided tools encouraged participants to do more work (i.e., information search, communication, reflection, etc.), which in turn resulted in improved forecasting performance.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Background</title>
      <p>
        Forecasting and other forms of intelligence analysis are
information-intensive tasks that rely heavily on
information foraging and sense-making tools
        <xref ref-type="bibr" rid="ref12">(Pirolli &amp; Card,
2005)</xref>
        . However, forecasting is more challenging than
other investigational search and sense-making tasks. In a
typical investigational task, the answer exists somewhere, and
the users have to find their way to that answer or assemble
an answer from pieces of information found in different
locations. In forecasting tasks, the answers do not exist yet;
they have to be constructed by the users. An element of
novelty is always present in forecasting; no forecasting
solution applies to more than one problem, even though
general strategies may exist. Typically, real-world
forecastCopyright © 2020 for this paper by its authors. Use permitted under
Creative Commons License Attribution 4.0 International (CC BY 4.0).
ing occurs over an extended time course, during which the
world changes and potentially relevant but also irrelevant
or misleading evidence accumulates. Further adding to the
complexity, forecasters often engage in multiple
forecasting tasks, and each task may be attempted by a group of
cooperating and/or competing forecasters.
      </p>
      <p>
        The symbiosis between humans and machines
        <xref ref-type="bibr" rid="ref10">(Licklider, 1960)</xref>
        holds great promise for tackling the unparalleled
complexity of the forecasting task. The science and
practice of human-technology coordination have departed from
the traditional function allocation methods
        <xref ref-type="bibr" rid="ref5">(who-does-what
or men-are-better-at / machines-are-better-at,
MABA/MABA; Fitts, 1951)</xref>
        and is currently moving
toward a human-technology teaming approach in which the
focus is on how machines can become effective team
players
        <xref ref-type="bibr" rid="ref4">(Dekker &amp; Woods, 2002)</xref>
        and how humans and
technology co-evolve (Ackerman, 2000).
      </p>
      <p>The tools used in our study are called hybrid because
they are intended to combine human and machine
capabilities (Rahwan, Cebrian, et al., 2019) to improve the
performance of the whole socio-technical system that generates
forecasts. Using hybrid tools to assist forecasting serves
three purposes: (1) correct for cognitive biases; (2) reduce
the cognitive load of forecasters; and (3) increase the
amount of relevant information available to the forecaster.
These goals can be complementary and mutually
reinforcing: providing humans with machine-made forecasts and
making the relevant information easier to search and
interpret may reduce cognitive load and cognitive biases, which
in turn facilitates high quality forecasts, which via various
aggregation methods result in better “hybrid” forecasts.</p>
      <p>
        Cognitive workload and fatigue have been shown to
affect judgment quality, with forecast quality decreasing as
the number of forecasts made in a day increased. As they
get fatigued, forecasters exhibit more herding behavior and
less granularity in their forecasts
        <xref ref-type="bibr" rid="ref7">(Hirshleifer et al., 2019)</xref>
        .
Task-offload tools can be used to delegate some task
demands to automation
        <xref ref-type="bibr" rid="ref9">(Kirlik, 1993)</xref>
        . However,
externalizing too much task-related information can reduce the user’s
ability to meaningfully engage in high-level processes such
as planning and reasoning and may harm motivation and
performance
        <xref ref-type="bibr" rid="ref15">(Van Nimwegen, Burgos, Van Oostendorp, &amp;
Schijf, 2006)</xref>
        . Thus, a hybrid tool that aims to support
human forecasting must strike a balance between offloading
task demands and maintaining user engagement.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Method</title>
      <p>Human forecasters had to solve forecasting problems (FPs)
about real-world events in the following domains: conflict,
economics, health, politics, science, and technology.
Participants were asked to provide an initial forecast and
update it as many times as necessary based on information
they searched for, updates of this information, or new
information. Each forecasting problem had between two and
five discrete, mutually exclusive outcome options.
Outcome options had to be assigned a probabilistic forecast
with probabilities over all options adding up to 1.</p>
      <p>Two samples of participants were recruited for this
study. The first sample consisted of volunteers with
interest in geopolitical analysis. They were mostly U.S. citizens
(76%), males (82%), with an average age of 43, and with a
relatively high level of education (53% had received a
postgraduate degree). A second sample of participants was
recruited from the members of the web service TurkPrime,
typically referred to as workers. To simplify our language,
we will refer to the participants from the first sample as
Volunteers and to the participants from the second sample
as Turkers.</p>
      <p>
        Forecasting performance was measured with the Brier
score
        <xref ref-type="bibr" rid="ref2">(Brier, 1950)</xref>
        and a relative accuracy score. The Brier
score provides a measure of the error of a probability
forecast: the further a forecast probability is from the actual
outcome, the larger the error:
      </p>
      <p>Brier score = ∑(pi − oi)2
Where pi is the probability assigned to answer i, and oi is 1
if answer i is correct, or 0 if it is not. The Brier score is
between 0 (perfect forecast) and 2 (worst possible
forecast).</p>
      <p>The accuracy score is a relative score based on one’s
Brier scores compared to the median Brier scores of all
participants. The accuracy on a particular day is cd - yd ,
where yd is the participant’s Brier score on that day, and cd
is the crowd's median Brier score on that day. The
accuracy score varies between -2 (worst) and 2 (best).</p>
      <p>
        The participants accessed a dedicated website containing
hybrid features designed to assist them with information
search and task management1. The use of the available
features was optional to users. We assumed users would
strategically
        <xref ref-type="bibr" rid="ref9">(Kirlik, 1993)</xref>
        choose the features they needed
depending on what stage of the task they needed more
support with
        <xref ref-type="bibr" rid="ref8">(Huurdeman, Kamps, &amp; Wilson, 2019)</xref>
        or what
costs and benefits they attributed to using automated tools
        <xref ref-type="bibr" rid="ref12">(Pirolli &amp; Card, 2005)</xref>
        . Only a subset of these features was
used in the study that we report here. The participants
could access numerical indicators relevant to the selected
FP, other user forecasts, forum conversations, news, links,
tabs, and so on (see Table 1).
      </p>
      <p>The Indicators tool displays a list of indicators, which
are statistics relevant to the FP. Indicators can be economic
statistics, Internet search term frequency, information from
databases, etc. A participant can monitor how their
indicators change over time to see when something changes
about a FP and decide to update their forecast.</p>
      <p>The Query tool allows the participants to extract data
from several relevant sources. Query bots automatically
access web sites and databases providing the current and
past trends of indicators underlying many of the FPs. To
guide participants to queries that would help them answer a
given FP, the system automatically recommends databases
to participants. A set of databases was pre-compiled by
subject mater experts for each FP category (e.g. conflict,
economy, or health); when a FP from a certain category is
posted, the system automatically recommends the
databases for that category. Participants can edit a suggested
query, for example, by modifying some of the suggested
values. Participants can also manually add databases they
deem relevant to a given FP. Then they can create queries
on databases using a query editor that allows them to
specify a date, location, type, actor, etc.</p>
      <p>Forecasters can save a query in order to automatically
track its results in time. A saved query becomes an
Indicator. Every six hours, the system automatically reruns the
query. Forecasters can also manually rerun their queries.
Indicators can be shared among forecasters by making
them public. Indicators updated over the course of an FP’s
lifecycle are viewable to participants as time-series graphs.</p>
      <p>Another important feature allows participants to create
custom alarms (also called alerts) based on indicators.
Alarms can alert a participant when key statistics have
updated that may affect his or her forecast. Alarms are
created with the Alarm Rule Editor. They are written in the
form of IF condition, THEN action. That is, the participant
1 The original experiment included a control group that did not have
access to these hybrid features. However, the data from the control group
were not available at the time this paper was written.
specifies the conditions that trigger the alarm and what
actions should be taken once the alarm is triggered (i.e.,
forecast recommendations). The participants can create
three types of alarms: crowd-based, indicator-based, and
time-based alarms. Crowd-based alarms track the average
forecast among all forecasters for a specific outcome and
will alert the participant when the crowd’s prediction has
changed. Indicator-based alarms track the value of one or
more indicators. Once an indicator reaches a pre-specified
value, the participant is notified. Time-based alarms
remind the user to review their forecast after a specified
period has passed. Email updates were sent to the participants
when their alarms fired.</p>
      <p>Provide aggregate information about
how all forecasters have answered the
question.</p>
      <p>Display current value and time course
of statistics relevant to the FP.</p>
      <p>Allow participants to discuss the
question and share information
Display a list of useful links to
sources relevant to the question.</p>
      <p>Suggest relevant news and allow
news search.</p>
      <p>Allow participants to extract data
from relevant sources. Query bots
automatically recommend relevant
data sources and queries. Query
editor supports creation and reruns.</p>
      <p>Notify participant when relevant
information (e.g., the value of a
particular indicator) changes and
recommend a forecast update.</p>
      <p>Detect change in relevant
information, automatically update
forecast, and notify the participant.</p>
      <p>Provide participants with general
information and customized
recommendations and feedback about their
forecasts.</p>
      <p>
        Most of the participants completed the Cognitive
Reflection Test
        <xref ref-type="bibr" rid="ref6">(Frederick, 2005)</xref>
        , the Actively Open-minded
Thinking scale
        <xref ref-type="bibr" rid="ref13">(Stanovich and West, 1997)</xref>
        and the Need
for Cognition scale
        <xref ref-type="bibr" rid="ref3">(Cacioppo et al., 1984)</xref>
        . These
variables were found in previous studies to correlate with
forecasting performance
        <xref ref-type="bibr" rid="ref11">(Mellers et al., 2015)</xref>
        . In addition, we
collected extensive data on participant behavior, such as
the number of FPs forecasted, the number of forecast
updates per FP, frequency of usage for each hybrid feature,
etc.
      </p>
      <p>
        We expected that the provided suite of hybrid features
would improve forecasting productivity and quality, that is,
the number of forecasts participants can generate, the
frequency at which these forecasts can be updated, and the
accuracy of these forecasts. The hybrid features should
allow participants to reduce the cognitive load associated
with monitoring their forecasts and updates, which in turn
should allow them to make more forecasts and focus on
evaluating information quality and relevance. For example,
when alarms trigger, they remind participants to update
their forecasts, and a higher frequency of updating has in
turn been linked to better forecasting performance
        <xref ref-type="bibr" rid="ref14">(Tetlock
&amp; Gardner, 2015)</xref>
        . When users create alarms, they are
implicitly encouraged to employ a top-down (model-driven)
strategy. They need to develop intuitive causal models of
what factors determine the occurrence of the event to be
forecasted. Due to the nature of the forecasting task,
modeling and understanding the (hidden) causes of events are
critical for performance.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>To evaluate if the use of hybrid features improved
forecasting productivity and performance, we split the forecasters
into two groups: one that used no hybrid features (queries,
indicators, or alarms) composed of 519 participants and a
group of participants who used one or more hybrid
features, 319 participants.</p>
      <p>The average number of forecasts per FP was higher for
participants using the hybrid tools, t(371.67) = -6.44, p &lt;
0.001. Thus, participants who used hybrid tools made more
forecast updates. The average number of FP topics
forecasted was also higher for participants who used hybrid
tools, t(737.78) = -8.81, p &lt; 0.001. Thus, the participants
who used hybrid tools attempted to forecast a wider range
of IFP topics. The total number of forecasts submitted was
higher for participants who used hybrid tools, t(328.61) =
4.89, p &lt; 0.001. Forecasting performance as measured by
the Brier score and the relative accuracy measure
(described above) was higher for the participants who used
hybrid tools, t(835.29) = 1.99, p = 0.05 for Brier scores and
t(834.07) = -4.58, p&lt; 0.001 for relative accuracy.</p>
      <p>Thus, as expected, forecasting productivity and accuracy
were higher in those participants who used the provided
hybrid features. However, it remains unclear whether these
findings are driven by the availability of hybrid features or
by motivation. Mellers at al. (2015) found that the
frequency of forecast updating, which they considered to be a
behavioral indicator of motivation, was a significant predictor
of forecasting performance. Arguably, the direction of
causality could go both ways: (1) the highly motivated
participants made a larger number of forecast updates and used
the provided hybrid tools, which in turn increased
performance, or (2) the hybrid tools increased the participants
motivation to make updates, which in turn increased
performance.</p>
      <p>To test these two possibilities, we constructed and tested
two structural equation modeling (SEM) models
attempting to explain the structural relations between hybrid tools
usage (a sum of queries, indicators, and alarms used),
psychometric measures (cognitive reflection – cRS and
actively open-minded thinking – aTS), motivation (number of
topics forecasted – N and average number of forecasts per
IFP - aFI) and performance (Brier score – Brr and accuracy
– Acc).</p>
      <p>Model 1 (Fig. 1) hypothesizes a direct causal link
between hybrid feature use and forecasting performance,
whereas model 2 (Fig. 2) hypothesizes an indirect causal
link (via motivation) between hybrid feature use and
forecasting performance. Model 1 assumes that motivation
causes hybrid tool usage, which in turn causes increased
performance. It also includes the known associations
between psychometrics, motivation, and forecasting
performance. Model 2 assumes that hybrid tool usage causes
motivation, which in turn causes increased performance.
Similar to model 1, model 2 also includes the known
associations between psychometrics, motivation, and
forecasting performance.</p>
      <p>We compared the two models using the Akaike
information criterion (AIC) and Bayesian information criterion
(BIC). Model 2 had AIC = 15913 and BIC = 15994,
whereas Model 1 had AIC = 15941 and BIC = 16022, thus
Model 2 fits the data slightly better than Model 1.</p>
      <p>Model 2 supports the hypothesis that the use of hybrid
tools has a direct effect on motivation. Conceivably, email
alerts about indicator changes and crowd changes
motivated participants to update their own forecasts and perhaps
do additional information searches. In agreement with
previous studies, motivation had a direct effect on
performance, as did the psychometric variables actively
openminded thinking and the tendency to engage in cognitive
reflection.</p>
    </sec>
    <sec id="sec-4">
      <title>Discussion and Conclusion</title>
      <p>
        Previous studies
        <xref ref-type="bibr" rid="ref11 ref14">(Mellers et al., 2015; Tetlock &amp; Gardner,
2015)</xref>
        reported dispositional and behavioral predictors of
forecasting performance. These findings were replicated in
our study: cognitively reflective and open-minded
participants made better forecasts. In addition,
        <xref ref-type="bibr" rid="ref11">Mellers et al.
(2015)</xref>
        showed that participants who updated their
forecasts more often achieved better forecasting performance.
This finding was also replicated in our study.
      </p>
      <p>
        Our study added a suite of hybrid feature to assist
forecasters with the laborious tasks of information search,
sense making, and decision-making. The use of these tools
was optional. We assumed users would act strategically
        <xref ref-type="bibr" rid="ref9">(Kirlik, 1993)</xref>
        and use these tools as needed. The
expectation was that forecasters equipped with hybrid tools would
become more productive and more accurate. The effect of
the hybrid tools was expected to be independent of the
effects that were already known (i.e., cognitive ability,
cognitive style, and motivation). For example, hybrid tools
were expected to be helpful above and beyond a
participant’s motivation or cognitive ability. What we found does
not entirely support this expectation. We did find that the
use hybrid features improve forecasting performance, but
this relationship is most likely mediated by motivation.
The use of hybrid features increased the forecasters’
productivity, as indicated by the number and the variety of
IFPs they forecasted and the frequency of forecast updates.
Since the use of hybrid features was optional, the
relationship between the use of hybrid features and forecasting
performance must be interpreted with caution, as only a
minority of participants used the provided hybrid tools
(319 of 839) and the decision to use hybrid features might
be confounded by other factors such as trust in automation
and in other forecasters (Juvina, Collins et al., in press).
      </p>
      <p>Our SEM analysis provided support for the
interpretation that the provided hybrid features encouraged the
participants to do more work (i.e., information search,
communication, reflection, etc.), which in turn resulted in
improved forecasting performance.</p>
      <p>
        We focused here on a subset of hybrid tools, namely
queries, indicators, and alarms. They appear to be useful in
driving improvements in forecasting performance. While it
is not surprising that supporting users information foraging
and sense making improves forecasting performance, our
unique contribution emphasizes the importance of
engaging users in creating their own support tools. We provided
the alarm editor to encourage participants to create
customized alarms that would alert them when potentially relevant
information changes and recommend a forecast update.
The participants who chose to create an alarm had to
specify the conditions that would trigger the alarm (i.e., specific
changes in one or more indicators) and the action to be
recommended (i.e., a specific change in the forecast).
Arguably, the alarm editor challenged participants to create
their own intuitive models of information search and
forecasting and turn these models into support tools. The
results highlight the importance of providing tools that are
not only useful and useable, but are able to engage users
and enhance their cognitive activity, aiming to strike a
balance between user effort and information search
automation
        <xref ref-type="bibr" rid="ref1">(Bates, 1990)</xref>
        , ultimately achieving the goals of human
machine symbiosis and co-evolution
        <xref ref-type="bibr" rid="ref10">(Licklider, 1960;
Ackerman, 2000)</xref>
        .
      </p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>This research was supported by the Office of the Director
of National Intelligence (ODNI), Intelligence Advanced
Research Projects Activity (IARPA), via contract no.
2017-17072100002 to Raytheon-BBN, through subaward
to Kairos Research. The views and conclusions contained
herein are those of the authors and should not be
interpreted as necessarily representing the official policies, either
expressed or implied, of ODNI, IARPA, or the U.S.
Government or BBN. The U.S. Government is authorized to
reproduce and distribute reprints for governmental
purposes notwithstanding any copyright annotation therein.
Ackerman, M.S. (2000). The intellectual challenge of CSCW: the
gap between social requirements and technical feasibility.
Human-Computer Interaction, 15(2): 179-203.
Appendix 2: Higher-resolution diagram for SEM model 2.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Bates</surname>
            ,
            <given-names>M. J.</given-names>
          </string-name>
          (
          <year>1990</year>
          ).
          <article-title>Where should the person stop and the information search interface start?</article-title>
          <source>Information Processing &amp; Management</source>
          ,
          <volume>26</volume>
          (
          <issue>5</issue>
          ):
          <fpage>575</fpage>
          -
          <lpage>591</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Brier</surname>
            ,
            <given-names>G. W.</given-names>
          </string-name>
          (
          <year>1950</year>
          ).
          <article-title>Verification of forecasts expressed in terms of probability</article-title>
          .
          <source>Monthey Weather Review</source>
          ,
          <volume>78</volume>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>3</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Cacioppo</surname>
            ,
            <given-names>J. T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Petty</surname>
            ,
            <given-names>R. E.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Feng Kao</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>1984</year>
          ).
          <article-title>The efficient assessment of need for cognition</article-title>
          .
          <source>Journal of personality assessment</source>
          ,
          <volume>48</volume>
          (
          <issue>3</issue>
          ),
          <fpage>306</fpage>
          -
          <lpage>307</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Dekker</surname>
            ,
            <given-names>S.W.A</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Woods</surname>
            ,
            <given-names>D.D.</given-names>
          </string-name>
          (
          <year>2002</year>
          ).
          <article-title>MABA-MABA or Abracadabra? Progress on Human-Automation Co-ordination.</article-title>
          <string-name>
            <surname>Cognition</surname>
          </string-name>
          ,
          <source>Technology &amp; Work</source>
          <volume>4</volume>
          :
          <fpage>240</fpage>
          -
          <lpage>244</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Fitts</surname>
          </string-name>
          , P.M. (ed.) (
          <year>1951</year>
          ).
          <article-title>Human engineering for an effective air navigation and traffic control system</article-title>
          .
          <source>National Research Council</source>
          , Washington, DC.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Frederick</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Cognitive reflection and decision making</article-title>
          .
          <source>Journal of Economic perspectives</source>
          ,
          <volume>19</volume>
          (
          <issue>4</issue>
          ),
          <fpage>25</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Hirshleifer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lourie</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Teoh</surname>
            ,
            <given-names>S. H.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>Decision fatigue and heuristic analyst forecasts</article-title>
          .
          <source>Journal of Financial Economics.</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Huurdeman</surname>
            ,
            <given-names>H. C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kamps</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          &amp; Wilson,
          <string-name>
            <surname>M. L.</surname>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>The multistage experience: the simulated work task approach to studying information seeking task stages</article-title>
          .
          <source>In Proc. BIIRRR workshop at CHIIR</source>
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Kirlik</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>1993</year>
          ).
          <article-title>Modeling strategic behavior in humanautomation interaction: Why an" aid" can (and should) go unused</article-title>
          .
          <source>Human factors</source>
          ,
          <volume>35</volume>
          (
          <issue>2</issue>
          ),
          <fpage>221</fpage>
          -
          <lpage>242</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Licklider</surname>
            ,
            <given-names>J. C. R.</given-names>
          </string-name>
          (
          <year>1960</year>
          ).
          <article-title>Man-Computer Symbiosis</article-title>
          .
          <source>IRE Transactions on Human Factors in Electronics, volume HFE-1</source>
          , pages
          <fpage>4</fpage>
          -
          <lpage>11</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Mellers</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stone</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Atanasov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rohrbaugh</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Metz</surname>
            ,
            <given-names>S. E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ungar</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , ... and
          <string-name>
            <surname>Tetlock</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2015a</year>
          ).
          <article-title>The psychology of intelligence analysis: Drivers of prediction accuracy in world politics</article-title>
          .
          <source>Journal of experimental psychology: applied</source>
          ,
          <volume>21</volume>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Pirolli</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Card</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>The sensemaking process and leverage points for analyst technology as identified through cognitive task analysis</article-title>
          .
          <source>In Intelligence Analysis</source>
          ,
          <fpage>2</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Stanovich</surname>
            ,
            <given-names>K. E.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>West</surname>
            ,
            <given-names>R. F.</given-names>
          </string-name>
          (
          <year>1997</year>
          ).
          <article-title>Reasoning independently of prior belief and individual differences in actively open-minded thinking</article-title>
          .
          <source>Journal of Educational Psychology</source>
          ,
          <volume>89</volume>
          (
          <issue>2</issue>
          ),
          <fpage>342</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Tetlock</surname>
            ,
            <given-names>P.E.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Gardner</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Superforecasting: The Art</article-title>
          and Science of Prediction. New York: Broadway Books.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Van Nimwegen</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burgos</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Van Oostendorp</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Schrijf</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>The paradox of the assisted user: guidance can be counterproductive</article-title>
          .
          <source>In Proc. CHI</source>
          <year>2006</year>
          ,
          <volume>917</volume>
          -
          <fpage>926</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <article-title>Appendix 1: Higher-resolution diagram for SEM model 1</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>