=Paper=
{{Paper
|id=Vol-268/paper-2
|storemode=property
|title=A Tool that Uses Human Factors to Simplify Model Building and Facilitate More Accurate Strategic Decisions
|pdfUrl=https://ceur-ws.org/Vol-268/paper2.pdf
|volume=Vol-268
|dblpUrl=https://dblp.org/rec/conf/uai/Kipersztok07
}}
==A Tool that Uses Human Factors to Simplify Model Building and Facilitate More Accurate Strategic Decisions==
A Tool that Uses Human Factors to Simplify Model Building and
Facilitate More Accurate Strategic Decisions
Oscar Kipersztok,
Mathematics & Computing Technology
Boeing Phantom Works
P.O.Box 3707, MC: 7L-44
Seattle, WA 98124
oscar.kipersztok@boeing.com
Abstract summarizes those human factors and
processes and investigates how those can be
This paper describes a tool to validate used for building systems or computational
hypotheses in strategic decision making. The
system builds on experimental evidence of
tools that facilitate better decision making.
human factors that lead to more accurate There has been a recent surge in technology
decisions. The paper shows how those
used to support decisions. Graphical
factors are translated into specifications for
a user interface that simplifies the capture of probabilistic networks (Bayesian networks),
expert knowledge to create complex models for example, are used to model large
of strategic domains of interest. The tool complex domains from the aggregation of
uses graphical probabilistic reasoning, smaller, local, probabilistic dependencies
combined with text classification methods, to
between the variables of a domain [Pearl,
expedite the search for relevant information
in a large number of documents to help 1988; Lauritzen and Spiegelhalter, 1988].
validate hypotheses. The methodology uses efficient probability-
update algorithms that guarantee consistent
1 INTRODUCTION and correct propagation of uncertainty when
This paper describes a tool that facilitates the models are queried. In Data Mining
strategic decision making, i.e., decisions [Cabena, et. al., 1998], algorithms are used
that, if wrong, can have significantly to search for information in very large
negative or even catastrophic consequences. databases to discover patterns and trends
The questions addressed are what type of that can explain new phenomena. For
information decision makers need to know, example, for strategic decisions,
and how they process such information to classification algorithms [Langley, 1996] are
arrive at decisions? Are there processes that contributing to the ability to retrieve
lead to better decisions? What specific pertinent information in support of critical
quantities do decision makers need to be decisions. In many instances, not having the
able to make decisions? right information available at the right time
can have serious consequences. The large
There is evidence from experimental amounts of information that can be accessed
psychology that suggest trends and today make it difficult for decision makers
behaviors that, on the average, result in
more accurate decisions. This work
to search for specific information that may into a decision process, increases the
be needed for time-critical decisions. confidence of the decision maker without
necessarily increasing the accuracy of the
In this paper, a prototype tool is presented
decision. People tend to believe that they use
that uses human factors found to lead to
considerably more information in their
more accurate decisions. The system uses
decisions than they actually do [Summers et
graphical probabilistic reasoning, combined
al., 1970; Slovic and Lichtenstein, 1971].
with text classification methods, to facilitate
People compile vast amounts of information,
the search through a large number of
over a life time of making decisions.
documents. It allows the capture of user’s
Although it may appear that all that
knowledge in a straight forward manner to
information is brought to bear in each new
create complex models of domains of
decision; actually, only a relatively small
interest. These are also referred to as
amount is relevant for each individual
“mental models” of the domain.
decision [Shepard, 1964].
As the user builds a model, a graphical
Over time, people develop “mental models”
probabilistic network is automatically
of the world that are comprised of ideas and
created that allows the user to raise
concepts learned, and the relationships
hypotheses by making queries to the model.
between them [Heuer, 1999]. Such mental
At the same time a text classifier is also
models become the “filter” through which
created for search and retrieval from a large
people interpret the world around them.
volume of documents. Once built, the user
There is a continuous refinement of the
queries the model and the system responds
mental model based on new experiences
by predicting the likelihood of a hypothesis.
gained that reinforce certain relationships
In prediction-driven mode, the classifier will
and weaken others.
search for relevant documents that can be
used to substantiate or negate a given People tend to utilize two distinct modalities
hypothesis. In evidence-driven mode, the for decision making. One modality is
evidence is retrieved ahead of formulating referred to as the “mosaic” model [Heuer,
the hypothesis. In both cases the search is 1999] where pieces of information are
driven by the most likely hypotheses or the accumulated, in no particular pattern, on the
most likely evidence. The tool creates a belief that a unified picture will, eventually,
summary report describing the formulated emerge revealing the solution to a problem
hypothesis and the evidence gathered to query. The emphasis is in accumulating as
substantiate or refute it. much information on the subject of interest
as possible.
This paper shows that it is possible to embed
in the GUI design features based on the The second modality is using the mental
human factors mentioned to guide the model to create a hypothesis for the query
operational use of the tool to flow in a and then use the hypothesis to guide the
manner that facilitates more expedient and retrieval of information. The search for
effective strategic decision making. information is driven by the model and the
information retrieved is used to validate,
2 HUMAN FACTORS FROM refute, or reformulate the hypothesis.
EXPERIMENTAL PSYCHOLOGY Experimental psychology provides evidence
Evidence from experimental psychology that the latter is the modality that tends to
[Oskamp, 1965; Goldberg, 1968; Heuer, conduce to more accurate decisions [Elstein,
1999] shows that factoring more information et. al., 1978]. The process of scientific
2
discovery, for example, follows the second knowledge engineer. The communication
modality where theories are created about language of the system with the user is
unexplained phenomena and experiments limited exclusively to the language of the
are designed to validate or refute the user’s domain of interest. All technical
theories. In turn, the data are also used to terminology is hidden from the user. The
improve theories and the process continues user defines the domain as a list of concepts
where experiment and theory affect each with appropriate labels and defines
other. relationships between them with a weight of
causal belief. The model is captured, as a
The described system uses these results to
directed graph and hidden from the user.
guide the user in the process of making
decisions. Firstly, it allows the user to build The second principle is to allow the user to
a mental model of their domain of interest in attach specific qualitative and quantitative
a fairly simple and unrestricted manner. information to nodes and links in the graph.
Secondly, as the user queries the model, a The intent is to associate the same type of
hypothesis is proposed, and the system information uniformly across all nodes
automatically identifies the essential keeping the amount of added information to
parameters that are relevant to it. The system a minimum. The new information, although
also conducts a search to retrieve of the same type, contains different semantic
information for use in validating or negating meaning for each individual concept or
the hypothesis. The gathered evidence is relation.
used to redefine the query and to propose a
better and more likely hypothesis. This cycle 4 DESCRIPTION OF THE SYSTEM
of operation patterns itself after the human AND ITS USE
factor trends described in this section that
4.1 WHAT STRATEGIC DECISION
can lead to better decisions.
MAKERS WHANT TO KNOW
3 A KNOWLEDGE ACQUISITION Based on documents of strategic decisions
PERSPECTIVE FOR BUILDING and discussions during meetings of strategic
COMPLEX MODELS nature, the observation was postulated that a
common set of prediction parameters that
From a knowledge acquisition perspective,
decision makers are interested are the
the translation of the aforementioned human
occurrence of events and the emergence of
factors into system design specifications
trends. Moreover, for each predicted event
must conform to proper practices of user
or trend, decision makers are interested in
interface design. Domain models that
knowing their likelihood of occurrence, their
facilitate strategic decisions can become
magnitude or impact, and the time when
quite complex. How can such complexity be
they expected to occur.
managed during the knowledge acquisition
stage, was an important question that needed 4.2 BUILDING THE “MENTAL MODEL”
to be addressed.
Specifying the use of mental models in
Two basic principles are defined and applied computing architectures of decision support
to the user interface design to manage the system requires defining methodologies of
building of complex models. The first how such models are to be built. The type of
principle is driven by the goal to create a model suggested here is a cognitive causal
system that will allow direct and iterative model in the form of a directed graph made
interaction with the user, and will not up of nodes representing concepts and links
necessitate the presence of an intermediary
3
representing relationships. Concepts are model enables the system to acquire as
ideas represented by descriptive labels or much information as possible.
phrases that convey a specific meaning or
Figure 1, shows the model building screen
thought that is relevant to the domain of
for building the mental model. The model is
interest. Such model can be built by making
built by defining concepts and their
a list of concepts and identifying, for each
relations. As an example, concepts are
concept, all other concepts that may affect it.
shown for the “Aviation Safety” domain. In
This type of model is referred to as an Figure 2, “Traffic Control Errors” and
“unconstrained model” because there are no “Public concern” represent state transition
restrictions as to the number of concepts or concepts that affect the “Occurrence of
directional relations that can be defined in Accidents”, a discrete event concept.
the graph. Any number of cycles is allowed.
Since not all relations have the same degree
of belief, a “weight of causal belief” is
introduced and attached to each parent-child
relation. Such weight represents the degree
of belief that if the parent is true, it will
influence the child to become true as well.
An arbitrary scale between [0, 1] is used to
assign such weight. In order to make Figure 2- Examples of concepts, relations
inference about proposed hypotheses and and weight of causal belief
make predictions about the domain, the The unconstrained model can be as large
unconstrained model is converted into a and complex as needed. The model is
graphical probabilistic network by reducing captured as a directed graph. Figure 3 shows
the directed graph into an acyclic graph. a simple version of the Aviation Safety
model where the negative signs at the end of
the arrows representing negative weights of
belief.
Figure 3- Simplified graph of Aviation
Figure 1 – Model building screen Safety domain
During model creation the system allows as 4.3 INFERENCE AND PREDICTION
much flexibility for free association, without
To be able to make predictions that decision
much concern about its ability perform
makers are interested, such as the likelihood,
inference. The use of the unconstrained
magnitude and time of expected event and
4
trend occurrences, Bayesian networks are 5 MENTAL MODEL GUIDING
used. The tool computes likelihoods and EVIDENCE SEARCH
probabilities in response to queries. The user As mentioned in Section 2, the iterative
can add information to concepts to define process of using mental models to raise
them in term of dimensional quantities. hypotheses and the subsequent search for
Characteristic time quantities are also added information to help validate them has been
for use in temporal reasoning [Nodelman, et. shown by experiments to result in more
al., 2002]. To transform the directed graph accurate decision making.
into an acyclic Bayesian network, two steps
are necessary. The first step is to eliminate
cycles in the directed graph. This is done by
minimizing information loss as a trade-off
between the full expressiveness of the
unconstrained model and the ability to make
predictive inference. The second step is to
utilize the weights of causal belief to create
conditional probability tables between each
node in the acyclic graph and its parent
nodes. Similar approaches are found in
[Rosen and Smith, 1996]
4.4 QUERY AND HYPOTHESIS CREATION
A query presents a question of how evidence Figure 4- Query response with predicted
or beliefs about current events (or trends) hypothesis
can predict the occurrence of future events
of interest. The current evidence is the state While the user builds the model, the tool
of occurring events and on-going trends. also provides added utilities to facilitate the
Experimental psychology factors described automated building of a text classifier. The
in Section 2, suggests that only a few creation of the classifier is based mostly on
parameters are relevant in addressing a the concept labels and the text descriptions
specific query in support of a decision. To that are attached to the nodes and links of
answer a query the system identifies the sub the model graph.
model from the unconstrained model The tool’s prediction capability using the
containing only those parameters that are mental model allows the user to discover the
relevant to the query [Druzdzel and most likely hypotheses that justify the search
Suermondt, 1994]. The screen in Figure 4 for validating information. The capability to
shows the results of such query with the identify the parameters most relevant to the
corresponding prediction. hypotheses further narrows the space of
The response to the query is in the form of a possible search. And the text classifier built
hypothesis that predicts the likelihood of an using the domain language, introduced by
event or trend given current evidence or the user, helps conduct the search in an
beliefs. The user can simulate various ‘what efficient manner as is shown next. The
if’ scenarios to gain insight into which of the search requires the presence of a large
postulated hypotheses will result in the most corpus of text documents. The text classifier
likely future events. built by the tool is used to rank text content
relevant to concepts and relations associated
5
with the hypotheses that resulted from the hypothesis. In this manner, the user can
query. search through the documents and build a
case by selecting the pertinent document
Figure 5 shows the screen with the results of
paragraphs that help support or reject the
the classification process. Every document
hypothesis.
has been classified into the relevant concepts
for the query. A score is computed by the
classifier for each relevant concept. The
documents are ranked according to content
most relevant to the concepts in the
hypothesis.
Figure 6 - Query response with predicted
hypothesis
Figure 7 shows a screen with the ranked
paragraphs for each specific document. The
Figure 5- Query response with predicted search process is driven, from the top down,
hypothesis by the most relevant content. Tens of
The user can inspect each concept and thousands of documents can be processed in
identify the specific documents that have this manner, considerably reducing the
been associated with it. The screen in Figure search time.
6 shows the specific documents that have
been assigned to a concept. A second score
is also computed to rank the documents with
the most relevant content. Shown on the
right pane of Figure 6 is information about
author, source, title of each document, and a
level of reliability that the user selects for
each source. Such reliability factor is used to
re compute the ranking of each document.
Each document associated with a concept is
further broken down into ranked paragraphs,
as shown in Figure 7.
The highest ranked paragraph contains the Figure 7 - Query response with predicted
content most closely associated with the hypothesis
concept. The user can, quickly, inspect the The last step is summarization of the
most relevant paragraphs of each document evidence gathered from the selected
and select the paragraphs that contain documents and paragraphs which content
evidence to help substantiate or reject the helps to substantiate the hypotheses. The
6
paragraphs selected by the user are compiled Applications to Expert Systems. Journal of the Royal
automatically and captured in a file with the Statistical Society, B, Vol. 50, No. 2, 1988.
postulated hypothesis and the conclusions [Nodelman, et. al., 2002] Nodelman U., Shelton, C.
drawn by the user. R., and Koller, D. Continuous Time Bayesian
Networks. Proceedings of the Eighteenth Conference
on Uncertainty in Artificial Intelligence, pp. 378-387,
6 SUMMARY
[Oskamp, 1965] Stuart Oskamp, Overconfidence in
Experimental evidence from human factors
Case-Study Judgments, Journal of Consulting
studies describe behaviors that lead to more Psychology, 29, 1965, pp. 261-265.
accurate decisions. A prototype system is
[Pearl, 1988] Pearl. J. Probabilistic Reasoning in
presented that utilizes those factors in the Intelligent Systems, Morgan Kaufmann, San Mateo,
design of the user-interface to help the user California, 1988.
step through the creation of a model,
[Rosen and Smith, 1996] Julie A. Rosen and Wayne
presentation of a query, postulation of a L. Smith. Influence Net Modeling with Causal
hypothesis and, lastly, validation of the Strengths: An Evolutionary Approach, Proceedings
hypothesis by expediting the search through of the Command and Control Research and
a large corpus of documents in the context Technology Symposium, Naval Post Graduate
of the most likely evidence. School, Monterey, CA, June 25-28, 1996.
[Shepard, 1964] R. N. Shepard, On Subjectively
References Optimum Selection Among Multiattribute
[Allison, 1971] Graham Allison, Essence of Alternatives, in M. W. Shelly, II and G. L. Bryan,
Decision, 1971, The Case Study Anthology, Editor eds., Human Judgments and Optimality, New York:
Robert K. Yin, Sage Publications, 2004. Wiley,1964, p. 166.
[Cabena, et. al., 1998] Cabena P., P. Hadjinian, R. [Slovic and Lichtenstein, 1971] Paul Slovic and
Stadler, J. Verhees, and A. Zanasi. Discovering Data Sarah Lichtenstein, Comparison of Bayesian and
Mining: From Concept to Implementation. Upper Regression Approaches to the Study of Information
Saddle River, NJ: Prentice Hall, 1998. Processing in Judgment, Organizational Behavior
and Human Performance, 6 ,1971, p. 684.
[M. Druzdzel and H. Suermondt, 1994] Relevance in
Probabilistic Models: “Backyards” in a “Small [Summers et al., 1970] David A. Summers, J. Dale
World”. In Working Notes of the AAAI 1994 Fall Taliaferro, and Donna J. Fletcher, Subjective vs.
Symposium Series: Relevance, pp 60-63, New Objective
Orleans, Louisiana, Nov 4-6, 1994. Description of Judgment Policy, Psychonomic
[Elstein, et. al., 1978] Arthur S. Elstein et.al, Science, 18 , 1970, pp. 249-250.
Medical Problem Solving: An Analysis of Clinical
Reasoning. Cambridge, MA and London: Harvard
University Press, 1978, pp. 270 and 295.
[Goldberg, 1968] Lewis R. Goldberg, Simple
Models or Simple Processes? Some Research on
Clinical Judgments, American Psychologist, 23,
1968, pp. 261-265.
[Heuer, 1999] Richards J. Heuer, Jr., Psychology of
Intelligence Analysis. Center for the Study of
Intelligence, Central Intelligence Agency, 1999.
[Langley, 1996] Langley P. Elements of Machine
Learning. San Francisco: Morgan Kaufmann, 1996.
[Lauritzen and Spiegelhalter, 1988] Lauritzen S. L.,
and D.J. Spiegelhalter. Local Computation with
Probabilities in Graphical Structures and Their
7