Automated Narrative Extraction from Administrative Records*,**


     Karine Megerdoomian                                                 Karl Branting                              Charles E. Horowitz
       The MITRE Corporation                                        The MITRE Corporation                           The MITRE Corporation
          McLean, VA, USA                                              McLean, VA, USA                                 McLean, VA, USA
         karine@mitre.org                                            lbranting@mitre.org                             chorowitz@mitre.org

            Amy B. Marsh                                                   Nick Modly                                  Stacy J. Petersen
       The MITRE Corporation                                        The MITRE Corporation                           The MITRE Corporation
          McLean, VA, USA                                              McLean, VA, USA                                 McLean, VA, USA
         amarsh@mitre.org                                             nmodly@mitre.org                               spetersen@mitre.org

              Eric O. Scott                                            Sujit B. Wariyar
       The MITRE Corporation                                        The MITRE Corporation
          McLean, VA, USA                                              McLean, VA, USA
         escott@mitre.org                                            swariyar@mitre.org


ABSTRACT                                                                            history have allowed the probation office to have a better
                                                                                    understanding of their client population and to perform
The U.S. Probation and Pretrial Services Office staff produce                       analyses that were previously unavailable to the organization.
billions of pages of information on defendants’ and offenders’                      This technical approach can be applied across organizations,
profile and conduct. While it is critical for probation officers                    legal institutions, clinical administrations, and government
and district chiefs to have up-to-date knowledge on their                           agencies that maintain large amounts of information in the
clients to better assist and reduce risk of recidivism, the data                    form of free text narratives.
are often stored in narrative texts in multiple large documents.
As a result, these records remain mostly out of reach without
the use of painstaking manual review. This paper describes an                       1 Introduction
analytic prototype developed to automatically acquire
                                                                                    The U.S. Probation and Pretrial Services Office (PPSO) staff
structured information from natural language text in probation
                                                                                    supervise more than 300,000 people a year and collect and
office documents through the application of PDF content
                                                                                    produce billions of pages of information on defendants’ and
extraction, text mining, and language analytics. Since serious
                                                                                    offenders’ profile and conduct, as well as on the strategies and
mental illness is very prevalent in the U.S. corrections system,
                                                                                    actions of officers and their outcomes. While it is critical for
the first phase of the project focused on extracting information
                                                                                    probation officers to have up-to-date knowledge on their
and constructing timelines from narrative text regarding the
                                                                                    clients to reduce the risk of recidivism, the data are often stored
defendants’ mental health conditions, substance use and
                                                                                    in narrative texts in multiple large documents, making it very
treatment history.
                                                                                    challenging and time-consuming to collect all relevant case
                                                                                    information manually. This renders 70 terabytes of mostly
Automated narrative extraction and the construction of an
                                                                                    unstructured data on more than a million defendants, and
event timeline for defendants’ mental and emotional health
                                                                                    strategies used by thousands of officers over decades, mostly
                                                                                    unusable by PPSO [1]. As a result, policy makers, program
*In Proceedings of the Workshop on Artificial Intelligence and the Administrative
State (AIAS 2019), June 17, 2019, Montreal, QC, Canada.
                                                                                    evaluators, and probation and pretrial services staff have been
Copyright © 2019 for this paper by The MITRE Corporation. Use permitted under       denied valuable data with which to do their jobs.
Creative Commons License Attribution 4.0 International (CC BY 4.0).
Published at http://ceur-ws.org
** Approved for Public Release; Distribution Unlimited 19-1482. Throughout this     A significant number of offenders supervised by the U.S.
document, all names of people, places, facilities and dates are replaced with       probation services have a current mental health condition,
fictitious ones to anonymize the information.
                                                                                    most of them with co-occurring substance use disorders.
 AIAS’19, June, 2019, Montreal, Quebec Canada                                                                  K. Megerdoomian et al.

Defendants who suffer from mental disorders often require            2 Background
more intensive monitoring and specialized treatment [2]. We
                                                                     Past clinical information extraction systems have tended to rely
therefore focus on addressing important PPSO business
                                                                     on shallow NLP techniques (pattern-matching, simple parses,
questions to better understand the nature of the mental
                                                                     linear pattern interpretation rules). More recently, however,
conditions in the officers’ caseload and gain knowledge of the
                                                                     several projects have adopted knowledge-based approaches
defendants’ diagnosis and treatment history. The information
                                                                     adapted for the clinical domain.
was automatically obtained from the free text sections of
Presentence Investigation Reports (PSIR), which represent
                                                                     While the advantages of machine learning methods for
investigations into the history of the person convicted of a
                                                                     information extraction cannot be denied, they also present a
crime before sentencing to determine if there are extenuating
                                                                     number of limitations in applications for narrative extraction
circumstances. To automatically extract and analyze the free
                                                                     from clinical data. To begin with, machine learning algorithms
text information in the PSIRs, we applied language analytics
                                                                     require large amounts of training data which are pre-tagged for
technology to detect the events of interest (substance use,
                                                                     the relevant features and parameters. Preparing the pre-
diagnosis, treatment sessions, prescriptions) in the defendant’s
                                                                     annotated data sets can be time-consuming and expensive. In
life and visualized them as a timeline of activities that could be
                                                                     addition, such probabilistic approaches might miss rare
reviewed by the probation and parole officers.
                                                                     phenomena that need to be identified since they do not occur
                                                                     often enough in the training data to be picked up by the
The system leverages Apache cTAKES (clinical Text Analysis
                                                                     learning algorithms. Another challenge for using machine
and Knowledge Extraction System), an open-source Natural
                                                                     learning methods in the clinical domain is that users often
Language Processing (NLP) system developed specifically to
                                                                     expect high level of consistency in the results and precise
extract and analyze clinical information from unstructured text
                                                                     information on how the computational decisions were made. In
[3]. cTAKES identifies clinical terms such as drugs, diseases and
                                                                     such instances, a rule-based approach might be more
disorders, symptoms, and medical and treatment procedures.
                                                                     transparent and easier to understand and modify.
It also performs deep textual analysis and can identify, for
instance, if a sentence is negated or not, or if the person being
                                                                     The approach described in this paper leverages in-depth
discussed is the patient or a family member. The prototype
                                                                     linguistic and semantic analysis to detect the domain
system combines the results of cTAKES with rich linguistic
                                                                     information in narrative text, more in line with recent
analysis from other open source systems such as concept
                                                                     knowledge-based approaches [5] [6]. Machine learning
ontologies and the Stanford CoreNLP parser and entity
                                                                     approaches often require a large amount of pre-annotated data
recognizer [4]. These syntactic and semantic analyses are then
                                                                     on which to train the algorithms. Since the PSIR data had not
enhanced to adapt to the use case, by identifying significant
                                                                     previously been tagged for the events of interest and mental
terms for the events of interest for the mental health domain,
                                                                     conditions, a purely machine learning approach was not readily
applying linguistic analysis to improve argument and negation
                                                                     available. Hence, the prototype applies a hybrid method. It
detection, and implementing recent advances in NLP to
                                                                     leverages rich linguistic and semantic information through the
improve precision (e.g., vector space semantics, algorithms for
                                                                     application of open-source Natural Language Processing
building a narrative timeline).
                                                                     systems, adapted for the existing use case by applying a
                                                                     combination of rule-based linguistic analysis, vector space
All extracted information on a defendant’s narrative is stored
                                                                     semantics, and machine learning techniques to enhance the
in a graph database and displayed on a dynamic map, allowing
                                                                     results. These were used to improve negation detection and
filtering of results based on judicial district, defendants’
                                                                     argument identification (i.e., entities the events refer to), and to
demographic information (age, education, citizenship),
                                                                     develop temporal reasoning algorithms. Ontologies (lexicons)
criminal category, mental conditions or medications
                                                                     of mental health and medication terms, vetted by a subject
prescribed.
                                                                     matter expert, were used for concept identification. The rest of
                                                                     this section provides a detailed description of the technical
As large amounts of information in business, government and
                                                                     steps in building the analytic prototype.
administration are maintained in the form of narratives
(clinical records, legal and financial summaries, progress
reports, human resources assessments, etc.), the approach            3 Technical Approach
described in this paper for acquiring structured information
                                                                     The technical approach is a hybrid one, leveraging open source
from narrative text can be reapplied across organizations and
                                                                     NLP applications often developed by training machine learning
government agencies.
                                                                     algorithms, and refining the syntactic and semantic analyses
                                                                     with a combination of knowledge-based and probabilistic
                                                                     approaches.
 Automated Narrative Extraction for Administrative Records                            AIAS’19, June, 2019, Montreal, Quebec Canada

3.1 Analytic Pipeline                                                 5.   User Interface (UI): This component interacts with the
                                                                           Neo4j database and displays results on a Google Earth
The presentence reports undergo several steps in order to
                                                                           map. The UI allows the user to run queries, to review the
extract the defendant’s mental health and substance use
                                                                           details on particular defendants, and to see aggregate
narratives. These are shown in Error! Reference source not
                                                                           results on the data set.
found. and are described in detail in the rest of this section. The
specific steps involved are:
                                                                      3.2 Content Extraction
1.   Content Extraction: parsing the different sections of the        The Content Extraction component parses the PDF presentence
     PDF documents and extracting the structured profile and          reports, identifies all subsections and extracts the textual
     criminal information as well as all free text content. This      content. To analyze the mental health and substance use
     component also “cleans” the data by normalizing the              information of defendants, the text content of the Mental and
     textual content to maximize processing.                          Emotional Health (MEH) and Substance Abuse (SA) sections in
2.   Language Analytics: The extracted text for each PSIR is          presentence reports are automatically extracted. In addition,
     run through the Natural Language Processing                      this step identifies and extracts all federal charges from the
     components, providing a full linguistic parse, a list of         cover sheet of the PSIR, criminal history information from the
     entities and events of interest, and semantic relationships.     Juvenile Adjudications and Adult Criminal Convictions sections
3.   Knowledge Discovery: This step is the heart of the               of the report, Arrest Dates and associated charges from the
     textual analytics where the system identifies all concepts,      Criminal History information, and Criminal History Score and
     events, and their relationships for the domain of interest.      Category from the Criminal History Computation section.
     •    Identifies the events of interest associated with the
          defendant      (arrests,     diagnoses,    treatments,      The prototype’s Content Extraction component successfully
          prescriptions, drug use, suffering from a mental            extracted information from 92% of the original PDF
          condition);                                                 documents, providing us with a data set of 11,243 extracted
     •    Determines whether the information is obtained from         narrative text documents to analyze. Given that some
          medical records or if it is reported by the defendant,      defendants have more than one presentence report associated
          by a medical professional, or by a third party;             with them, the successfully extracted content corresponds to
     •    Provides full event description including date,             10,973 defendants. The free text content extracted from the
          location, persons involved, treatment provider,             MEH and SA sections amount to 22,486 text items. These can
          nature of treatment and medication prescribed;              range from a few sentences to several paragraphs depending
                                                                      on the report.
     •    Computes the temporal relationships between the
          various events to build a narrative timeline for a
          defendant.
                                                                      3.3 Language Analytics
                                                                      The Language Analytics component leverages existing Natural
                                                                      Language Processing software to perform various linguistic
                                                                      analyses on a piece of text. NLP is a subset of Artificial
                                                                      Intelligence (AI) and is fast becoming an essential technology
                                                                      in modern-day organizations to gain significant insights from
                                                                      unstructured content, such as email communications, social
                                                                      media, videos, customer reviews, customer support request,
                                                                      and administrative records in business and government.
                                                                      Natural Language Processing tools and techniques help to
                                                                      automatically process, analyze, and understand large amounts
                                                                      of data, providing structure and meaning to information that
Figure 1: Analytic pipeline for narrative extraction and timeline
                                                                      originally was in unstructured form.
                          development
                                                                      In this step of the analysis, the texts extracted from the Mental
4.   Neo4j Database: Neo4j is a graph database management             and Emotional Health and Substance Abuse sections of the
     system and is available as open source software. All             PSIRs are run through several NLP software tools. The software
     extracted information from the Knowledge Discovery               packages currently in use are Apache cTAKES (clinical Text
     component, as well as the client demographic metadata,           Analysis and Knowledge Extraction System), Stanford Named
     and structured information on arrest history and federal         Entity Recognizer, and FONS (Framework for Operation NLP
     offenses extracted from the presentence reports are              Services) – a software package pipeline leveraging open source
     loaded into the database.                                        tools and was built by a research team at MITRE to detect
                                                                      events of interest to national security.
 AIAS’19, June, 2019, Montreal, Quebec Canada                                                                 K. Megerdoomian et al.

                                                                      1.   Identify concepts (entities and events) of interest
cTAKES output forms the primary basis for further analytics. It            associated with the client, including mentions of a client
was chosen primarily because of its entity recognition                     suffering from a mental condition, diagnoses, treatments,
capabilities in the clinical domain, which aligned with the                prescriptions and drug use.
desire to obtain data about PPSO clients’ mental and emotional        2.   Detect the event description such as the date and location
health and substance use. Entities identified by cTAKES include            when it occurred, the persons involved, the treatment
medical conditions, drugs/medications, medical procedures,                 provider, the nature of treatment (e.g., inpatient or
and medical symptoms. The entities identified by cTAKES out-               outpatient, anger management, drug rehabilitation) and
of-the-box were supplemented with additional entities                      the medication prescribed.
frequently encountered by analysts in PSIRs. We worked                3.   Detect the source of the information – was the information
closely with a PPSO subject matter expert to review the list of            reported by the client, was it obtained from medical
conditions and medications that cTAKES recognized, and                     records or a medical professional, or reported by a third
identify the ones that were of interest in the mental and                  party?
emotional health and substance use domain. The subject
matter expert also identified a more general superclass for each      As described, cTAKES detects these entities of interest in the
of these specific mental and emotional conditions so that             mental and emotional health domain. However, to identify
further analysis could be conducted at the appropriate level of       whether a client is suffering from a mental condition, it does
granularity. For example, conditions such as depression, chronic      not suffice to simply retrieve sentences with a mental condition
depression, and major depressive disorder were all mapped to          mention. It is also important to detect the subject of the
the more general term depressive disorder.                            sentence to distinguish cases where a family member is
                                                                      mentioned to suffer from a mental condition (e.g., “the
cTAKES also provides domain-independent NLP capabilities of           defendant’s mother suffered from Schizophrenia”), and to
syntactic parsing, dependency parsing, and semantic role              exclude any negated events (e.g., “the defendant does not suffer
labelling – it can give the base forms of words, their parts of       from a severe mental disease or defect”). Fortunately, when
speech, mark up the structure of sentences in terms of phrases        cTAKES identifies a concept, it also identifies that sentence’s
and syntactic relations, detect negation in the sentence and          polarity (whether the entity appears in a negated context or
identify the role of the entities in a sentence (e.g., agent of       not), and the event’s subject (whether that event or concept
event). The results of all these capabilities were used to identify   should be ascribed to the client described in the text, a family
events of interest in a client’s mental and emotional health and      member of the client, or someone else). Some modifications to
substance use history. However, we found it useful to                 the cTAKES source code were made to improve the accuracy of
supplement the cTAKES output with other natural language              these attribute identifications.
processing systems to achieve the most accurate analysis. The
Stanford Named Entity Recognizer was applied to identify              While the cTAKES entities can be counted to obtain statistics on
people, places, organizations, dates, times, and locations, none      the prevalence of various mental conditions among the
of which are identified by cTAKES. Additionally, the FONS             defendant population, further processing is necessary to
system, which also generates entities, syntactic parsing and          identify more complicated events, such as receiving a
dependency parsing output, was used to supplement cTAKES’             diagnosis, attending treatment, being prescribed medication,
output to obtain a higher level of accuracy. In particular, FONS      or using drugs. To identify the events of interest, a small sample
was applied to the PSIR text data to tag entities (people,            of PSIRs was reviewed to identify the verbs commonly
facilities, locations, dates and times), and to categorize all        associated with these events. An iterative process was used in
events into conceptual classes by detecting event types (e.g.,        reviewing the event detection results and updating the
state, transfer, communication) and different verb meanings           predicates for the domain. The verbal predicates associated
(e.g., prescribe can either be the verb denoting the prescription     with each type of event are listed in Table 1.
of medication by a medical professional or a communication
event meaning ‘to advise’, ‘to recommend’).                            Event Type            Predicate
                                                                       Diagnosis             diagnose
3.4 Domain-Specific Entity and Event
                                                                       Prescription          prescribe, treat (with)
   Identification                                                      Treatment             admit, attend, complete, discharge,
The Knowledge Discovery phase of the analytics involves                                      enroll, enter, hospitalize, meet,
processing the output from the Natural Language Processing                                   participate, place, receive, see, seek,
systems to perform several steps in knowledge discovery in                                   speak, treat, undergo
natural language text:                                                 Usage                 abuse,    addict,    consume,    drink,
                                                                                             experiment, ingest, inhale, relapse,
                                                                                             smoke, snort, take, try, use
 Automated Narrative Extraction for Administrative Records                            AIAS’19, June, 2019, Montreal, Quebec Canada

  Table 1: Verbs used to identify events related to mental and       of the source of information. The top verbs identified as
              emotional health and substance use                     Communication events are listed in Table 2.

Once the predicates are identified, the semantic roles
                                                                      Event Type                 Predicate
associated with each occurrence of the predicate are
                                                                      Communication              state, indicate, note, explain,
automatically extracted to enable the identification of the
                                                                                                 report, say, acknowledge, discuss,
predicate’s agent, affected entity, and whether the predicate
                                                                                                 identify, confirm, deny, address,
was negated. The sentence in which the predicate appeared
                                                                                                 agree, communicate, question,
was also examined to identify medications, drugs, mental
                                                                                                 suggest, tell, describe, claim,
conditions, medical procedures, and treatments associated
                                                                                                 mention, inform, disclose
with that event.
                                                                      Other formulation          according to
                                                                        Table 2: Terms used to identify the source of information.
To detect the source of the information, all sentences with
Communication events identified by the FONS software
                                                                     This linguistically rich event-based narrative analysis
package were analyzed and the subject of the verbs extracted.
                                                                     methodology allows the Language Analytics component to
For example, in “Dr. Gray stated that the defendant has never
                                                                     extract information of interest including the people involved in
been hospitalized for emotional disorders of any kind”, the
                                                                     the event, the time it occurred, and the places mentioned. A
communication verb stated is detected and its subject, Dr. Gray
                                                                     sample analyzed sentence is shown in the following example:
(a medical professional), is identified as the source of the
information. Similarly, in the example “the defendant’s mother           The       defendant<source-of-info>      reported
also reported he was diagnosed with Bi-Polar Disorder several            she<affected-entity/diagnose-event>           was
years ago”, the source of information is identified as the               diagnosed<diagnose-event> at the age of 14<time>
defendant’s mother (a third party).                                      with                 depression<mental-condition>,
                                                                         schizophrenia<mental-condition>     and   bi-polar
                                                                         disorder<mental-condition>     and     was     not
If the subject of the communication verb is mentioned as the
                                                                         prescribed<prescribe-event|NEGATIVE>           any
defendant, the system treats it as a self-reported event. In the
                                                                         medication<medication-mention>.
writing style of the presentence reports, mentions of he or she
tend to refer overwhelmingly to the defendant. Since the             3.5 Generalized Event Analysis
current version of the analytic system does not include a
                                                                     While cTAKES proved very useful for identifying events in the
“coreference resolution” component that can accurately
                                                                     clinical domain, it is not specifically tuned for identifying more
identify who the pronouns refer to, the assumption is made to
                                                                     general events. Events that are not directly related to
treat these cases as self-reported events. This can be seen in the
                                                                     diagnoses, prescriptions, substance abuse, or treatment may
following example where the events in both sentences are
                                                                     still be of interest when analyzing a client’s mental and
automatically labeled as self-reported: “The defendant
                                                                     emotional health history. For example, in “He became depressed
expressed feelings of depression, helplessness, and hopelessness.
                                                                     when his infant brother died”, the event of the infant brother’s
He also admitted to occasional auditory hallucinations.”
                                                                     death does not fall into one of the domain-specific event
                                                                     categories, but it is still relevant to indicate a trigger or risk
If the name of the defendant is mentioned as the subject of the
                                                                     factor. To try to capture these types of events, a more general
communication verb (e.g., “McKenna could not recall being
                                                                     approach to parsing free text was used, producing an event-
prescribed medication to treat his Depression”), an additional
                                                                     based analysis for every verb encountered in the Mental and
step is performed to verify the name McKenna against the
                                                                     Emotional Health and Substance Abuse sections.
defendant metadata information – if the system finds a match,
then the information is labeled as self-reported.
                                                                     As part of the Knowledge Discovery phase, the linguistic output
Certain automated enhancements had to be made to the                 from the NLP systems loaded into the Neo4j graph database is
Communication event detection, however, since the automatic          used as the basis for generating events that do not rely on a
classification by FONS included verbs such as stuttered and          domain-specific vocabulary. In this framework, events are
snorted. In order to improve the results, we computed semantic       generally identified by the presence of a verb and an event-
vector measures that capture the similarity in usage of verbs        based analysis is performed on the sentence. In simple
against canonical Communication events such as reported and          sentences, this means that one event corresponds to the entire
stated. The verbs that are closest in the context of use within      sentence. However, if a sentence contains multiple clauses,
the text and thus have closer meaning to the report/state verbs      each clause could potentially represent one event. In the
produce higher values and are thus more likely to be indicative      sentence “he became depressed when his infant brother died”,
                                                                     becoming depressed is one event, and his infant brother died is a
                                                                     separate event. The two clauses are linked by the conjunction
 AIAS’19, June, 2019, Montreal, Quebec Canada                                                                  K. Megerdoomian et al.

when, which indicates the temporal relationship between them.
To handle sentences such as these, a list of terms that signify a      To place events in order, TimeML uses the TLINK tag, which
subordinate clause was created and sentences were divided              records the id’s of two related events and the temporal
into clauses when one of these terms was found. The list of            relationship between the two. In this project, temporal
terms used is in Table 3 below. These terms are used in further        relationships are marked as an attribute of the event rather
analytics to identify temporal or causal relations between             than a separate entity, and an abbreviated set of seven
events.                                                                temporal relationships are used, rather than the fourteen
                                                                       defined in TimeML. The temporal relationships utilized are
 Relationship Type         Clause Marker Terms                         listed in Table 4.
 Temporal                  after, before, during, following, prior
                           to, throughout, until, upon, when,           Relationship         Description
                           while                                        AFTER                Event 1 occurs some time after Event 2
 Causal                    although, as a result of, because, due       BEFORE               Event 1 occurs some time before Event 2
                           to, in order to, since                       BEGINS_AT            Event 1 occurs immediately after Event
 Other                     according to, along with, in addition                             2
                           to, relating to                              ENDS_AT              Event 1 occurs immediately before
Table 3: Terms signifying the presence of a subordinate clause in                            Event 2
                           a sentence.                                  INCLUDES             Event 1 starts before and ends after
                                                                                             Event 2
After all clauses have been identified, an event is generated for       IS_INCLUDED          Event 1 starts after and ends before
each clause. If the clause contains a verb, the verb phrase forms                            Event 2
the basis of the event. If there is no verb phrase in the clause,       SIMULTANEOUS         Event 1 and Event 2 start and end at the
(e.g., in the sentence “while in prison, the defendant used heroin”,                         same time
“while in prison” is a clause without an explicit verb), the            Table 4: Temporal relationships used for timeline generation.
phrase after the clause marker forms an event description
which is the basis of the event. Then, information from the            Determining the values of the temporal type, start time, end
syntactic parses, dependency parses, semantic roles, and               time, and temporal relationships for the generated events is a
named entities are used to identify agents, affected entities,         three-step process. In the first pass through the events, any
indirect objects, locations, and temporal mentions related to          temporal mentions associated with each event were parsed
the basis of the event for a complete narrative analysis.              with regular expressions and used to set the event’s type, start
                                                                       date, and end date. Next, temporal relationships were
3.6 Temporal Reasoning                                                 identified by examining events to see if they contained any of
Once all relevant events have been extracted from the text of          the subordinate clause markers listed in Table 5. Rules were
the PSIRs, it is possible to make a timeline of the relevant events    then applied to relate two events connected by a subordinate
with temporal mentions in a client’s history. To accomplish            clause marker. One final pass through the events was used to
this, we adapted TimeML (Markup Language for Temporal and              set any additional start and end dates that could be inferred
Event Expressions) standards to the narratives generated [6].          after the temporal relationship was determined.
TimeML is designed to provide a standard way to annotate
events with a time stamp and place events in chronological             We can follow this entire process on the sentence “He began
order; it is thus optimal for the problem of timeline generation.      smoking marijuana at the age of 16 until his arrest in 2014”,
In TimeML, events are typically described by verbs, which              which contains the events he began smoking marijuana and his
aligns with our approach to narrative generation. In the actual        arrest in 2014. The first step after the identification of the two
TimeML specification, temporal expressions are marked as               clauses is to identify the presence of the temporal expressions
separate entities, falling into the categories of date (for events     in each clause – at the age of 16 in the smoking event (EV1) and
that take place at a specific time, which might be a date, month,      in 2014 in the arrest event (EV2). In EV1, the start time can be
or year), time (for events that take place at a specific time of       obtained from the defendant’s date of birth in the profile
day), duration (for events that have clear start and end points),      information available in the database. In EV2, the Knowledge
and set (for periodic events). In our adaptation, we recorded          Discovery component establishes that the temporal expression
the type of temporal expression as an attribute of the event it        is of type date, with a start and end time set to span the whole
was associated with, and did not use the category of time since        year as shown in Table 5, since the time is not more clearly
the specific time of day of various events is not typically            specified than that. The second step will identify the
specified in PSIRs. The category of set was recorded but is not        subordinate clause marker until, and follow a rule that
currently used for timeline generation. The start date and end         establishes that the smoking marijuana event ended at his
date of each event are also recorded as additional attributes.         arrest in 2014. The final step will use the presence of the
 Automated Narrative Extraction for Administrative Records                              AIAS’19, June, 2019, Montreal, Quebec Canada

ENDS_AT relationship to set the end time of he began smoking           The User Interface interacts with the Neo4j database to access
marijuana to the start time of his arrest in 2014. The final event     all content and narrative analytics output and displays the
analysis associated with a temporal range is then used to build        results on a Cesium Server. The web-based interface allows the
a timeline and visualize on the web-based interface.                   user to run queries of interest, filter based on the defendant’s
                                                                       profile information, and view the retrieved information on a
 Clause/Event           [He began smoking marijuana]clause1/EV1        spatial map of judicial districts or States.
 detection              until <clause-marker>
                        [his arrest in 2014] clause2/EV2               The UI displays an aggregate report of the data for the provided
 Step    1:  Detect     EV1: <type: “date”, startAt: {year: ‘1992’,    query as shown in Figure 2. This display can be further filtered
 temporal               month: ‘6’, day: ‘12’}, endAt: None>
                                                                       based on the mental conditions, medications and substances of
 expressions when       EV2: <type: “date”, startAt: {year: ‘2014’,
 available              month: ‘1’, day: ‘1’},                         interest, as well as the defendant’s demographic information
                        endAt: {year: ‘2015’, month: ‘1’, day: ‘1’}>   and criminal category.
 Step 2: Establish      id: “EV1”,
 temporal relation      relType: “ENDS_AT EV2”
 between events
 Step 3: Temporal     EV1: <type: “date”, startAt: {year: ‘1992’,
 reasoning to set     month: ‘6’, day: ‘12’},
 temporal             endAt: {year: ‘2014’, month: ‘1’, day: ‘1’}>
 expression           EV2: <type: “date”, startAt: {year: ‘2014’,
                      month: ‘1’, day: ‘1’},
                      endAt: {year: ‘2015’, month: ‘1’, day: ‘1’}>
 Table 5: Temporal reasoning process in Knowledge Discovery
                           phase.


4 Graph-Based Representation

The main motivation for using a graph database to store the
parse output is that syntactic parse outputs are often modeled          Figure 2: Narrative analytics results viewed by judicial district.
in linguistic theory in the form of trees (a graph in which each
node has a single parent) and dependency parses capture the            The user can then select to view the identified defendants on a
semantic relationship associated with two nodes, so storing the        map to the level of street detail. The user may also select to
parse outputs as a graph allows to use Neo4j API (Application          view a particular client’s information in more detail, such as
Programming Interface) and CQL (Cypher Query Language) to              mental conditions reported, and see associated text from the
directly access these grammatical relationships and handle the         Presentence Investigation Report with relevant sections
recursion inherent in language. Additionally, once the natural         highlighted. In addition, the data are used to visualize a
language parsing outputs are stored in graph format, it is easy        timeline of the defendant’s life events including arrests,
to align and merge the outputs from the different NLP systems          diagnoses, substance use, and treatments.
being used. Finally, Neo4j provides a visualization of the graph
for linguists and developers that assists in understanding the
structure of the language.                                             5 Results of Analytics

Once the output from the NLP systems is stored in the database,        The programmatically important questions of interest to PPSO
we apply several enhancements to the raw system output to              that are addressed in the current prototype are (i) determining
improve the parses’ accuracy and generalizability. These               how many defendants sentenced had a mental health
enhancements include labelling all nodes with a more coarse-           condition; (ii) the types of conditions present; (iii) the source
grained part-of-speech tag, grouping together multi-word verb          of the diagnosis; (iv) prior treatment exposure; and reporting
phrases into a single entity (e.g. merging the nodes for the           of that information by demographic, offense and prior criminal
terms in the phrase has been attending into a single node attend       history information.
with appropriate tense and aspect information), and combining
coordinated phrases with conjunctions into a single entity (e.g.       To identify the number of defendants with a mental illness, the
merging the nodes for the terms in the phrase mental and               system extracts all the client cases where a mental illness was
emotional into a single node to facilitate further analysis).          mentioned as attributed to the defendant (whether officially
                                                                       diagnosed or not). It was found that 3,959 defendants in the
                                                                       data set (about 36% of the studied population) had a history of
 AIAS’19, June, 2019, Montreal, Quebec Canada                                                               K. Megerdoomian et al.

one or more mental conditions. If Substance Use Disorder is
included as a mental health condition, that number increases to      System performance was evaluated by creating a small
58%. Figure 3 provides the heuristics for the mental health          reference sample of about 500 sentences to measure the
conditions mentioned in the Mental and Emotional Health              accuracy of the information extracted for each event type. The
sections of presentence reports studied. However, the total          500 sentences were manually annotated by team members
number of defendants who have officially been diagnosed with         indicating the expected mental conditions, event types
a mental condition is 2,238 (20% of the studied population). In      (diagnosis, treatment, prescription, usage), and medications.
addition, 82% of the defendants had a history of substance use       The annotations also included important event-related
(mainly Marijuana and alcohol), and 53% of cases had a prior         information such as the agent (prescriber, diagnoser), polarity
criminal record cited. Most common prescriptions are Prozac,         (whether the event is negated or not), and the temporal
Ritalin, Seroquel and Xanax, and top substances include              expression associated with the event. The language analytics
Marijuana, Alcohol, Cocaine, Methamphetamine and Heroin.             results were then compared to the pre-annotated reference set
                                                                     to measure how many of the detected elements were accurate
                                                                     and to also calculate how many of the expected elements were
                                                                     not picked up by the system.


Figure 3: Mental health conditions associated with about 11,000
                          defendants.

As described earlier, the analytic prototype identifies the source
of information for each detected event of interest. There are
                                                                         Figure 4: Correlation analysis shows defendants' onset of
five distinct categories for the source: (i) self (client self-
                                                                     substance use. The x-axis represents the defendant’s age and the
report), (ii) medical professional, (iii) medical records, (iv)      y-axis is the number of times the onset of substance consumption
report (official non-medical records, including evaluations and                              is found in the text.
assessments), and (v) third party (third party corroboration
such as a family member, defense counsel, probation agent, or        We also explored the aggregated national and district data for
pretrial services agency). In the presentence reports studied, the   potential correlations and analyses across defendants. Figure 4
majority of the events (about 89% of all events found) are self-     illustrates one such analysis, which shows the onset of
reported.                                                            substance use among the defendants studied. This examination
                                                                     automatically detects any mentions of the age of the defendants
The full set of results in response to the PPSO business questions   in the Substance Abuse texts and identifies any sentences that
is shown in Table 6.                                                 refer to the onset of using a drug or alcohol by the defendant.
                                                                     For instance, a sentence such as “he began using cocaine at age
                                                                     17”, is labeled as an “inception predicate” and associated with
                                                                     the age of the defendant (i.e., 17). The results show that the
                                                                     onset of substance use among defendants starts at age 10, with
                                                                     a steady increase to age 16 and peaks at age 18.

                                                                     This analysis is only one example of the types of aggregated
                                                                     correlations and computations that are available after the full
                                                                     language analytics have been performed on the data. Other
                                                                     correlations explored include automatically detecting
Table 6: Automatically obtained responses to the PPSO business       instances of co-morbidity to understand which mental
  questions on defendants' mental health and substance use           conditions tend to co-occur most often among the population,
                            history.                                 automatic detection of defendants with previous suicide
 Automated Narrative Extraction for Administrative Records                            AIAS’19, June, 2019, Montreal, Quebec Canada

attempts or history of suicidal ideation, and identification of       but risk/needs assessments for offenders who are on
events that may trigger mental health issues (e.g., death of a        supervised release are not normally referred to in judicial
family member, history of sexual or domestic abuse, fatal             decision-making.
medical diagnosis, divorce).
                                                                      In this work, we focus on the foundational question of
                                                                      extracting information from unstructured text that can inform
6 Application to Risk Assessment                                      the decisions of officers and analysts working within the
                                                                      federal probation system. We defer questions about automated
Analysts working in the Probation and Pretrial Services domain        risk assessment, and the fairness thereof, to future research.
leverage a variety of data-driven instruments to measure              The current work focuses instead on extracting and arranging
trends, train officers, and assess the recidivism risk in             raw facts from various sources in a visualization that a human
individual clients. At a high level, these efforts are typically      can use to support their professional judgement in a particular
described in terms of the popular Risk, Needs, and Responsivity       case and that an officer can potentially leverage in detecting
model, which dictates that effective offender supervision ought       patterns that had previously been unavailable.
to allocate more treatment resources to high-risk clients, that
treatment should target specific criminogenic needs in the
client’s case, and that officers should apply cognitive-              7 Future Directions
behavioral techniques to respond to the details of a client’s
particular situation [7, 8]. In recent years there has been a         The paper describes a successful approach to the automatic
trend toward using data-driven approaches for the first step,         extraction and analysis of narrative text in the mental health
and actuarial risk assessment instruments such as “Levels of          and substance use domain. The approach has since been
Service” surveys [9] and the federally developed Post-                applied to other domains such as employment history and
Conviction Risk Assessment (PCRA) [10] have played an                 financial history. The results provide evidence that the use of
increasingly important role in the allocation of treatment            technology in identifying important information in free
resources. These tools are typically based on survey questions        narrative text in administrative records is feasible and cost-
that must be administered and recorded by the officer, which          effective, and any adaptations to new domains can be
then serve as inputs to traditional statistical modeling              accelerated through probabilistic methods. These analytics can
techniques (e.g., logistic linear regression). Such tools are time-   be further developed in various directions, depending on the
consuming to use, and they offer only a limited, static snapshot      mission needs of the organization. This section provides some
of the specific criminogenic needs that are present in a client’s     directions to pursue.
case. Risk/needs assessment is an active area of research, and
efforts are ongoing to identify next generation tools that can        The current results of the analytics can further be improved
offer improved data-driven methods that can help support              upon by annotating more data and performing a larger-scale
probation officer responses during their regular interactions         evaluation and refinement cycle. Although event extraction
with clients. Leveraging the wealth of unstructured                   accuracy ranged in the 90-percentile, an evaluation conducted
information that is present in the existing documentation that        on a larger data set will provide better accuracy measures and
is available in probation case tracking systems is one promising      can identify low frequency events that may have been missed
approach to solving this problem.                                     in the current version of the analytics. Further work can also be
Any application of AI or data analysis to officer decision-           performed on negation and argument detection to achieve
making can end up having a significant impact on the                  higher precision. In addition, the analytics results have not yet
population under supervision, and so it is important to be            been fully validated by a subject matter expert to ensure that
aware of the various ethical concerns that surround the               the data identified and the way the results are presented are
application of data analysis software to social issues [11]. Such     valuable for the PPSO officer or mental health analyst.
concerns include the need for general algorithmic
accountability [12], the need for assurance that algorithms that      Building a timeline of a defendant’s life events from narrative
are used for such important tasks as recidivism prediction do         text is a very complex task and the topic of much current
not exhibit unacceptable biases [13], the need for judicial           research in the field of NLP. We successfully identified the
review of algorithm-assisted decision-making (where such              temporal expressions associated with events and introduced a
review may be called for), and more practically, the need to          temporal reasoning component which is tightly integrated into
inspire trust in users, who tend to be unwilling to rely on           the system’s syntactic parse and semantic relations output. Yet,
algorithms whose inner workings are poorly understood.                identifying the temporal relations between events is not an
Some of these issues are of greater concern than others in a          easy task and oftentimes, the system needs to infer a
probation domain. Judicial review, for example, is a legal            relationship that is not overtly mentioned in the sentence.
necessity when algorithms directly impact a judge’s decisions,        Building a hybrid method combining knowledge-based
 AIAS’19, June, 2019, Montreal, Quebec Canada                                                                          K. Megerdoomian et al.

linguistic analysis with a statistical machine learning approach     the Applied Technology Research and Development program
will provide more robust temporal relationship analyses.             managed by the Judiciary Engineering and Modernization
                                                                     Center operated by the MITRE Corporation.
One of the issues that were left unaddressed in the current
version of the analytics was the distinction between events          This (software/technical data) was produced for the U. S.
(e.g., diagnoses, treatments) that occurred in the past and those    Government under Contract Number USCA16D2019, and is
that are currently valid. This can be accomplished by leveraging     subject to Judiciary Policy Clause 6-60, Rights in Data—General
the tense and aspect information that the system computes and        (JUN 2012).
adding a filter on the UI to allow the user to view only events
that are current.                                                    No other use other than that granted to the U. S. Government,
                                                                     or to those acting on behalf of the U. S. Government under that
Building a complete timeline of a defendant’s life events will       Clause is authorized without the express written permission of
provide the important information at the individual level for        The MITRE Corporation.
PPSO officers to view and analyze, helping them identify
precursor events and triggering factors. For instance, in            For further information, please contact The MITRE
addition to the mental health and substance use information,         Corporation, Contracts Management Office, 7515 Colshire
the personal history of the defendant (e.g., whether he or she       Drive, McLean, VA 22102-7539, (703) 983-6000.
graduated high school, history of domestic violence or neglect),
existence of dependents (e.g., number of dependents and their        © 2019 The MITRE Corporation.
age, learning issues, custodian), family relations (e.g., siblings
and whether they have a criminal or substance abuse history),        REFERENCES
employment status, gang or terrorism activity, etc. are all          [1] Matthew G. Rowland (2018). Federal Probation and Pretrial Services:
                                                                          What’s Going On and Where Are We Going? (Presentation by the Chief of
important information elements that could shed light on the
                                                                          PPSO). Probation and Pretrial Services Office, Administrative Office of the
defendant’s situation and allow probation officers to provide             U.S. Courts.
more efficient supervision and intervention measures to              [2] Probation and Pretrial Services Office (2016). Overview of Probation and
                                                                          Supervised Release Conditions. Administrative Office of the United States
reduce recidivism. This requires fusing all events and                    Courts.
information extracted from presentence documents onto a              [3] Guergana K. Savova, James J. Masanz, Philip V. Ogren, Jiaping Zheng,
                                                                          Sunghwan Sohn, Karin C. Kipper-Schuler, and Christopher G. Chute (2010).
single timeline to view and analyze.                                      Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES):
                                                                          Architecture, component evaluation and applications. Journal of American
An important goal for analytics research is to leverage the large         Medical Informatics Association 17: 507–513.
                                                                     [4] Christopher D. Manning, Mihai Surdeanu, John Bauer, Jenny Finkel, Steven J.
amount of data from diverse sources available to the probation            Bethard, and David McClosky (2014). The Stanford CoreNLP Natural
office—including treatment reports, Chrono notes, social                  Language Processing Toolkit. In Proceedings of 52nd Annual Meeting of the
                                                                          Association for Computational Linguistics: System Demonstrations, pp. 55-60
media, structured metadata, risk assessments, and court              [5] Shervin Malmasi, Nicolae L. Sandor, Naoshi Hosomura, Matt Goldberg,
documents—to obtain a more complete picture of the                        Stephen Skentzos, and Alexander Turchin (2017). Canary: An NLP Platform
defendant’s history, conduct and status. The data analytics               for Clinicians and Researchers. Applied Clinical Informatics 08(02): 447-453.
                                                                     [6] Hyuckchul Jung, James Allen, Nate Blaylock, Will de Beaumont, Lucian
methods will be applied to these data sources and all results             Galescu, and Mary Swift (2011). Building timelines from narrative clinical
combined into a unified database available for query and                  records: Initial results based on deep natural language understanding.
                                                                          Proceedings of BioNLP 2011 Workshop, pp. 146-54. Association for
analysis. Building on a multi-source analysis, the system can             Computational Linguistics.
begin identifying precursor events to criminal activity or           [7] Don A. Andrews, James Bonta, and Robert D. Hoge (1990). Classification for
                                                                          effective rehabilitation: Rediscovering psychology. Criminal justice and
noncompliance, or detecting triggers for mental health issues             Behavior 17.1: 19-52.
or substance use relapses, and leverage that information to          [8] Bonta, James, and Donald A. Andrews (2007). Risk-need-responsivity model
build a predictive model to forecast potential risk and generate          for offender assessment and rehabilitation. Rehabilitation 6.1: 1-22.
                                                                     [9] J. Stephen Wormith and James Bonta (2018). The Level of Service (LS)
automatic alerts. Such an alerting system can help direct an              Instruments. In Handbook of Recidivism Risk/Needs Assessment Tools, pp.
officer’s attention to elements of a client’s case history that           117-145. Wiley & Sons.
                                                                     [10] Christoper T. Lowenkamp, James L. Johnson, Alexander M. Holsinger, Scott
indicate a special cause for concern.                                     W. VanBenschoten, and Charles R. Robinson (2013). The federal Post
                                                                          Conviction Risk Assessment (PCRA): A construction and validation study.
                                                                          Psychological Services 10(1): 87-96.
                                                                     [11] Scott W. VanBenschoten (2008). Risk/needs assessment: Is this the best we
ACKNOWLEDGMENTS                                                           can do. Federal Probation 72: 38.
                                                                     [12] Nicholas Diakopoulos (2014). Algorithmic Accountability Reporting: On the
We would like to acknowledge the guidance and support of the              Investigation of Black Boxes. Columbia University, Tow Center for Digital
U.S. Probation and Pretrial Services Office throughout this               Journalism. New York: Columbia University Academic Commons.
project. In particular, we would like to thank Steve Levinsohn       [13] Alexandra Chouldechova (2017). Fair Prediction with Disparate Impact: A
                                                                          Study of Bias in Recidivism Prediction Instruments. Big Data(5): 153-163.
for providing essential subject matter expertise to the team.
The project was led and funded by the Technology Solutions
Office at the Administrative Office of the U.S. Courts as part of