=Paper= {{Paper |id=Vol-2931/ICBO_2019_paper_32 |storemode=property |title=Early steps of an Ontology for Magnetic Resonance Imaging: MRIO |pdfUrl=https://ceur-ws.org/Vol-2931/ICBO_2019_paper_32.pdf |volume=Vol-2931 |authors=Lucas M. Serra,Michael G. Dwyer,William D. Duncan,Alexander D. Diehl |dblpUrl=https://dblp.org/rec/conf/icbo/SerraDDD19 }} ==Early steps of an Ontology for Magnetic Resonance Imaging: MRIO== https://ceur-ws.org/Vol-2931/ICBO_2019_paper_32.pdf
              Early steps of an Ontology for Magnetic Resonance Imaging: MRIO

             Lucas M. Serra 1, Michael G. Dwyer 2, William D. Duncan 1, Alexander D. Diehl 1

   Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo,
   USA1; Buffalo Neuroimaging Analysis Center, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo,
                                                      Buffalo, USA2.

Abstract                                                                and timing how long it takes for the needle to re-right itself. As
                                                                        the protons re-align themselves with the applied magnetic field,
The Magnetic Resonance Imaging Ontology (MRIO) is an                    they release energy. Protons can release energy to their
application ontology that represents numerous entities in the           surroundings, which is referred to as spin-lattice relaxation or
domain of magnetic resonance imaging (MRI) including MRI                T1 relaxation. Alternatively, protons can become out of phase
analysis and MRI sequences. Data from clinical trials MRI               with each other. This is called spin-spin relaxation or T2
protocols were used to create the axioms of these MRI                   relaxation. Depending on which of these effects dominates an
sequences. We have also created means for automatically                 image determines whether we designate an image as a “T1
loading MRI headers as new ontology instances and                       image” or a “T2 image”. The aforementioned effects alter the
demonstrate the ability to query data in MRIO. The current              net magnetic vector within the machine, which is captured as
work represents the beginnings of a full-fledged imaging                electrical impulses by the RF coil. In addition to these
                                                                        “classical” image contrasts, the field of MRI physics has
ontology and automated analysis pipeline, which we plan to
                                                                        discovered many other sources of tissue contrast that can be
further develop. Future iterations of the project will include a
                                                                        elucidated by variations in the standard pulse sequence regime.
stream-lined user-interface for querying and improved                   Together, these various contrasts enable fine discrimination of
capability in classifying image types.                                  tissue composition that is not possible with other imaging
                                                                        modalities, which has cemented MRI as the premiere imaging
Keywords:                                                               option for pathologies affecting soft tissues.
MRI ontology; imaging informatics; MRIO.                                Expanding Use and Standards
Introduction                                                                MRI is an often-used component in a physician’s toolkit
                                                                        especially in the US which boasts the second highest number of
The Fundamentals of Magnetic Resonance Imaging                          MRI machines per capita globally (2). MRI has broad clinical
                                                                        and research applications ranging from traumatic brain injuries
    Magnetic resonance imaging (MRI) is a mainstay of modern            to osteoarthritis to malignancy. Within the past two decades, the
medicine that has rapidly integrated itself into a myriad of            use of imaging across healthcare has risen dramatically, and has
diagnostic algorithms and has proven itself as a valuable               been partly fueled by physicians who purchase MRI machines
component of healthcare due to its versatility and accuracy.            for their practices and consequently order more scans (3, 4). The
However, these features come at the cost of price and                   growing use of imaging data has necessitated improvements in
complexity. MRI is a nuanced technology and, when                       imaging standards and protocols. Healthcare professionals and
approaching methods for representing its components in an               researchers working within the field of imaging wisely adopted
ontology, merits an understanding of the fundamental principles         a standard file format for medical images decades ago. Digital
of magnetic resonance. MRI is based upon the same physical              Imaging and Communications in Medicine (DICOM) is used
principles that underlie nuclear magnetic resonance and is              worldwide to store and transmit medical images. (5). In order to
predicated upon on the notion of “spin”. Spin gives particles,          further augment the standardization and interoperability
like protons, their angular momentum and a magnetic moment              introduced by DICOM, centers involved in clinical trials often
(1). Protons therefore have magnetic fields which align with            adopt detailed protocols, which state specific image parameters
applied external magnetic fields. By interrogating these proton         and tolerances for use during data collection. These data exist as
spins with radiofrequency pulses and recording the responses,           numbers and text in the metadata fields of a DICOM header.
MRI is able to infer many different properties of the underlying
tissue.                                                                 Problems Facing the Field
The Anatomy of an MRI Machine                                             I.    With increased use and widening adoption comes ever-
                                                                        growing volumes of data that must be catalogued, managed, and
    Modern MRI machines are composed of a primary                       analyzed. Despite the progress made in standardizing medical
superconducting magnet that supplies the main magnetic field,           images, there exist numerous challenges in the management of
a gradient coil to alter the primary magnet’s field and encode          imaging data, which the use of an ontology helps to mitigate.
spatial information, and a set of radiofrequency (RF) coils to          The metadata fields of a DICOM file frequently represent non-
create pulses and receive signals. If we consider an analogy            explicit knowledge using ambiguous language. For instance,
where the protons are the needle of a compass, the RF coil’s            one of the fields in the DICOM header is labeled ‘PulseTime’.
function is somewhat similar to nudging the needle with a finger        The preceding fields deal with cardiac aspects of the scan such




Copyright © 2019 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
as ‘CardiacRepetitionTime’ and ‘ImagesPerCardiacCycle’                  “data siloes”. The ontology covering MRI simulations and
which may lead one to believe that ‘PulseTime’ relates to the           sequences did not publish their ontology in any form. The
pulse or heart rate of the patient. This is complicated by later        DICOM controlled terminology, although published alongside
fields that reference RF pulses but instead do so using language        ontologies on BioPortal, has a completely flat structure and
like ‘PulseSequence’. This makes it challenging for a user who          some of its definitions are not crafted in the style preferred by
is unfamiliar with the domain to use the data. Fully                    the OBO Foundry. Additionally, all these ontologies seem to not
understanding the intended meaning of the data fields involves          cover the higher levels of abstraction that we desire in our
deep knowledge of the latest version of the DICOM                       ontology.
specifications, use of a third-party website, or consultation with
                                                                            In the current work, we have developed the MRI ontology
a domain expert. Among the most important issues is a lack of
                                                                        (MRIO) to represent MRI analyses, sequences, images, and
consensus about the exact parameters that make up a specific
image type, which is partly confounded by intermachine and              machines using metadata from DICOM files to create axioms.
                                                                        We have also created methods for extracting this information
inter-operator     variability.   The     Alzheimer's       Disease
                                                                        from DICOM headers and automatically creating new ontology
Neuroimaging Initiative (ADNI) maintains highly detailed MRI
scanner protocols for use in its clinical trials and illustrates this   instances.
variability well (6). The ADNI 3 protocols define the MRI               Methods
acquisition parameters for capturing a sagittal 3D fluid-
attenuated inversion recovery image of a human brain in several
                                                                        Ontology Construction
different machines from different vendors. In a General Electric
25 MRI machine, the echo time (TE) is 119.0ms, the repetition               MRIO was created with the latest version of Protégé (5.5.0)
time (TR) is 4800.0ms and the inversion time (TI) is 1451ms             (11). The HermiT (1.4.3.456) reasoner plugin was used for
while in a Siemens Magnetom Verio machine the parameters are            inference (12). Our ontology was built with certain principles in
442ms, 4800ms, and 1650ms for TE, TR, and TI respectively.              mind, such as resource identifiers, textual definitions, and
Although both machines are 3 tesla MRI machines and                     openness, all of which are outlined by the Open Biological and
attempting to capture the same image, their TE parameters are           Biomedical Ontology (OBO) Foundry (13). Following these
quite different. Moreover, even small changes in these                  principles, MRIO uses BFO as its upper level ontology and re-
parameters can result in radically different images and                 uses existing ontologies like the Ontology for Biomedical
associated image types. Broadly speaking, we currently do not           Investigations (OBI) and the Information Artifact Ontology (14-
have effective methods for transitioning from these elementary          16). MRIO adds 70 new terms, most with well-constructed
imaging parameters to higher semantic levels. If we borrow an           textual and logical definitions to represent multiple aspects of
analogy from biology, these imaging parameters are similar to           MRI images. Around two dozen terms were reused from OBI
the nucleotides of DNA where different sequences can code for           and IAO as upper level terms or in relations. Our ontology was
the same codons and proteins. As of yet, we lack an elegant way         constructed in both a top-down and a bottom-up approach. The
to determine these proteins or their functions from their               entities we deemed most important in representing MRIs in an
constituents. These factors can result in problems with                 ontology are: the MRI image objective and the MRI sequences,
interoperability when combining large sets of MRI images and            followed by the MRI machine, the patient/evaluant, the MRI
the requirement to write complex and cumbersome queries to              assay, and the MRI image itself. We consulted with domain
create retrospective cohorts.                                           experts in order to create the MRI analysis hierarchy. The most
                                                                        salient metadata on DICOM image files are “parameter
Imaging Ontologies                                                      specifications” or “acquisition parameters”, which describe RF
    The current work is not the first ontology in the domain of         pulse sequences. These parameters, implemented as data
MRI images, and a handful of past studies have created MRI-             properties, were used in creating the axioms and computer-
related ontologies. NeuroLOG or OntoNeuroLOG is a French                readable definitions of sequences. A GitHub repository
multi-level ontology created to integrate neurological resources        containing the latest version of the ontology is available at:
from multiple academic centers and uses DOLCE as its upper              https://github.com/LucasSerra1/MRIO.git
level ontology (7). NeuroLOG covers a wide array of brain-
                                                                        Data Extraction
centric investigation-related entities including MRI (8). A more
recent MRI ontology covered MRI simulations and modeled the                 The scripts used in parsing MRI headers and MRI protocol
fundamental processes of the RF pulses that form sequences (9).         files were written using the Python programming language. In
Lastly, the DICOM controlled terminology is available on                essence, the scripts extract information from the DICOM
BioPortal and consists of every term used in the DICOM file             headers and transform the information into instances of MRIO
format along with their definitions (10). These works suffer            classes and relations. DICOM header data fields are first
from limitations in accessibility and usability. NeuroLOG is            transformed into a spreadsheet. These fields are mapped to
inaccessible through the paper’s provided links and what is             MRIO data properties. Numeric values are then read and
viewable through snapshots of the ontology show missing                 associated with these data properties. The RDFLib (4.2.2)
textual and logical definitions for represented entities.               Python library was used to facilitate this transformation and
NeuroLOG also uses DOLCE, which restricts its                           automatically add graph nodes and new instances to our
interoperability with the multitude of existing OBO Foundry             ontology from these mapped classes. To create the axioms that
ontologies that are grounded in the Basic Formal Ontology.              underlie the sequence types (Figure 2), a separate script was
Interoperability with OBO Foundry ontologies is an important            created that extracts parameter specifications from JSON files
feature that promotes reuse and prevents the creation of isolated
representing years of MRI study protocols used in clinical trials     Discussion
conducted at the Buffalo Neuroimaging Analysis Center. As no
exact definitions for consensus sequence parameters exist in the          Our work contributes to imaging informatics in a number of
DICOM specifications or in published literature, simple ranges        ways. The automatic creation of ontology instances mitigates the
were used to define sequence parameters and provide a survey          laborious task of data entry. Our system also enables precise
of the data available. Minimum and maximum values were                selection of cohorts from datasets of DICOMs and facilitates
extracted across hundreds of entries to create the ranges that        discovery of potential subgroups within imaging data. MRIO
constitute the axioms of our sequence classes.                        provides a structured semantic representation of many of the
    Our final output of the data extraction process was an OWL        metadata fields found in the DICOM format. To this end, MRIO
file containing 4 instances (representing a single DICOM              improves the interpretability of data field definitions without the
header), 70 MRIO-specific classes, and 8 new data properties.         need for external resources and elucidates some of the implicit
The original data consisted of 300 text files containing 1000         knowledge found within this domain. MRIO’s adherence to
entries for MRI protocols (17). This was distilled into 5 MRI         OBO Foundry principles also enhances interoperability with
pulse sequence classes in the final ontology. After modification      other similarly structured ontologies.
with RDFLib, the ontology was loaded as a triplestore into the            Despite these benefits, MRIO and its extraneous systems are
free version of GraphDB (8.9) Using SPARQL, we queried the            currently limited in some respects. At present, MRIO can only
data looking for images by their parameters (18, 19).                 process single DICOM headers, which must be loaded as text
                                                                      files. Furthermore, once new MRI instances are loaded, the
Results                                                               HermiT reasoning engine in Protégé takes minutes to sort
                                                                      individuals and infer relations. This occurs with only a handful
     The ensemble of these moving pieces is a pipeline that           of DICOMs loaded. We are investigating methods to speed up
automatically loads DICOM headers and inserts them into a             the reasoning so we can scale the ontology appropriately. Our
queryable MRI ontology created from a combination of domain           ontology also only captures a small selection of the vast number
expertise and parameter data extracted from clinical trial            of data fields found within the DICOM file standard. We would
protocols. Figure 3 provides an overview of the gross structure       also like to more fully develop the definitions of our classes. As
of the ontology. As MRIO is built upon the foundations of OBI,        a final limitation, our system requires that users understand
it takes a similar approach in establishing relationships between     SPARQL to write their queries and extract information from
the overall imaging process and the participants. Terms derived       data loaded in triplestores, although our long-term plans include
from OBI are in ovals while MRIO terms are in boxes. More             creating a web interface to simplify querying.
specifically, ‘magnetic resonance imaging pulse sequence’ is              MRIO represents the beginnings of a full-fledged imaging
define as a type of ‘processed material’ and stands in a ‘part of’    ontology and automated analysis pipeline. There are many
relation to the ‘magnetic resonance imaging radiofrequency            possibilities for future work and expanding the functionality of
coil’, which is an OBI ‘measurement device’. As shown in              MRIO. With thousands of MRIs loaded from disparate data sets
Figure 3, both the MRI machine and a ‘material entity’ with the       and institutions, it would be possible to better grasp which are
‘magnetic resonance imaging evaluant role’ are the specified          the exact elements that make a “T1 image”. This could occur
inputs of a ‘magnetic resonance imaging assay’. This ‘magnetic        either through community consensus or MRIO could provide
resonance imaging assay’ term resides under the ‘planned              high-quality data for machine learning or statistical treatments
process’ class and has ‘magnetic resonance imaging datum’ as          of this question. In later versions of our work, the query system
the specified output. This data undergoes a ‘magnetic resonance       could be improved with a natural language processing-based
imaging data transformation’, which in the real-world partly          query system and a more stream-lined user interface that would
takes the form of a Fourier transformation and results in the final   obviate the need for users to know SPARQL.
‘magnetic resonance imaging image’. The image is tied back to         Conclusion
the sequence used and the subject of the scan using ‘is about’
relations.                                                               MRIO is the only MRI ontology under active development.
     Figure 1 depicts the structure of the MRI pulse sequences.       At present, MRIO enjoys a number of useful features and
Several new data properties were needed to fully represent            these initial steps provide a proof-of-concept for a much larger
sequence parameters: ‘has TR’, ‘has TE’, ‘has inversion time’,        analytic platform with numerous uses.
‘has flip angle’, and ‘has echo train length’. These entities were
derived from BNAC MRI protocol specifications and represent
settings configured on an MRI machine for the creation of an                                ACKNOWLEDGMENT
MRI image.                                                            AD was supported by 5UL1TR001412 (NCATS).
     Figure 2 illustrates the type of query one is able to use with   MD has received consultant fees from Claret Medical and EMD
MRIO. With SPARQL, an investigator is able to hone in on              Serono, and research grant support from Novartis and Celgene.
well-crafted cohorts via sequence parameters as in this example
or via a number of other axes.
   Fig. 1. Sample MRI pulse sequence class                                                                 Fig 2. Example SPARQL query




   Fig. 3. Gross and relational structure of MRIO




         image creation
            device
                                                                                                                                                                     data
                           MRI machine       has_specified_input                          imaging assay                                                         transformation

                                                                                                                                                       MRI datum
                                                                            MRI assay                                          has_specified_input                      has_specified_output
                                                                                                                                                     transformation
                      has part    has part
                                                                                               has_specified_output
measurement                                            device
  device                                                                                                                  MRI datum                                              MRI image
                  MRI
                                         MRI magnet
          radiofrequency coil
                                                                         has_specified_input                                         measurement                                               image
                                                                                                                                       datum
                has part
                                                            material
               MRI pulse                                     entity
               sequence
                                                                                                                          is about
   processed
    material                                                inheres in




                                                       MRI evaluant role

                                                                          evaluant role                        is about
REFERENCES
1.    Plewes DB, Kucharczyk W. Physics of MRI: A primer. Journal
      of Magnetic Resonance Imaging. 2012;35(5):1038-54.
2.    Health expenditure indicators [Internet]. 2014. Available from:
      https://www.oecd-ilibrary.org/content/data/data-00349-en.
3.    Agarwal R, Bergey M, Sonnad S, Butowsky H, Bhargavan M,
      Bleshman MH. Inpatient CT and MRI utilization: trends in the
      academic hospital setting. Journal of the American College of
      Radiology : JACR. 2010;7(12):949-55.
4.    Baker LC. Acquisition of MRI equipment by doctors drives up
      imaging use and spending. Health affairs (Project Hope).
      2010;29(12):2252-9.
5.    Mildenberger P, Eichelberg M, Martin E. Introduction to the
      DICOM standard. European Radiology. 2002;12(4):920-7.
6.    Petersen RC, Aisen PS, Beckett LA, Donohue MC, Gamst AC,
      Harvey DJ, et al. Alzheimer's Disease Neuroimaging Initiative
      (ADNI): clinical characterization. Neurology. 2010;74(3):201-9.
7.    Masolo C, Borgo S, Gangemi A, Guarino N, Oltramari A.
      WonderWeb deliverable D18 ontology library. 2003.
8.    Michel F, Gaignard A, Ahmad F, Barillot C, Batrancourt B,
      Dojat M, et al. Grid-wide neuroimaging data federation in the
      context of the NeuroLOG project. Studies in health technology
      and informatics. 2010;159:112-23.
9.    Lasbleiz J, Saint-Jalmes H, Duvauferrier R, Burgun A. Creating
      a magnetic resonance imaging ontology. Studies in health
      technology and informatics. 2011;169:784-8.
10.   Salvadores M, Alexander PR, Musen MA, Noy NF. BioPortal as
      a Dataset of Linked Biomedical Ontologies and Terminologies
      in RDF. Semantic web. 2013;4(3):277-84.
11.   Noy NF, Crubezy M, Fergerson RW, Knublauch H, Tu SW,
      Vendetti J, et al. Protégé-2000: an open-source ontology-
      development and knowledge-acquisition environment. AMIA
      Annual Symposium proceedings AMIA Symposium.
      2003;2003:953-.
12.   Glimm B, Horrocks I, Motik B, Stoilos G, Wang Z. HermiT: An
      OWL 2 Reasoner. Journal of Automated Reasoning.
      2014;53(3):245-69.
13.   Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, et
      al. The OBO Foundry: coordinated evolution of ontologies to
      support biomedical data integration. Nature biotechnology.
      2007;25(11):1251-5.
14.   Bandrowski A, Brinkman R, Brochhausen M, Brush MH, Bug
      B, Chibucos MC, et al. The Ontology for Biomedical
      Investigations. PLOS ONE. 2016;11(4):e0154556.
15.   Smith B, Ceusters W. Aboutness: Towards Foundations for the
      Information Artifact Ontology2015.
16.   Arp R, Smith B, Spear AD. Building Ontologies with Basic
      Formal Ontology: The MIT Press; 2015. 248 p.
17.   Antoniou G, van Harmelen F. Web Ontology Language: OWL.
      In: Staab S, Studer R, editors. Handbook on Ontologies. Berlin,
      Heidelberg: Springer Berlin Heidelberg; 2004. p. 67-92.
18.   Gueting RH. GraphDB: A Data Model and Query Language for
      Graphs in Databases. Informatik-Bericht 155. 1994.
19.   Prud’hommeaux E, Seaborne, A. SPARQL Query Language for
      RDF. W3C Recommendation. January 2008.