<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Developing A Semantic Web-based Framework for Executing the Clinical Quality Language Using FHIR</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Guoqian Jiang</string-name>
          <email>jiang.guoqian@mayo.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eric Prud'Hommeaux</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guohui Xiao</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Harold R. Solbrig</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Free University of Bozen-Bolzano</institution>
          ,
          <addr-line>Bolzano, 39100</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Mayo Clinic</institution>
          ,
          <addr-line>Rochester, MN, 55905</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>W3C/MIT</institution>
          ,
          <addr-line>Cambridge, MA 02142</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The Clinical Quality Language (CQL) is a HL7 specification, aiming to provide a human-readable language to define clinical quality measures and decision support rules while it makes logic expressions independent of any specific data model (e.g., Quality Data Model, HL7 Fast Healthcare Interoperability Resources - FHIR). FHIR adopts RDF as its third representation/interchange format in addition to XML and JSON, and uses the Shape Expressions (ShEx) schema to standardize and validate the FHIR RDF graphs. In this presentation, we propose a Semantic Web-based framework that enables the execution of CQL using the FHIR RDF technologies. The framework comprises the following four modules: 1) a CQL-to-SPARQL transformation module; 2) a value set service module; 3) a FHIR RDF transformation module; and 4) a ShEx-based data validation module. We implemented a prototype to demonstrate the utility of the framework and discussed the challenges and ongoing tasks.</p>
      </abstract>
      <kwd-group>
        <kwd>Clinical Quality Language</kwd>
        <kwd>FHIR</kwd>
        <kwd>RDF</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Future advances in translational and precision medicine research will be increasingly
dependent on the creation of patient cohorts encompassing both highly detailed
phenotypic and molecular data. The growth of electronic health records (EHRs) has been
recognized as a viable and efficient model for enabling translational and precision
medicine research1. The distinct advantages of EHRs include cost efficiency, large
amounts of available clinical data, and the ability to analyze data over time, however,
the data are highly complex, inaccurate and frequently missing. The healthcare
informatics community faces huge methodological and computational challenges in
repurposing EHRs for translational and precision medicine research, with respect to
standards-based data normalization, effective data integration and accurate phenotyping.
The Clinical Quality Language (CQL)2, 3 is a HL7 specification providing a
humanreadable language to define clinical quality measures and decision support rules. As
the creation of EHR-driven phenotype algorithms shares many common requirements
with the definition of clinical quality measures/clinical decision support rules, there
are emerging interests in the clinical research informatics communities to explore the
CQL as a tool for the standard representation and execution of phenotype algorithms
(i.e., structured selection criteria designed to produce research-quality phenotypes)
driven by EHRs. Notably, the phenotype execution and modeling architecture
(PhEMA)4 consortium has been looking into the standard representation and execution of
phenotype algorithms using CQL and related standards5.</p>
      <p>HL7 Fast Healthcare Interoperability Resources (FHIR)6 is an emerging
nextgeneration standards framework for the exchange of electronic healthcare data. FHIR
adopted the Resource Description Framework (RDF) as its third
representation/interchange format in addition to XML and JSON, and uses the Shape
Expressions (ShEx) schema to standardize the structure of FHIR RDF graphs7. The HL7
ITS/W3C RDF Task Force has made initial decisions how the FHIR RDF graphs look
like, and our team at Mayo Clinic reviewed the existing FHIR RDF decision
documents and created a minimal set of elements, defined the FHIR ShEx transformation
rules for the elements. We implemented the FHIR ShEx transformation tools and
evaluated the utility of the FHIR ShEx schemas leveraging the ShEx validation tools
developed in the W3C ShEx community7. This enables promising opportunities for
the clinical research informatics communities to leverage existing semantic tools
developed in the Semantic Web communities for standards-based data integration and
phenotype algorithm definition. The objective of this study is to propose a Semantic
Web-based framework for executing the CQL using FHIR to enable a next-generation
EHR-driven phenotyping infrastructure.
2</p>
    </sec>
    <sec id="sec-2">
      <title>System Architecture</title>
      <p>ELM Translator3, the API of which is utilized for parsing and processing CQL. The
Jena SPARQL API8 is used for building target SPARQL queries.
The value set service module. Value set is one of key CQL constructs, specifying
that logic within the CQL library may reference the specified value set by the given
name. For instance, the statement [valueset "Antibiotic Medications":
'2.16.840.1.113883.3.464.1003.196.12.1001' ] means the value set of antibiotic
medications can by referenced by its name and resolved by its OID through external value
set services. By default, the Value Set Authority Center (VSAC)9 value set services
are recommended. In this module, a tool is developed to invoke the external VSAC
services and retrieve the coded values and associated metadata required for the
SPARQL query construction.</p>
      <p>The FHIR RDF transformation module. As mentioned above, the FHIR RDF has
become part of the official FHIR STU3 release and a FHIR RDF transformation tool
has been incorporated into the FHIR building toolkit. Our team at Mayo Clinic is
actively developing tools that help the FHIR RDF transformation from either XML or
JSON formats. In addition, large volumes of clinical data are stored in the relational
databases and the tools (e.g., Ontop14) that support the execution of SPARQL queries
over relational databases can be leveraged to transform the relational data into the
FHIR-based RDF representation.</p>
      <p>The ShEx-based data validation module. ShEx is a constraint language for formally
describing RDF structures and can serve the same role with RDF as that of XML
schema to XML. Validating an RDF node against a shape tests the adjacent nodes
against the constraints in the shape. As mentioned above, FHIR ShEx schemas have
become part of the official FHIR release. The FHIR ShEx transformation and
validation tools will be leveraged and extended to support the data validation needs in
executing the CQL.</p>
    </sec>
    <sec id="sec-3">
      <title>Prototype Implementation</title>
      <p>We are actively implementing each module of the framework for executing the
CQLbased phenotype algorithms using the FHIR RDF technology. We have first examined
available FHIR-based CQL examples and identified a number of patterns for building
the SPARQL queries. Fig. 2 shows such a SPARQL pattern. The VSAC value set
services were invoked to retrieve the codes for two value sets “Acute Pharyngitis” and
“Acute Tonsillitis”, in which 17 and 19 codes (from the code systems SNOMED CT
and ICD-10-CM) were retrieved respectively. A Java-based program was developed
for transforming identified patterns from CQL to SPARQL while invoking the VSAC
value set services to retrieve coded values in a value set.</p>
      <p>For the FHIR RDF transformation tools, we have implemented a stand-alone tool to
convert FHIR resources in the JSON format to their equivalent in the FHIR RDF
format10. We are also developing a FHIR-based data access framework to enable
exposing the clinical data stored in the OHDSI CDM-based data repositories in the
FHIR RDF format through leveraging an ontology of the FHIR metadata vocabulary
and an open-source Ontology-based Data Access (OBDA) system Ontop11,14, in
which an OHDSI virtual machine12 is used to provide sample clinical research data
for the testing purpose. A test query generated from Fig. 2 was run successfully
against the FHIR-Ontop-OHDSI platform and 1640 patients were identified. The
results were verified accurate using the OHDSI cohort identification tool.</p>
    </sec>
    <sec id="sec-4">
      <title>Discussion and Conclusion</title>
      <p>In this study, we proposed a framework for executing CQL-based phenotype
algorithms using FHIR RDF technologies and demonstrated the feasibility of using CQL
and FHIR in support of EHR-driven phenotype algorithm creation and execution. It
turns out that temporal patterns (e.g., ages, intervals) are commonly used in CQL, and
existing SPARQL functions may need to be extended to capture and execute such
patterns effectively. While CQL uses the basic expression definition capabilities
defined by FHIRPath13 for its core expression terms, we noticed that there are some
discrepancies between FHIRPath expressions and FHIR RDF decisions, which needs
to be harmonized through a community-based collaboration in the future.
Acknowledgement: This study is supported in part by NIH grants U01 HG009450,
U01 CA180940, and R01 GM105688.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Hripcsak</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Albers DJ</surname>
          </string-name>
          .
          <article-title>Next-generation phenotyping of electronic health records</article-title>
          .
          <source>J Am Med Inform Assoc</source>
          .
          <year>2013</year>
          ;
          <volume>20</volume>
          (
          <issue>1</issue>
          ):
          <fpage>117</fpage>
          -
          <lpage>21</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>HL7</given-names>
            <surname>Cross-Paradigm</surname>
          </string-name>
          <string-name>
            <surname>Specification</surname>
          </string-name>
          :
          <source>Clinical Quality Language, Release</source>
          <volume>1</volume>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <source>The Clinical Quality Language GitHub Site</source>
          <year>2017</year>
          [Available from: https://github.com/cqframework/clinical_quality_language.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <article-title>4. The phenotype execution and modeling architecture (PhEMA) 2017</article-title>
          [Available from: http://projectphema.org.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Jiang</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiefer</surname>
            <given-names>RC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rasmussen</surname>
            <given-names>LV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solbrig</surname>
            <given-names>HR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mo</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pacheco</surname>
            <given-names>JA</given-names>
          </string-name>
          , et al.
          <article-title>Developing a data element repository to support EHR-driven phenotype algorithm authoring and execution</article-title>
          .
          <source>J Biomed Inform</source>
          .
          <year>2016</year>
          ;
          <volume>62</volume>
          :
          <fpage>232</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>HL7</given-names>
            <surname>Fast Healthcare Interoperability Resources (FHIR</surname>
          </string-name>
          )
          <year>2017</year>
          [Available from: http://hl7.org/fhir/.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Solbrig</surname>
            <given-names>HR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prud'hommeaux</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grieve</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McKenzie</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mandel</surname>
            <given-names>JC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sharma</surname>
            <given-names>DK</given-names>
          </string-name>
          , et al.
          <article-title>Modeling and validating HL7 FHIR profiles using semantic web Shape Expressions (ShEx)</article-title>
          .
          <source>J Biomed Inform</source>
          .
          <year>2017</year>
          ;
          <volume>67</volume>
          :
          <fpage>90</fpage>
          -
          <lpage>100</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <article-title>8. ARQ - A SPARQL Processor for Jena 2017</article-title>
          [Available from: https://jena.apache.org/documentation/query/.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>The</given-names>
            <surname>Value Set Authority Center (VSAC</surname>
          </string-name>
          )
          <year>2017</year>
          [Available from: https://vsac.nlm.nih.gov/.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>The</surname>
          </string-name>
          FHIR-to-
          <source>RDF Transformation Tool</source>
          <year>2017</year>
          [Available from: https://github.com/BD2KOnFHIR/fhirtordf.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Jiang</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiao</surname>
            <given-names>GH</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiefer</surname>
            <given-names>RC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prud'hommeaux</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solbrig</surname>
            <given-names>HR</given-names>
          </string-name>
          , editors.
          <article-title>Building an FHIR Ontology based Data Access Framework with the OHDSI Data Repositories</article-title>
          .
          <article-title>AMIA 2017 Annual Symposium</article-title>
          (in press); 2017; Washington DC.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <source>The OHDSI Virtual Machine</source>
          <year>2017</year>
          [Available from: http://www.ohdsi.org/web/wiki/doku.php
          <article-title>?id=documentation:software:ohdsi-in-a-box.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>13. FHIRPath 2017 [Available from: http://hl7.org/fhirpath/.</mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Calvanese</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cogrel</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Komla-Ebri</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kontchakov</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lanti</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rezk</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodriguez-Muro</surname>
            <given-names>M</given-names>
          </string-name>
          , and
          <string-name>
            <given-names>Xiao</given-names>
            <surname>GH</surname>
          </string-name>
          .
          <article-title>Ontop: Answering SPARQL Queries over Relational Databases</article-title>
          .
          <source>In: Semantic Web Journal</source>
          .
          <year>2017</year>
          :
          <volume>8</volume>
          .3:
          <fpage>471</fpage>
          -
          <lpage>487</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>