<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Health informatics visualisation engine: HIVE</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Senior Statistician CSIRO</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Norm Good</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chris Bain</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Hansen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simon Gibson</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Health Informatics, Alfred Health</institution>
          ,
          <addr-line>The Alfred, Victoria</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>The Australian e-Health Research Centre, CSIRO</institution>
          ,
          <addr-line>Queensland</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <fpage>30</fpage>
      <lpage>31</lpage>
      <abstract>
        <p>Norm Good SUMMARY This project was designed to integrate, analyse, synthesise and present essential health and hospital information in a highly accessible, agile and visual form - because pictures are worth a thousand words. We developed a prototype software tool that is; • capable of drawing on standardised data files that replicate known industry standard, or are easily derivable from such standards • provides the user (analyst, operational manager, financial manger, executive) with a customisable view of the relative outcomes of, and resources used in, care in a number of dimensions- clinical (LOS, number of adverse events, number of drug doses, attending doctor etc) and financial (surgical, pharmacy, nursing etc) - in one setting • and identify outliers using advanced statistical modelling techniques. This tool will generate immediate value for a hospitals' endeavour in continuous operational improvement and will be of particular interest to potential customers throughout Australia given the move to nationally provided Activity Based Funding for hospital services. The tool is a useful way to harness the power of “big data” through advanced analytics.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Norm Good is a Statistician within the CSIRO’s
Computational Informatics Division. He has been based
in the Australian E-Health Research Centre (Brisbane) for
the past eight years conducting health related research.
Mr Good has considerable expertise in developing and
applying variable selection techniques and survival models
to health data. Examples of this include novel approaches
for estimating optimal colonoscopy screening intervals
and developing a risk profiles for patients who are likely
to be readmitted to hospital. He has also worked in the
field of confidential health data, developing risk and utility
measures for regression modelling and promoting the use
of remote server statistical querying. Recently Mr Good has
exploring the utility of visualisation methods for analysing
high-dimensional geometry and multivariate data.</p>
    </sec>
    <sec id="sec-2">
      <title>DESCRIPTION</title>
      <p>ANALYTICS ENGINE - Behind the software interface are two main analytics modules. In the first module we can
develop models of predicted costs for treating individual patients. In addition to the standard cost attached to a
specific DRG, extra costs based on clinical and demographic parameters can be estimated. The main purpose
of this modeling was to develop a profile for the “average” patient. The second module focused on identifying
outliers in a data set. Given the predicted cost of care calculated above, the actual cost of care is calculated. An
outlier in this context is the difference between the actual and predicted cost for a patient which is outside the
“normal” range expected. The current method for identifying outliers in Victoria is the L3H3 method3. It uses
three times the average length of stay for a particular DRG as the high cut point for outliers. We will be using a
more robust method based on statistical discordancy to identify outliers4. What is potentially more informative
is the detection of “inliers”. That is, people whose costs are much lower than predicted. Examination of
these patients may reveal insights into optimal care models. Given the multivariate nature of the cost and
clinical data, dimension reductions tools such as principal components analysis have been employed to project
patients and their associated cost of care and clinical status onto a viewable two or three dimensional space.
The baseline data comes from clinical and financial databases from The Alfred Hospital, statistical analyses are undertaken with the R statistical package and the
GUI is developed using web-based Java script.</p>
    </sec>
    <sec id="sec-3">
      <title>RESULTS</title>
      <p>Figure 1 is a four dimensional representation of some of the hospital data. A simple point and click screen in the software produced this plot. The x-axis represents
the costs associated with ICU, Allied and Pathology. These costs tend to be related to each other. However, the greatest cost is ICU. The y-axis represents costs
associated with Nursing and Medical non-surgery. These axes are derived from a method known as “principal components”. The size and colour of the bubbles are
representative of total costs. The “Yes” and “No” overlaid onto the bubbles indicate whether the patient was alive at discharge. As you can see some of the biggest
costs are associated with patients that died, “No”. Figure 2 is a screenshot from the software tool showing a parallel coordinates plot. This plot easily shows outliers
or inliers and associated clinical/financial measures. A “brush” can be applied to any vertical axis to select subsets of the data. This is only one example of showing
the data. Another option is to undertake statistical discordancy analysis on a subset of disease related groups. This effectively “standardises” costs so that we can
compare them in a relative way across patients.
1. Principal component bubble plot</p>
    </sec>
    <sec id="sec-4">
      <title>CONCLUSION</title>
      <p>The development of advanced visual analytics capabilities especially those in the bioinformatics sphere5 can give greater insight into an organisation data than
standard reports alone such as those given by platforms such as Tableau ® and Qlikview®. Such capabilities could serve a very useful purpose when it comes
to quickly gaining insights from “big data” data sets. Adding advanced multivariable reduction techniques such as principal components analysis and statistical
discordancy can add additional insights into identifying outliers and inliers in hospital administrative and clinical data. It is hoped that this tool will aid in the timely
identification of outliers and inliers and provide insight into reducing costs in the future.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>M</given-names>
            <surname>Harley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.A.</given-names>
            <surname>Mohammed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S</given-names>
            <surname>Hussain</surname>
          </string-name>
          , et al.
          <article-title>Was Rodney Ledward a statistical outlier? Retrospective analysis using routine hospital data to identify gynaecologists' performance, BMJ (</article-title>
          <year>2005</year>
          ),
          <volume>330</volume>
          :
          <fpage>929</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>E</given-names>
            <surname>Polverejan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.C.</given-names>
            <surname>Gardiner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.J.</given-names>
            <surname>Bradely</surname>
          </string-name>
          , et al.
          <article-title>Estimating mean hospital cost as a function of length of stay and patient characteristics, Health Economics (</article-title>
          <year>2003</year>
          ),
          <fpage>935</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>T.E. McGuire</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          <string-name>
            <surname>Bender</surname>
            ,
            <given-names>C</given-names>
          </string-name>
          <string-name>
            <surname>Maskel</surname>
          </string-name>
          .
          <article-title>Casemix episodic payment for private health insurance</article-title>
          .
          <source>Canberra: AGPS</source>
          (
          <year>1995</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>A.H.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.R.</given-names>
            <surname>Vemuri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y</given-names>
            <surname>Zhao</surname>
          </string-name>
          .
          <article-title>A discordancy test approach to identify outliers of length of hospital stay</article-title>
          ,
          <source>Statistics in Medicine 17</source>
          (
          <year>1998</year>
          ),
          <fpage>2199</fpage>
          -
          <lpage>206</lpage>
          5. Visualising Biological Data: VISBI, http://visbi.org [accessed 10/10/2014].
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>