<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>GPU enables search for 2-way and 3-way interactions in GWAS</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dr Adam Kowalczyk</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Principal Scientist NICTA</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Adam Kowalczyk</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Qiao Wang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fan Shi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrew Kowalczyk</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Rawlinson</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Benjamin Goudey</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Richard Campbell</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Herman Ferra</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Computing and Information Systems, The University of Melbourne</institution>
          ,
          <addr-line>Parkville, VIC 3010</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>NICTA, Victorian Research Laboratories, The University of Melbourne</institution>
          ,
          <addr-line>Parkville, VIC 3010</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <fpage>32</fpage>
      <lpage>33</lpage>
      <abstract>
        <p>SUMMARY Genome-wide association studies (GWAS) probe millions of DNA loci in an attempt to associate DNA mutations with a given disease. Complex aetiologies of many common diseases involve combinations of different genes which require individual evaluation of trillions (non-additive) combinations of loci for association in an average size study. We have developed solutions using a single GPU to evaluate association of each and every one bivariate feature within minutes (available via free webserver). Although an exhaustive tri-variate analysis requires currently a medium size GPU cluster, many focused tri-variate analysis tasks can be accomplished routinely on a single GPU within hours of computation.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Dr Adam Kowalczyk is currently a principal researcher in
Victorian Research Laboratories of National ICT Australia
(NICTA). He leads projects in molecular medicine and
biology, leveraging many years of research and commercial
experience in pure and applied mathematics, mathematical
physics, artificial intelligence, telecommunications and,
recently, bioinformatics.</p>
    </sec>
    <sec id="sec-2">
      <title>DESCRIPTION</title>
      <p>Spectacular progress in commodity computing technology in the last few years has led to development of a
number of algorithms capable of exhaustive bivariate analysis not only on moderate computer clusters but also
on standard desktop computers equipped with General Purpose Graphics Processing Units (GPUs). In order
to fully exploit the potential of these devices for medical and biotechnological research there is a need for
efficient software tools that reduce implementation and performance difficulties, so researchers can focus on
comparison and evaluation of results rather than software tools development and low level algorithm tweaking;
see a recent elaboration of this point in6. It has been demonstrated7 that the number of novel putative epistatic
loci can be detected using such techniques.</p>
      <p>In this talk we present an efficient library of GPU kernels that can be used for fast implementation of any
bivariate GWAS statistics that can be derived from contingency table counts. In order to illustrate the point we
have implemented nine different algorithms from the literature. All algorithms can execute exhaustive search on
typical case/control GWAS (500K SNPs, 5K samples) within 10 minutes. This performance makes comparative
analysis of different statistical methods easy. These results are also significantly improved compared to original
implementations of the nine algorithms considered. Speedup factors of over 300 are observed compared to
some original GPU implementations in literature and even larger factors of over 10,000 are seen with respect
to the CPU implementations, e.g. a popular Fast Epistasis algorithm in PLINK 1.07 software package.</p>
      <p>Consider that timing scales quadratically with the density of genotyping markers used. Future high-density
SNP arrays will include up to 5 million SNPs, and forthcoming GWAS based on NGS data will have even higher
marker density and may include other technology such as methylation markers. Together, these expectations
mean that the computational burden of exhaustive bivariate analysis will continue to be challenging: between
one hundred and one thousand times more complex than existing GWAS. Thus our GPU approach, which
efficiently applies a battery of statistical tests to exhaustive search of all SNP pairs in minutes for current GWAS data, will still require hours or days for near-future
data. This is achievable on a single GPU-equipped desktop or laptop computer, but time scales down accordingly if GPU clusters are used. Additionally, the users can
define and add their own statistics to our platform and make use of our high performance library that generates contingency tables and ranks scores. To facilitate
this we have insured compatibility with popular input data formats, a common interface for defining statistical tests. The runtime of ~10 minutes for a typical dataset
and computer is fast enough that researchers can experiment with methods interactively, reviewing the effects of varied algorithms almost immediately.
The practical consequences of such an improvement in productivity are not a matter of degree. By enabling researchers to conduct GWAS experiments in less
time than a coffee break it becomes possible to focus effort on statistical methods and results rather than avoiding performance bottlenecks. New ideas can be
implemented and evaluated in very fast cycles, without the need to book time on shared high end computing resources.</p>
      <p>Most recently, we have extended GWIS to exhaustive search for 3-way interactions, a previously impossible computational task. Using our methods, an exhaustive
3-way analysis of Celiac disease GWAS from UK containing ~310K SNPs and 2200 samples using a cluster of 200 GPUs requires 7 days of computing time. To our
knowledge this is the first time such an analysis has been shown to be practical. The runtime reduces significantly for more targeted analysis, for example a specific
DNA region or a preselected set of SNPs. Exhaustive filtering through all SNP-triplets in ~2500 SNPs, including the extended MHC region, requires &lt;3 minutes on
a standard PC with a single GTX470 NVIDIA GPU.</p>
    </sec>
    <sec id="sec-3">
      <title>CONCLUSION</title>
      <p>In conclusion, analysis of two-way and three-way interactions in modern GWAS using multiple methods is practical today. Once such analyses are practiced in
labs around the world faster progress in unveiling the genetic aetiology of complex diseases may result. To date, it has been difficult to compare methods across a
range of datasets due to implementation difficulties and prohibitive runtime. Many existing benchmark studies were forced to use only small size, typically synthetic,
datasets. What we claim is a paradigm shift, towards routine usage real life data for methods development, benchmarking and then “production” deployment of
novel GWAS data analysis paradigms.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Cordel</surname>
          </string-name>
          ,
          <year>2009</year>
          , Nat.
          <source>Rev.Genet</source>
          ,
          <year>2009</year>
          ,
          <volume>10</volume>
          (
          <issue>6</issue>
          ),
          <fpage>392</fpage>
          -
          <lpage>404</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Marchini</surname>
          </string-name>
          et al.,
          <year>2005</year>
          ,
          <string-name>
            <given-names>Nat</given-names>
            <surname>Genet</surname>
          </string-name>
          ,
          <volume>37</volume>
          (
          <issue>4</issue>
          ),
          <fpage>413</fpage>
          -
          <lpage>417</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. Zhang and Liu,
          <year>2007</year>
          ,
          <string-name>
            <given-names>Nature</given-names>
            <surname>Genetics</surname>
          </string-name>
          ,
          <volume>39</volume>
          (
          <issue>9</issue>
          ),
          <fpage>1167</fpage>
          -
          <lpage>1173</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Culverhouse</surname>
          </string-name>
          et al.,
          <year>2002</year>
          ,
          <string-name>
            <given-names>Am. J.</given-names>
            <surname>Hum</surname>
          </string-name>
          . Genet.,
          <volume>70</volume>
          ,
          <fpage>461</fpage>
          -
          <lpage>471</lpage>
          5. Zhang et al.,
          <year>2010</year>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Comp</surname>
          </string-name>
          . Biol.
          <volume>17</volume>
          (
          <issue>3</issue>
          ),
          <fpage>401</fpage>
          -
          <lpage>415</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          6. Wilson et al. (
          <year>2014</year>
          ,
          <string-name>
            <given-names>PLoS</given-names>
            <surname>Biol</surname>
          </string-name>
          ,
          <volume>12</volume>
          (
          <issue>1</issue>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          7.
          <string-name>
            <surname>Godey</surname>
          </string-name>
          et al. (
          <year>2013</year>
          ),
          <source>BMC Genomics 14.</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>