<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Formal Concept Analysis Applied to Transcriptomic Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mehwish Alam</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Adrien Coulet</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Amedeo Napoli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Malika Smal-Tabbone</string-name>
          <email>malika.smailg@inria.fr</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CNRS, LORIA, UMR 7503, Vandoeuvre-les-Nancy</institution>
          ,
          <addr-line>F-54506</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Inria, Villers-les-Nancy</institution>
          ,
          <addr-line>F-54600</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Universite de Lorraine, LORIA, UMR 7503, Vandoeuvre-les-Nancy</institution>
          ,
          <addr-line>F-54506</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Identifying functions or pathways shared by genes responsible for cancer is still a challenging task. This paper describes the preparation work for applying Formal Concept Analysis (FCA) to biological data. After gene transcription experiments, we integrate various annotations of selected genes in a database along with relevant domain knowledge. The database subsequently allows to build formal contexts in a exible way. We present here a preliminary experiment using these data on a core context with the addition of domain knowledge by context apposition. The resulting concept lattices are pruned and we discuss some interesting concepts. Our study shows how data integration and FCA can help the domain expert in the exploration of complex data.</p>
      </abstract>
      <kwd-group>
        <kwd>Formal Concept Analysis</kwd>
        <kwd>Knowledge Discovery</kwd>
        <kwd>Data Integration</kwd>
        <kwd>Transcriptomic Data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Over past few years, large volumes of transcriptomic data were produced but
their analysis remains a challenging task because of the complexity of the
biological background. In the eld of transcriptomics, biologists analyze routinely the
transcription or expression of genes in various situations (e.g., in tumor samples
versus non-tumor samples).</p>
      <p>
        Some earlier studies aimed at retrieving sets of genes sharing the same
transcriptionl behaviour with the help of Formal Concept Analysis (see, e.g., [
        <xref ref-type="bibr" rid="ref10 ref11 ref7">7, 10,
11</xref>
        ]). Further studies analyze gene expression data by using gene annotations to
determine whether a set of di erentially expressed genes is enriched with
biological attributes [
        <xref ref-type="bibr" rid="ref1 ref13 ref2">1, 2, 13</xref>
        ]. Many useful resources are available online and several
e orts have been made for integrating heterogeneous data [
        <xref ref-type="bibr" rid="ref5 ref8">5, 8</xref>
        ]. A recent
example is of the Broad Institute where biological data were gathered from multiple
resources to get thousands of prede ned gene sets stored in the Molecular
Signature DataBase, MSigDB [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. A prede ned gene set is a set of genes known to have
a speci c property such as their position on the genome, their involvement in a
biological process (or a molecular pathway) etc. Subsequently, given an
experimental gene list as input the GSEA (Gene Set Enrichment Analysis) program
is used to asses whether each prede ned gene set (in the MSigDB database) is
signi cantly present in the input list by computing an enrichment score [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>In this paper, we are interested in applying knowledge discovery techniques
for analyzing a di erentially expressed gene set and identifying functions or
pathways shared by these genes assumed to be responsible for cancer.
Knowledge discovery aims at extracting relevant and useful knowledge patterns from
a large amount of data. It is an interactive and iterative process involving a
human (analyst or domain expert) and data sources. We show how various gene
annotations and domain knowledge are integrated in a database which is then
queried for building in a exible way formal contexts. We present here a
preliminary experiments using these data. It was performed on a core context with
the addition of domain knowledge (by context apposition). The considered
domain knowledge are the hierarchical relationships between molecular pathways.
Pruning the obtained lattices allows us to retrieve interesting concepts which we
discuss. The results obtained from both experiments are also compared.</p>
      <p>The plan of the paper is as follows: Section 2 introduces Formal Concept
Analysis, Section 3 explains the data resources which are integrated, Section 4
focuses on the application of FCA, Section 5 discusses the results and Section 6
concludes the paper and presents future Work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Formal Concept Analysis</title>
      <p>We introduce here the basics of Formal Concept Analysis that are needed to
understand what follows. Let G and M be the set of objects and set of attributes
respectively and I be the relation between the objects and the attributes I
G M , where g 2 G, m 2 M , gIm is true i the object g has the attribute
m. The triple K = (G; M; I) is called a formal context. If A G, B M are
arbitrary subsets, then a Galois connection denoted by 0 is given by:
A0 := fm 2 M j gIm 8 g 2 Ag
B0 := fg 2 G j g I m 8 m 2 Bg
(1)
(2)</p>
      <p>
        FCA framework is fully described in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. FCA helps in de ning concepts which
are composed of a maximal set of objects sharing a maximal set of attributes.
However, given an input context, the resulting concept lattice can be very large
leading to computational and interpretation problems. In order to have reduced
and meaningful concepts, one can select concepts whose support is greater than
a certain threshold, i.e., the iceberg lattice. For a concept (A,B), the support is
the cardinality of the extent A. An alternative is to use the notion of stability
that was proposed in [
        <xref ref-type="bibr" rid="ref12 ref9">9, 12</xref>
        ]. The stability index measures how much the concept
intent depends on particular objects of the extent.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Complex Biological Data Integration</title>
      <p>In this section, we introduce and describe the biological data on which we are
working.
3.1</p>
      <sec id="sec-3-1">
        <title>Molecular Signature Database (MSigDB)</title>
        <p>
          Molecular Signature Database (MSigDB) is an up-to-date database which
contains data from several resources such as KEGG, BIOCARTA, REACTOME,
and Amigo [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. It is a collection of 6769 prede ned gene sets. A prede ned gene
set is a set of genes having a speci c property such as their position on the
genome (e.g., the genes at position chr5q12, i.e., band 12 on arm q of
chromosome 5), their involvment in a biological process or a molecular pathway (e.g.,
the genes which are involved in the KEGG APOPTOSIS pathway)... A pathway
is a series of actions among molecules in a cell that leads to a certain change
in a cell. KEGG is a database storing hundreds of known pathways4. Besides,
the MSigDB gene sets are grouped into ve categories (Table 1). For instance,
all the gene sets which are de ned on the basis of gene position belong to the
category C1. The category C5 groups the gene sets de ned on Gene Ontology
(GO) terms annotating the genes (with respect to their molecular function or
their housing cellular component).
        </p>
        <p>For our study, we used MSigDB Version 3.0. One entry, shown below in XML
format, describes the gene set corresponding to the GO term 'RNA Polymerase
II Transcription Factor Activity Enhancer Binding' (all the attribute names are
underlined). The Members attribute contains the list of gene symbols belonging
to the gene set. MSigDB was chosen as the main source for describing genes
because it gathers up-to-date informations about many aspects of human genes.</p>
        <p>&lt;GENESET Standard Name =\RNA Polymerase II Transcription Factor
Activity Enhancer Binding" Systematic Name = \M900" Historical Names =\"
Organism =\Homo sapiens" Geneset Listing URL =\" Chip = \Human Gene
Symbol" Category Code =\c5" Sub Category Code =\MF" Contributor =\Gene
Ontology" Contributor Org =\GO" Description Brief =\Genes annotated by
the GO term GO:0003705. Functions to initiate or regulate RNA polymerase
II transcription by binding an enhancer region of DNA." Description Full =""
Members =\ MYOD1, TFAP4, EPAS1, RELA, MYF5, MYEF2, NFIX, PURA,
HIF1A" Members Symbolized = \MYOD1, TFAP4, EPAS1, RELA, MYF5,
MYEF2, NFIX, PURA, HIF1A" Members EZID =\ 7023, 2034, 5970, 3091"
Members Mapping = \ MYOD1, 4654-TFAP4, TFAP4, 7023-EPAS1, EPAS1,
2034-RELA, RELA, 5970-MYF5, MYF5, 4617-MYEF2, MYEF2, 50804-NFIX,
NFIX, 4784-PURA, PURA, 5813-HIF1A" Status =\public" &gt; &lt;/GENESET&gt;
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Domain Knowledge</title>
        <p>Besides the gene annotations included in MSigDB, many types of domain
knowledge are interesting to use when analyzing genes. The rst type of such
do4 http://www.genome.jp/kegg/pathway.html
main knowledge are the hierarchical relationships between GO terms or between
KEGG pathways. Indeed, the KEGG hierarchy for human groups the KEGG
pathways into 40 categories and 6 upper level categories. Figure 1 illustrates the
KEGG hierarchy detailing on one upper-level category and one category.</p>
        <p>In our study we have genes described by pathways involving them which
may in turn be present in some category of pathways. For example, if a gene
is involved in a pathway apoptosis it will also be in the category 'Cell Growth
and Death'. In order to facilitate the knowledge discovery, it is important to
identify the relevant data sources, organize, and integrate the data at one single
database. In our case, the relevant primary data sources are MSigDB, KEGG
PATHWAYS database, and AmiGO database.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>From Data to Knowledge</title>
      <p>Once the data are integrated in our database the next step is to build formal
contexts for applying FCA. Our experiment focuses on applying FCA to a core
context describing genes by MSigDB-based attributes and shows its extension
based on the addition of domain knowledge.
4.1</p>
      <sec id="sec-4-1">
        <title>Test Data Sets</title>
        <p>
          The experiments described here are based on three published sets of genes
corresponding to Cancer Modules de ned in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. The authors compiled gene sets from
various resources and a large collection of micro-array data related to cancers.
These modules correspond to gene sets whose expression signi cantly change in
a variety of cancer conditions (they are also de ned as MSigDB gene sets in the
C4 category). Our test data are composed of three lists of genes corresponding
to the Cancer Modules 1 (Ovary Genes), 2 (Dorsal Root Ganglia Genes), and 5
(Lung Genes).
4.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Using FCA for Analyzing Genes</title>
        <p>We apply FCA for analyzing a context describing genes of each Cancer Module
with MSigDB-based attributes. Table 2 shows ve genes (involved in Cancer
Module 1) as a set of objects described by attributes which are the memberships
to gene sets from MSigDB. For example, CCT6A is in the set of genes (gene
set) whose standard name is Reactome Serotonin Receptors. Interestingly, by
querying our integrated database the analyst is able to select the prede ned
gene sets to include in the formal context.</p>
        <p>In order to extend the analysis of a list of genes, we need to take into account
the domain knowledge. Hence, the same experiment was conducted with the
addition of the KEGG hierarchy knowledge to the core contexts resulting in
three extended contexts. All KEGG categories and upper-level categories were
added as a set of attributes. If a gene is member of a KEGG pathway which in
turn belongs to a category and an upper level category then a cross ' ' is added
in the corresponding cells in the extended context.</p>
        <p>Table 2 shows ve genes (from Cancer Module 1) with the addition of
one KEGG category (kc) and one KEGG upper level category (kuc). In the
given example CCT6A is involved in pathway KEGG PPAR Signaling Pathway
which belongs to the category kc:Endocrine System and upper level category
kuc:Organismal Systems. The lattices were generated and the statistics for each
Cancer Module are given in Table 3. The concepts were ltered and ranked based
on same criteria as in the rst experiment.
Data Sets No. of Genes No. of Attributes No. of Concepts Levels
Module 1 361 3496 9,588 12
Module 2 378 3496 6,508 11</p>
        <p>Module 5 419 3496 5,004 12
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>In this study, biologists are interested in links between the input genes in terms of
pathways in which they participate, relationship between genes and microRNAs
etc. We obtained concepts with shared transcription factors, pathways, positions
of genes and some GO terms. After the selection of concepts with higher support,
we observed that there were some concepts with pathways from KEGG and
REACTOME as their intent. These pathways are either related to cell proliferation
or apoptosis (cell death). The addition of domain knowledge e ectively gives an
opportunity to obtain the pathway categories shared by larger sets of of genes.
Table 4 shows the top-ranked concepts found in each module. For example, in
module 5, we have con rmation that Cytokine Cytokine Receptor Interaction
pathway comes under the category Signaling Molecules and Interaction and
upper level category Environmental Information Processing (Concept ID 4938).
The absolute support and stability of the concept containing only the category
Signaling Molecules and Interaction and upper level category Environmental
Information Processing as its intent are higher (Concept ID 4995, Table 4) .</p>
      <p>To sum up, we were able to discover interesting biological properties of
subsets of genes in the three test data sets. As for example, the Focal Adhesion
pathway was found to be associated to 17 genes in both modules 1 and 2; the
Dataset Concept Intents</p>
      <p>ID
Module 1 9585
9571
9566
KEGG category Immune System was found to be shared by 11 to 25 genes in the
three cancer modules (Table 4). Given the test data sets, these results are
hopeful and constitute interesting positive control. This con rms that FCA-based
analysis o ers a powerful procedure to deeply explore sets of genes.
Our study shows how Formal Concept Analysis can be applied to complex
biological data. Data integration and FCA give the exibility of using various
types of attributes (pathways, GO terms, positions, microRNAs and
Transcription Factor Targets) for analyzing a list of genes. Our approach gives an insight
into how domain knowledge can be introduced in the analysis with the help of
FCA. As for future work, we plan to apply our approach to experimental gene
lists and take into account gene-gene relationships (physical Protein Protein
Interactions), term-term relationships (Gene Ontology relationships, namely is-a,
part-of, and regulates) and relationships between gene positions. Moreover, in
order to e ciently deal with the relationships present within the data we can
use Relational Concept Analysis.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. Gabriel F. Berriz,
          <string-name>
            <given-names>Oliver D.</given-names>
            <surname>King</surname>
          </string-name>
          , Barbara Bryant, Chris Sander, and
          <string-name>
            <given-names>Frederick P.</given-names>
            <surname>Roth</surname>
          </string-name>
          .
          <article-title>Characterizing gene sets with FuncAssociate</article-title>
          .
          <source>Bioinformatics</source>
          ,
          <volume>19</volume>
          (
          <issue>18</issue>
          ):
          <volume>2502</volume>
          {
          <fpage>2504</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Scott</given-names>
            <surname>Doniger</surname>
          </string-name>
          , Nathan Salomonis, Kam Dahlquist, Karen Vranizan, Steven Lawlor, and Bruce Conklin.
          <article-title>MAPPFinder: using Gene Ontology and GenMAPP to Create a Global Gene-expression Pro le from Microarray Data</article-title>
          .
          <source>Genome Biology</source>
          ,
          <volume>4</volume>
          (
          <issue>1</issue>
          ):
          <fpage>R7</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Aravind</given-names>
            <surname>Subramanian</surname>
          </string-name>
          et al.
          <article-title>Gene Set Enrichment Analysis: A Knowledge-based Approach for Interpreting Genome-wide Expression Pro les</article-title>
          .
          <source>Proceedings of the National Academy of Sciences</source>
          ,
          <volume>102</volume>
          :
          <fpage>15545</fpage>
          {
          <fpage>15550</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Arthur</given-names>
            <surname>Liberzon</surname>
          </string-name>
          et al.
          <article-title>Molecular Signatures Database (MSigDB) 3.0</article-title>
          . Bioinformatics,
          <volume>27</volume>
          (
          <issue>12</issue>
          ):
          <volume>1739</volume>
          {
          <fpage>1740</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Michael</surname>
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Galperin and Xose M. Fernandez-Suarez</surname>
          </string-name>
          .
          <article-title>The 2012 Nucleic Acids Research Database Issue and the online Molecular Biology Database Collection</article-title>
          .
          <source>Nucleic Acids Research</source>
          ,
          <volume>40</volume>
          (
          <string-name>
            <surname>Database-Issue</surname>
          </string-name>
          ):
          <volume>1</volume>
          {
          <issue>8</issue>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Bernhard</given-names>
            <surname>Ganter</surname>
          </string-name>
          and
          <string-name>
            <given-names>Rudolf</given-names>
            <surname>Wille</surname>
          </string-name>
          .
          <source>Formal Concept Analysis: Mathematical Foundations</source>
          . Springer, Berlin/Heidelberg,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Mehdi</given-names>
            <surname>Kaytoue-Uberall</surname>
          </string-name>
          , Sebastien Duplessis,
          <string-name>
            <surname>Sergei O. Kuznetsov</surname>
            , and
            <given-names>Amedeo</given-names>
          </string-name>
          <string-name>
            <surname>Napoli</surname>
          </string-name>
          .
          <article-title>Two FCA-Based Methods for Mining Gene Expression Data</article-title>
          . In Sebastien Ferre and Sebastian Rudolph, editors,
          <source>ICFCA</source>
          , volume
          <volume>5548</volume>
          of Lecture Notes in Computer Science, pages
          <volume>251</volume>
          {
          <fpage>266</fpage>
          . Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Purvesh</given-names>
            <surname>Khatri</surname>
          </string-name>
          and
          <string-name>
            <given-names>Sorin</given-names>
            <surname>Draghici</surname>
          </string-name>
          .
          <source>Ontological Analysis of Gene Expression Data: Current Tools</source>
          , Limitations, and Open Problems. Bioinformatics,
          <volume>21</volume>
          (
          <issue>18</issue>
          ):
          <volume>3587</volume>
          {
          <fpage>3595</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Sergei</surname>
            <given-names>O.</given-names>
          </string-name>
          <string-name>
            <surname>Kuznetsov</surname>
          </string-name>
          .
          <article-title>On stability of a Formal Concept</article-title>
          . Ann. Math. Artif. Intell.,
          <volume>49</volume>
          (
          <issue>1-4</issue>
          ):
          <volume>101</volume>
          {
          <fpage>115</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Francois</surname>
            <given-names>Rioult</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jean-Francois</surname>
            <given-names>Boulicaut</given-names>
          </string-name>
          , Bruno Cremilleux, and
          <string-name>
            <given-names>Jeremy</given-names>
            <surname>Besson</surname>
          </string-name>
          .
          <article-title>Using Transposition for Pattern Discovery from Microarray Data</article-title>
          .
          <source>In DMKD</source>
          , pages
          <volume>73</volume>
          {
          <fpage>79</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Francois</surname>
            <given-names>Rioult</given-names>
          </string-name>
          , Celine Robardet, Sylvain Blachon, Bruno Cremilleux,
          <string-name>
            <surname>Olivier</surname>
            <given-names>G</given-names>
          </string-name>
          , and
          <article-title>Jean-Francois Boulicaut. Mining Concepts from Large SAGE Gene Expression Matrices</article-title>
          . In In: Proceedings KDID03 co
          <article-title>-located with ECML-PKDD 2003, CatvatDubrovnik (Croatia</article-title>
          , pages
          <volume>107</volume>
          {
          <fpage>118</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Camille</surname>
            <given-names>Roth</given-names>
          </string-name>
          , Sergei A.
          <string-name>
            <surname>Obiedkov</surname>
          </string-name>
          , and
          <string-name>
            <surname>Derrick</surname>
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Kourie</surname>
          </string-name>
          .
          <article-title>Towards Concise Representation for Taxonomies of Epistemic Communities</article-title>
          .
          <source>In CLA.</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Zhong</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Storch</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lipan</surname>
            <given-names>O</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kao</surname>
            <given-names>MJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weitz</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <article-title>and Wong WH</article-title>
          .
          <article-title>GoSurfer: a graphical interactive tool for comparative analysis of large gene sets in Gene Ontology space</article-title>
          .
          <source>Applied Bioinformatics</source>
          ,
          <volume>3</volume>
          (
          <issue>4</issue>
          ):1{
          <issue>5</issue>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Eran</surname>
            <given-names>Segal</given-names>
          </string-name>
          , Nir Friedman, Daphne Koller, and
          <string-name>
            <given-names>Aviv</given-names>
            <surname>Regev</surname>
          </string-name>
          .
          <source>A Module Map Showing Conditional Activity of Expression Modules in Cancer. Nat.Genet</source>
          .,
          <volume>36</volume>
          :1090{
          <fpage>8</fpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>