<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Ontology Modeling and Analysis of COVID-19 Associated Acute Kidney Injury and Its Underlying Molecular Mechanisms</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Easheta Shah</string-name>
          <email>shaheash@umich.edu</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roshan Desai</string-name>
          <email>roshand@umich.edu</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Suyuan Peng</string-name>
          <email>peng.suyuan@bjmu.edu.cn</email>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Luxia Zhang</string-name>
          <email>luxia_zhang@163.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yongqun He</string-name>
          <email>yongqunh@med.umich.edu</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Advanced Institute of Information Technology, Peking University</institution>
          ,
          <addr-line>Hangzhou</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Medicine, Renal Division, Peking University First Hospital, Peking University Institute of Nephrology</institution>
          ,
          <addr-line>No.8 Xishiku St., Beijing</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>National Institute of Health Data Science, Peking University</institution>
          ,
          <addr-line>No.38 Xueyuan Rd., Beijing</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>School of Public Health, Peking University</institution>
          ,
          <addr-line>No.38 Xueyuan Rd., Beijing</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of Michigan</institution>
          ,
          <addr-line>Ann Arbor, MI</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Acute kidney injury (AKI) is found to be common among COVID-19 patients. In this study, we performed extensive literature mining and used the BioGRID COVID-19 interaction data to bridge the mechanistic and molecular link between COVID-19 and AKI. DAVID GO enrichment analysis of the BioGRID data allowed for further filtration of COVID-19 related interactors by their relevance to untoward kidney manifestations. Key physiological processes involved in this pathway include Renin-Angiotensin system (RAS) activation, complement activation, and most importantly, systemic inflammation. Discovered interactors like CD147, CD209, CypA, and MASP2 were found to be heavily implicated in the mentioned processes. The Coronavirus Infectious Disease Ontology (CIDO) was used to represent our analyzed results, leading to further understanding of the COVID-19 associated AKI mechanisms.</p>
      </abstract>
      <kwd-group>
        <kwd>1 COVID-19</kwd>
        <kwd>Acute Kidney Injury (AKI)</kwd>
        <kwd>Coronavirus Infectious Disease Ontology (CIDO)</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The SARS-CoV-2 virus has many different phenotypic outcomes associated with the failure of
organs and organ systems. This multi-organ effect heavily involves untoward manifestations in the
kidney, ranging from mild proteinuria to progressive Acute kidney injury (AKI). It has been reported
that over 30 percent of hospitalized patients in New York with COVID-19 developed AKI as defined
by KDIGO criteria [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Many resources and ongoing research efforts are available to study SARS-CoV-2 pathogenesis,
including thousands of peer-reviewed journal articles that have been published with experimentally
verified scientific insights. Many databases have also been developed in this context. The BioGRID
database has compiled many coronavirus proteins and their interactions with specific host molecules
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. For SARS-CoV-2, BioGRID provides 32 viral proteins and the different host interactors each
protein is involved with. Additionally, the Coronavirus Infectious Disease Ontology (CIDO) has been
developed to represent an aggregate of experimentally verified knowledge [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. CIDO provides
important representations of coronavirus diseases with relation to their etiology, transmission,
pathogenesis, host-coronavirus interactions, and vaccine treatments [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>In this study, we hypothesize that AKI is manifested through molecular interactions between
SARSCoV-2 proteins and host proteins. BioGRID/literature mining and ontology modeling were used to
explore the possible protein-protein interactions (PPIs) leading to AKI in COVID-19 patients.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Methods</title>
    </sec>
    <sec id="sec-3">
      <title>2.1. Literature Mining and Annotation</title>
      <p>PubMed and PubMed Central were utilized in the search for peer-reviewed results in the context
of SARS-CoV-2, AKI and host-coronavirus interactions. The resulting discoveries of kidneyrelated
phenotypic outcomes were then collected in a highly specific Excel format in which each interaction
cause and outcome were precisely categorized. Specific focuses were on S and N proteins and their
interactions with host proteins. Specific pathways such as the Renin-Angiotensin system (RAS) and
complement activation were systematically studied.
2.2.</p>
    </sec>
    <sec id="sec-4">
      <title>BioGRID Data Analysis</title>
      <p>
        As of Jun 5, 2021, BioGRID collected 643 proteins interacting with SARS-CoV-2 S protein, and
345 proteins interacting with SARS-CoV-2 N protein. These interactors were downloaded and exposed
for DAVID Gene Ontology (GO) enrichment analysis. The bioinformatics resource DAVID is used to
create charts and lists of interactors and interactor functions for further functional analysis [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. DAVID
allows for additional filtration by relevance to the kidney. By inputting BioGRID's 20 viral interactors
into DAVID, we were able to view specific interactors and determine which ones are related to the
kidneys and AKI by searching for kidney-related terms like "renal failure" and "nephropathy".
2.3.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Ontology Modeling</title>
      <p>
        Each interaction pair was tagged by its reference, and all together, this data modeled the ontological
structure required for infectious disease bioinformatics. These interactions were compared to rapidly
developing and existing COVID-19 ontologies like BioGRID and CIDO. Protege-OWL editor [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] was
used for CIDO modeling. The results are available at the CIDO GitHub website:
https://github.com/CIDO-ontology/cido.
      </p>
    </sec>
    <sec id="sec-6">
      <title>3. Results</title>
    </sec>
    <sec id="sec-7">
      <title>3.1. BioGRID Data Analysis and Further Literature Analysis</title>
      <p>BioGRID provides a list of known viral-host interactions for the SARS-CoV-2 virus. Each interactor
is listed in accordance with its relation to the S or N viral protein and each description is supported by
two sources regarding its specific coronavirus interaction and kidney manifestation. The provided table
is a compilation of these findings from BioGRID, DAVID, and further literature analysis.</p>
      <p>Our literature search found two key phenotypic processes in circulation that are associated with
COVID-19 related AKI: RAS activation and complement activation. Both processes are catalyzed by
specific viral-host interactions, currently identified as follows: ACE2 and RAS activation and MASP2
and complement activation. These two processes can both induce inflammation.</p>
      <p>
        The renin-angiotensin system (RAS) is one of the major control systems for blood pressure and fluid
balance. It plays an important role in the physiological regulation of the kidneys, heart, and blood
vessels. The activation of this system is a central part of many common pathological conditions,
including hypertension, heart failure, and kidney disease. One of the enzymes in RAS, ACE2, has been
identified as a cellular receptor for SARS-CoV-2. This pathway is activated by the binding of viral S
protein and the ACE2 human cell receptor, the latter of which is present in various organs, including
the kidney [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Viral entry may result in kidney involvement via systemic RAS activation and organ
crosstalk. These untoward effects result in kidney injury when left untreated.
      </p>
      <p>
        Another pathway to kidney injury is complement activation which is activated by binding of viral N
protein and the MASP-2 cell receptor, a mannan-binding lectin protease. Complement activation is a
signaling cascade event that can be activated by lectin binding, and it results in lung injury, which will
indirectly affect the kidney, and endothelial injury and thrombotic microangiopathy which will directly
affect the kidney [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ].
      </p>
      <p>By incorporating literature review results with our BioGRID and DAVID filtration analysis, we
were able to generate an integrative model of the AKI formation (Fig. 1). This model integrates systemic
inflammation through the processes of complement activation and RAS activation. Out of the ten
proteins in Table 1, three notable interactors were found to have strong relevance to the previously
mentioned systemic processes and SARS-CoV-2 infection: CD147 (BSG) and CD209 (DC-SIGN), S
protein interactors, and PPIA (CyPA), an N protein interactor (Table 1). Of note is the binding
relationship between CD147 and CyPa, which manifests into Acute Kidney Injury independent of viral
entry. This interaction manifestation may also be implicated with the SARS-CoV-2 virus, but this part
of the pathway requires further investigation. While explicit mechanisms leading to AKI remain
unclear, a large portion of the story has been uncovered by performing further literature studies on the
articles regarding these three interactors and combining the information to formulate a cohesive
conclusion. For DC-SIGN and BSG, in their respective sources, there was a strong correlation between
the molecular presence and the likelihood of AKI [10, 11]. Furthermore, all three interactors noted are
heavily involved in stimulating an inflammatory response, leading to the understanding that
inflammation is a common connection leading from Viral Infection to AKI.</p>
      <p>The terms and relations identified above (Table 1 and Fig. 1) were also represented in the CIDO.
After importing all the proteins defined in Table 1 from the Protein Ontology (PRO) [12], we generated
new axioms to define the relations between host and viral proteins as demonstrated below:
• 'S protein of SARS-CoV-2': 'capable of binding to' some 'basigin (human)'
• 'nucleoprotein (SARS-CoV-2)': 'capable of binding to' some 'peptidyl-prolyl cis-trans
isomerase A (human)'
• 'peptidyl-prolyl cis-trans isomerase A (human)': 'has receptor' some 'basigin (human)'
• 'cyclosporin A': 'chemical has protein target as inhibitor' some 'peptidyl-prolyl cis-trans
isomerase A (human)'
• Polyman26: 'chemical has protein target as antagonist' some 'CD209 antigen (human)'
Here we needed to define the meanings of the two relations 'capable of binding to' and 'has receptor,'
which are not available in Relation Ontology (RO). While the binding of a certain viral protein to some
host interactor can induce viral infection, this process will not always occur under varying conditions.
Thus, we must use a term like 'capable of binding to' rather than a relation term like bound_to
(http://purl.obolibrary.org/obo/OBI_1110119), the latter of which asserts that such binding is always
there. Similarly, the binding between CD147 and CyPA is a process that may occur under specific
conditions independent of SARS-CoV-2 viral infection. Therefore, we use the related term 'has
receptor.'</p>
    </sec>
    <sec id="sec-8">
      <title>4. Discussion</title>
      <p>The contributions of this study are multifold. Firstly, by BioGRID data analysis and literature
annotation, we identified many human proteins such as CD147, CD209, and CyPA that are capable of
interacting with viral S or N protein and likely contribute to COVID-19-associated AKI induction.
Secondly, through literature mining, we identified several important processes, including systemic
inflammation, RAS activation, and complement activation, which likely contribute to AKI formation
in COVID-19 patients. Thirdly, we were able to generate a new integrative model that explains the
COVID-19 associated AKI generation by linking our newly identified proteins and biological
processes. Lastly, we modeled and represented the knowledge identified in this study in the CIDO
ontology.</p>
      <p>While the BioGRID database provides a large number of host-coronavirus interactions, the
knowledge does not directly translate to our understanding of COVID-19 associated AKI generation.
This is due to the high throughput screening of the present BioGRID data. Our approach strategically
deciphers through new knowledge from BioGRID data to support developing hypotheses about this
pathogenesis. The hypotheses generated in this study are worth experimental verification.</p>
      <p>Ontologies have played significant roles in our study. We used the Gene Ontology (GO) enrichment
analysis through the DAVID tool. Later, we used CIDO for our ontology modeling of the
hostcoronavirus interactions to better understand how AKI occurs in COVID-19 patients. The logical
axioms added to CIDO in this project begin to represent the knowledge of COVID-19 related AKI
manifestation, but further axioms can be developed to link more downstream processes (e.g., systemic
inflammation) of this pathogenesis.</p>
      <p>More work is currently underway. Some COVID-19 patients experience early AKI symptoms (AKI
onset before multiorgan dysfunction) while other patients do not. Previously, we found that this AKI
generation might be due to direct viral infection or indirect influences from the viral infection of other
organs [13]. It is hypothesized that early AKI is a result of direct viral infection to the kidney. We are
currently extending our current methods to better address the hypothesis.</p>
    </sec>
    <sec id="sec-9">
      <title>5. Acknowledgements</title>
      <p>This study was supported by a grant to LZ and YH from Michigan Medicine–Peking University
Health Sciences Center Joint Institute for Clinical and Translational Research (71017Y2027) and
PKUBaidu Fund (2020BD032) to SP.
6. References
[10] T. Kosugi, K. Maeda, W. Sato, S. Maruyama, and K. Kadomatsu, CD147 (EMMPRIN/Basigin) in
kidney diseases: from an inflammation and immune system viewpoint, Nephrol Dial Transplant,
vol. 30, pp. 1097-103, Jul 2015.
[11] T. Zhou, X. Li, J. Zou, M. Cai, G. Sun, Y. Zhang, et al., Effects of DC-SIGN expression on renal
tubulointerstitial fibrosis in nephritis, Front Biosci (Landmark Ed), vol. 14, pp. 2935-43, Jan 1
2009.
[12] D. A. Natale, C. N. Arighi, J. A. Blake, C. J. Bult, K. R. Christie, J. Cowart, et al., Protein Ontology:
a controlled structured network of protein entities, Nucleic Acids Res, vol. 42, pp. D415-21, Jan
2014.
[13] S. Peng, H. Y. Wang, X. Sun, P. Li, Z. Ye, Q. Li, et al., Early versus late acute kidney injury among
patients with COVID-19-a multicenter study from Wuhan, China, Nephrol Dial Transplant, vol.
35, pp. 2095-2102, Dec 4 2020.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J. S.</given-names>
            <surname>Hirsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. H.</given-names>
            <surname>Ng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. W.</given-names>
            <surname>Ross</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. H.</given-names>
            <surname>Shah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Barnett</surname>
          </string-name>
          , et al.,
          <article-title>Acute kidney injury in patients hospitalized with COVID-19</article-title>
          ,
          <string-name>
            <surname>Kidney</surname>
            <given-names>Int</given-names>
          </string-name>
          , vol.
          <volume>98</volume>
          , pp.
          <fpage>209</fpage>
          -
          <lpage>218</lpage>
          ,
          <year>Jul 2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Oughtred</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Stark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. J.</given-names>
            <surname>Breitkreutz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rust</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Boucher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chang</surname>
          </string-name>
          , et al.,
          <source>The BioGRID interaction database: 2019 update, Nucleic Acids Res</source>
          , vol.
          <volume>47</volume>
          , pp.
          <fpage>D529</fpage>
          -
          <lpage>D541</lpage>
          , Jan 8
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Y.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Ong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Huffman</surname>
          </string-name>
          , et al.,
          <article-title>CIDO, a community-based ontology for coronavirus disease knowledge and data integration, sharing, and analysis</article-title>
          ,
          <source>Sci Data</source>
          , vol.
          <volume>7</volume>
          , p.
          <fpage>181</fpage>
          ,
          <string-name>
            <surname>Jun</surname>
            <given-names>12</given-names>
          </string-name>
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>W.</given-names>
            <surname>Huang da</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. T.</given-names>
            <surname>Sherman</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Lempicki</surname>
          </string-name>
          ,
          <article-title>Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources</article-title>
          ,
          <source>Nat Protoc</source>
          , vol.
          <volume>4</volume>
          , pp.
          <fpage>44</fpage>
          -
          <lpage>57</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D. L.</given-names>
            <surname>Rubin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. F.</given-names>
            <surname>Noy</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Musen</surname>
          </string-name>
          ,
          <article-title>Protege: a tool for managing and using terminology in radiology applications</article-title>
          ,
          <source>J Digit Imaging</source>
          , vol.
          <volume>20</volume>
          <issue>Suppl 1</issue>
          , pp.
          <fpage>34</fpage>
          -
          <lpage>46</lpage>
          ,
          <year>Nov 2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Cao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          , et al.,
          <article-title>Interaction network of SARSCoV-2 with host receptome through spike protein</article-title>
          , bioRxiv,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>I.</given-names>
            <surname>Hamming</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Timens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Bulthuis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. T.</given-names>
            <surname>Lely</surname>
          </string-name>
          , G. Navis, and
          <string-name>
            <surname>H. van Goor</surname>
          </string-name>
          ,
          <article-title>Tissue distribution of ACE2 protein, the functional receptor for SARS coronavirus. A first step in understanding SARS pathogenesis</article-title>
          ,
          <source>J Pathol</source>
          , vol.
          <volume>203</volume>
          , pp.
          <fpage>631</fpage>
          -
          <lpage>7</lpage>
          ,
          <year>Jun 2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>F.</given-names>
            <surname>Pfister</surname>
          </string-name>
          , E. Vonbrunn,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ries</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. M.</given-names>
            <surname>Jack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Uberla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Lochnit</surname>
          </string-name>
          , et al.,
          <source>Complement Activation in Kidneys of Patients With COVID-19, Front Immunol</source>
          , vol.
          <volume>11</volume>
          , p.
          <fpage>594849</fpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>E.</given-names>
            <surname>Asgari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Farrar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Lynch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. M.</given-names>
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Roscher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Stover</surname>
          </string-name>
          , et al.,
          <article-title>Mannan-binding lectinassociated serine protease 2 is critical for the development of renal ischemia reperfusion injury and mediates tissue injury in the absence of complement C4</article-title>
          ,
          <string-name>
            <surname>FASEB</surname>
            <given-names>J</given-names>
          </string-name>
          , vol.
          <volume>28</volume>
          , pp.
          <fpage>3996</fpage>
          -
          <lpage>4003</lpage>
          ,
          <year>Sep 2014</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>