=Paper= {{Paper |id=Vol-1309/paper7 |storemode=property |title=Towards a common semantic representation of informed consent for biobank specimens |pdfUrl=https://ceur-ws.org/Vol-1309/paper7.pdf |volume=Vol-1309 }} ==Towards a common semantic representation of informed consent for biobank specimens== https://ceur-ws.org/Vol-1309/paper7.pdf
          Towards a Common Semantic Representation of
            Informed Consent for Biobank Specimens
          Frank J. Manion1*, Yongqun He2, Elizabeth Eisenhauer3, Yu Lin2, Alla Karnovsky4,
                                      Marcelline R. Harris3
                                   1
                                     Comprehensive Cancer Center, University of Michigan Medical School, Ann Arbor, MI 48109
      2
          Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics,
                                                University of Michigan Medical School, Ann Arbor, MI 48109, USA
                    3
                      Division of Systems Leadership and Effectiveness Science, University of Michigan School of Nursing, Ann Arbor, MI 48109
                     4
                       Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109

    Abstract — Biospecimen-based research is rapidly growing in the                 may be collected under different models of consent. A typical
post genomic era, and includes the need to retrieve specimens from                  scenario might read something like this:
distributed biobanks of various size and complexity in a fashion that
ethically preserves the expressed wishes of specimen donors as                              “For   my   study,   I   want to use samples from my
represented by the informed consent process and its artifacts. This                     organizations’s  biobank, collected under a blanket biobank
paper briefly describes existing work along these lines, presents some                  informed consent form. I discover that I will need more
challenges unique to biobanks, and presents our own work on an                          samples, so I contact   another   organization’s   biobank   to  
ontology of informed consent.                                                           determine if they hold relevant and available specimens.
                                                                                        That   organization’s   samples   were   collected   under   a   tiered  
  Keywords— BioBank, Informed consent; ontology; ICO; OBO                               biobank informed consent form. While some samples are
Foundry, Basic Formal Ontology (BFO), OBI ontology                                      shared with me, I still need more samples to address the
                                                                                        requirements of my study. I then collect additional samples
                           I.     INTRODUCTION                                          using a consent form specific  to  my  study.”
    Research in the post-genomic era requires access to high
quality biospeciments, often annotated with or linked to                                In this example there are three informed consents forms to
clinical data. Many groups at varying levels of institutional                       account for – a blanket consent, a tiered consent, and the
complexity, ranging from small scale individual laboratories to                     investigator’s single study consent. In an effort to support the
distributed international collaboratives, have established and                      expressed wishes of the donors, informed consent documents
operate biorepositories (also refered to by various names such                      impose a series of legal and ethical restrictions, obligations,
as biobanks, biolibraries, and even collections). Often, there                      and permissions to biobank operators and research teams using
are needs to share data and specimens among multiple                                the specimens and data collected in these banks. Often these
biobanks [1-3]. The act of requesting specimens from a                              rights, obligations, and permissions accrue from multiple
biorespository may demand a complex series of transactions,                         sources of authority and are represented in multiple legal
each of which in turn may convey a series of rights, obligations,                   documents. Consequently, the biobanking domain presents a
and permissions for access to specimens and data. Despite over                      series of modeling challenges, including:
a decade of experience incorporating biospecimens in the                                The operational model of the biobank. A biobank can be a
research process, formal models that describe the use of                            single, dedicated resource that provides samples to single or
biorepositories in human research are a relatively recent                           closely allied groups of studies using a common consent
development. Without a common formal model of consent and                           model. It might be a virtual or distributed biorepository using
the associated permissions on collection and distribution of                        precoordinated consent models. Another organization structure
specimens and data, integration of data across the translational                    might be that of a shared biobank facility containing multiple
spectrum, or from multiple banks and institutions will remain a                     sets of tissues from multiple projects and attempting to
difficult, manually intensive problem.                                              maximize use of these tissue resources by making them
    In this paper we briefly review current efforts toward such                     available to requestors.
models, describe our own work toward a formal model for                                The consent model used for the biobank. This can be opt-in
informed consent, and describe what we consider challenges                          or opt-out. In the case of an opt-in consent model, a tiered
and opportunities for supporting biorespository-based research                      consent may be used to present the participant or volunteer
with ontologies. A simple example that provides a motivation                        with choices of the type of data the participant may want
for this effort follows.                                                            shared, and for what types of research or other constraints.
                  II.    EXAMPLE OF THE CHALLENGE                                       The protocol model the bank operates under. Typically a
                                                                                    biobank serving more than one project would operate under
    Clinical or translational research often involves the
                                                                                    one or more Institutional Review Board (IRB)-approved
extraction and usage of biospecimen from humans. Different
                                                                                    collection protocols and Health Insurance Portability and
biospecimens may be stored and processed differently, and
                                                                                    Accountability Act (HIPAA) authorizations. Researchers
   * Corresponding author.




                                                                           	
  61	
  
subsequently requesting specimens and data would operate                                IV.   THE INFORMED CONSENT ONTOLOGY (ICO)
under separate IRB-approved protocols, and depending on this                         Development of ICO, a BFO-based ontology represented in
protocol, separate consent and HIPAA authorization may be                        the Web Ontology Language (OWL2) [12], follows OBO
required for use of a previously banked specimen. Such a                         Foundry principles of openness and collaboration. ICO is
model is sometimes called a two-protocol model [4].                              aligned with the BFO [13], making it possible to align and
    Rights, obligations, and permissions accrue from multiple                    integrate with other BFO-based ontologies. The initial release
sources and must be consistent across time. Properly modeling                    of the ontology focuses on modeling informed consent
the decisions typically made by human review boards and                          documents. As for Aug. 14, 2014, ICO contains 471 terms
regulatory personnel considering sample and data distribution                    including 137 ICO-specific terms and other terms imported
for research requires modeling not just the consent documents,                   from other BFO-aligned ontologies. Detailed ICO statistics can
but the protocols, data use agreements, and possibly other                       be     found    on    the    Ontobee    ICO    web       page:
information artifacts used in both depositing samples into a                     http://www.ontobee.org/ontostat.php?ontology=ICO. ICO is
biobank, and withdrawing them for subsequent research.                           released under an open Creative Commons 3.0 License.
   In a research oriented university such as the University of                       The ontology was developed using a combination of top-
Michigan, thousands of informed consent forms have been                          down and bottom-up approaches. Protégé-OWL 4.2 was used
generated, and there are over 100 biobanks in the Medical                        for the ontology authoring and editing. To build the OBI-based
School alone. Queries supporting appropriate use of banked                       framework of ICO we manually identified informed consent
biospecimens and data must be linked to the signed informed                      concepts from existing OBO Foundry library ontologies. These
consent agremments with the biospecimen donor.                                   were imported to ICO using Ontodog [14] and OntoFox [15]
                                                                                 which allowed for recursive inclusion of all defined axioms
                      III.   EXISTING EFFORTS                                    and related terms. The results were then manually reviewed for
    Several current efforts are evident, focused on modeling                     final approval before inclusion in the ICO framework.
aspects of the biobanking domain. At least two BFO-aligned                           Bottom up construction proceeded by manually identifying
ontologies relate to biobanking. The Ontologized Minimum                         and extracting a list of candidate terms from two informed
Information About BIobank data Sharing (OMIABIS)                                 consent templates used at the University of Michigan (one
expresses data concepts in an ontology of biobank                                from the Medical School Institutional Review Board, another
administration [5]. OMIABIS is based on work by Norlin and                       from the Health Sciences and Behavioral Sciences Institutional
colleagues [6] to develop a minimum data set for eight                           Review Board). We also identified terms from a consent form
countries participating in the EU Biobanking and Biomolecular                    used for the University of Michigan Medical School
Resources Research Infrastructure project. Limitations of this                   biorepository, and from World Health Organization (WHO)
effort are that it is intended to serve only as a description of a               informed consent templates. The candidate terms identified
biobank contents, and does not describe collection critera,                      from these templates were then enriched with metadata
consenting, and protocol provenance of individual specimens.                     including definitions, concept identifiers, preferred terms,
A group at the University of Pennsylvania is developing an                       synonyms, and URIs extracted from three ontology
ontology for the representation of biobanks, although the work                   repositories: the National   Library   of   Medicine’s   Unified
is in early stages [7]. Similarly, we are aware that a group at                  Medical Language System (UMLS®) Metathesaurus [16]; the
Duke University is working on a collaborative effort to develop                  National Center for Biomedical Ontology (NCBO) BioPortal
a normative set of data elements and terms to recommend as                       [17]; and Ontobee [18]. When textual definitions were not
best practice to the International Society for Biological and                    provided, other sources such as clinical research glossaries or
Environmental Repositories (ISBER), although this work is not                    the current literature were used. These enriched candidate
yet published [8, 9].                                                            terms were manually mapped to several pre-identified
    There are also non-BFO aligned ontologies in related areas,                  resources containing terms and definitions developed and
including a Permission Ontology used for development and                         vetted by the United States regulatory community. This process
evaluation of software tools for reasoning about consent                         yielded candidate preferred terms contining definitions
permission, published by a group at the University of                            accepted as robust and well defined by that community.
California San Diego (UCSD) [10]. Related work to build a                        Resources used in this step included the National Cancer
Research Permission Management System was done at the                            Institute Thesaurus (NCIt), the Biomedical Research Integrated
Medical University of South Carolina (MUSC) to support a                         Domain Group (BRIDG) [19], the Ontology of Clinical
statewide research network [11]. A search of the term                            Research (OCRe) [20], the Consumer Health Vocabulary
“consent”   in   the   NCBO   biportal   identified   the   notion   of          (CHV) and the University of California San Diego permission
informed consent at the class level in 19 different systems                      ontology [10].
(http://bioportal.bioontology.org/search).                                           The pool of enriched candidate terms was organized into
   Our efforts to develop a BFO-aligned informed consent                         categories of like terms according to their definitions. For
ontology (ICO) emphasizes the broad domain of informed                           example, the category   ‘authorization’   included the terms
consent. Although motivated by a biobanking use case, initial                    ‘authorization for medical records release’,   ‘authorization
development reported here is not restricted to that domain.                      documentation’   or   ‘authorization’. Enriched candidate terms
                                                                                 grouped by categories formed to-be-included terms in ICO.
                                                                                 The final set of categories (or modeling units) was then




                                                                        	
  62	
  
mapped to branches of BFO. For example, terms categorized                                                   REFERENCES
under   ‘authorization’   were considered to be subclasses of           [1]  A. Cambon-Thomsen, E. Rial-Sebbag, and B. M. Knoppers, "Trends in
BFO:process. Informed consent workflows in a typical clinical                ethical and legal frameworks for the use of human biobanks," Eur
research study were modeled as three processes: (i) pre-                     Respir J, vol. 30, pp. 373-82, Aug 2007.
informed consent processes, (ii) obtaining informed consent             [2] R. E. Hewitt, "Biobanking: the foundation of personalized medicine,"
                                                                             Curr Opin Oncol, vol. 23, pp. 112-9, Jan 2011.
processes, and (iii) processes after signing informed consent           [3] G. E. Henderson, R. J. Cadigan, T. P. Edwards, I. Conlon, A. G. Nelson,
documents. Relations between entities involved in the above                  J.  P.  Evans,  A.  M.  Davis,  C.  Zimmer,  and  B.  J.  Weiner,  “Characterizing  
processes were defined. Finally, all terms and relations were                biobank   organizations   in   the   U.S.:   results   from   a   national   survey,”  
aligned with BFO.                                                            Genome Medicine, vol. 5, no. 1, p. 3, Jan. 2013.
                                                                        [4] F. J. Manion, R. J. Robbins, W. A. Weems, and R. S. Crowley,
             V.    DISCUSSION AND CONCLUSIONS                                "Security and privacy requirements for a multi-institutional cancer
                                                                             research data grid: an interview-based study," BMC Med Inform Decis
    Modeling informed consent is a necessary but not sufficient              Mak, vol. 9, p. 31, 2009.
part of the modeling needed to support responsible use of               [5] M. Brochhausen, M. N. Fransson, N. V. Kanaskar, M. Eriksson, R.
biospecimens and data in research. Biospecimen and data                      Merino-Martinez, R. A. Hall, L. Norlin, S. Kjellqvist, M. Hortlund, U.
                                                                             Topaloglu, W. R. Hogan, and J.-E. Litton, "Developing a semantically
release is complex, and informed consent plays a major role in               rich ontology for the biobank-administration domain," J Biomed
the regulatory and scientific governance used by                             Semantics, vol. 4, p. 23, 2013.
biorepositories to release specimens and data. In follow on             [6] Norlin, M. N. Fransson, M. Eriksson, R. Merino-Martinez, M.
work we plan to examine the specific area of specimen and                    Anderberg, S. Kurtovic, and J.-E.   Litton,   “A   Minimum   Data   Set   for  
                                                                             Sharing   Biobank   Samples,   Information,   and   Data:   MIABIS,”  
data release involving the longitudinal agreements of rights,                Biopreservation and Biobanking, vol. 10, no. 4, pp. 343–348, Aug.
permissions, and obligations. Other work is needed in the                    2012.
complex areas of protocol representation, data use agreements           [7] J. Zheng, Personal Communication to Yongqun He, 2014.
and material transfer agreements.                                       [8] H. Ellis, Personal Communication to Frank J. Manion, 2014.
                                                                        [9] ISBER. (2014). ISBER - International Society for Biological and
    Limitations of our preliminary work will inform further                  Environmental Repositories. Available: http://www.isber.org/
development efforts toward a robust Informed Consent                    [10] A. Grando and R. Schwab, "Building and evaluating an ontology-based
Ontology. First, the ICO is admittedly preliminary work and is               tool for reasoning about consent permission," AMIA Annu Symp Proc,
                                                                             vol. 2013, pp. 514-23, 2013.
currently focused on informed consent documents and                     [11] J. S. Obeid, K. Gerken, K. C. Madathil, D. Rugg, C. E. Alstad, K. Fryar,
processes. More work is needed to validate the coverage and                  R. Alexander, A. K. Gramopadhye, J. Moskowitz, and I. C. Sanderson,
completeness in the domain. Concepts from the US Common                      “Development   of   an   Electronic   Research   Permissions   Management  
Rule and the EU Prior Informed Consent legislation need to be                System to Enhance Informed Consents and Capture Research
                                                                             Authorizations   Data,”   AMIA Summits Transl Sci Proc, vol. 2013, pp.
included. Our current models of informed consent processes                   189–193, 18 2013.
likely lack the richness and complexity of real-life informed           [12] W3C, "OWL 2 Web Ontology Language document overview," pp.
consent processes, and they need validation with research study              http://www.w3.org/TR/2009/REC-owl2-overview-20091027/. Accessed
teams from a variety of domain areas. Aspects of rights,                     on March 1, 2014, 2009.
obligations, permissions, and ethics must be modeled and used           [13] P. Grenon and B. Smith, "SNAP and SPAN: Towards dynamic spatial
                                                                             ontology," Spatial Cognition and Computation, vol. 4, pp. 69–103,
to extend the ontology. Finally, axioms must be developed and                2004.
competency validation of the ICO must be conducted using a              [14] J.  Zheng,  Z.  Xiang,  C.  J.  Stoeckert,  and  Y.  He,  “Ontodog:  a  web-based
series of still to be defined use case derived competency                    ontology  community  view  generation  tool,”  Bioinformatics, vol. 30, no.
questions.                                                                   9, pp. 1340–1342, May 2014.
                                                                        [15] Z. Xiang, M. Courtot, R. R. Brinkman, A. Ruttenberg, and Y. He,
    We have described our work on ICO, a preliminary                         “OntoFox:  web-based  support  for  ontology  reuse,”  BMC Res Notes, vol.
ontology of informed consent that provides general                           3, p. 175, Jun. 2010.
                                                                        [16] NLM, "U.S. National Library of Medicine (NLM). Unified Medical
classification of content contained in general informed consent              Language           System           (UMLS®)             Metathesaurus           [version
documents. It requires expansion, revisions and collaboration                2013AA][Internet]. Bethesda (MD): National Library of Medicine,"
to build a robust model, and to move toward a representation of              2013.
the complex area of biobank data sharing and specimen                   [17] P. L. Whetzel, N. F. Noy, N. H. Shah, P. R. Alexander, C. Nyulas, T.
release. We hope to collaborate with the broader community in                Tudorache,   and   M.   A.   Musen,   “BioPortal:   enhanced   functionality   via  
                                                                             new Web services from the National Center for Biomedical Ontology to
this effort.                                                                 access  and  use  ontologies  in  software  applications,”   Nucleic Acids Res,
                                                                             vol. 39, no. Web Server issue, pp. W541–W545, Jul. 2011.
                     ACKNOWLEDGMENT                                     [18] Z.   Xiang,  C.  Mungall,   A.   Ruttenberg,   and   Y.   He,   “Ontobee:   A   Linked  
    This research was supported by a University of Michigan                  Data   Server   and   Browser   for   Ontology   Terms.,”   in   Proceedings of the
                                                                             2nd International Conference on Biomedical Ontologies (ICBO),
interdisciplinary research award (MCubed) and by the National                Buffalo, NY, USA, 2011, pp. 279–281.
Center for Advancing Translational Sciences of the National             [19] D.   B.   Fridsma,   J.   Evans,   and   C.   N.   Mead,   “The   BRIDG   Project:   A  
Institutes of Health under Award Number 2UL1TR000433-06.                     Technical   Report,”   Journal of the American Medical Informatics
The content is solely the responsibility of the authors and does             Association, vol. 15, no. 2, p. 130, 2008.
                                                                        [20] I. Sim, S. W. Tu, S. Carini, H. P. Lehmann, B. H. Pollock, M. Peleg, and
not necessarily represent the official views of the National                 K.   M.   Wittkowski,   “The   Ontology   of   Clinical   Research   (OCRe):   An  
Institutes of Health.                                                        informatics   foundation   for   the   science   of   clinical   research,”   Journal of
                                                                             Biomedical Informatics, Nov 13 2013.




                                                               	
  63