=Paper= {{Paper |id=Vol-2969/paper6-s4biodiv |storemode=property |title=EcoPortal: An Environment for FAIR Semantic Resources in the Ecological Domain |pdfUrl=https://ceur-ws.org/Vol-2969/paper6-s4biodiv.pdf |volume=Vol-2969 |authors=Xeni Kechagioglou,Lucia Vaira,Pierfrancesco Tomassino,Nicola Fiore,Alberto Basset,Ilaria Rosati |dblpUrl=https://dblp.org/rec/conf/jowo/KechagioglouVTF21 }} ==EcoPortal: An Environment for FAIR Semantic Resources in the Ecological Domain== https://ceur-ws.org/Vol-2969/paper6-s4biodiv.pdf
EcoPortal: An Environment for FAIR Semantic Resources in the
Ecological Domain
Xeni Kechagioglou 1, Lucia Vaira 1, Pierfrancesco Tomassino 2, Nicola Fiore 1, Alberto Basset
1,2,3
      and Ilaria Rosati 2
1
  LifeWatch ERIC, Service Centre, S.P. Lecce-Monteroni – Ecotekne, Lecce, 73100, Italy
2
  Institute of Research on Terrestrial Ecosystems (IRET), National Research Council (CNR), Via Salaria Km
29.300, 00015 Monterotondo Stazione, Rome, Italy
3
  University of Salento, Department of Biological and Environmental Sciences and Technologies, S.P. Lecce-
Monteroni – Ecotekne, Lecce, 73100, Italy


                                  Abstract
                                  Ecological semantic resources are being increasingly required by researchers. To improve their
                                  FAIRness, LifeWatch ERIC launched a repository to collect, describe, display, manage and
                                  facilitate semantic resource discovery, called EcoPortal. Based on OntoPortal, EcoPortal has
                                  achieved the key functionality targets commonly expected by knowledge organisation system
                                  repositories. In addition, it allows for resource updating through an integrated editor,
                                  VocBench, while published resources may also obtain a Digital Object Identification for citing.

                                  Keywords 1
                                  Ecology, biodiversity, FAIR, semantic resource, knowledge organisation system, EcoPortal

1. Introduction
   Perceiving the necessity for collaboration and integration to overcome fragmented and siloed
ecological data [1], research has been investing in creation and exploitation of semantic resources.
Formalised knowledge organisation systems (KOS) of a range of intricacy may now be encountered,
from simple lists of local species to domain thesauri, all aiming at systematising knowledge and aligning
information. With advancements in data collection and the accumulation of vast amounts of data to be
consumed, this need for homogeneity has nowadays become even more pressing [2].
   To serve the purpose of knowledge integration, targets for data producers and publishers have been
summarised in the four main principles of FAIRness: Findability, Accessibility, Interoperability, and
Reusability [3]. For semantic resources, these principles have been translated into ten rules for accurate
communication across the users and producers of ecological information [4]. Among others, the rules
make it explicit that FAIR KOS be available from at least one repository recognised by the target
community, with adequate metadata, resource mappings, feedback mechanisms and proper citation.
   This paper presents EcoPortal, a repository implemented to host KOS, promote discoverability,
render content and metadata accessible, increase interoperability, and facilitate reuse.

2. The EcoPortal environment



S4BioDiv 2021: 3rd International Workshop on Semantics for Biodiversity, held at JOWO 2021: Episode VII The Bolzano Summer of
Knowledge, September 11-18, 2021, Bolzano, Italy
EMAIL:       xeni.kechagioglou@lifewatch.eu       (X.    Kechagioglou);      pierfrancesco.tommasino@iret.cnr.it      (P.     Tommasino);
lucia.vaira@lifewatch.eu (L. Vaira); nicola.fiore@lifewatch.eu (N. Fiore); alberto.basset@lifewatch.eu (A. Basset); ilaria.rosati@cnr.it (I.
Rosati)
ORCID: 0000-0002-5935-6074 (L. Vaira); 0000-0002-9538-2966 (N. Fiore); 0000-0002-3603-9316 (A. Basset); 0000-0003-3422-7230 (I.
Rosati)
                               © 2021 Copyright for this paper by its authors.
                               Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Wor
    Pr
       ks
        hop
     oceedi
          ngs
                ht
                I
                 tp:
                   //
                    ceur
                       -
                SSN1613-
                        ws
                         .or
                       0073
                           g

                               CEUR Workshop Proceedings (CEUR-WS.org)
    The initiative to implement a repository for ecological semantic resources was taken by LifeWatch
Italy and the Long-Term Ecosystem Research Network in Europe (LTER-Europe) [5]. The physical
realization of EcoPortal (ecoportal.lifewatch.eu) was launched by the European consortium for e-
science infrastructure for biodiversity and ecosystem research (LifeWatch ERIC). The domain niche
specified for the repository is ecology, particularly biodiversity and ecosystem research, as well as
biological and physical environmental data observations. It pertains all fields relevant to the study of
ecosystems, including both their biotic and abiotic components and their interactions.
    Biodiversity is also a theme of interest to AgroPortal [6], although there it is intended as auxiliary
to the main domains: agronomy and plants. Similarly, the GFBio Terminology Service [7] lists largely
relevant KOS, although its thematic scope goes beyond that of EcoPortal, to include e.g., geosciences
and chemistry. The EcoPortal niche laps slightly over also with the repository of biomedical ontologies
BioPortal [8], whose collection includes a few KOS on phenotype, biotic and abiotic stress, and
organism diversity. This overlap of fields of interest is not a drawback. On the contrary, it may serve as
incentive for resource harmonization, development of cross-portal services and even detection of novel
affinities. Nor may it be foreseen to shrink; if anything, one may expect niche expansion for EcoPortal
to encompass new directions, as demonstrate indicative cases such as species conservation [9].
    To meet the functional scope commonly expected by KOS repositories, and thus differentiate from
KOS registries that merely store and retrieve metadata about semantic assets [10], EcoPortal adheres to
a set of key functionalities specified in [11]. The main components of the EcoPortal environment are
the repository deployment infrastructure OntoPortal and the VocBench platform for semantic resource
development, both coming with compatibility with a Resource Description Framework (RDF) store.

2.1.    Basic Functionalities
   The basic functionalities of EcoPortal are based on OntoPortal (deployed version 2.5), a VMware
Virtual Appliance infrastructure [12] currently maintained by the Stanford Center for Biomedical
Informatics Research (BMIR). It is advanced by and for the collaborative OntoPortal Alliance [13], of
which LifeWatch ERIC is an active member. Its basic tools cater for browsing, searching, publishing,
annotating, mapping, and receiving recommendations for semantic assets.
Figure 1: The browsing interface of EcoPortal. Central to the view is the list of published KOS with an
indication of their size. Users may restrict their search by ecological categories (among others), and
they may also query the list by text.

   Browse
    Browsing (Figure 1) is assisted by matching user-inserted text to the title or the description of the
KOS. Filtering is accomplished by means of EcoPortal-specific categories (Table 1), currently mostly
coinciding with GEMET (GEneral Multilingual Environmental Thesaurus) concepts [14] and intended
to be updated according to user needs. Upon location of a KOS of interest, users may view its metadata,
as well as visualise its hierarchical and overall structure and content.

Table 1
EcoPortal categories for filtering the resources during browsing
  Category Name       Source         Category Name        Source        Category Name          Source
       Ecology        GEMET            Biodiversity       GEMET         Earth sciences        EcoPortal
  Applied ecology     GEMET            Biodiversity       GEMET         Environmental         EcoPortal
                                      conservation                         sciences
   Forest ecology     GEMET             Terrestrial       GEMET         Oceanography          EcoPortal
                                       biodiversity
     Landscape        GEMET       Aquatic biodiversity    GEMET
       ecology
    Plant ecology     GEMET        Urban biodiversity     GEMET
     Population       GEMET         Soil biodiversity     GEMET
       ecology
     Synecology       GEMET        Forest biodiversity    GEMET
   Urban ecology      GEMET         Species diversity    EcoPortal
  Human ecology       GEMET       Ecosystem diversity    EcoPortal
  Animal ecology      GEMET       Functional diversity   EcoPortal
  Aquatic ecology     GEMET         Genetic diversity     GEMET
   Palaeoecology      GEMET
  Trophic ecology     GEMET

   Search, Mappings, Annotator, Recommender
    Searching for specific concepts in the entire list of KOS also uses the above ecological categories to
filter results. Mapping of similar concepts across EcoPortal resources is based on text-matching concept
labels, while text analysis for annotation operates by string matching and by leveraging the KOS
hierarchical structure and mappings [15]. Equally important for resource reuse, the Recommender
service evaluates relevance of each EcoPortal KOS with regard to keywords or free text inserted by the
user. The tool builds on the annotation service, as well as on statistical and quality metrics of the
resources and of the annotation output, before assigning criteria weights and computing scores [16].

   Publish
    Publishing a new KOS on EcoPortal is at the heart of the repository’s existence. The resources have
metadata associated with them, submitted to the system through an online form upon uploading. The
required elements follow a schema (Table 2) that, having started with the BioPortal Metadata ontology
[17], was adjusted with elements from the extended AgroPortal schema [18] and was completed with
fields from the DataCite Metadata Schema (v.4.3) [19]. Albeit all elements were selected to serve
FAIRness, the DataCite ones in particular are meant for Digital Object Identifier (DOI) assignment to
the uploaded KOS.
    Uploading of a KOS is not a prerequisite to its indexing in EcoPortal. In fact, the cataloguing aspect
of the repository is independent of whether the resource resides in it or not, as long as sufficient
information for successful redirection is provided. Uploaded resources may further be marked as public
or private, imposing restrictions on their accessibility, while persons indicated as contacts may
optionally receive feedback in the form of comments and proposals from users of their resource.


Table 2
The main metadata elements for KOS description in EcoPortal
              Metadata element                  Description                      Source
                      Name                   Name of the KOS                    BioPortal
                    Acronym                 Acronym of the KOS                  BioPortal
                   Description             Description of the KOS               BioPortal
                      Status             Release status, e.g., alpha            BioPortal
                     Version                    KOS version                    AgroPortal
                     Format                     KOS format                     AgroPortal
                     Visibility     Viewing restrictions (public/private)      AgroPortal
                     Contact        Name and e-mail of contact person          AgroPortal
                   Homepage                    Homepage URL                    AgroPortal
                  Publications        URL for bibliographic reference          AgroPortal
                Documentation         URL for further documentation            AgroPortal
                   Categories          Controlled list with categories         AgroPortal
                     Groups             Controlled list with groups            AgroPortal
                  Release date        Date of upload to the EcoPortal          AgroPortal
                    Identifier          Unique string ID (e.g., DOI)            DataCite
                 Identifier type             Type of identifier                 DataCite
                     Creators         Actors involved in KOS creation           DataCite
                      Titles           Alternative names of the KOS             DataCite
                    Publisher                 Publishing entity                 DataCite
                 Resource type            KOS type, e.g., ontology              DataCite
             Resource type general            Dataset or other                  DataCite


2.2.    EcoPortal extended functionalities
   VocBench
   Recognising the need for editing and updating EcoPortal resources with as few steps as possible, the
VocBench 3 (VB3) development platform has been integrated. VB3 - deployed version 8.02 - is an
open-source editing environment for ontologies, thesauri and RDF datasets. Initially created to support
the needs of the AGROVOC thesaurus, it offers itself as a web-based platform [20].
   Connection between VocBench and EcoPortal user accounts is established via a personal API key
assigned to the user upon registration to EcoPortal. Users may update the resource in VocBench when
and as deemed suitable; once the new version is ready to be submitted in EcoPortal, users may do so
from VocBench, through a form (Figure 2) that contains the same metadata fields as the EcoPortal
Publish tool does; in this way, KOS deployment through VocBench automatically updates EcoPortal.
   Beyond employing a straightforward process for new KOS versions, advantages to using VocBench
include better development control. Users assigned to a collaborative KOS creation project may be
delegated roles and get authorisation to perform specific tasks only, while all edits undergo a validation
process. Last, VocBench comes with a SPARQL endpoint for querying KOS structures and linked data.

   Digital Object Identifier
   LifeWatch ERIC is a direct member of DataCite and has the authorisation to provide a DOI to
ecological semantic resources that fulfil certain quality criteria, in order to be cited in a reliable and
constant way. KOS owners may request this upon publication of their resource on the EcoPortal
repository. DOI assignment is free of charge and is deemed fundamental to the FAIRness of resources.




Figure 2: The EcoPortal metadata schema for KOS description is respected whether resource
submission occurs through EcoPortal or through VocBench.


3. Conclusion and outlook
   The EcoPortal environment for FAIR semantic resources in the ecological domain currently fulfils
functionalities commonly required by KOS repositories, while it also offers unique capabilities that
increase reuse of the hosted assets. Its realisation is incorporated in the broader mission of LifeWatch
ERIC to support with e-science infrastructure the scientific community in biodiversity and ecosystem
research. Next steps in its establishment involve cases of utilisation by the end users to guide
implementation of new repository features, as well as component upgrade and consolidation in a new
LifeWatch ERIC platform.
   End users play in fact a key role in EcoPortal advancement, as identification of needs and design for
improvement may only be achieved through stepping-up the engagement of the ecological community
and the exchange of feedback. In this way, the EcoPortal environment may serve its purpose as the
entry point for KOS discovery, with the aim to use them in annotation of Digital Objects, in enrichment
of data, in reuse of information and services, and in FAIR maintenance of resources and components.

4. References

[1] J. N. Thompson, O. J. Reichman, P. J. Morin, G. A. Polis, M. E. Power, R. W.Sterner, C. A. Couch,
     L. Gough, R. Holt, D. U. Hooper, F. Keesing, C. R.Lovell, B. T. Milne, M. C. Molles, D. W.
     Roberts, S. Y. Strauss, Frontiers of Ecology, Bioscience (1999) 15(1) 15-24. doi:10.1641/0006-
     3568(2001)051[0015:FOE]2.0.CO;2.
[2] C. Barba-González, J. García-Nieto, M.M. Roldán-García, I. Navas-Delgado, A.J. Nebro, J.F.
     Aldana-Montes, BIGOWL: Knowledge centered Big Data analytics, Expert Systems With
     Applications 115 (2019) 543-556. doi:10.1016/j.eswa.2018.08.026.
[3] M. D. Wilkinson, M. Dumontier, I. J. Aalbersberg, G. Appleton, M. Axton, A. Baak, N. Blomberg,
     et al., The FAIR Guiding Principles for scientific data management and stewardship, Scientific
     Data (2016) 3(1):160018. doi:doi.org/10.1038/sdata.2016.18.
[4] S. J. D. Cox, A. N. Gonzalez-Beltran, B. Magagna, M.-C. Marinescu, Ten simple rules for making
     a vocabulary FAIR. PLoS Computational Biology (2021) 17(6):e1009041. doi:
     10.1371/journal.pcbi.1009041.
[5] N. Fiore, B. Magagna, D. Goldfarb, EcoPortal: A proposition for a semantic repository dedicated
     to ecology and biodiversity, in: Proceedings of the 2nd International Workshop on Semantics for
     Biodiversity, S4BioDiv ’17, CEUR Workshop Proceedings Vol-1933 (CEUR-WS.org), Vienna,
     Austria, 2017.
[6] C. Jonquet, A. Toulet, E. Arnaud, S. Aubind, E. Dzalé Yeumod, V. Emoneta, J. Graybealf, et al.,
     AgroPortal: A vocabulary and ontology repository for agronomy, Computers and Electronics in
     Agriculture 144 (2018) 126–143. doi:10.1016/j.compag.2017.10.012.
[7] N. Karam, C. Müller-Birn, M. Gleisberg, D. Fichtmüller, R. Tolksdorf, A. Güntsch, A
     Terminology Service supporting semantic annotation, integration, discovery and analysis of
     interdisciplinary research data, Datenbank Spektrum 16 (2016) 195–205. doi: 10.1007/s13222-
     016-0231-8.
[8] P. L. Whetzel, N. F. Noy, N. H. Shah, P. R. Alexander, C. Nyulas, T. Tudorache, M. A. Musen,
     BioPortal: Enhanced functionality via new Web services from the National Center for Biomedical
     Ontology to access and use ontologies in software applications, Nucleic acids research, 39 (Web
     Server issue) (2011) W541–W545. doi:10.1093/nar/gkr469.
[9] R. E. Hintzen, M. Papadopoulou, R. Mounce, C. Banks-Leite, R. D. Holt, M. Mills, et al.,
     Relationship between conservation biology and ecology shown through machine reading of 32,000
     articles, Conservation Biology, 34(3) (2019) 721–732. doi:10.1111/cobi.13435.
[10] J. Voß, J. M. Agne, U. Balakrishnan, M. Akter, Terminology registries and services, Poster, The
     Research      Data    Alliance    Deutschland     Treffen,    RDA-DE-16,      Potsdam,     2016.
     doi:10.5281/zenodo.166717.
[11] M. d'Aquin, N. F. Noy, Where to publish and find ontologies? A survey of ontology libraries, Web
     Semantics: Science, Services and Agents on the World Wide Web 11 (2012) 96–111.
     doi:10.1016/j.websem.2011.08.005.
[12] J. Graybeal, A. Skrenchuk, J. Vendetti, M. Dorf, C. Jonquet, N. Fiore, X. Yang, M. A. Musen,
     OntoPortal Virtual Appliance and Community Alliance, Poster n.74, Research Data Alliance
     Plenary 14, 2020. URL: https://ontoportal.org/ontoportal-alliance-at-rda-plenary-14/.
[13] J. Graybeal, C. Jonquet, N. Fiore, M. A. Musen, OntoPortal software: Adoption increasing, Poster,
     Research Data Alliance Plenary 13, 2019. URL: https://ontoportal.org/ontoportal-alliance-at-rda-
     plenary-13/.
[14] European Environment Information and Observation Network – EIONET, General Multilingual
     Environmental Thesaurus (GEMET) v.4.2, 2021. URL: https://www.eionet.europa.eu/gemet/en/
     about/.
[15] N. H. Shah, N. Bhatia, C. Jonquet, D. Rubin, A. P. Chiang, M. A. Musen, Comparison of concept
     recognizers for building the Open Biomedical Annotator, BMC Bioinformatics (2009) 10(Suppl
     9):S14. doi: 10.1186/1471-2105-10-S9-S14.
[16] M. Martínez-Romero, C. Jonquet, M. J. O’Connor, J. Graybeal, A. Pazos, M. A. Musen, NCBO
     Ontology Recommender 2.0: An enhanced approach for biomedical ontology recommendation,
     Journal of Biomedical Semantics (2017) 8(21) 1-22. doi: 10.1186/s13326-017-0128-y.
[17] National Center for Biomedical Ontology (NCBO) public wiki, Architecture, 2010. URL:
     https://www.bioontology.org/wiki/Architecture.
[18] C. Jonquet, A. Toulet, B. Dutta, V. Emonet, Harnessing the power of unified metadata in an
     ontology repository: The case of AgroPortal. Journal on Data Semantics 7 (2018) 191–221. doi:
     10.1007/s13740-018-0091-5.
[19] DataCite Metadata Working Group, DataCite Metadata Schema documentation for the publication
     and citation of research data, Version 4.3, DataCite e.V., 2019. doi:10.14454/7xq3-zf69.
[20] A. Stellato, M. Fiorelli, A. Turbati, T. Lorenzetti, W. Gemert, D. Dechandon, et al., VocBench 3:
     A collaborative Semantic Web editor for ontologies, thesauri and lexicons, Semantic Web 5 (2020)
     1-27. doi:10.3233/SW-200370.