<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>IWSG</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Enacting Open Science by gCube</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Massimiliano Assante</institution>
          ,
          <addr-line>Leonardo Candela, Donatella Castelli, Gianpaolo Coro, Francesco Mangiacrapa, Pasquale Pagano</addr-line>
          ,
          <institution>Costantino Perciante Istituto di Scienza e Tecnologie dell'Informazione “A. Faedo” - Consiglio Nazionale delle Ricerche via G. Moruzzi</institution>
          ,
          <addr-line>1 - 56124, Pisa</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <volume>19</volume>
      <fpage>19</fpage>
      <lpage>21</lpage>
      <abstract>
        <p>-The Open Science movement is promising to revolutionise the way science is conducted with the goal to make it more fair, solid and democratic. This revolution is destined to remain just a wish if it is not supported by changes in culture and practices as well as in enabling technologies. This paper describes the gCube offering to enact Open Sciencefriendly Virtual Research Environments. In particular, the paper describes how a complete solution suitable for realising Open Science practices is achieved by a social networking collaborative environment in conjunction with a shared workspace, an open data analytics platform, and a catalogue enabling FAIR principles on every research artefact.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>I. INTRODUCTION</title>
      <p>
        Open Science is promising to revolutionise the entire
science enterprise by envisioning and proposing new practices
aiming at making science better [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]–[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The promised
benefits include (i) better interpretation, understanding and
reproducibility of research activities and results; (ii) enhanced
transparency in science lifecycle and improvement in
“scientific fraud” detection; (iii) reduction of the overall cost of
research, by promoting re-use of results; (iv) introduction of
comprehensive and fair scientific reward criteria capturing all
facets and contributions in research life-cycle; and (v) better
identification and assessment of scientific results within the
“tsunami of scientific literature” witnessed by scientists.
      </p>
      <p>These benefits are destined to remain a wish if the research
community as a whole (funding agencies, research performing
organizations, publishers, research infrastructures, scientists,
as well as citizens) does not fully embrace Open Science
and put in place efforts and initiatives aiming at making it
the norm. The good news is that the movement is gaining
momentum and consensus. In fact, funding agencies are
developing policies supporting its implementation as well as are
supporting the development of infrastructures and services.
Moreover, research infrastructures start offering services and
facilities going in the direction of Open Science. Scientists try
to overcome the limitations of scholarly communication
practices by relying on services and technologies to “publish”
nontraditional research artefacts (datasets, workflows, software).
The bad news is that the implementation of this movement is
confronting with a number of barriers including: (a) cultural
factors, e.g. the fear to lose the control and exploitation of
datasets; (b) cost-based factors, e.g. the extra-effort needed
to make a research-artefact exploitable by a user other than
the initial owner; and (c) disincentive factors, e.g. the effort
spent in “publishing” research artefacts going beyond papers
is receiving little or no value in researchers assessment and
career success.</p>
      <p>
        This paper describes the solution proposed by gCube [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] to
overcome some of the above mentioned barriers – in particular
those tied to cost-based arguments – by providing researchers
and practitioners with a working environment where Open
Science practices are transparently promoted. gCube is a
software system specifically conceived to enable the
construction and development of Virtual Research Environments
(VREs) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], i.e. web-based working environments tailored to
support the needs of their designated community working on
a research question. Beside providing their users with the
domain-specific facilities, i.e. datasets and services suitable
for the research question, each VRE is equipped with basic
services supporting collaboration and cooperation among its
users, namely: (i) a shared workspace to store and organise
any version of a research artefact; (ii) a social networking area
to have discussions on any topic (including working version
and released artefacts) and be informed on happenings; (iii)
a data analytics platform to execute processing tasks either
provided by a user or provided by others to be applied to
users’ cases and datasets; and (iv) a catalogue-based publishing
platform to make the existence of a certain artefact public and
disseminated. These facilities are at the fingerprint of VRE
users. They continuously and transparently capture research
activities, authors and contributors, as well as every by-product
resulting from every phase of a typical research lifecycle
thus reducing the issues related with Open Science and its
communication [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>II. RELATED WORKS</title>
      <p>
        There are plenty of tools and approaches supporting Open
Science [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. They include (a) repositories maintaining different
versions of datasets and software to promote their citation
and reuse, e.g. Dryad, GitHub, Zenodo, figshare; (b) tools
aiming at promoting and enacting the production of new
forms of publications to make the release of research results
more effective and comprehensive, e.g. interactive notebooks
and enhanced publications [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]; (c) tools aiming at making
more open, transparent, holistic and participative the research
assessment process, e.g. open peer review, post-publication
review, annotation and commenting tools, social networks for
scientists like ResearchGate [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. One of the major barrier
preventing the systematic exploitation and uptake of these
tools by scientific communities and application contexts is
related to their “fragmentation”, scientists have to jump across
several platforms to get a complete picture of a research
activity and its current and future results. Whenever possible,
the “pieces” resulting from a research activity are linking each
other, either by explicit links or by derived links, e.g. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
However this link-based mechanism is quite fragile and costly
to keep healthy and up to date, and this is leading to research
packaging formats [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>
        Scientific workflows technologies [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] are adopted by an
increasing number of communities to automate scientific
methods and procedures. Environments enabling to publish and
share workflows exist, yet the guarantees that the method
captured by the workflow seamlessly works in settings other
then the originator ones are limited. Moreover, the act of
publishing workflows is not systematic across communities
and contexts.
      </p>
      <p>The Open Science Framework is a web-based service
enabling to keep files, data, and protocols pertaining to any
user defined project in a single, shared place. It provides for
credits, citation and versioning as well as for carefully deciding
what is going to be shared with whom. This platform share
commonalities with the gCube based set of facilities described
in this paper, e.g. a project is like a VRE, yet the VRE based
approach make available to its users in one place the entire set
of facilities and services they need to perform their research,
thus going beyond the pure sharing of material.</p>
    </sec>
    <sec id="sec-3">
      <title>III. VIRTUAL RESEARCH ENVIRONMENTS BASED</title>
      <p>SCIENTIFIC WORKFLOWS</p>
      <p>Figure 1 depicts the main components and facilities offered
by gCube to support collaborative activities and enable Open
Science practices, namely a shared workspace (cf. Sec. III-A),
a social networking area (cf. Sec. III-B), a data analytics
platform (cf. Sec. III-C), and a publishing platform (cf. Sec.
III-D). These components are all correlated each other and
realize a “system” where (i) research artefacts seamlessly flow
across the various components to be “managed” according to
the components’ purpose, e.g. being openly discussed by social
networking practices, (ii) research artefacts are continuously
enriched and enhanced with metadata capturing their entire
lifecycle, their versions, and the detailed list of authors and
tasks performed leading to the current development status and
shapes.</p>
      <p>
        All these components (a) are conceived to operate in a
well defined application context corresponding to the Virtual
Research Environment they are serving, i.e. the VRE members
are the primary researchers and practitioners expected to have
fully-fledged access to the artefact shared by a VRE; (b)
are conceived to open up research artefacts, independently of
their maturity level, beyond VRE boundaries (no lock-in) yet
according to artefact owner policies, i.e. it is up to artefact
owners to decide when a certain item resulting from a research
activity is “ready” to be released and how (only metadata,
rolebased access to payloads, usage license); (c) are operated by
relying on an infrastructure that guarantees a known quality of
service thus promoting community uptake, i.e. scientists might
be reluctant to migrate their working environment towards
innovative and “cloud”-based ones [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], the proposed facilities
should be as much as possible easy to use, protect consolidated
practices, and guarantee that scientists continue to get on with
their daily job.
      </p>
      <p>Fig. 2. gCube-based Open Science Workflow</p>
      <p>A prototypical and simplified scientific workflow enacted
by these components is (cf. Fig. 2): (i) Dr. Smith is willing to
investigate the impact of a certain alien species in the
Mediterranean sea and announces this willingness by a post (social
networking); (ii) Dr. Green and Dr. Rossi start collaborating
with Dr. Smith by organising and populating a shared folder
with suitable material, e.g. datasets, notes, papers (workspace);
(iii) Dr. Smith and Dr. Rossi propose two diverse models
aiming at capturing the effects of the selected species on
Mediterranean sea ecosystem, they implement and make them
available (data analytics); (iv) the availability of these
earlyresults suggests Dr. Bahl to start a study on another species he
developed a model for in the past and leads Dr. Bahl to create
another workspace folder with specific material and produce
another version of Dr. Rossi’s model; (vi) Dr. Smith, Dr. Green
and Dr. Rossi execute a large set of concurrent experiments,
make available every dataset resulting from them (workspace,
publishing), and announce their findings by also preparing
a paper. Meanwhile, Dr. Wang start re-using the model(s)
produced by Dr. Smith et al. as well as Dr. Bahl’s one to
analyse certain datasets she owns, spot a potential
implementation issue affecting all of the models, produces and publishes
corrected versions, and “annotate” the initial models with her
findings; (vii) being alerted by Dr. Wang annotation, Dr. Smith
et al. decide to re-execute their experiments on other datasets
by using both their version of the model and Dr. Wang’s one
to realise that Dr. Wang model better suits with their initial
hypothesis (all of this happen well before their paper being
published). This representative workflow can be easily and
effectively implemented only by relying on a suite of facilities
like those offered by gCube where the “place” where research
activity is conducted and the “place” the activity is published
and immediately communicated are the same. In other settings
where there is a decoupling of the “place” where research is
performed (the scientists workbench) from the place where
research is communicated, e.g. papers containing links to
supporting material, the implementation of this scenario is
more challenging and expensive, if feasible at all.</p>
      <sec id="sec-3-1">
        <title>A. The Workspace Platform</title>
        <p>
          Figure 3 depicts the user interface of the workspace facility,
i.e. the area VRE users rely on to organise their material and
have access to the material shared with others. It resembles
a typical file system with files organised in folders, yet it
supports an open-ended set of items that are (a) equipped with
rich and extensible metadata and (b) actually stored by an array
of storage solutions [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
        </p>
        <p>
          Figure 4 depicts the software architecture characterising the
workspace facility. This facility relies on Apache Jackrabbit
for storing and managing workspace items – actually their
metadata – by means of specific node types and attributes as
“key, value” pairs. Items payload is stored by relying on a
hybrid storage solution [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] that, by means of ad-hoc plugins,
exploits various storage solutions suitable for diverse
typologies of content, e.g. MongoDB for binary files, GeoServer
and THREDDS Data Servers for geospatial data, RDB for
tabular data. In addition to the portlet previously discussed,
the workspace facility is offered by (i) a widget suitable for
integrating the workspace facility in other applications (e.g. it
is exploited by the Analytics Platform Portlet to give seamless
access to workspace items), and (ii) a RESTful API suitable
for any web-based programmatic access.
        </p>
        <p>Fig. 4. gCube Workspace Platform Architecture</p>
        <p>The distinguishing features of this platform for Open
Science are the following: (i) every workspace item is equipped
with an actionable unique identifier that can be used for
citation and access purposes; (ii) every workspace item is
versioned and a new version is automatically produced
whenever the item is explicitly changed by the user or any
application/service of the VRE on behalf of an authorised user;
(iii) every item, be it a single item or a folder, is equipped
with rich and extensible metadata (“key, value” pairs) that
capture descriptive features as well as lineage features; (iv)
three typologies of folders are supported: private, content is
available for the owner only; shared, content is available for
selected users decided by the owner; and VRE folder, content
is available to VRE members; (v) the workspace is tightly
integrated with both the social networking and catalogue for
easing the dissemination of its artefact (either single items or
groups of items).</p>
      </sec>
      <sec id="sec-3-2">
        <title>B. The Social Networking Collaborative Platform</title>
        <p>Figure 5 depicts the user interface of the social networking
area, i.e. the area VRE users rely on to communicate with
their VRE co-workers and be informed on others
achievements, discussions and opinions. It resembles a social network
with posts, tags, mentions, comments and reactions, yet its
integration with the rest makes it a powerful and flexible
communication channel for scientists.</p>
        <p>
          Figure 6 depicts the software architecture characterising the
social networking collaborative platform. This facility relies
on the Social Networking Engine, a Cassandra database [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]
for storing social networking related data and Elasticsearch
[
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] for the retrieval of social networking data. The Engine
exposes its facilities by an HTTP REST Interface and
comprises two services: (i) the Social Networking Service that
efficiently store and accesses social networking data (Posts,
Comments, Notifications, etc.) in the underlying Cassandra
Cluster. and (ii) the Social Networking Indexer Service that
builds Elasticsearch indices to perform search operations over
the social networking data.
The distinguishing features of this platform for Open
Science are the following: (i) every item is equipped with an
actionable unique identifier that can be used for citation and
access purposes; (ii) the discussion patterns enabled are really
transparent and open; every (re)action performed by a user –
be it a new post, a reply to a post, or the rating of a certain post
or post reply – is carefully captured and documented; (iii) there
is no pre-defined way to structure a discussion; users can start
new discussion threads, annotate them with tags for easing the
cataloguing and discovery, refer to other threads and material
both internally stored and available on the web.
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>C. The Data Analytics Platform</title>
        <p>Fig. 7. gCube Data Analytics Platform screenshot</p>
        <p>
          Figure 8 depicts the software architecture characterising the
analytics platform. The DataMiner Master is a web service
in charge to accept requests for executing processes and
executing them, either locally or by relying on the DataMiner
Worker(s) depending from the specific process. The service is
conceived to work in a cluster of replica services operating
behind a proxy acting as load balancer. It is offered by a
standard web-based protocol, i.e. OGC WPS1; The DataMiner
Worker is a web service in charge to execute the processes it
is assigned to by a Master. The service is conceived to work in
a cluster of replica services and is offered by a standard
webbased protocol, i.e. OGC WPS. Both the services are conceived
to be deployed and operated by relying on various providers,
e.g. Master and Worker instances can be deployed on private
or public cloud providers. DataMiner Master and Worker
instances execute processes based on an open set of algorithms
hosted by a dedicated repository, the DataMiner Algorithms
Repository. Two kinds of algorithms are hosted: “local” and
“distributed” algorithms. Local algorithms are directly
executed on a DataMiner Master instance and possibly use parallel
processing on several cores and a large amount of memory.
Distributed algorithms use distributed computing with a
MapReduce approach and rely on the DataMiner Worker instances
in the Worker cluster. The Algorithm Importer portlet and
the Algorithm Publisher service enable users to inject new
algorithms into the platform by using various programming
languages [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
        </p>
        <p>The distinguishing features of this platform for Open
Science are the following: (i) every process hosted by the platform
is equipped with an actionable unique identifier that can be
used for citation and access purposes; (ii) the offering and
publication of user provided processes (e.g. scripts, compiled
programs) by an as-a-Service standard-based approach
(pro1OGC Web Processing Servicehttp://www.opengeospatial.org/standards/
wps
cesses are described and exposed by the OGC Web Processing
Service standard); (iii) the ability to manage and support
processes produced by using several programming languages
(e.g. R, Java, Fortran, Phyton); (iv) the automatic production of
a detailed provenance record for every analytics task executed
by the platform, i.e. the overall input/output data, parameters,
and metadata that would allow to reproduce and repeat the
task are stored into the workspace and documented by a
PROV-O-based accompanying record; (v) integration with the
shared workspace to implement collaborative experimental
spaces, e.g. users can easily share datasets, methods, code; (vi)
support for Cloud computing using a Map-Reduce approach
for computing and data intensive processing; (vii) extensibility
of the platform to quasi-transparently rely on and adapt to a
distributed, heterogeneous and elastically provided array of
workers to execute the processing tasks.</p>
      </sec>
      <sec id="sec-3-4">
        <title>D. The Publishing Platform</title>
        <p>Figure 9 depicts the user interface of the publishing
platform, i.e. the facility VRE users rely on to announce and be
informed on the availability of certain artefacts at diverse
maturity levels. It resembles a catalogue of artefacts with search
and browse, yet the openness with respect to the typologies
of products published, the metadata to document them as well
as the integration with the rest make it a flexible environment.
Every published item in the catalogue is characterised by (i) a
type, which highlights its features and allows an easier search,
(ii) an open ended set of metadata which carefully describe
the item, and (iii) optional resource(s) representing the actual
payload of the item.</p>
        <p>Figure 10 depicts the software architecture characterising
the publishing platform. This platform primarily relies on
CKAN technology, i.e. an open source software enabling to
build and operate open data portals / catalogues 2. This core
technology has been wrapped and extended by means of the
Catalogue Service, a component realising the business logic
of the publishing platform. The Catalogue Service enact the
management of Catalogue Item Types, i.e. specifications of
diverse typologies of items supported. Each catalogue item
type carefully defines the metadata elements characterising
the item typology by specifying the names of the attributes,
the possible values, whether an attribute is single instance
or repeatable. In addition to that, each item type contains
2CKAN technology website https://ckan.org/</p>
        <p>Fig. 9. gCube Data Publishing Platform screenshot
directives on how to exploit attributes for items
organisational purposes, e.g. automatically transform values in tags
or exploit the values for creating collections or groups of
items. On top of this Catalogue Service, gCube offers several
components to make publication of items easier for VRE users
and services. A Catalogue Portlet, accessible in each VRE,
allows to navigate the catalogue content as well as to publish
content by exploiting the Publishing Widget. This widget is
also embedded into the Workspace portlet, so users can publish
folders and/or files directly from there. External services can
access the catalogue content and publish new items via the
gCube Catalogue RESTful APIs. The Catalogue Service relies
on the Workspace and Storage Hub (cf. Sec. III-A) for storing
the payload of the published items.</p>
        <p>Fig. 10. gCube Publishing Platform Architecture</p>
        <p>The distinguishing features of this platform for Open
Science are the following: (i) every catalogue item is equipped
with an actionable, persistent, unique identifier that can be
used for citation and access purposes; (ii) whenever a
catalogue item is published, the associated payload(s) is stored in
a persistent storage area to guarantee its long-term availability;
(iii) every catalogue item is equipped with a license carefully
characterising the possible (re-)uses; (iv) every publication of
an item leads to the automatic production of a post in the social
networking area of the VRE to inform its members; (v) every
catalogue item is equipped with rich and open metadata, i.e.
it is possible to carefully customise the typologies of products
and the accompanying metadata to the community needs.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>IV. CONCLUSION AND FUTURE WORK</title>
      <p>This paper described a suite of tools overall realising Open
Science-friendly working environments. These tools support
all the phases of typical research lifecycles and transparently
inject practices aiming at making the entire process leading
to a certain version of a research artefact more transparent
and repeatable without posing additional requirements for
scientists. They are conceived to make the “publishing” act an
easy, dynamic, comprehensive, lossless and holistic task where
owners retain the control of and credit for every published
artefact that, being interlinked with other artefacts and the
working environment exploited for their production, cater for
their effective understanding and reuse. Open publishing is the
beginning of a research task rather than the concluding ones.</p>
      <p>
        These tools are an essential part of the gCube Open Source
technology [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. They are offered as-a-Service by means of the
D4Science.org infrastructure [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Concrete exploitation cases
and experiences demonstrate their effectiveness, e.g. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]–[
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
      </p>
      <p>
        Future work include the integration with recommender
systems [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], scientific workflows [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], and research
objects [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] to enlarge the possible exploitation cases and
scenarios.
      </p>
    </sec>
    <sec id="sec-5">
      <title>ACKNOWLEDGMENT</title>
      <p>This work has received funding from the European Union’s
Horizon 2020 research and innovation programme under the
AGINFRA PLUS project (Grant agreement No. 731001),
the BlueBRIDGE project (Grant agreement No. 675680), the
ENVRI PLUS project (Grant agreement No. 654182), and the
EOSCpilot project (Grant No. 739563).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>Fecher</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Friesike</surname>
          </string-name>
          , “
          <article-title>Open science: One term, five schools of thought,” in Opening Science</article-title>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bartling</surname>
          </string-name>
          and S. Friesike, Eds. Springer International Publishing,
          <year>2014</year>
          , pp.
          <fpage>17</fpage>
          -
          <lpage>47</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bartling</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Friesike</surname>
          </string-name>
          , “
          <article-title>Towards another scientific revolution</article-title>
          ,” in Opening Science. Springer International Publishing,
          <year>2014</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>B. A.</given-names>
            <surname>Nosek</surname>
          </string-name>
          , G. Alter,
          <string-name>
            <given-names>G. C.</given-names>
            <surname>Banks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Borsboom</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. D.</given-names>
            <surname>Bowman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Breckler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Buck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Chambers</surname>
          </string-name>
          , G. Chin, G. Christensen,
          <string-name>
            <given-names>M.</given-names>
            <surname>Contestabile</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dafoe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Eich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Freese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Glennerster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Goroff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Green</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Hesse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Humphreys</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ishiyama</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Karlan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kraut</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lupia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mabry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Madon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Malhotra</surname>
          </string-name>
          , E. Mayo-Wilson,
          <string-name>
            <given-names>M.</given-names>
            <surname>McNutt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Miguel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Levy</given-names>
            <surname>Paluck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Simonsohn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Soderberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. A.</given-names>
            <surname>Spellman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Turitto</surname>
          </string-name>
          , G. VandenBos,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vazire</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Wagenmakers</surname>
          </string-name>
          , R. Wilson, and
          <string-name>
            <given-names>T.</given-names>
            <surname>Yarkoni</surname>
          </string-name>
          , “
          <article-title>Promoting an open research culture</article-title>
          ,” Science, vol.
          <volume>348</volume>
          , no.
          <issue>6242</issue>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Assante</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Candela</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Castelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Coro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lelii</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pagano</surname>
          </string-name>
          , “
          <article-title>Virtual research environments as-a-service by gcube</article-title>
          ,
          <source>” PeerJ Preprints</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>L.</given-names>
            <surname>Candela</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Castelli</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pagano</surname>
          </string-name>
          , “
          <article-title>Virtual research environments: an overview and a research agenda,” Data Science Journal</article-title>
          , vol.
          <volume>12</volume>
          , pp.
          <fpage>GRDI75</fpage>
          -
          <lpage>GRDI81</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>B. A.</given-names>
            <surname>Nosek</surname>
          </string-name>
          and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Bar-Anan</surname>
          </string-name>
          , “
          <article-title>Scientific utopia: I. opening scientific communication,” Psychological Inquiry</article-title>
          , vol.
          <volume>23</volume>
          , no.
          <issue>3</issue>
          , pp.
          <fpage>217</fpage>
          -
          <lpage>243</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Assante</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Candela</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Castelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Manghi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pagano</surname>
          </string-name>
          , “
          <article-title>Science 2.0 repositories: Time for a change in scholarly communication</article-title>
          ,”
          <string-name>
            <surname>D-Lib</surname>
            <given-names>Magazine</given-names>
          </string-name>
          , vol.
          <volume>21</volume>
          , no.
          <issue>1</issue>
          /2,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>B.</given-names>
            <surname>Kramer</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Bosman</surname>
          </string-name>
          , “
          <article-title>Innovations in scholarly communication - global survey on research tool usage [version 1; referees: 2 approved]</article-title>
          ,
          <source>” F1000Research</source>
          , vol.
          <volume>5</volume>
          , no.
          <issue>692</issue>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bardi</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Manghi</surname>
          </string-name>
          , “
          <article-title>Enhanced publications: Data models and information systems</article-title>
          ,
          <source>” LIBER Quarterly</source>
          , vol.
          <volume>23</volume>
          , no.
          <issue>4</issue>
          , pp.
          <fpage>240</fpage>
          -
          <lpage>273</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Thelwall</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Kousha</surname>
          </string-name>
          , “ResearchGate: Disseminating, communicating, and measuring scholarship?
          <source>” Journal of the Association for Information Science and Technology</source>
          , vol.
          <volume>66</volume>
          , no.
          <issue>5</issue>
          , pp.
          <fpage>876</fpage>
          -
          <lpage>889</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Burton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Koers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Manghi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fenner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Aryani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. La</given-names>
            <surname>Bruzzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Diepenbroek</surname>
          </string-name>
          , and U. Schindler, “
          <article-title>The scholix framework for interoperability in data-literature information exchange</article-title>
          ,”
          <string-name>
            <surname>D-Lib</surname>
            <given-names>Magazine</given-names>
          </string-name>
          , vol.
          <volume>23</volume>
          , no.
          <issue>1</issue>
          /2,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bechhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Buchan</surname>
          </string-name>
          , D. De Roure,
          <string-name>
            <given-names>P.</given-names>
            <surname>Missier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ainsworth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bhagat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Couch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Cruickshank</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Delderfield</surname>
          </string-name>
          , I. Dunlop,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gamble</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Michaelides</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Owen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Newman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sufi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Goble</surname>
          </string-name>
          , “
          <article-title>Why linked data is not enough for scientists,” Future Generation Computer Systems</article-title>
          , vol.
          <volume>29</volume>
          , no.
          <issue>2</issue>
          , pp.
          <fpage>599</fpage>
          -
          <lpage>611</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Liew</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. P.</given-names>
            <surname>Atkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Galea</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. F.</given-names>
            <surname>Ang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Martin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and J. I. V.</given-names>
            <surname>Hemert</surname>
          </string-name>
          , “
          <article-title>Scientific workflows: Moving across paradigms,” ACM Computing Surveys</article-title>
          , vol.
          <volume>49</volume>
          , no.
          <issue>4</issue>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>M.</given-names>
            <surname>Armbrust</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Griffith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Joseph</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Katz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Konwinski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Patterson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rabkin</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Stoica</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Zaharia</surname>
          </string-name>
          , “
          <article-title>A view of cloud computing,” Communications of the ACM</article-title>
          , vol.
          <volume>53</volume>
          , no.
          <issue>4</issue>
          , pp.
          <fpage>50</fpage>
          -
          <lpage>58</lpage>
          , Apr.
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J.</given-names>
            <surname>Carpenter</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Hewitt</surname>
          </string-name>
          , Cassandra: The Definitive Guide, 2nd
          <string-name>
            <surname>Edition. O'Reilly Media</surname>
          </string-name>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C.</given-names>
            <surname>Gormley</surname>
          </string-name>
          and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Tong</surname>
          </string-name>
          ,
          <string-name>
            <surname>Elasticsearch: The Definitive Guide. O'Reilly Media</surname>
          </string-name>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>G.</given-names>
            <surname>Coro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Candela</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pagano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Italiano</surname>
          </string-name>
          , and L. Liccardo, “
          <article-title>Parallelizing the execution of native data mining algorithms for computational biology</article-title>
          ,
          <source>” Concurrency and Computation: Practice and Experience</source>
          , vol.
          <volume>27</volume>
          , no.
          <issue>17</issue>
          , pp.
          <fpage>4630</fpage>
          -
          <lpage>4644</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>L.</given-names>
            <surname>Candela</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Castelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Manzi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pagano</surname>
          </string-name>
          , “
          <article-title>Realising Virtual Research Environments by Hybrid Data Infrastructures: the D4Science Experience,” in International Symposium on Grids and Clouds</article-title>
          (ISGC)
          <year>2014</year>
          23-
          <fpage>28</fpage>
          March 2014,
          <string-name>
            <given-names>Academia</given-names>
            <surname>Sinica</surname>
          </string-name>
          , Taipei,
          <string-name>
            <surname>Taiwan,</surname>
          </string-name>
          <article-title>PoS(ISGC2014)022, ser</article-title>
          .
          <source>Proceedings of Science</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>R.</given-names>
            <surname>Froese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Thorson</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R. B. J.</given-names>
            <surname>Reyes</surname>
          </string-name>
          , “
          <article-title>A bayesian approach for estimating length-weight relationships in fishes</article-title>
          ,
          <source>” Journal of Applied Ichthiology</source>
          , vol.
          <volume>30</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>78</fpage>
          -
          <lpage>85</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>G.</given-names>
            <surname>Coro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Magliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ellenbroek</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pagano</surname>
          </string-name>
          , “
          <article-title>Improving data quality to build a robust distribution model for architeuthis dux</article-title>
          ,
          <source>” Ecological Modelling</source>
          , vol.
          <volume>305</volume>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>39</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>G.</given-names>
            <surname>Coro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Large</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Magliozzi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pagano</surname>
          </string-name>
          , “
          <article-title>Analysing and forecasting fisheries time series: purse seine in indian ocean as a case study,”</article-title>
          <source>ICES Journal of Marine Science: Journal du Conseil</source>
          , p.
          <fpage>fsw131</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>G.</given-names>
            <surname>Coro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pagano</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Ellenbroek</surname>
          </string-name>
          , “
          <article-title>Combining simulated expert knowledge with neural networks to produce ecological niche models for latimeria chalumnae,” Ecological modelling</article-title>
          , vol.
          <volume>268</volume>
          , pp.
          <fpage>55</fpage>
          -
          <lpage>63</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>G.</given-names>
            <surname>Coro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. J.</given-names>
            <surname>Webb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Appeltans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Bailly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Cattrijsse</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pagano</surname>
          </string-name>
          , “
          <article-title>Classifying degrees of species commonness: North sea fish as a case study,” Ecological Modelling</article-title>
          , vol.
          <volume>312</volume>
          , pp.
          <fpage>272</fpage>
          -
          <lpage>280</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>H.</given-names>
            <surname>Avancini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Candela</surname>
          </string-name>
          , and U. Straccia, “
          <article-title>Recommenders in a Personalized, Collaborative Digital Library Environment</article-title>
          ,
          <source>” Journal of Intelligent Information Systems</source>
          , vol.
          <volume>28</volume>
          , no.
          <issue>3</issue>
          , pp.
          <fpage>253</fpage>
          -
          <lpage>283</lpage>
          ,
          <year>June 2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bobadilla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ortega</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hernando</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Gutie</surname>
          </string-name>
          <article-title>´rrez, “Recommender systems survey,” Knowledge-Based Systems</article-title>
          , vol.
          <volume>46</volume>
          , pp.
          <fpage>109</fpage>
          -
          <lpage>132</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>