<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>M. Haris); auer@tib.eu (S. Auer); markus.stocker@tib.eu (M. Stocker)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Federated Querying of Scholarly Communication Infrastructures</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Muhammad Haris</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sören Auer</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Markus Stocker</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>L3S Research Center, Leibniz University Hannover</institution>
          ,
          <addr-line>30167</addr-line>
          ,
          <country>Hannover Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>TIB-Leibniz Information Centre for Science and Technology</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>Exponentially increasing inter-related scholarly knowledge is being published on multiple scholarly communication infrastructures. Retrieving data from a single scholarly communication infrastructure is not suficient to meet users complex requirements. Moreover, the manual linking of scholarly knowledge to produce inter-related outputs is a cumbersome task. Required are flexible and user-friendly mechanisms that retrieve inter-related data from distributed scholarly infrastructures. In the proposal presented here, we leverage a federated interface to access data from multiple scholarly communication infrastructures to answer complex user queries. Specifically, we use ORKG (Open Research Knowledge Graph), ORKG Ask, DataCite, OpenAIRE Graph and Semantic Scholar endpoints to access data from these infrastructures in a federated manner. We present the work for the information needs of diverse stakeholders to demonstrate the practicability of the federation, the straightforward implementation and the added value. The code of our service is publicly available on Gitlab1,2.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Federated Query</kwd>
        <kwd>Machine Actionability</kwd>
        <kwd>Open Research Knowledge Graph</kwd>
        <kwd>(Meta)data-based Search</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>structured and semantic manner and enables retrieval of related scholarly content or
derivatives thereof (e.g., comparisons). We propose that the (machine-readable) scholarly knowledge
published in ORKG and other scholarly communication infrastructures can be leveraged to
answer complex user queries (i.e., (meta)data-driven analysis). However, this federated system
currently has some limitations: the scope of querying within the federated service is limited;
it does not support a wide range of queries, which restricts the ability to retrieve diverse data
types or conduct complex data searches across the integrated infrastructures. Furthermore, the
current capabilities for filtering content within the federated query service is limited. While
scholarly artefacts can be retrieved simultaneously from multiple scholarly infrastructures, the
options for filtering these results based on specific criteria are not suficiently comprehensive
or user-friendly. Additionally, after retrieving the data, post-processing steps are required to
refine the results according to the user’s specific requirements.</p>
      <p>
        To address these limitations, we extend the GraphQL-based federated system and integrate
other scholarly communication infrastructures, namely, ORKG Ask3, OpenAIRE Graph [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ],
and Semantic Scholar [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The extended federated system facilitates federated query execution
and supports the integrated retrieval of scholarly information and knowledge, through the
integration with ORKG Ask also generative content. Its primary objective is to enable the
interconnection of scholarly knowledge and contextual information, and to allow filtering at
the (meta)data level. We suggest that the machine-actionable scholarly knowledge published
in ORKG could be utilized together with the scholarly content from other infrastructures to
address complex user queries concerning bibliographic metadata, article content, or both.
      </p>
      <p>Our contributions are as follows:
1. We extend the GraphQL-based federated system to include the ORKG Ask, OpenAIRE
Graph, and Semantic Scholar infrastructures, and to retrieve the fragmented scholarly
content via a single endpoint to answer complex user queries.
2. We define a clear and straightforward mechanism for filtering results in a user-friendly
manner. This enhancement includes filtering capabilities for data obtained from
infrastructures beyond ORKG.</p>
      <p>In order to demonstrate the practicability of federated scholarly infrastructures, the
straightforward implementation and the added value, we propose the following scenarios reflecting
diverse information needs of diferent stakeholder groups:
• Researchers: Filtering papers based on the specified criteria e.g., find all COVID-19 papers
reported 0 estimate less than a threshold.
• Funders: Retrieve the most significant papers (both in terms of statistical impact and
academic impact) published under a specific grant. This entails identifying research
outputs that have advanced the field as well as demonstrated solid results for a particular
problem.
• Bibliometricians: (1) Create a network of co-authors for a given researcher, focusing
on those collaborations that have produced the most significant papers. This involves
analyzing the content of researchers articles and identifying those that have reported
statistically significant results. (2) Uncover the research focusing on novel problems that
have received a high number of citations. This involves identifying such papers and
assessing their citation counts.</p>
      <p>In the context of above discussion, we thus focus on addressing the following research
question:</p>
      <p>How can we reuse interoperable scholarly knowledge to support complex (meta)data-driven
analysis?</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Several services have been developed to process federated queries across diferent databases or
scholarly communication infrastructures. BioThings Explorer [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] is an application designed to
query a federated knowledge graph, which is formed by merging data from various biomedical
web services. This tool uses detailed semantic annotations to understand the inputs and outputs
of each source, enabling it to automatically link multiple web service calls to execute complex
graph queries. Since there is no centralized knowledge graph to maintain, the proposed data
explorer is a streamlined application that fetches data dynamically.
      </p>
      <p>
        To access biological data, ROBOKOP (Reasoning Over Biomedical Objects linked in
Knowledge-Oriented Pathways) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] was proposed. ROBOKOP is a KG that supports the
open biomedical question-answering application. Additionally, the ROBOKOP Knowledge
Graph Builder (KGB) was presented, which constructs the KG and provides a rich framework to
execute graph query on federated data sources. A biomedical query system named Translator
Query Language (TranQL) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] was proposed to search and analyze semantically connected
knowledge graphs. Utilizing the Biolink data model, TranQL structures queries into a graph of
Biolink elements. These queries are executed on federated knowledge sources, and the retrieved
results are integrated into one knowledge graph. Similarly, BioCarian [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]— an user-friendly
interface was proposed to enable querying heterogeneous biological databases in an exploratory
manner. The interface is enriched with facets that enable better query construction, thus making
it easier for users to filter data. BioCarian also provides a SPARQL endpoint where users can
directly execute the federated queries to explore the disparate databases.
      </p>
      <p>
        Tong et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] proposed Hu-Fu system for efective and secure spatial query processing
on federated data. Hu-Fu supports querying native SQL as well as various spatial databases,
including PostGIS, Simba, GeoMesa, and SpatialHadoop. Comprehensive tests indicate that
Hu-Fu outperforms state-of-the-art systems in execution speed and communication eficiency
while ensuring robust security. Ontario [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] is a specialized federated query processing method
designed for vast and diverse data systems. It ofers eficient query processing across a collection
of data sources within a data lake. Ontario utilizes RDF Molecule Templates which are abstract
overviews of entity properties in a unified schema and their implementation in a data lake.
3. Federated Access to Scholarly Infrastructure
In this section, we present the approach to extend our federated query service. We integrate
Open Research Knowledge Graph (ORKG), ORKG Ask, OpenAIRE Graph, DataCite and Semantic
Scholar infrastructures in a federated GraphQL service to retrieve and integrate the fragmented
scholarly content to enable complex data-driven analysis.
      </p>
      <p>DataCite ofers a GraphQL endpoint for the PID Graph, connecting persistently identified
resources from DataCite, ORCID, ROR, and other sources, and providing standardized metadata
for these resources. Similarly, we implement a GraphQL endpoint to enable access to ORKG
content. Additionally, we also integrate the ORKG Ask, OpenAIRE Graph, and Semantic Scholar
REST APIs to fetch diverse information regarding diferent scholarly artefacts. ORKG Ask is
an advanced search system that helps to find and extract valuable information from a vast
corpus of research articles. It revolutionizes the way researchers navigate scholarly literature
by leveraging cutting-edge Large Language Model (LLM) technologies to deliver precise and
relevant answers according to research queries. By federating also ORKG Ask, the system obtains
the ability to leverage generative content, which provides a possibility to address missing data.
When certain pieces of information are not available in the ORKG papers, the system can use
generative models to predict the missing details, thereby enhancing the comprehensiveness
and utility of the search results. OpenAIRE Graph provides metadata about various scholarly
artefacts (articles, datasets, software) and other objects, including projects, organizations, and
researchers. Semantic Scholar is an academic search engine developed by the Allen Institute
for AI. It uses artificial intelligence and machine learning to help researchers find relevant
information eficiently.</p>
      <p>Our federated query service facilitates cross-walking between metadata about artefacts (such
as articles and datasets) with their context (such as people and organizations) and the content
of the articles, thereby supporting complex (meta)data-driven analysis. Figure 1 illustrates the
enhanced federated architecture. The proposed federated graph plays a crucial role in enabling
complex (meta)data-driven analysis. In addition to metadata analysis, infrastructures can utilize
article’s content from the ORKG to perform new kind of analysis on data. For example, a
researcher may discover all research outputs (papers) published under a particular grant that
have impact (high number of citations) as well as have significant results (in a statistical sense
of  &lt; .001).</p>
    </sec>
    <sec id="sec-3">
      <title>4. (Meta)data-driven Analysis</title>
      <sec id="sec-3-1">
        <title>4.1. Researcher</title>
        <p>A researcher discovered an ORKG COVID-19 comparison for the virus’ basic reproduction (0)
estimates. This comparison can be accessed through the federated GraphQL endpoint using its
DOI: https://doi.org/10.48366/r44930. The researcher is interested in filtering studies
that report 0 estimate less than a specified threshold and utilize the method generalized
growth model. While the comparison includes details about methods used to conduct
experiments, not all papers include them. We thus leverage ORKG Ask to complete this missing data
in a generative manner. The federated endpoint is designed to facilitate such complex queries
by executing the relevant components at designated endpoints, thereby enabling the essential
metadata-driven analysis for the research. Listing 1 shows the query executed on ORKG and
ORKG Ask in a federated manner. ORKG provides the details of a comparison whereas semantic
details (methods, and results) of compared papers are retrievable from ORKG Ask. We can also
include additional filters to refine the search results, for example search only those studies that
have reported results for the European region. For such a scenario, the GeoNames endpoint
within the federated service will be queried to retrieve all countries included in the EU region.
The results can be filtered by adding the location property to the where clause: property:
"location", value: "EU".</p>
      </sec>
      <sec id="sec-3-2">
        <title>4.2. Funders</title>
        <p>A funder seeks to collect all articles that have employed Named Entity Recognition across
various domains, particularly those reporting an accuracy exceeding, for example, 65%. This
process will allow the funder to understand the domains where NER has been applied, the
methodologies used, and their corresponding accuracies. Additionally, the funder aims to
determine if a retrieved artefact is peer-reviewed or not. Such a complex scenario can be
addressed by querying OpenAIRE Graph, DataCite, and ORKG in a federated manner. The
federated query outlined in Listing 2 is first executed on OpenAIRE Graph to gather all papers
acknowledging a specific grant (859136 4), then on DataCite to retrieve peer review information,
and finally on ORKG to obtain the corresponding machine-actionable contribution descriptions.
These papers are then filtered based on the research problem addressed and the reported
accuracy. Additionally, the funder can generate a comparison of all research contributions
and analyze the significance of the funded work, considering both its impact and statistical
significance.</p>
      </sec>
      <sec id="sec-3-3">
        <title>4.3. Bibliometricians</title>
        <sec id="sec-3-3-1">
          <title>4.3.1. Co-author Network Analysis</title>
          <p>A bibliometrician wants to find all co-authors of a particular researcher who is working on the
estimation of COVID-19 reproduction and published highly significant papers i.e., reported
average 95% Confidence Interval (CI) less than some threshold. Thus, she performs a network
analysis of the researcher’s collaborations and also determines highly significant studies in
the epidemiology research. This task relies on the DOI of articles that reported 95% CI values,
retrievable from ORKG thanks to machine actionable content representation. The set of articles
of a researcher can be retrieved using ORCID ID by leveraging the DataCite services. Finally,
the most important articles meeting the criteria (COVID-19 articles reported CI 95% &lt; 4) can be
fetched from ORKG. Listing 3 shows the query executed on DataCite and ORKG in a federated
manner. Figure 2 shows the co-authors network of a researcher with whom he has published
COVID-19 articles.</p>
        </sec>
        <sec id="sec-3-3-2">
          <title>4.3.2. Citation Analysis</title>
          <p>A bibliometrician is exploring studies that address fairness in machine learning models, with a
specific focus on identifying articles that have reported significant outcomes in fairness metrics
and are also highly cited. This task involves accessing articles that have reported fairness
metrics, available through ORKG. With a relevant set of articles identified, the number of
citations for each can be retrieved from Semantic Scholar. Listing 4 outlines the query executed
on ORKG and Semantic Scholar in a federated manner. Lines 3-9 in the query are executed on
the ORKG API to fetch the papers’ DOIs, and line 12 is executed on Semantic Scholar to retrieve
the citations of each paper.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>5. Discussion</title>
      <p>Federated search is a widely used approach to retrieve data from distributed sources. To address
our research question, we presented a federated architecture that crosswalks the metadata
and data from DataCite, ORKG, ORKG Ask, OpenAIRE Graph, and Semantic Scholar
infrastructures. The proposed federated architecture leverages the machine-actionable scholarly
knowledge published in the ORKG, the generative content capabilities by ORKG Ask, and
classical (bibliographic) metadata to answer complex user queries. Our federated endpoint
enables diferent stakeholders to pose queries to meet their complex information needs. We
have shown the practicability of federated search by presenting diferent federated queries:
(i) retrieving statistically significant research articles (ii) retrieving highly impactful research
articles published under a specific grant (iii) supporting bibliometricians in creating network of
co-authors who have published articles with significant results.</p>
      <sec id="sec-4-1">
        <title>5.1. Future directions</title>
        <p>We aim to develop ORKG Commons (Figure 3), a web-based platform that will enable interactive
exploration of data served by the federated GraphQL service presented here. This platform will
allow users to seamlessly navigate and analyze interconnected artefacts retrieved from various
scholarly communication infrastructures in a federated setup. A key feature of ORKG Commons
will be to allow users to write queries in natural language, which will be automatically converted
into GraphQL queries to facilitate broader accessibility and ease of use. Additionally, ORKG
Commons will feature dynamic facets for content filtering. These facets will be generated based
on the metadata and the content of scholarly artefacts —providing users with enriched options
to refine their search results. This interface will provide users with appropriate options to refine
their search results and facilitate navigation through the vast and interconnected scholarly
knowledge available through various scholarly communication infrastructures.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>6. Conclusion</title>
      <p>Federated search is a well-established method for retrieving data from heterogeneous and
distributed sources. This paper introduces a federated architecture for a system designed to support
cross-walks between scholarly metadata and data. The federated system described has the
potential to help users in conducting advanced scientific inquiries. By enabling the formulation
of complex information needs, this system supports the exploration and analysis of scholarly
knowledge published in scholarly articles together with contextual research information. This,
in turn, can lead to more insightful, data-driven discoveries across various scientific domains.</p>
      <sec id="sec-5-1">
        <title>Acknowledgments</title>
        <p>This work was co-funded by the European Research Council for the project ScienceGRAPH
(Grant agreement ID: 819536) and the German Research Foundation (DFG) project NFDI4DS
(PN: 460234259).</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>A. Federated GraphQL Queries</title>
      <p>Listing 1: Retreiving COVID-19 comparison from ORKG and other semantic details from ORKG</p>
      <p>Ask as well as filtering the results according to the specified condition.
comparison( doi: "10.48366/r44930"
where: [{property: "METHOD", value: "generalized growth model" }]) {
doi label
contributions {
id label
paper { doi label }
ORKG Askdetails { method }
data { propertyLabel label}
},
"ORKG Askdetails": {
"method": "The study statistically estimated the cCFR and the basic reproduction number
using the exponential growth rate of the incidence."
Listing 2: GraphQL query executed on OpenAIRE Graph, DataCite and ORKG to obtain the
details of papers published under ScienceGraph project (Grant ID: 819536).
24
25
26
27 } } }
},
"data": [...]
}]
id: "819536", where: [
1 { # OpenAIRE Graph query
2 project(
3
4
5
6
7 ]) {
8 #DataCite Query
9 peerReview { type }
10 papers { # ORKG query
11 label doi
12 contributions {
13 id label
14
15 }}
16 }}
17
18 Result (shortened):
19 { "data": { "project": {
20 "papers": [{</p>
      <p>data { propertyId label }
{ property: "P32", value: "named entity recognition" }</p>
      <p># search for numeric value
{ property: "P45075", _GTE: 0.65 }
Listing 3: Results obtained by querying the PID Graph and ORKG federation about COVID-19
1 Query:
2 { # DataCite query
3 person(id: "https://orcid.org/0000-0001-8722-6149") { id name
4 # ORKG query
5 papers( where: [
6 {
7 property: "P32" #research problem
8 value: "Determination of the COVID-19 basic reproduction number"
9 }
10 { property: "HAS_VALUE", _LT: 4 }
11 ]) { label id} # showing paper title and doi
12 } }
13
14 Result (shortened):
15 { "data": { "person": { "id": "https://orcid.org/0000-0001-8722-6149", "name": "Shi Zhao",
16 "papers": [{
17 "id": "R44910",
18 "label": "Estimating the Unreported Number of Novel Coronavirus
19 (2019-nCoV) Cases in China in the First Half of January 2020:
20 A Data-Driven Modelling Analysis of the Early Outbreak"
21 }]
22 } } }
Listing 4: Results obtained by querying the Semantic Scholar and ORKG federation about the
study Fairness in machine learning and has reported Balanced accuracy
&gt; (75%).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>T.</given-names>
            <surname>Kuhn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chichester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krauthammer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Queralt-Rosinach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verborgh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Giannakopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.-C. N.</given-names>
            <surname>Ngomo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Viglianti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <article-title>Decentralized provenance-aware publishing with nanopublications</article-title>
          ,
          <source>PeerJ Computer Science</source>
          <volume>2</volume>
          (
          <year>2016</year>
          )
          <article-title>e78</article-title>
          . doi:
          <volume>10</volume>
          .7717/ peerj-cs.
          <volume>78</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hendler</surname>
          </string-name>
          ,
          <article-title>Data integration for heterogenous datasets</article-title>
          ,
          <source>Big data 2</source>
          (
          <year>2014</year>
          )
          <fpage>205</fpage>
          -
          <lpage>215</lpage>
          . doi:
          <volume>10</volume>
          . 1089/big.
          <year>2014</year>
          .
          <volume>0068</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhou</surname>
          </string-name>
          , S. De,
          <string-name>
            <given-names>K.</given-names>
            <surname>Moessner</surname>
          </string-name>
          ,
          <article-title>Implementation of federated query processing on linked data</article-title>
          ,
          <source>in: 2013 IEEE 24th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC)</source>
          ,
          <year>2013</year>
          , pp.
          <fpage>3553</fpage>
          -
          <lpage>3557</lpage>
          . doi:
          <volume>10</volume>
          .1109/PIMRC.
          <year>2013</year>
          .
          <volume>6666765</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Schwarte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Schenkel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          , Fedx:
          <article-title>Optimization techniques for federated query processing on linked data</article-title>
          , in: International Semantic Web Conference,
          <year>2011</year>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -25073-6_
          <fpage>38</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Haris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. E.</given-names>
            <surname>Farfar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <article-title>Federating scholarly infrastructures with graphql</article-title>
          , in: H.
          <string-name>
            <surname>-R. Ke</surname>
            ,
            <given-names>C. S.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          Sugiyama (Eds.),
          <source>Towards Open and Trustworthy Digital Societies</source>
          , Springer International Publishing, Cham,
          <year>2021</year>
          , pp.
          <fpage>308</fpage>
          -
          <lpage>324</lpage>
          . doi:
          <volume>10</volume>
          . 1007/978-3-
          <fpage>030</fpage>
          -91669-5_
          <fpage>24</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Stocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Oelen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Y.</given-names>
            <surname>Jaradeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Haris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. A.</given-names>
            <surname>Oghli</surname>
          </string-name>
          , G. Heidari,
          <string-name>
            <given-names>H.</given-names>
            <surname>Hussein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.-L.</given-names>
            <surname>Lorenz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kabenamualu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. E.</given-names>
            <surname>Farfar</surname>
          </string-name>
          , et al.,
          <article-title>Fair scientific information with the open research knowledge graph</article-title>
          ,
          <source>FAIR Connect 1</source>
          (
          <year>2023</year>
          )
          <fpage>19</fpage>
          -
          <lpage>21</lpage>
          . doi:
          <volume>10</volume>
          .3233/FC-221513.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>P.</given-names>
            <surname>Manghi</surname>
          </string-name>
          , Bolikowski,
          <string-name>
            <given-names>N.</given-names>
            <surname>Manola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schirrwagen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <surname>Openaireplus:</surname>
          </string-name>
          <article-title>the european scholarly communication data infrastructure, D-Lib Magazine 18 (</article-title>
          <year>2012</year>
          ). doi:
          <volume>10</volume>
          .1045/ september2012-manghi.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>Manghi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Houssos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mikulicic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Jörg</surname>
          </string-name>
          ,
          <article-title>The data model of the openaire scientific communication e-infrastructure</article-title>
          , in: J.
          <string-name>
            <surname>M. Dodero</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Palomo-Duarte</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          Karampiperis (Eds.),
          <source>Metadata and Semantics Research</source>
          , Springer Berlin Heidelberg, Berlin, Heidelberg,
          <year>2012</year>
          , pp.
          <fpage>168</fpage>
          -
          <lpage>180</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -35233-1_
          <fpage>18</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Fricke</surname>
          </string-name>
          ,
          <article-title>Semantic scholar</article-title>
          ,
          <source>Journal of the Medical Library Association: JMLA</source>
          <volume>106</volume>
          (
          <year>2018</year>
          )
          <article-title>145</article-title>
          . doi:
          <volume>10</volume>
          .5195/jmla.
          <year>2018</year>
          .
          <volume>280</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Callaghan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. H.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Xin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Cano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Riutta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Juneja</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Narayan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hanspers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Pico</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. I. Su</surname>
          </string-name>
          , BioThings Explorer:
          <article-title>a query engine for a federated knowledge graph of biomedical APIs</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>39</volume>
          (
          <year>2023</year>
          ). doi:
          <volume>10</volume>
          .1093/bioinformatics/btad570.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Balhof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kebede</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Morton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Fecho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tropsha</surname>
          </string-name>
          ,
          <article-title>Robokop kg and kgb: integrated knowledge graphs from federated sources</article-title>
          ,
          <source>Journal of chemical information and modeling</source>
          <volume>59</volume>
          (
          <year>2019</year>
          )
          <fpage>4968</fpage>
          -
          <lpage>4973</lpage>
          . doi:
          <volume>10</volume>
          .1021/acs.jcim.9b00683.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Cox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. C.</given-names>
            <surname>Ahalt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Balhof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Fecho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kebede</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Morton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tropsha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Xu</surname>
          </string-name>
          , et al.,
          <article-title>Visualization environment for federated knowledge graphs: development of an interactive biomedical query language and web application interface</article-title>
          ,
          <source>JMIR Medical Informatics</source>
          <volume>8</volume>
          (
          <year>2020</year>
          )
          <article-title>e17964</article-title>
          . doi:
          <volume>10</volume>
          .2196/17964.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Zaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Tennakoon</surname>
          </string-name>
          ,
          <article-title>Biocarian: Search engine for exploratory searches in heterogeneous biological databases</article-title>
          ,
          <source>BMC Bioinformatics 18</source>
          (
          <year>2017</year>
          )
          <article-title>435</article-title>
          . doi:
          <volume>10</volume>
          .1186/ s12859-017-1840-4.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Xue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , L. Chen,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Xu</surname>
          </string-name>
          , et al.,
          <article-title>Hu-fu: Eficient and secure spatial queries over data federation</article-title>
          ,
          <source>Proceedings of the VLDB Endowment</source>
          <volume>15</volume>
          (
          <year>2022</year>
          )
          <article-title>1159</article-title>
          . doi:
          <volume>10</volume>
          .14778/3514061.3514064.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>K. M. Endris</surname>
            ,
            <given-names>P. D.</given-names>
          </string-name>
          <string-name>
            <surname>Rohde</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-E. Vidal</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Auer</surname>
          </string-name>
          , Ontario:
          <article-title>Federated query processing against a semantic data lake</article-title>
          ,
          <source>in: Database and Expert Systems Applications: 30th International Conference, DEXA</source>
          <year>2019</year>
          , Linz, Austria,
          <source>August 26-29</source>
          ,
          <year>2019</year>
          , Proceedings,
          <source>Part I 30</source>
          , Springer,
          <year>2019</year>
          , pp.
          <fpage>379</fpage>
          -
          <lpage>395</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -27615-7_
          <fpage>29</fpage>
          . {
          <article-title>"type": "PeerReview" } "label": "Agriculture Named Entity Recognition-Towards FAIR, Reusable Scholarly Contributions in Agriculture", "contributions": [</article-title>
          ...]
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>