<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Dileepa Jayakodya, Nirojan Selvanathana and Violeta Damjanovic-Behrendta</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Salzburg Research</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jakob Haringer Strasse</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Salzburg</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Austria</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>This paper presents an index-time merging approach to federated search designed for a European Connected Factory Platform for Agile Manufacturing (EFPF) project and its federated digital ecosystem. The presented approach supports search and recommendations for products, services as well as suitable business partners for the creation of networks within the business ecosystem. We discuss the overall architecture of the search and recommendation system, major challenges and the current state of development. Federated search, recommendation, digital platforms, platform search Federated search systems emerged in the late 1990s in order to support library users to simultaneously search multiple library sources and catalogues [1]. Early federated search systems were designed to collect search criteria from users, submit search queries to sources to be retrieved, wait for a response to arrive, and filter and organize the query results before sending the results back to the users. For example, Windows 7 services differentiate between two types of search, Windows Search and Federated Search [2]. Windows Search enables searching of the entire local system and a limited amount of external resources such as shared drives. Likewise, the selection-based search in MacOS Spotlight requires an index of all items and files that exist in the system to be created.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>2020 Copyright for this paper by its authors.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Literature review</title>
      <p>The authors in [3] differentiate between federated search and metasearch. Metasearch engines crawl
open websites and return results of all kinds, such as apps, webpages, contacts and documents that are
drawn from different sources. In contrast to metasearch, federated search systems focus on subscribed
databases that are processed through an indexing mechanism that enables structured, in-depth,
contentoriented retrieval [4]. A number of attempts to implement federated search arose from the 1990s
onwards, e.g. MetaLib and SFX implemented at Boston College [5], MetaLib and SFX with Ex Libris
at the Five Colleges Libraries in Massachusetts [6], Georgia Library Learning Online (GALILEO) with
its hybrid approach to federated search products [7], and more. Many interviews showed that the users
were dissatisfied with federated search performance of that time, due to limitations in database
availability, search speed, retrieval precision, and result comprehensiveness [8]. Users preferred simple
interfaces over federated search and had little interest in advanced searching techniques. In recent years,
several attempts to improve federated search were conducted. In 2004, the University of Nevada at
Reno experimented with the Google Search Appliance to be used as a federated search utility. This
required partnership between Google, the library and a test vendor, and it still experienced technical
and interoperability challenges [9].</p>
    </sec>
    <sec id="sec-3">
      <title>3. Problem statement</title>
      <p>Federated search is a technique for simultaneous search of multiple data sources using a single query
and search interface. The examples of federated search can be found in many sectors, facilitating search
of thousands of products, each sorted into different categories, e.g. search of numerous websites serving
various purposes and stakeholders [11]. The majority of the research on federated search is focused on
system performance and technical development, system usability testing, user interaction, interface
customization, authentication, design, database communication protocol, system platform [10].</p>
      <p>The two fundamental components of federated search systems comprise of an index that is a
reference to the data to be searched and a search algorithm. These two components interact using either
search-time, index-time or a federated search interface [11]:
 Search-time merging runs separate search algorithms on each data source that is searched. It
uses multiple indices and aggregates the returned search results into a final list, which is then
presented to the user. Obtaining the results can be slow, if the central search engine needs to wait
until all of the local search engines have responded.
 Index-time merging requires a central index to be built for all of the data that need to be included
in the search results. It requires one search engine and one index, and is faster than search-time
merging. To aggregate data from multiple sources and formats into a single index, interoperability
issues need to be addressed.
 Federated search indexing is an extension of the search-time merging method that requires a
robust search solution in order to index different types of content in different indices and create the
unified federated search interface.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Federated search and recommendation in EFPF</title>
      <p>Our solution to a federated search and recommendation system, designed in EFPF, includes the
following tree steps: firstly, in order to achieve a technical federation of digital platforms, we design a
common federated search capability. Secondly, we collect the results of federated search to support
federated recommendations. Finally, both federated search and recommendations support advanced
matchmaking and agile network creation mechanisms for business transactions across the digital
platform ecosystem. Federated search in EFPF supports queries for products, services and business
partners. The search criteria and the results are collected to support the recommendation processes,
based on different techniques of information pattern matching, e.g. information retrieval and similarity
matching techniques, which are both based on Machine Learning (ML) and data analysis. After the
recommendation algorithm has identified the most suitable products, services and/or partners, the user
evaluates the results based on selected indicators, e.g. cost, reliability, quality, etc. In the final step, the
user makes a decision and initiates a business transaction.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Design of a federated search and recommendation system</title>
      <p>In EFPF, federated search is based on the index-time merge architectural approach (Figure 1). This
approach requires content from all base platforms participating in the federation, to be channeled into
a central index of the EFPF platform. The four base platforms in EFPS are NIMBLE
(https://www.nimble-project.org/), COMPOSITION (https://www.composition-project.eu/), vf-OS
(https://www.vf-os.eu/) and DIGICOR (https://www.digicor-project.eu/). The EFPF Data Spine is
designed as an integration flow engine, enabling toolsets and services from the base platforms to
connect with each other and offers their capabilities to the users of the different platforms. For the
implementation of the Federated Search Index, we use Apache Solr (https://lucene.apache.org/solr/)
that enables distributed indexing, replication and load-balanced querying, automated failover and
recovery, configuration, etc.</p>
      <p>One major disadvantage of the index-time merge architecture concerns the data connectors that need
to be implemented to integrate different types of data sources. In addition, acquiring the content from
different repositories and data sources requires considerable efforts; e.g., it needs to be done using
scheduled read-only processes that need to be designed and implemented at the data integration layer.
This also requires a decision about the frequency of the data ingestion into the central index. Data
ingestion frequency needs to be configured e.g. hourly, daily or weekly, depending on the data velocity
of the base platforms.</p>
      <p>The user interaction data in EFPF is stored using the User Activity Log Service, which listens to all
user interaction events generated from the EFPF portal (https://efpf-portal.ascora.eu/), e.g. purchased
items, etc. The user interaction data are also stored using Solr Index, in order to support ML model
creation. The outcomes of the matchmaking are presented to the EFPF users through an intuitive UI
that is integrated in the portal.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>Federated search, ML and recommendation techniques can be combined in an innovative way to
deliver better results for businesses, improving user experience, increasing user engagement through
incentives and rewards mechanisms, boosting conversion rates and fully operationalizing businesses
using digital platform solutions. However, there are still many questions to be explored in the
intersection of these fields, e.g.: Which recommendation algorithms are more suitable for federated
platform ecosystems? What is the minimum amount of data to run an effective recommendation
system? How symbolic AI methods can compensate for the missing data and better support
recommendation? This paper has presented the main challenges and the initial design of the federated
search architecture. The above questions will be further answered over the course of the project
implementation. In 2021, an open call for third-party use of the platform infrastructure will offer the
opportunity for large-scale validation of the search and recommendation system.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Acknowledgements</title>
    </sec>
    <sec id="sec-8">
      <title>8. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>This research has been co-funded by the European Commission within the H2020 project NIMBLE,</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>No.</surname>
          </string-name>
          <article-title>723810 and the H2020 DT-</article-title>
          <string-name>
            <surname>ICT-</surname>
          </string-name>
          07-2018-2019
          <string-name>
            <surname>project</surname>
            <given-names>EFPF</given-names>
          </string-name>
          , No.
          <volume>825075</volume>
          . [1]
          <string-name>
            <given-names>A. C.</given-names>
            <surname>Elguindi</surname>
          </string-name>
          , K. Schmidt, “
          <article-title>Discovery systems, layers and tools, and the role of the electronic</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <article-title>resources librarian”</article-title>
          .
          <source>Electronic Resource Management</source>
          ,
          <year>2012</year>
          , p.
          <fpage>109</fpage>
          -
          <lpage>139</lpage>
          . [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Orchilles</surname>
          </string-name>
          , “Introduction to Windows 7”,
          <source>Microsoft Windows 7 Administrator's Reference</source>
          , p.
          <fpage>1</fpage>
          -
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          91, doi:10.1016/b978-1
          <source>-59749-561-5</source>
          .
          <fpage>00001</fpage>
          -
          <lpage>2</lpage>
          ,
          <year>2010</year>
          . [3]
          <string-name>
            <given-names>X.</given-names>
            <surname>Fei</surname>
          </string-name>
          , “
          <article-title>Implementation of a Federated Search System: Resource Accessibility Issues“</article-title>
          , Serials
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Review - SER REV</surname>
          </string-name>
          .
          <volume>35</volume>
          ,
          <year>2009</year>
          , p.
          <fpage>235</fpage>
          -
          <lpage>241</lpage>
          . [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Hsieh-Yee</surname>
          </string-name>
          , S. Zhang, “
          <article-title>User Percep</article-title>
          .
          <article-title>of MetaLib Comb. Search: An Inv</article-title>
          . of How Users
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <article-title>Make Sense of Federated Search,”</article-title>
          <string-name>
            <surname>Intern. Ref. Serv. Q.</surname>
          </string-name>
          <year>12</year>
          ,
          <year>2007</year>
          , p.
          <fpage>211</fpage>
          -
          <lpage>236</lpage>
          . [5]
          <string-name>
            <given-names>B.</given-names>
            <surname>Geritty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lyman</surname>
          </string-name>
          , E. Tallent, “Blurring Services and Resources: Boston College's
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <article-title>Implementation of MetaLib</article-title>
          and SFX”,
          <source>Ref. Serv. Rev</source>
          .
          <volume>30</volume>
          , no.
          <issue>3</issue>
          ,
          <issue>2002</issue>
          , p.
          <fpage>229</fpage>
          -
          <lpage>241</lpage>
          . [6]
          <string-name>
            <given-names>L.S. Mestre C.</given-names>
            <surname>Turner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Lang</surname>
          </string-name>
          , B. Morgan, “
          <article-title>Do We Step Together, in the Same Direction, at the</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <source>Reference Services Quarterly</source>
          <volume>12</volume>
          , no.
          <issue>1</issue>
          /2, 2007, p.
          <fpage>111</fpage>
          -
          <lpage>132</lpage>
          . [7]
          <string-name>
            <given-names>L.</given-names>
            <surname>Fancher</surname>
          </string-name>
          , “Wanted,
          <article-title>Dead or Alive: Federated Searching for a Statewide Virtual Library”,</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <source>Internet Reference Services Quarterly</source>
          <volume>12</volume>
          , no.
          <issue>1</issue>
          /2, 2007, p.
          <fpage>133</fpage>
          -
          <lpage>158</lpage>
          . [8]
          <string-name>
            <given-names>K.</given-names>
            <surname>Calhoun</surname>
          </string-name>
          , “
          <article-title>An Integrated Framework for Discovering Digital Library Collections”</article-title>
          ,
          <source>Journal of</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          Zhejiang University Science 6A, no.
          <issue>11</issue>
          ,
          <year>2005</year>
          , p.
          <fpage>1318</fpage>
          -
          <lpage>1326</lpage>
          . [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Taylor</surname>
          </string-name>
          , “
          <article-title>Using the Google Search Appliance for federated searching”</article-title>
          , Internet Reference
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Services</given-names>
            <surname>Quarterly</surname>
          </string-name>
          ,
          <volume>10</volume>
          (
          <issue>3</issue>
          ),
          <year>2006</year>
          , p.
          <fpage>45</fpage>
          -
          <lpage>55</lpage>
          . [10]
          <string-name>
            <given-names>D.G.</given-names>
            <surname>Dorner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.M.</given-names>
            <surname>Curtis</surname>
          </string-name>
          , “
          <article-title>A Comparative Review of Common User Interface Products,” Library</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <source>Hi Tech 22, no. 2</source>
          ,
          <issue>2004</issue>
          , p.
          <fpage>182</fpage>
          -
          <lpage>197</lpage>
          . [11]
          <string-name>
            <surname>Algolia</surname>
          </string-name>
          , “What is Federated Search?“, URL: https://blog.algolia.
          <article-title>com/what-is-federated-search/,</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>