<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Smart Book Recommender: A Semantic Recommendation Engine for Editorial Products</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Francesco Osborne</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Thiviyan Thanapalasingam</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Angelo Salatino</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aliaksandr Birukou</string-name>
          <email>aliaksandr.birukou@springer.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Enrico Motta</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Knowledge Media Institute, The Open University</institution>
          ,
          <addr-line>MK7 6AA, Milton Keynes</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Springer-Verlag GmbH</institution>
          ,
          <addr-line>Tiergartenstrasse 17, 69121 Heidelberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Academic publishers, such as Springer Nature, need to constantly make informed decisions about how and where to market their editorial products. In the field of Computer Science (CS), it is particularly critical to assess which books will be of interest to the attendees of a conference. Typically, these items are manually chosen by publishing editors, on the basis of their personal experience. To make this process both faster and more robust we have developed the Smart Book Recommender (SBR), a semantic application designed to support the Springer Nature editorial team in promoting their publications at CS venues. SBR takes as input the proceedings of a conference and suggests books, journals, and other conference proceedings which are likely to be relevant to the attendees of the conference in question. It does so by taking advantage of a semantic representation of topics, which builds on a very large ontology of Computer Science topics; characterizing Springer Nature books as distributions of semantic topics; and approaching the problem as one of semantic matching between such distributions of semantic topics.</p>
      </abstract>
      <kwd-group>
        <kwd>Scholarly Data</kwd>
        <kwd>Recommendation Systems</kwd>
        <kwd>Ontology</kwd>
        <kwd>Bibliographic Data</kwd>
        <kwd>Scholarly Ontologies</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>Academic publishers need to constantly make timely and data-driven decisions to ensure
that they are showcasing their editorial products to their target market. In the field of
Computer Science, it is particularly critical to assess which books, journal, or proceedings
will be of interest for the attendees of a conference. Typically, these items are manually
chosen by publishing editors, on the basis of their personal experience. As the number of
publications grows, there is an increasing need for automated and data-driven methods
that can support this complex and time-consuming task by analysing large-scale data
about editorial products.</p>
      <p>
        In what follows we present the Smart Book Recommender (SBR), a web application
developed in collaboration with Springer Nature, which recommends books, journals and
conference proceedings that are likely to be relevant to the attendees of a given
conference. This work stems from the ongoing collaboration between Springer Nature and
the Knowledge Media Institute (KMi) of the Open University, which has produced a
number of other innovative solutions, including Smart Topic Miner (STM) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], a semantic
framework for classifying academic documents, and its API, the Smart Topic API.
Since January 2017, STM is being routinely used by the SN Computer Science editorial
team, halving the time for classifying conference proceeding. Similarly, SBR is in line to
be adopted to support SN editors in selecting the best set of books to market to the
participants of a conference. A demo of the SBR prototype is available at
http://rexplore.kmi.open.ac.uk/SBR-demo.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2 Smart Book Recommender</title>
      <p>SBR takes as input the title of a SN book, usually the proceedings of a conference, and
returns a list of books, journals, and proceedings which address topics that are likely to be
relevant to the participants of the conference in question. To do so, it represents SN books
in Computer Science as distributions of semantically-characterized topics, which are
drawn from a large-scale ontology of Computer Science, and then computes their pairwise
similarity. SBR offers a simple web interface, shown in Figure 1, to allow editors to filter
the results and share their feedback. It is thus complementary, but very different in scope,
to Reccomended1, the SN recommender system, which suggest books to users on the basis
of their last 100 papers read on online platforms.</p>
      <p>SBR relies on the following background knowledge: a large database of SN book
metadata and the Computer Science Ontology (CSO). The database of metadata contains
titles, abstracts, keywords and other information describing the chapters of about 27K
books and 270 journals in the field of Computer Science. In the case of conference
proceedings and journals, each chapter is usually a research paper.</p>
      <p>
        CSO is a large scale and granular ontology of research topics that has been created
automatically by running the Klink-2 algorithm [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] on the Rexplore dataset [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], which
consists of about 16 million publications in the field of Computer Science. It is an
extension2 of the BIBO ontology3 which in turn builds on SKOS. The current version of
CSO includes about 17K topics, which are linked by 70K semantic relationships.
      </p>
      <sec id="sec-2-1">
        <title>1 http://recommended.springernature.com/recommended/ 2 http://kmi.open.ac.uk/technologies/rexplore/ontologies/BiboExtension.owl</title>
        <sec id="sec-2-1-1">
          <title>2.1 Architecture</title>
          <p>Since computing the cosine similarity of all the books in the dataset is
computationallyheavy, we consider only promising pairs which obtain a jaccard similarity of at least
0.125. A data analysis revealed that this heuristic halves the number of candidate pairs
while still producing very good results.</p>
          <p>When the user submits an input book to the SBR web interface, the relevant ID and user
settings are sent as JSON to the background API via a GET query. The API queries the
database for selecting the most similar books and returns their descriptions. Both the API
and the recommendation engine are realized in Python.</p>
        </sec>
        <sec id="sec-2-1-2">
          <title>2.2 The Web Interface</title>
          <p>The user can select specific conference proceedings by typing their name in an
autocomplete field. The recommendation results are displayed in order of descending
similarity scores and they can be filtered according to document types (journals, books,
conference proceedings) and year range.</p>
          <p>Figure 3 shows an example of proceedings suggested by SBR. For simplicity,
proceedings of different editions of the same conferences are grouped together. Each
proceedings volume is described according to its title, year and the top fifteen topics. SBR
also highlights in blue the topics which are among the top fifty topics of the input book.
The user has the option to provide feedback about each item using emoticons buttons.
When the user clicks either button, the feedback is sent to the API and recorded in the</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>3 http://purl.org/ontology/bibo/</title>
        <p>database. These data will be used to determine the quality of different similarity metrics
and further enhance the recommendation process.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3 Conclusions</title>
      <p>In this demo paper, we presented the prototype of SBR, a novel system for identifying
related editorial products and facilitating the marketing process at SN.</p>
      <p>As next steps, we intend to improve the recommendation process using other features
(e.g., sales figures) and to conduct a formal evaluation with a group of SN editors. We are
also planning to design a more advanced user interface for comparing the topics of
different books and to implement a new version of the system for assisting researchers in
identifying books and conferences which are relevant to their work.</p>
    </sec>
    <sec id="sec-4">
      <title>4 References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salatino</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birukou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <article-title>Automatic Classification of Springer Nature Proceedings with Smart Topic Miner</article-title>
          .
          <source>In International Semantic Web Conference</source>
          <year>2016</year>
          (pp.
          <fpage>383</fpage>
          -
          <lpage>399</lpage>
          ). Springer. (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
          </string-name>
          , E.:
          <article-title>Klink-2: integrating multiple web sources to generate semantic topic networks</article-title>
          .
          <source>In International Semantic Web Conference</source>
          <year>2015</year>
          (pp.
          <fpage>408</fpage>
          -
          <lpage>424</lpage>
          ). Springer. (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Mulholland</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Exploring scholarly data with Rexplore</article-title>
          .
          <source>In International Semantic Web Conference</source>
          <year>2013</year>
          (pp.
          <fpage>460</fpage>
          -
          <lpage>477</lpage>
          ). Springer. (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>