<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Supporting Springer Nature Editors by means of Semantic Technologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Francesco Osborne</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Angelo Salatino</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aliaksandr Birukou</string-name>
          <email>aliaksandr.birukou@springer.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Thiviyan Thanapalasingam</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Enrico Motta</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Knowledge Media Institute, The Open University</institution>
          ,
          <addr-line>MK7 6AA, Milton Keynes</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Springer-Verlag GmbH</institution>
          ,
          <addr-line>Tiergartenstrasse 17, 69121 Heidelberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The Open University and Springer Nature have been collaborating since 2015 in the development of an array of semantically-enhanced solutions supporting editors in i) classifying proceedings and other editorial products with respect to the relevant research areas and ii) taking informed decisions about their marketing strategy. These solutions include i) the Smart Topic API, which automatically maps</p>
      </abstract>
      <kwd-group>
        <kwd>Scholarly Data</kwd>
        <kwd>Ontology Learning</kwd>
        <kwd>Bibliographic Data</kwd>
        <kwd>Scholarly Ontologies</kwd>
        <kwd>Data Mining</kwd>
        <kwd>Conference Proceedings</kwd>
        <kwd>Metadata</kwd>
        <kwd>Classification</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Correctly classifying proceedings and other editorial products in terms of the relevant
research areas is critical to facilitate their discovery and to allow editors to take informed
decisions about where to market them. Traditionally, this process has been handled
manually by experienced editors, leading to high costs and slow throughput. In this short
paper, we present a number of solutions informed by Semantic Technologies, which we
have developed to address this issue in the context of a collaboration between Springer
Nature (SN) and the Knowledge Media Institute (KMi) of The Open University. These
solutions include: i) the Smart Topic API, ii) the Smart Topic Miner, and iii) the Smart
Book Recommender.</p>
      <p>
        Purely syntactic solutions, which extract frequent keyphrases from a set of documents,
have shown to be limited for classifying conference proceedings [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], since they typically
return a very large and unwieldy distribution of terms. The Smart Topic API addresses
this issue by mapping these terms to concepts in the Computer Science Ontology (CSO)
and returning a human-friendly number of structured topic descriptors. CSO is a large
scale and granular ontology of research topics that has been created automatically, by
running the Klink-2 algorithm [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] on the Rexplore dataset [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This consists of about 16
million publications in the field of Computer Science. CSO includes about 17K topics,
which are linked by 70K semantic relationships. The mappings from keywords to
concepts take into account both synonyms and sub-areas of a topic – e.g., all documents
associated to terms such as “semantic technologies”, “linked data”, “RDF”, “OWL” will
be also tagged with the topic “Semantic Web”. The result is a balanced topic distribution,
which is both accurate in terms of topic annotation and also easy to understand and edit
for the user.
      </p>
      <p>
        The Smart Topic API, which is also available for research purposes, currently supports
two web applications. The first is the Smart Topic Miner (STM) 1 [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], which helps editors
in understanding and classifying proceedings i) by suggesting, for each proceeding, both a
structured set of relevant research topics drawn from CSO, as well as a set of codes drawn
from the SN Classification for Computer Science, and ii) by providing editors with an
environment to make sense of the proposed annotations and edit them if necessary. The
second application is the Smart Book Recommender 2 [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], which takes as input the
proceedings of a conference and returns books, journals and other proceedings that are
likely be of interest for its attendees. It does so by computing the similarity of SN editorial
products over the vectors of semantic topics returned by the Smart Topic API.
2 Business Value
The STM tool has been routinely used by the SN Computer Science editorial team for
classifying conference proceedings since January 2017. It is being applied to the Lecture
Notes in Computer Science (LNCS) and other computer science series (LNBIP, CCIS,
IFIP-AICT, LNICST), which publish about 780 volumes each year.
      </p>
      <p>STM halves the time needed for classifying proceedings from 20-30 to 10-15 minutes.
The benefits are especially evident when classifying complex multi-volume conferences,
such as the European Conference on Computer Vision (ECCV3). The perceived value of
STM can be roughly described as “it helps one to get 80-90% of topics correct very
quickly”. Indeed, while the classification of proceedings has traditionally been performed
only by very experienced editors, thanks to STM it is now possible for assistant editors to
perform the task, thus distributing the load and reducing costs. In addition, the adoption of
a controlled vocabulary (in terms of the CSO ontology) makes the process more robust
and facilitates the identification of related editorial products.</p>
      <p>
        In the future, we plan to integrate the STM tool with the SN Linked Open Data portal4,
which describes Springer Nature conferences [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. This will allow users to formulate
complex queries that take advantage of the granular topic taxonomy provided by CSO.
      </p>
      <p>In conclusion, we believe that this project offers an excellent example of how the use
of ontologies and other semantic technologies can be effectively deployed in an
organization to make workflows more robust and reduce costs.
1 Demo available at http://rexplore.kmi.open.ac.uk/STM2_demo/
2 Demo available at http://rexplore.kmi.open.ac.uk/SBR-demo
3 https://link.springer.com/conference/eccv
4 http://lod.springer.com/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salatino</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birukou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <article-title>Automatic Classification of Springer Nature Proceedings with Smart Topic Miner</article-title>
          .
          <source>In International Semantic Web Conference</source>
          <year>2016</year>
          (pp.
          <fpage>383</fpage>
          -
          <lpage>399</lpage>
          ). Springer. (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
          </string-name>
          , E.:
          <article-title>Klink-2: integrating multiple web sources to generate semantic topic networks</article-title>
          .
          <source>In International Semantic Web Conference</source>
          <year>2015</year>
          (pp.
          <fpage>408</fpage>
          -
          <lpage>424</lpage>
          ). Springer. (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Mulholland</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Exploring scholarly data with Rexplore</article-title>
          .
          <source>In International Semantic Web Conference</source>
          <year>2013</year>
          (pp.
          <fpage>460</fpage>
          -
          <lpage>477</lpage>
          ). Springer. (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thanapalasingam</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salatino</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birukou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
          </string-name>
          , E.:
          <article-title>Smart Book Recommender: A Semantic Recommendation Engine for Editorial Products</article-title>
          .
          <source>In International Semantic Web Conference</source>
          <year>2017</year>
          (
          <article-title>Poster Track)</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bryl</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birukou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eckert</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kessler</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>What's in the proceedings? Combining publisher's and researcher's perspectives</article-title>
          .
          <source>In SePublica</source>
          <year>2014</year>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>