<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Exploiting FrameNet for Content-Based Book Recommendation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Orphée De Clercq Michael Schuhmacher</string-name>
          <email>michael@informatik.uni-</email>
          <email>orphee.declercq@ugent.be</email>
          <email>orphee.declercq@ugent.be michael@informatik.unimannheim.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simone Paolo Ponzetto</string-name>
          <email>simone@informatik.uni-</email>
          <email>simone@informatik.unimannheim.de</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Véronique Hoste</string-name>
          <email>veronique.hoste@ugent.be</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>LT3, Language and Research Group Data and, Translation Technology Team Web Science, Ghent University University of Mannheim</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>LT3, Language and, Translation Technology Team, Ghent University</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Research Group Data and</institution>
          ,
          <addr-line>Web Science</addr-line>
          ,
          <institution>University of Mannheim</institution>
        </aff>
      </contrib-group>
      <fpage>13</fpage>
      <lpage>20</lpage>
      <abstract>
        <p />
      </abstract>
      <kwd-group>
        <kwd>Content-Based Recommender Systems</kwd>
        <kwd>Semantic Frame</kwd>
        <kwd>Linked Data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Adding semantic knowledge to a content-based recommender
helps to better understand the items and user
representations. Most recent research has focused on examining the
added value of adding semantic features based on structured
web data, in particular Linked Open Data (LOD). In this
paper, we focus in contrast on semantic feature
construction from text, by incorporating features based on semantic
frames into a book recommendation classifier. To this
purpose we leverage the semantic frames based on parsing the
plots of the items under consideration with a
state-of-theart semantic parser. By investigating this type of
semantic information, we show that these frames are also able to
represent information about a particular book, but without
the need of having explicitly structured data describing the
books available. We reveal that exploiting frame
information outperforms a basic bag-of-words approach and that
especially the words relating to those frames are beneficial
for classification. In a final step we compare and combine
our system with the LOD features from a system
leveraging DBpedia as knowledge resource. We show that both
approaches yield similar results and reveal that combining
semantic information from these two di↵ erent sources might
even be beneficial.</p>
    </sec>
    <sec id="sec-2">
      <title>Categories and Subject Descriptors</title>
      <p>Permission to make digital or hard copies of all or part of this work for
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that copies
bear this notice and the full citation on the first page. To copy otherwaisuet,htoors.
Copyright 2014 for the individual papers by the paper’s
republish, to post on servers or to redistribute to lists, requires prior specific
Copying permitted for private and academic purposes. This volume is
permission and/or a fee.</p>
      <p>CpuBbRlieschSeyds 2an01d4c,oOpcytroibgehrte6d, 2b0y1i4t,sSeidliictoonrsV.alley, CA, USA.</p>
      <p>CoBpRyercigShyts22001144b,yOtchteoabuetrh6o,r(2s0).14, Silicon Valley, CA, USA.
1.</p>
    </sec>
    <sec id="sec-3">
      <title>INTRODUCTION</title>
      <p>Recommender systems are omnipresent online and
constitute a significant part of the marketing strategy of various
companies. In recent years, a lot of advances have been made
in constructing collaborative filtering systems, whereas the
research on content-based recommenders had lagged
somewhat behind. Similar to evolutions in information retrieval
research, the focus has been more on optimizing tools and
finding more sophisticated techniques leveraging for
example big data than on the actual understanding or processing
of the items or text at hand.</p>
      <p>In Natural Language Processing (NLP), on the other hand,
huge advances have been made in processing text both from
a lexical and semantic perspective. In this respect, we
believe it is important to test whether a content-based
recommender system might actually benefit from plugging in more
semantically enriched text features, which is the purpose of
the current research. In this paper we wish to investigate
to what extent leveraging semantic frame information can
help in recommending books to users. We chose to work
with books, since these typically contain a chronological
description of certain actions or events which might be
indicative for the interests of a particular reader. Someone
might enjoy reading historical novels, for example, but is
more prone to those novels where a love history is explained
in closer detail than those where a typical revenge story is
portrayed. We hypothesize that the semantic frames and or
events in these two types of historical novels will be di↵ erent.
In other words, we wish to investigate to what extent deep
semantic parsing of the plots describing a book following the
FrameNet paradigm can help for recommendation.</p>
      <p>In order to validate these claims we performed an
extensive analysis on a book recommendation dataset which was
provided in the framework of the 2014 ESWC challenge.
What is particularly interesting about this dataset is that
all the books have been mapped to their corresponding
DBpedia URIs which allows us to directly compare externally
gained semantic information as available in the Linked Open
Data cloud (LOD) with internal semantic information based
on the plots themselves. Our analysis reveals that although
some frames and events are good indicators of genres derived
from external DBpedia information, they do represent some
additional information which might help the
recommendation process.</p>
      <p>To actually verify this finding we test the added value
of incorporating frame information as semantic features in
a basic recommender system. We see that exploiting this
kind of semantic information outperforms a standard
bagof-words unigram baseline and that incorporating frame
elements and lexical units evoking the frames allows for the
best overall performance. If we compare our best system
to a system levering semantic LOD information, we observe
that our frames approach is not able to outperform this
system. We do find, however, that if we combine these two
semantic information sources into one system we get the
best overall performance. This might indicate that
combining semantic information from die↵rent sources, i.e. from
the linguistically grounded implicit frame features and the
explicit, ontology grounded DBpedia features, is beneficial.</p>
      <p>The remainder of this paper is structured as follows. In
Section 2 we describe some related work with an explicit
focus on the added value of semantic information for
recommender systems. In Section 3 we then explain in closer detail
the construction and reasoning behind the semantic
frameenhancement. We then continue by describing the actual
experimental setup (Section 4) and have a closer analysis
of the results (Section 5). We finish with some concluding
remarks and ideas for future work (Section 6).
2.</p>
      <sec id="sec-3-1">
        <title>RELATED WORK</title>
        <p>
          In content-based recommender systems, the items to be
recommended are represented by a set of features based on
their content, whereas a user is represented by his profile.
To build a recommender both information sources are
compared. Most content-based recommenders use quite simple
retrieval models, such as keyword matching or the vector
space model with basic TF-IDF weighting [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. A
problem with these models is that they tend to ignore semantic
information. To overcome this one can use Explicit
Semantic Analysis (ESA) [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] instead of TF-IDF weighting which
allows to represent a document as a weighted vector of
concepts. Another way to add more linguistic knowledge is to
use for example information from Wordnet as done by [
          <xref ref-type="bibr" rid="ref3 ref6">6, 3</xref>
          ].
An alternative is to use language models to represent
documents. This was done for example by [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] when exploring
content-based filtering of calls for papers. Besides retrieval
models, machine learning techniques where a system learns
the user profile and classifies items as interesting or not are
also used for content-based recommenders. One of the first
to do this was [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] using a Na¨ıve Bayes classifier.
        </p>
        <p>
          When it comes to adding semantic information to
recommender systems we see that currently leveraging Linked
Open Data (LOD) is a popular research strand. [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] and
[
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] were among the first to use LOD for recommendation.
The former use this information to build open recommender
systems whereas the latter built a music recommender using
collaborative filtering techniques. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] was the first to really
leverage LOD to build a content-based recommender and
the first to exploit the semantics of the relations in the link
hierarchy. They use LOD information from DBpedia,
Freebase and LinkedMDB as the only background knowledge for a
movie recommender system and show that thanks to this
ontological information the quality of a standard content-based
system can be improved. In more recent work, the
semantic item descriptions based on LOD have been merged with
positive implicit feedback in a graph-based representation
to produce a hybrid top-N item recommendation algorithm,
SPrank [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], which further underlines the added value of this
kind of data. Moreover, in 2014 in order to spark research
on LOD and content-based recommender systems, a shared
task was organized by the same authors, i.e. the ESWC-14
Challenge1.
        </p>
        <p>
          In content-based recommendation, the advances that have
been made were made possible thanks to the availability of
designated datasets. These include data for predicting
music, Last.FM2, and or movies, MovieLens3. Up till now little
research has been performed on other genres, such as books.
The ESWC challenge, however, made a book
recommendation dataset available which is mapped to DBpedia.
DBpedia is a crowd-sourced community eo↵rt to extract
structured information from Wikipedia and makes them available
as linked RDF data [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. This dataset will be used as our
main data source. In this paper, we focus on the feature
construction for a classifier in that we also incorporate
semantic features based on the semantic frames present within
the items to be recommended. This is, to our knowledge,
the first approach that tries to leverage this kind of data and
is one way of tackling the issue of Limited Content Analysis
within recommender systems [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. In order to validate these
claims we will compare and combine our best system with a
system exploiting LOD.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3. FRAME-ENHANCEMENT</title>
        <p>In this section we give some more information about why
we believe exploiting frame information might help with
recommendations. First, we introduce some basic concepts and
theory after which we explain how we apply a
state-of-theart semantic frame parser to our dataset and provide a first
analysis. We hypothesize that a plot description tells more
about a book than using more global semantic classification
based on external semantic information as provided by the
LOD cloud. This reasoning can be transferred to other data
sources having a large number of textual information.
1http://challenges.2014.eswc-conferences.org/
index.php/RecSys
2http://labrosa.ee.columbia.edu/millionsong/
3http://grouplens.org/datasets/movielens/</p>
      </sec>
      <sec id="sec-3-3">
        <title>Frame semantics and FrameNet</title>
        <p>
          Following the basic assumption that the meanings of most
words can best be understood on the basis of a
semantic frame, FrameNet [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] was developed as a linguistic
resource storing considerable information about lexical and
predicate-arguments semantics in English.
        </p>
        <p>
          FrameNet is grounded in the theory of frame semantics [
          <xref ref-type="bibr" rid="ref7 ref8">7,
8</xref>
          ]. This theory tries to describe the meaning of a sentence
by characterizing the background knowledge required to
understand this sentence. This knowledge is presented in an
idealized, i.e. prototypical, form. A frame is thus a
structured representation of a concept. It can be a description
of a type of event, relation or entity, and the participants
in it. In Table 1 we present an example of such a frame,
KILLING. We see it is a semantic class containing various
predicates, also known as lexical units (LUs), evoking the
described situation, e.g. killer, murder, lethal. Moreover,
it illustrates that within FrameNet each frame comes with
a set of semantic roles, i.e. frame elements (FEs), which
can be perceived as the participants and/or properties of a
frame which are of course also lexicalized in the text itself,
e.g. Killer: John, Instrument: with only a pocketknife.
        </p>
        <p>
          FrameNet’s latest release (1.5) contains 877 frames and
about 155K exemplar sentences.4 An interesting aspect of
the FrameNet lexicon is that asymmetric frame relations can
relate two frames, thus forming a complex hierarchy
containing both is-a like and non-hierarchical relations [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. In this
work, we are particularly interested in the former type, also
known as Inheritance relations. This type of relation entails
that the child frame is a subtype of the parent frame. If we
look for instance at our Killing example, of which the
taxonomy is visualized in Figure 15, we are able to find out that
this frame is a child of the frame Transitive action, which is
in turn a child of both the frame Objective Influence and,
more interestingly, the frame Event. This taxonomy thus
enables us to find even more semantic properties about
specific frames.
3.2
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>Exploiting FrameNet</title>
        <p>3.2.1</p>
        <p>Book dataset</p>
        <p>For the research described in this paper, we worked with
the dataset of the ESWC challenge which is in fact a
re4This release is available at http://framenet.icsi.
berkeley.edu
5This graph was produced using the FrameGrapher
tool, https://framenet.icsi.berkeley.edu/fndrupal/
FrameGrapher</p>
        <p>The [Prince], the protagonist, is [named] Alexander.</p>
        <p>His [father], [Prince] Baudouin, is [murdered] by
T the [King] of Cornwall, [King] [March]. [When]
L Alexander [comes] of [age], he [sets out] to Camelot
O
P to [seek] justice from [King] Arthur and to [avenge]
the [death] of his [father]....</p>
        <p>
          Leadership, Appointing, Kinship, Leadership, Killing,
SE Leadership, Leadership, Calendric unit,
M Temporal collocation,Arriving, Calendric unit,
RA Departing,Seeking to achieve, Leadership, Revenge,
F Death, Kinship.
elaborated version of the LibraryThing dataset6. This dataset
contains books that are part of a particular user’s online
catalog containing the books he/she has read or owns. For
the challenge, the books available in the dataset have been
mapped to their corresponding DBpedia URIs [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. Based
on the available information we were able to download the
plot description of each book from its corresponding Wikipedia
page (this plot information is lacking in DBpedia). In this
way we envisaged to investigate whether knowing more about
what is actually happening in a book can enhance the
recommendation. We worked with a subset by only including
books of which a uniform and unambiguous DBpedia link
was available and that actually contained plot information
on Wikipedia. In total our final dataset contains 5,063 books
with an average plot length of 312 words7.
        </p>
        <p>
          In order to annotate the semantic frames, each plot was
parsed using the state-of-the-art frame-semantic parser
SEMAFOR [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. This parser extracts semantic
predicateargument structures from text using a statistical model and
is trained on the FrameNet 1.5 release. It takes as input the
text as such, performs some preprocessing steps and outputs
on a sentence-per-sentence basis all frames that are present
within a text. These frames are represented by one of the 877
possible frame names and also the lexical units and frame
elements (both generic and lexicalized form) are output. An
example is presented in Table 2. This is the plot description
of the book The Prince and the Pilgrim. In the text itself,
the lexical units evoking the frames are indicated in square
brackets. The frames and LUs which are represented in bold
are those frames which actually constitute an Event.
Finding out which books are events can be done by exploiting
the taxonomy (cfr. supra) which enables us in a way to find
out more semantic properties of specific frames. Intuitively,
we can state that especially those Event frames give most
information about what is happening within a book: the
above-mentioned book is clearly a revenge story. However,
the other frames might also pinpoint important aspects, e.g.
the repetition of the Leadership and Kinship frames could
inform us that this novel is about royalty and family.
        </p>
        <p>What this example also illustrates is that the SEMAFOR
parser is not 100% accurate. For example, the name of a
particular king – King March – is interpreted by the parser
as evoking the frame Calendric unit. We should thus keep
6http://www.macle.nl/tud/LT/
7This dataset will also be made available to the research
community in due time
in mind that a certain amount of noise is also introduced
into our dataset. Moreover, some frames such as Arriving
or Temporal collocation, are correctly labeled but do not
really contribute interesting semantic information.</p>
        <p>For all books in our dataset we parsed the plots using
SEMAFOR, after which we also filtered out those frames
which can have the Event frame as a parent. Some data
statistics regarding these annotations are presented in
Table 3, which reveal that the information we have available is
rather skewed.</p>
        <p>As previously mentioned, we hypothesize that using frames
might represent di↵erent information than using semantic
information represented in the LOD-cloud. The books dataset
we have at hand is particularly useful to verify this claim
since all books have been mapped to their DBpedia URIs.</p>
        <p>
          In order to do so, we relied on a manual subdivision of
all books in genres based on LOD. This classification was
made by [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] by parsing the abstract (dbo: abstract),
the genre ( dbo:literaryGenre, dbp:genre) and the subject
(dcterms: subject) of each book against a regular
expression pattern of thirty distinct genres. The authors performed
this step to allow for more data coverage. However, by doing
so they also made a combination of various LOD
information categories which enables us to directly compare these
with our semantic frames. If we have a look at our running
example, The Prince and the Pilgrim,8, we notice that this
book is classified under the Fantasy genre.
        </p>
        <p>
          Based on this genre mapping, we calculated the gain
ratio [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] of our semantic frames representation with relation
to the genres, thus considering the frames as features
allowing to do genre classification. These gain ratios can then
be observed as feature weights, and ranked according to the
amount of information they add to discriminating between
the thirty possible genres. We start our analysis by first
only considering the semantic frame annotations. It became
apparent, however, that it might be more interesting to also
closer inspect those frames which are Events since these
intuitively better represent what is actually happening.
        </p>
        <p>The result of these analyses in presented in Table 4.
Because of space constraints, we only represent the five genres
representing most books of our dataset. This table each time
contains the ten top features (frames and events), i.e. those
with the highest gain ratio. The cell colour represents the
manual analysis, indicated in light grey are those frames and
events occurring only within one particular genre. In darker
grey the frames and events which are representative for a
specific genre are indicated. Regarding the frames, we see
that it is more dicult to find distinctive features correlating
with the genre (light grey). In the upper part, only the
Science Fiction and Crime genre contain truly representative
8http://dbpedia.org/resource/The Prince and the Pilgrim
frames based on our manual analysis (dark grey). If we go
to the level of the Events, we see that this already allows for
finding more unique events per genre. Again, the Science
Fiction and Crime genre are best represented. When we
had a closer look at other discriminating features we found
the same tendency. In the Crime genre, for example, other
Events such as Verdict, Revenge, Execution, Robbery all
appeared within the top twenty features.</p>
        <p>From this analysis we could deduce that both the frames
and events might deliver the same type of information as the
LOD, with the events being more representative. However,
what becomes clear is that the frames also contribute more
information. They can represent what is happening within
a book. If we again consider our running example (cfr.
Table 2), which is classified as Fantasy, we feel that enriching
a recommender with semantic frame, and especially with
event information, might account for a better
recommendation. This brings us to the actual experiments.
4.</p>
      </sec>
      <sec id="sec-3-5">
        <title>EXPERIMENTS</title>
        <p>For our experiments we focus on the generation of new,
semantic features. In our experimental setting we aim to
evaluate the contribution of those features and thus do not
explicitly focus on engineering towards a top
recommendation performance.
4.1</p>
      </sec>
      <sec id="sec-3-6">
        <title>Experimental Set-Up and Evaluation</title>
        <p>
          We opt to add our semantic features to an existing
recommender system [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], which participated, and performed well,
in the ESWC’14 Challenge. Though we do apply feature
weighting and feature selection as described below, the
overall item classification and collaborative-filtering elements of
the base system remain unchanged. This allows us to
directly compare the predictive power of the frame-based
features with the DBpedia-based features used by the original
system, in particular as both approaches are die↵rent
utilizations of the same information source, i.e. Wikipedia,
and dataset, i.e. the ESWC RecSys Challenge data.
        </p>
        <p>We use a reduced version of the dataset, based on a
filtering of the 5,063 books that were retained as having sucient
plot information available (Section 3.2). This dataset has
binary ratings and consists of 53,665 user-item-rating triples
(6,162 users, 4,251 items) in the training data and 50,654
triples (6,180 users, 4,311 items) in the evaluation dataset.</p>
        <p>Even though this is a binary classification task, we opt to
output the positive class likelihood and not the final binary
classification in order to avoid making a decision about the
cut-o↵ for the likelihood values. Consequently, we evaluate
with root-mean-squared error (RMSE) to capture also the
degree of confidence between the classification and the
goldstandard test dataset9. RMSE is calculated as:
v</p>
        <p>m
RM SE = tuu m1 Xi=1(Xi
xi)2
in which Xi is the prediction and xi the response value,
i.e. the correct value for the task at hand, and m is the
number of items for which a prediction is made. Speaking
in practical terms, the lower the RMSE value the better,
9Obtained from the ESWC’14 Challenge Chairs upon
request.
because the closer the prediction confidence to the actual
gold standard.</p>
        <p>
          In addition, again motivated by wanting to avoid to choose
a cut-o↵ point for the class assignment, we follow [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] and
evaluate with a receiver operating characteristic (ROC) curve
and also compute the area under the curve (AUC) for it.
While in contrast to RMSE, the ROC curve is computed
only on the relative ordering of the predictions sorted by
confidence values, it oe↵rs the advantage of understanding how
a classifier would perform given die↵rent cut-o↵ values. In
addition, with ROC we can compare against recommender
systems that output only an (implicit) ranking and no class
confidence values.
        </p>
        <p>
          The base system by [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] we extend is a simple
contentbased recommender which trains two Na¨ıve Bayes
classifiers10 on book features acquired from DBpedia, one global
classifier as background model and one per-user classifier to
capture individual preferences, trained on a user-neighborhood
of variable size. In our experiments, we leave this setting
unchanged but only vary the die↵rent features for item
representation. We experimented with five di↵erent feature
representations, which is explained in closer detail in the next
section.
4.2
        </p>
      </sec>
      <sec id="sec-3-7">
        <title>Feature Representation</title>
        <sec id="sec-3-7-1">
          <title>1. Baselines</title>
          <p>First, we established two baselines: the first baseline was
constructed by including the majority class based on the
training data, in our case the majority class is ‘0’. As a
second baseline we decided to include a bag-of-words approach
containing token unigrams from all the di↵erent plots.</p>
          <p>The next three groups of features all relate to the frame
representation of the plots based on the SEMAFOR output
(cfr. Section 3.2)
10Even though being a simple approach, in a preliminary
experiment Na¨ıve Bayes outperformed an SVM, motivating
us to not compare di↵erent classifiers but focus on feature
selection. In addition, Na¨ıve Bayes was – as expected –
significantly faster compared to other classifiers.</p>
        </sec>
        <sec id="sec-3-7-2">
          <title>2. Frames</title>
          <p>For the frames as such, we decided to include the resulting
frame names (e.g. Killing, Kinship, Leadership) as a
separate setting. In total this can lead to a maximum of 877
discriminating features, which is a large feature space shrinkage
compared to the bag-of-words representation. This is why
we decided to also take into consideration those particular
words evoking the frames, the Lexical Units (e.g. murdered,
father, Prince) on the one hand, and the lexical
representations of the Frame Elements – the semantic roles – evoked
by this frame on the other hand (e.g. Prince Baudouin, by
the King of Cronwall, King March). In a final setting, we
incrementally combine these various elements of data, thus
giving more information to our classifier.</p>
        </sec>
        <sec id="sec-3-7-3">
          <title>3. Events</title>
          <p>As was illustrated in Section 3.2 the Events occurring
within a book seem to intuitively represent important
information of what is actually happening. This is why we
also decided to perform the same experiments as with the
frames but, this time only incorporating those frames which
have a possible Event parent somewhere in the FrameNet
hierarchy. Looking only at the Events further reduced our
feature space to a maximum of 234 features. We therefore
also made the same combinations as mentioned above with
all possible LUs and FEs relating only to Events.</p>
        </sec>
        <sec id="sec-3-7-4">
          <title>4. Taxonomy</title>
          <p>
            In order to exploit the hierarchical structure of
FrameNet even further, we decided to also investigate three other
settings. First we explored whether including besides a
frame also its direct parent, thus going one level up in the
graph, might help. We did the same in the other direction,
by only including the children which are at the bottom of
our taxonomy (the leafs). Another way of incorporating this
graph information was to calculate for each possible frame
pair that was found in a plot its least common subsumer [
            <xref ref-type="bibr" rid="ref20">20</xref>
            ]
(LCS), i.e. the parent both frames have in common resulting
in the shortest path. Since the FrameNet taxonomy as such
is not hypercomplex, i.e. the maximum distance between
two frames is twelve, we decided to filter out those parents
which are too generic by manually inspecting the LCS.11
          </p>
          <p>For the four above-mentioned setups, the same feature
selection methods were employed. Of course in order to
allow for a good representation, all word-based features (bow,
LUs and FEs) were first tokenized, stemmed and filtered on
stop words. For the automatic feature selection, we first use
unsupervised feature attribute weighting by computing the
standard TF-IDF weights since all our features are in the
end derived from text (book plots).
Next, we use attribute selection by computing the gain ratio
with relation to the binary class label in the training data:
RG(Attr, Class) = (H(Class)
H(Class|Attr))/H(Attr)
This should allow us to filter out noise or unimportant
features. We keep only those features with a gain ratio larger
than zero (RG &gt; 0).12</p>
        </sec>
        <sec id="sec-3-7-5">
          <title>5. Linked Open Data (LOD)</title>
          <p>In a final setup we compare our best setting with the LOD
features used by the base system, i.e. properties and values
from DBpedia, and apply the same feature weighting and
selection process. The features in the base system were
manually selected and contain explicit book attributes, as e.g.
dbo:author (db:Umberto_Eco), but also categorical
information as dbo:literaryGenre (db:Historical_novel),
dcterms:subject (category:Novels_set_in_Italy) or
rdf:type (yago:PhilosophicalNovels) and untyped Wikipedia
links in general.</p>
          <p>
            We use the same set of features as reported by [
            <xref ref-type="bibr" rid="ref23">23</xref>
            ], but,
to remain consistent across all experimental settings, apply
our feature selection and weighting approach and use our
reduced training and test dataset. In addition, we tested
the combination of the DBpedia features with our
bestperforming frame approach.
          </p>
        </sec>
      </sec>
      <sec id="sec-3-8">
        <title>RESULTS</title>
        <p>We report experimental results for the di↵erent feature
settings in Table 5. Overall, the two best performing frame
features are the Frame elements and the Frames+LUs+FEs,
both achieve an RMSE of 0.6036. We see that the best
result is obtained when making the combination between
the Frame elements and the LOD system, RMSE of 0.5982.
Looking at the AUC for the ROC curve, both features still
perform very well, but not as good as the DBpedia features
alone, which achieve the best overall AUC of 0.5588.</p>
        <p>
          Considering the RMSE values, we observe that the
majority baseline is easily outperformed by all die↵rent
settings. Looking at the bag-of-words baseline, however,
illustrates that having the words of the plot available for
recommendation is already a quite dicult to beat baseline.
11We looked at the most frequent LCS nodes and excluded
the first 10 generic nodes such as Artifact, Relation,
Intentionally a↵ect, Gradable attributes, Transitive action.
12Preliminary experiments revealed that keeping all features
as well as doing classifier-based features selection with OneR
[
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] with 5-fold-cross-validation on the training data
constantly underperformed against this setting.
Contrary to our expectations, our settings with only frames
or events do not outperform this baseline. We do see that
the events as such, which constitute a much smaller feature
space, perform slightly better than the frames. The
bagof-words baseline is only outperformed when using features
actually presenting some sort of word filtering mechanism:
the Frame Elements are the lexical representation of words
which are evoked by certain frames in the form of semantic
roles. Even though these features are extracted from the
text, it performs better than the bag-of-words (Words as
such) baseline approach (0.6145) which does not make use
of any semantic information. Analyzing the RG-ranked
feature attributes revealed that also for the other best frame
approach Frames+LUs+FEs, the dominant attributes are
the Frame elements, these were ranked highest. What is
strange is that we do not find a similar trend when
performing the same combination with our Event frames. This is
probably because the feature space is too small to make a
well-informed decision.
        </p>
        <p>Figure 2 presents the ROC curves for our features, for
the sake of readability only the most interesting curves are
plotted. As to be expected from the AUC values, all curves
are very close together. Besides not being far away from the
diagonal, for no curve a clear cut-o↵ value is recognizable.
We observe that the DBpedia features are slightly better for
the left and partially the middle part of the curve, leading to
the interpretation that those features are superior for
recommender systems which focus on quality. Comparing the best
frames-based approach (FEs) with the bag-of-words baseline
(Words), we see that FEs are mostly better than just words,
with some exception around a false positive rate of around
0.23.</p>
        <p>
          We also compare our system with the hybrid recommender
system from Ristoski at. al. [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] (AUC 0.5848), which was
the second best system of the ESWC challenge and
performed essentially equally well as the winning system. That
system combined many di↵erent features, not only LOD,
but also user ratings and explicit collaborative filtering
approaches.13
13As that system only outputs scores for the purpose of
ranking, we transformed those into confidences by dividing each
e
t
a
r
e
v
iit
s
o
p
e
u
r
T
Looking at the ROC curve, it becomes clear that
incorporating more and diverse features is beneficial in this setting.
However, we have to note that this system combines di↵erent
recommenders using the Borda rank aggregation method,
which was not learned on the training data but manually
selected while having knowledge about the test dataset (see
also the comment below on our own combination model).
        </p>
        <p>If we compare our best semantic frame results with the
systems leveraging Linked Data, we see that we achieve a
better performance (RMSE of 0.6022) when using the
DBpedia features alone and that we get the best overall results
when combining both our best system with the Linked Data
(RMSE of 0.5982). In this way it appears that combining
semantic information from di↵erent sources, i.e. from the
linguistically grounded frames features and the explicit,
ontology grounded DBpedia features, is beneficial in this
setting. The AUC results, however, do not corroborate this
finding.</p>
        <p>
          Last, when not learning LOD+FEs together in one model,
but separately and combine results with a simple linear
combination (these results are presented in brackets in Table 5),
as also done by [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], with = 0.5, we achieve better results
(RMSE of 0.5664 and AUC of 0.5571). However, this
notable improvement depends in the end on our knowledge
of the test dataset, as it influenced our choice of a linear
combination, instead of learning the combination of the
different classifiers on the training data. Strictly speaking, this
is thus not a valid experimental result, nevertheless it
indicates there is most likely a better hybrid design with feature
combinations that will better utilize the semantic frame
features and should yield better results.
        </p>
      </sec>
      <sec id="sec-3-9">
        <title>CONCLUSION</title>
        <p>score by a constant.</p>
        <p>In this paper we have presented an alternative approach
to add semantic information to a content-based book
recommender system. We directly compared the addition of
text internal semantic frame information with text external
ontological information based on Linked Open Data (LOD),
a popular research strand.</p>
        <p>We have shown that parsing the book plots with a
state-ofthe-art semantic frame parser, SEMAFOR, delivers valuable
additional semantic information. This information could
enable a system to fully grasp what is happening within a book.
One of the added values of FrameNet is that all frames are
related in a taxonomy which allows you to pinpoint those
Events forming the key components of a book. Based on a
direct comparison between the frames and events and a list
of genres derived from DBpedia attributes, we have shown
that although these data sources show some similarities, the
semantic frames should be able to represent more specific
information about what is happening in a particular book.</p>
        <p>In order to test this claim in closer detail, we have
performed experiments where the focus was on generating new
semantic features and find out what these can contribute to
a book recommendation system using one global classifier
as background model and one per-user classifier. We see
that exploiting semantic frame information outperforms a
standard bag-of-words unigram baseline and that especially
incorporating frame elements and lexical units evoking the
frames allows for the best overall performance. If we
compare our best system to a system levering semantic LOD
information, we observe that our frames approach is not
able to outperform this system. We do find, however, that
if we combine these two semantic information sources into
one system we get the best overall performance. This might
indicate that combining semantic information from
di↵erent sources, i.e. from the linguistically grounded implicit
frame features and the explicit, ontology grounded DBpedia
features, is beneficial.</p>
        <p>This work has inspired many ideas for future work.
Considering the current setup, we are aware that we completely
relied on the output of one semantic frame parser, i.e.
SEMAFOR. We believe that using a filtering mechanism
beforehand, e.g. to filter out those frames and or events which
are less meaningful or noisy, or that by applying a di↵erent
parser or event extraction techniques new lights can be shed
on the added value of this type of information. Also, since we
now only relied on Wikipedia to extract book information,
we had to reduce an original larger book data. We realize
a lot of additional information about books can be found
online, for example on Google Books, Amazon, GoodReads,
etcetera. Also the same techniques can be used to extract
other types of information from both the items and users
under consideration for the recommendation task.</p>
        <p>As mentioned at the end of Section 5 we would like to
further investigate whether another hybrid design might yield
better results. In this respect, it would be interesting to
plug our semantic knowledge in a collaborative-filtering
approach to see whether this can actually help the overall
performance. Using our semantic frames we could also inspect
in closer detail typical problems recommender systems face
such as cold-start and data sparsity.</p>
      </sec>
      <sec id="sec-3-10">
        <title>Acknowledgments</title>
        <p>The work presented in this paper has been partly funded by
the PARIS project (IWT-SBO-Nr. 110067). Furthermore,
Orph´ee De Clercq was supported by an exchange grant from
the German Academic Exchange Service (DAAD STIBET
scholarship program). We would like to thank Christian
Meilicke for his help in providing the manually derived
genres and his help with building the original ESWC
recommender system.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C.</given-names>
            <surname>Basu</surname>
          </string-name>
          , H. Hirsh,
          <string-name>
            <given-names>W.</given-names>
            <surname>Cohen</surname>
          </string-name>
          , et al.
          <article-title>Recommendation as classification: Using social and content-based information in recommendation</article-title>
          .
          <source>In AAAI'98</source>
          , pages
          <fpage>714</fpage>
          -
          <lpage>720</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Billsus</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Pazzani</surname>
          </string-name>
          .
          <article-title>User modeling for adaptive news access. User Modeling</article-title>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>10</volume>
          (
          <issue>2-3</issue>
          ):
          <fpage>147</fpage>
          -
          <lpage>180</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Degemmis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lops</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Semeraro</surname>
          </string-name>
          .
          <article-title>A content-collaborative recommender that exploits wordnet-based user profiles for neighborhood formation</article-title>
          .
          <source>User Modeling</source>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>17</volume>
          (
          <issue>3</issue>
          ):
          <fpage>217</fpage>
          -
          <lpage>255</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Di</surname>
          </string-name>
          <string-name>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mirizzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Romito</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Zanker</surname>
          </string-name>
          .
          <article-title>Linked open data to support content-based recommender systems</article-title>
          .
          <source>In I-SEMANTICS '12 Proceedings of the 8th International Conference on Semantic Systems</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Dipanjan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. F. T.</given-names>
            <surname>Martins</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Schneider</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N. A.</given-names>
            <surname>Smith.</surname>
          </string-name>
          <article-title>Frame-semantic parsing</article-title>
          .
          <source>Computational Linguistics</source>
          ,
          <volume>40</volume>
          (
          <issue>1</issue>
          ):
          <fpage>9</fpage>
          -
          <lpage>56</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Eirinaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Vazirgiannis</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Varlamis.</surname>
          </string-name>
          <article-title>Sewep: using site semantics and a taxonomy to enhance the web personalization process</article-title>
          .
          <source>In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Filmore</surname>
          </string-name>
          .
          <article-title>Frame semantics</article-title>
          .
          <source>Linguistics in the Morning Calm</source>
          , pages
          <fpage>111</fpage>
          -
          <lpage>137</lpage>
          ,
          <year>1982</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Filmore</surname>
          </string-name>
          .
          <article-title>Frames and the semantics of understanding</article-title>
          .
          <source>Quaderni di Semantica</source>
          ,
          <source>IV(2)</source>
          ,
          <year>1985</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Filmore</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. R.</given-names>
            <surname>Johnson</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. R. L.</given-names>
            <surname>Petruck</surname>
          </string-name>
          . Background to framenet.
          <source>International Journal of Lexicography</source>
          ,
          <volume>16</volume>
          (
          <issue>3</issue>
          ):
          <fpage>235</fpage>
          -
          <lpage>250</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>E.</given-names>
            <surname>Gabrilovich</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Markovitch</surname>
          </string-name>
          .
          <article-title>Computing semantic relatedness using wikipedia-based explicit semantic analysis</article-title>
          .
          <source>In Proceedings of the 20th International Joint Conference on Artifical intelligence</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>B.</given-names>
            <surname>Heitmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Hayes</surname>
          </string-name>
          .
          <article-title>Using linked data to build open, collaborative recommender systems</article-title>
          .
          <source>In AAAI Spring Symposium: Linked Data Meets Artificial Intelligence</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Herlocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. G.</given-names>
            <surname>Terveen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <article-title>Evaluating collaborative filtering recommender systems</article-title>
          .
          <source>ACM Transactions on Information Systems</source>
          ,
          <volume>22</volume>
          (
          <issue>1</issue>
          ):
          <fpage>5</fpage>
          -
          <lpage>53</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>R. C.</given-names>
            <surname>Holte</surname>
          </string-name>
          .
          <article-title>Very simple classification rules perform well on most commonly used datasets</article-title>
          .
          <source>Machine Learning</source>
          ,
          <volume>11</volume>
          (
          <issue>1</issue>
          ):
          <fpage>63</fpage>
          -
          <lpage>90</lpage>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Isele</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jakob</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Jentzsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kontokostas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Mendes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hellmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Morsey</surname>
          </string-name>
          , P. van Kleef,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer. DBpedia - A Large-scale</surname>
          </string-name>
          ,
          <article-title>Multilingual Knowledge Base Extracted from Wikipedia</article-title>
          .
          <source>Semantic Web Journal</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>P.</given-names>
            <surname>Lops</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gemmis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Semeraro</surname>
          </string-name>
          .
          <article-title>Content-based recommender systems: State of the art and trends</article-title>
          . In F. Ricci,
          <string-name>
            <given-names>L.</given-names>
            <surname>Rokach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Shapira</surname>
          </string-name>
          , and P. B. Kantor, editors,
          <source>Recommender Systems Handbook</source>
          , pages
          <fpage>73</fpage>
          -
          <lpage>105</lpage>
          . Springer US,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>G.</given-names>
            <surname>Mart</surname>
          </string-name>
          <article-title>´ın</article-title>
          , S. Schockaert,
          <string-name>
            <given-names>C.</given-names>
            <surname>Cornelis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Naessens</surname>
          </string-name>
          .
          <article-title>An exploratory study on content-based filtering of call for papers</article-title>
          .
          <source>Multidisciplinary Information Retrieval, Lecture Notes in Computer Science</source>
          ,
          <volume>8201</volume>
          :
          <fpage>58</fpage>
          -
          <lpage>69</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>V.</given-names>
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. Di</given-names>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Di</given-names>
            <surname>Sciascio</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Mirizzi</surname>
          </string-name>
          .
          <article-title>Top-n recommendations from implicit feedback leveraging linked open data</article-title>
          .
          <source>In Proceedings of RecSys</source>
          <year>2013</year>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Passant</surname>
          </string-name>
          .
          <article-title>dbrec: music recommendations using dbpedia</article-title>
          .
          <source>In Proceedings of the 9th International Semantic Web Conference (ISWC'10)</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Quinlan</surname>
          </string-name>
          .
          <source>C4</source>
          .
          <article-title>5: Programs for Machine Learning</article-title>
          . Morgan Kaufmann, San Mateo, California,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>P.</given-names>
            <surname>Resnik</surname>
          </string-name>
          .
          <article-title>Using information content to evaluate semantic similarity in a taxonomy</article-title>
          .
          <source>In Proceedings of the 14th international joint conference on Artificial intelligence (IJCAI'95)</source>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ristoski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. L.</given-names>
            <surname>Mencia</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Paulheim</surname>
          </string-name>
          .
          <article-title>A Hybrid Multi-Strategy Recommender System Using Linked Open Data</article-title>
          .
          <source>In LOD-enabled Recommender Systems Challenge (ESWC</source>
          <year>2014</year>
          ),
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>J.</given-names>
            <surname>Ruppenhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ellsworth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. R. L.</given-names>
            <surname>Petruck</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C. R.</given-names>
            <surname>Johnson</surname>
          </string-name>
          .
          <article-title>Framenet ii: Extended theory and practice</article-title>
          .
          <source>Technical report</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>M.</given-names>
            <surname>Schuhmacher</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Meilicke</surname>
          </string-name>
          .
          <article-title>Popular Books and Linked Data: Some Results for the ESWC-14 RecSys Challenge</article-title>
          .
          <source>In LOD-enabled Recommender Systems Challenge (ESWC</source>
          <year>2014</year>
          ),
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>