<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Analysis of User-generated Content for Improving YouTube Video Recommendation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Michele Galli</string-name>
          <email>michele.galli@dia.uniroma3.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Davide Feltoni Gurini</string-name>
          <email>feltoni@dia.uniroma3.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fabio Gasparetti</string-name>
          <email>gaspare@dia.uniroma3.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessandro Micarelli</string-name>
          <email>micarel@dia.uniroma3.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giuseppe Sansonetti</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Roma Tre University Via della Vasca Navale 79 - Rome</institution>
          ,
          <addr-line>00146</addr-line>
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2015</year>
      </pub-date>
      <abstract>
        <p>Everyday video-sharing websites such as YouTube collect large amounts of new multimedia resources. Comments left by viewers often provide valuable information to describe sentiments, opinions and tastes of users. For this reason, we propose a novel re-ranking approach that takes into consideration that information in order to provide better recommendations of related videos. Early experiments indicate an improvement in the recommendation performance.</p>
      </abstract>
      <kwd-group>
        <kwd>Recommender systems</kwd>
        <kwd>Web 2</kwd>
        <kwd>0</kwd>
        <kwd>YouTube</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>YouTube is the world's most popular web video
community used by 1 billions unique users world wide each month1.
Four billions of videos are viewed per day, with 100 hours of
new ones uploaded every minute. Sifting through this large
repository of multimedia resources poses unique challenges
for the user.</p>
      <p>
        The YouTube user interface provides, given the current
video lid, a list of recommendations as shown in Fig. 1.
YouTube selects those recommendations based on an
algorithm that considers signals from a variety of sources
including the user's favorite, watched and liked videos [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
These signals are combined for ranking the list of related
videos compiled by monitoring what other people usually
watch next. By exploring this related-video graph, a
candidate list is built. Characteristics about the videos (e.g.,
(Acviews and ratings) and the similarities of the videos with
the history of videos watched by the user are combined to
rank the candidate resources. A trade-o between relevance
and diversity across categories builds up the related video
list Lid = (l1; l2; : : : ; ln). As a result, the user-generated
comments that are shown below the video are not taken
into consideration. Although these user interactions are
often short and noisy, they have the chance to represent
valuable information about user interests, tastes and, more in
general, debate topics about the videos.
      </p>
      <p>Related video lists can host a large number of suggestions,
i.e., up to 40. Our hypothesis is that two videos may be
related if they give rise to similar reactions and sentiments
from viewers. This sort of implicit relationship between
multimedia resources might improve the original YouTube
ranking in a way that better matches the user expectations. In
this paper we propose a re-ranking method that, for each
video, generates a new ordered list of videos proposed by
the YouTube traditional recommender.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>THE PROPOSED VIDEO RECOMMEN</title>
    </sec>
    <sec id="sec-3">
      <title>DATION</title>
      <p>Given the lid video, the YouTube Data API2 allows us to
retrieve up to 1000 comments Clid = fc1; c2; g.The API
provides us also the top 25 related videos. We lter too short
2https://developers.google.com/youtube/ (Accessed: 2 July
2015)
comments and the ones with obscene or profane language.
A Bayesian classi er trained on a subset of spam comments
help us to lter out the less relevant content.</p>
      <p>
        A keyword-based approach [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] identi es the words that
express a sentiment, assigning them a score in [0; 1] to each
of the following dimensions: positivity, negativity, and
objectivity. In particular, given a comment ci 2 Clid we sum
up all the positivity scores and then subtract the negativity
ones. The obtained normalized real value is encoded in a
categorical feature by linearly discretizing it to 5 intervals
so that each comment is assigned to one of the following
classes: very positive, positive, neutral, negative, very
negative. Those classes are also the ve dimensions of a vector
space model, where the sentiment vector:
vl(is!ds) = (v1;id; v2;id; v3;id; v4;id; v5;id)
(1)
is calculated by summing up the occurrences of the very
positive classes for the dimension v1;id, positive occurrences for
v2;id, neutral occurrences for v3;id, and so forth. The same
procedure is followed for each video lj 2 Lid by analyzing
the set of comments associated with lj . We obtain n vectors
v(s!s) that can be compared by means of a cosine similarity
lj
measure with vl(is!ds). The related video lj will thus have a
sentiment-based similarity ri(ds;sj) 2 [0; 1].
      </p>
      <p>A second step extracts named entities (e.g., persons,
locations) and nouns from each comment by means of the
Stanford Named-entity recognizer and Part-of-Speech
tagger, respectively. As with the previous procedure, two
vectors, vl(jn!e) and vl(jpo!s), are obtained for each video lj in Li(dy)
by summing up the contribution of the di erent comments.
The two vectors v(n!e) and v(po!s) are also computed. The
lid lid
dimensions of the vectors are distinct named entities and
nouns that appear in the analyzed user-generated data. A
cosine similarity measure assigns the scores ri(dn;ej) and ri(dp;ojs)
between lid and lj videos, respectively, for the named entity
and noun comparisons.</p>
      <p>The last step calculates the nal rank for the video j by
linearly combining the three measures:
rid;j = 1ri(ds;sj) + 2ri(dn;ej) + 3ri(dp;ojs)
(2)
where the three</p>
      <p>values are set to the 0 constant.</p>
    </sec>
    <sec id="sec-4">
      <title>EVALUATION</title>
      <p>A total of 8 persons were involved, mostly students of CS
courses, all usual users of the YouTube service. A Java
application has been developed to assist them during the
evaluation. We asked them to select 10 videos V = fv1; : : : ; v10g
from their watched history, the recommendations on the
YouTube homepage or the subscribed channels. For each
video vi 2 V the application obtains its related YouTube
videos Lvi . A new ordered list L0vi is built by downloading
the comments and running the proposed approach on them.
A randomized list is proposed to each user that was asked to
evaluate her interests in watching each single video with a
ve-level Likert scale. The Normalized discounted
cumulative gain (nDCG) is evaluated both for the YouTube list Lvi
and the new ranked one L0vi . After computing the measure
for each video we averaged them to obtain an overall
performance evaluation. The YouTube recommender obtains a
nDCG of 0.829 while the proposed approach reaches 0.858
with an improvement of 3.51% (p-value&lt;0.05).
4.</p>
    </sec>
    <sec id="sec-5">
      <title>RELATED WORKS</title>
      <p>
        To the best of our knowledge, our work makes the rst
attempt to analyze user comments in the video
recommendation domain. Shmueli et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] analyze users' co-commenting
patterns for predicting, for a given user, suitable news stories
that she likely comment on. A similar approach is focused
on the news recommendation by Messenger and Whittle [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
Sergiu et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] explore the e ectiveness of comments and
other social signals for the video retrieval task, that is, when
a user query must be elaborated.
5.
      </p>
    </sec>
    <sec id="sec-6">
      <title>CONCLUSIONS AND FUTURE WORK</title>
      <p>Whereas the obtained bene ts in the re-rank of YouTube
related videos is limited, the statistical signi cance of
ndings let us think that a textual comment mining approach
should be considered for future investigations. Much of the
computation can be implemented o ine, while the basic
cosine similarity calculus has limited complexity.</p>
      <p>
        More experiments are undergoing to better understand
the relationship between the kinds of opinions and
sentiments expressed by the users and the categories of the videos.
By collecting a large training dataset, it is possible to
dynamically assign di erent weights to the three parameters
of Eq. 2. Temporal dimension is a further element to
consider [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. There are many videos for which YouTube is not
able to compute a reliable set of related videos due to the
scarcity of user activities. It is interesting to understand
if the proposed approach can be successfully implemented
even for new videos that have collected a right number of
comments, partially addressing the data-sparsity issue due
to the scarcity of user activity records.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>G.</given-names>
            <surname>Arru</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Feltoni</given-names>
            <surname>Gurini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gasparetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Micarelli</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Sansonetti</surname>
          </string-name>
          .
          <article-title>Signal-based user recommendation on twitter</article-title>
          .
          <source>In Proc. of WWW '13</source>
          , pages
          <fpage>941</fpage>
          {
          <fpage>944</fpage>
          , Republic and Canton of Geneva, Switzerland,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Baccianella</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Esuli</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Sebastiani</surname>
          </string-name>
          .
          <article-title>Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining</article-title>
          .
          <source>In N. Calzolari</source>
          and et al., editors,
          <source>LREC. European Language Resources Association</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Chelaru</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Orellana-Rodriguez</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Altingovde.</surname>
          </string-name>
          <article-title>How useful is social feedback for learning to rank youtube videos</article-title>
          ? World Wide Web,
          <volume>17</volume>
          (
          <issue>5</issue>
          ):
          <volume>997</volume>
          {
          <fpage>1025</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Davidson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Liebald</surname>
          </string-name>
          , J. Liu,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nandy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. V.</given-names>
            <surname>Vleet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Gargi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lambert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Livingston</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Sampath</surname>
          </string-name>
          .
          <article-title>The youtube video recommendation system</article-title>
          .
          <source>In Proc. of RecSys'10</source>
          , pages
          <fpage>293</fpage>
          {
          <fpage>296</fpage>
          , New York, NY, USA,
          <year>2010</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Messenger</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Whittle</surname>
          </string-name>
          .
          <article-title>Recommendations based on user-generated comments in social media</article-title>
          . In Privacy, Security,
          <article-title>Risk and Trust (PASSAT) and 2011 IEEE SocialCom</article-title>
          , pages
          <volume>505</volume>
          {
          <fpage>508</fpage>
          ,
          <string-name>
            <surname>Oct</surname>
          </string-name>
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Shmueli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kagian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koren</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Lempel</surname>
          </string-name>
          .
          <article-title>Care to comment?: Recommendations for commenting on news stories</article-title>
          .
          <source>In Proc. of WWW '12</source>
          , pages
          <fpage>429</fpage>
          {
          <fpage>438</fpage>
          , New York, NY, USA,
          <year>2012</year>
          . ACM.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>