1. INTRODUCTION

Analysis of User-generated Content for Improving YouTube Video Recommendation

Michele Galli

michele.galli@dia.uniroma3.it 0

Davide Feltoni Gurini

feltoni@dia.uniroma3.it 0

Fabio Gasparetti

gaspare@dia.uniroma3.it 0

Alessandro Micarelli

micarel@dia.uniroma3.it 0

Giuseppe Sansonetti

0 0 Roma Tre University Via della Vasca Navale 79 - Rome , 00146 Italy

2015

Everyday video-sharing websites such as YouTube collect large amounts of new multimedia resources. Comments left by viewers often provide valuable information to describe sentiments, opinions and tastes of users. For this reason, we propose a novel re-ranking approach that takes into consideration that information in order to provide better recommendations of related videos. Early experiments indicate an improvement in the recommendation performance.

Recommender systems Web 2 0 YouTube

1. INTRODUCTION

YouTube is the world's most popular web video community used by 1 billions unique users world wide each month1. Four billions of videos are viewed per day, with 100 hours of new ones uploaded every minute. Sifting through this large repository of multimedia resources poses unique challenges for the user.

The YouTube user interface provides, given the current video lid, a list of recommendations as shown in Fig. 1. YouTube selects those recommendations based on an algorithm that considers signals from a variety of sources including the user's favorite, watched and liked videos [ 4 ]. These signals are combined for ranking the list of related videos compiled by monitoring what other people usually watch next. By exploring this related-video graph, a candidate list is built. Characteristics about the videos (e.g., (Acviews and ratings) and the similarities of the videos with the history of videos watched by the user are combined to rank the candidate resources. A trade-o between relevance and diversity across categories builds up the related video list Lid = (l1; l2; : : : ; ln). As a result, the user-generated comments that are shown below the video are not taken into consideration. Although these user interactions are often short and noisy, they have the chance to represent valuable information about user interests, tastes and, more in general, debate topics about the videos.

Related video lists can host a large number of suggestions, i.e., up to 40. Our hypothesis is that two videos may be related if they give rise to similar reactions and sentiments from viewers. This sort of implicit relationship between multimedia resources might improve the original YouTube ranking in a way that better matches the user expectations. In this paper we propose a re-ranking method that, for each video, generates a new ordered list of videos proposed by the YouTube traditional recommender. 2.

THE PROPOSED VIDEO RECOMMEN DATION

Given the lid video, the YouTube Data API2 allows us to retrieve up to 1000 comments Clid = fc1; c2; g.The API provides us also the top 25 related videos. We lter too short 2https://developers.google.com/youtube/ (Accessed: 2 July 2015) comments and the ones with obscene or profane language. A Bayesian classi er trained on a subset of spam comments help us to lter out the less relevant content.

A keyword-based approach [ 2 ] identi es the words that express a sentiment, assigning them a score in [0; 1] to each of the following dimensions: positivity, negativity, and objectivity. In particular, given a comment ci 2 Clid we sum up all the positivity scores and then subtract the negativity ones. The obtained normalized real value is encoded in a categorical feature by linearly discretizing it to 5 intervals so that each comment is assigned to one of the following classes: very positive, positive, neutral, negative, very negative. Those classes are also the ve dimensions of a vector space model, where the sentiment vector: vl(is!ds) = (v1;id; v2;id; v3;id; v4;id; v5;id) (1) is calculated by summing up the occurrences of the very positive classes for the dimension v1;id, positive occurrences for v2;id, neutral occurrences for v3;id, and so forth. The same procedure is followed for each video lj 2 Lid by analyzing the set of comments associated with lj . We obtain n vectors v(s!s) that can be compared by means of a cosine similarity lj measure with vl(is!ds). The related video lj will thus have a sentiment-based similarity ri(ds;sj) 2 [0; 1].

A second step extracts named entities (e.g., persons, locations) and nouns from each comment by means of the Stanford Named-entity recognizer and Part-of-Speech tagger, respectively. As with the previous procedure, two vectors, vl(jn!e) and vl(jpo!s), are obtained for each video lj in Li(dy) by summing up the contribution of the di erent comments. The two vectors v(n!e) and v(po!s) are also computed. The lid lid dimensions of the vectors are distinct named entities and nouns that appear in the analyzed user-generated data. A cosine similarity measure assigns the scores ri(dn;ej) and ri(dp;ojs) between lid and lj videos, respectively, for the named entity and noun comparisons.

The last step calculates the nal rank for the video j by linearly combining the three measures: rid;j = 1ri(ds;sj) + 2ri(dn;ej) + 3ri(dp;ojs) (2) where the three

values are set to the 0 constant.

EVALUATION

A total of 8 persons were involved, mostly students of CS courses, all usual users of the YouTube service. A Java application has been developed to assist them during the evaluation. We asked them to select 10 videos V = fv1; : : : ; v10g from their watched history, the recommendations on the YouTube homepage or the subscribed channels. For each video vi 2 V the application obtains its related YouTube videos Lvi . A new ordered list L0vi is built by downloading the comments and running the proposed approach on them. A randomized list is proposed to each user that was asked to evaluate her interests in watching each single video with a ve-level Likert scale. The Normalized discounted cumulative gain (nDCG) is evaluated both for the YouTube list Lvi and the new ranked one L0vi . After computing the measure for each video we averaged them to obtain an overall performance evaluation. The YouTube recommender obtains a nDCG of 0.829 while the proposed approach reaches 0.858 with an improvement of 3.51% (p-value<0.05). 4.

RELATED WORKS

To the best of our knowledge, our work makes the rst attempt to analyze user comments in the video recommendation domain. Shmueli et al. [ 6 ] analyze users' co-commenting patterns for predicting, for a given user, suitable news stories that she likely comment on. A similar approach is focused on the news recommendation by Messenger and Whittle [ 5 ]. Sergiu et al. [ 3 ] explore the e ectiveness of comments and other social signals for the video retrieval task, that is, when a user query must be elaborated. 5.

CONCLUSIONS AND FUTURE WORK

Whereas the obtained bene ts in the re-rank of YouTube related videos is limited, the statistical signi cance of ndings let us think that a textual comment mining approach should be considered for future investigations. Much of the computation can be implemented o ine, while the basic cosine similarity calculus has limited complexity.

More experiments are undergoing to better understand the relationship between the kinds of opinions and sentiments expressed by the users and the categories of the videos. By collecting a large training dataset, it is possible to dynamically assign di erent weights to the three parameters of Eq. 2. Temporal dimension is a further element to consider [ 1 ]. There are many videos for which YouTube is not able to compute a reliable set of related videos due to the scarcity of user activities. It is interesting to understand if the proposed approach can be successfully implemented even for new videos that have collected a right number of comments, partially addressing the data-sparsity issue due to the scarcity of user activity records.

[1]

Arru ,

D. Feltoni

Gurini ,

Gasparetti ,

Micarelli , and

Sansonetti . Signal-based user recommendation on twitter . In Proc. of WWW '13 , pages 941 { 944 , Republic and Canton of Geneva, Switzerland, 2013 .

[2]

Baccianella ,

Esuli , and

Sebastiani . Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining . In N. Calzolari and et al., editors, LREC. European Language Resources Association , 2010 .

[3]

Chelaru ,

Orellana-Rodriguez , and I. Altingovde. How useful is social feedback for learning to rank youtube videos ? World Wide Web, 17 ( 5 ): 997 { 1025 , 2014 .

[4]

Davidson ,

Liebald , J. Liu,

Nandy ,

T. V.

Vleet ,

Gargi ,

Gupta ,

He ,

Lambert ,

Livingston , and

Sampath . The youtube video recommendation system . In Proc. of RecSys'10 , pages 293 { 296 , New York, NY, USA, 2010 . ACM.

[5]

Messenger and

Whittle . Recommendations based on user-generated comments in social media . In Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE SocialCom , pages 505 { 508 , Oct 2011 .

[6]

Shmueli ,

Kagian ,

Koren , and

Lempel . Care to comment?: Recommendations for commenting on news stories . In Proc. of WWW '12 , pages 429 { 438 , New York, NY, USA, 2012 . ACM.