<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">A Regression Approach to Movie Rating Prediction using Multimedia Content and Metadata</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Hossein</forename><forename type="middle">A</forename><surname>Rahmani</surname></persName>
							<email>srahmani@znu.ac.ir</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Zanjan</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yashar</forename><surname>Deldjoo</surname></persName>
							<email>deldjooy@acm.org</email>
							<affiliation key="aff1">
								<orgName type="institution">Politecnico di Bari</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Markus</forename><surname>Schedl</surname></persName>
							<email>markus.schedl@jku.at</email>
							<affiliation key="aff2">
								<orgName type="institution">Johannes Kepler University</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">A Regression Approach to Movie Rating Prediction using Multimedia Content and Metadata</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">853C5E55FF8E0BE5CC639D2A892C1ACA</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T20:15+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper presents the submission of the team MASlab-ZNU to the MMRecSys movie recommendation task, as part of MediaEval 2019. The task involved predicting average movie ratings, standard deviation of ratings, and the number of ratings by using audio and visual features extracted from trailers and the associated metadata. In the proposed work, we model the rating prediction problem as a regression problem and employ different learning models for the prediction task, including ridge regression (RR), support vector regression (SVR), shallow neural network (SNN) and deep neural network (DNN). The results of fairly large amount of experiments on various models and features indicate that combination of DNN+tag features produce the best results for prediction of avgRating and StdRating while for numRating (popularity) it is the combination of RR+tag that significantly outperforms the other competitors, with a large margin.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION AND RELATED WORK</head><p>The global revenue obtained from the media and entertainment (M&amp;E) market in 2017 was approximately 2 trillion US dollars. The four main verticals of the industry in the US include: film (40%), music (6%), book publishing (12%) and video games (8%) <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref>. The movie industry not only has a large cultural and sociological impact on people but also occupies an important part of the business market in the M&amp;E industry. Producing a new movie means that the company is betting on this movie's success. Being able to predict into the future whether a movie will be successful or not is therefore crucial and requires machine learning techniques. The results of the predictions will be used by producers and investors to decide whether or not to adopt the production of similar movies.</p><p>The paper at hand describes the solution by the team MASlab-ZNU for the 2019 Multimedia for Recommender System Task: MovieREC and NewsREEL at MediaEval <ref type="bibr" target="#b5">[6]</ref>, the movie recommendation subtask. The task involved using of audio, visual features extracted from trailers of the corresponding movies coming from MMTF-14K dataset <ref type="bibr" target="#b3">[4]</ref> and metadata to predict three scores: avgRating (representing user appreciation and dis-appreciation of the content), stdRating (characterizing agreement and disagreement between user opinions/ratings), and numRating (reflecting popularity of movies). The audio and visual features have been used to solve various tasks in movie recommendation, see e.g., <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b4">5]</ref>.</p><p>In the context of movie popularity prediction, Szabo et al. in <ref type="bibr" target="#b9">[10]</ref> predict the future popularity of a video based on the number of previous views on YouTube using predictive models based on linear regression. Pinto et al. in <ref type="bibr" target="#b8">[9]</ref> extend the work by Szabo et al. by proposing a multivariate linear model and sampling the number of views at regular time slots. Recently, Moghaddam et al. <ref type="bibr" target="#b7">[8]</ref> address the problem of movie popularity prediction using visual features of movie trailers.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">EXPERIMENTS</head><p>This section presents the experiments carried out towards addressing the prediction task. For performance evaluation, we randomly split the original train set into 2 non-overlapping subsets, where we consider 80% train data and 20% validation data. We refer to these datasets as trainSet and validSet throughout the paper. In a prepossessing step, we normalize all features in the dataset, using min-max normalization. We perform a hyper parameter search and report all the results under the best setting. We model the task in question as regression and use the following regression models:</p><p>• Linear model using Ridge Regression (RR): We use linear regression to serve as a simple yet standard approach to model the relation between dependent (prediction scores) and independent variables (features) in a linear fashion <ref type="bibr" target="#b6">[7]</ref>. The model is given by y = θ T x where x and y represent feature vectors and scores, respectively, θ T contains the linear model coefficients estimated by RR minimize</p><formula xml:id="formula_0">θ T 1 2 ∥y − θ T x ∥ 2 2 + λ∥θ ∥ 2 2</formula><p>where ℓ 2 regularization is applied to avoid overfitting of the coefficients.</p><p>• Support Vector Regression (SVR): SVR is the regression type of Support Vector Machine (SVM). SVR makes use of a nonlinear transformation function to map the input features to a high-dimensional features space given by y</p><formula xml:id="formula_1">= n i=1 (a i − a * i ).K(x i , x) +b where K = exp(−||x − x ′ || 2 )</formula><p>is the Radial Basis Function (RBF) kernel, b is intercept, and a and a * are Lagrange multipliers. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">RESULTS AND ANALYSIS</head><p>The regression results using the proposed approaches are presented in Table   <ref type="table" target="#tab_2">1</ref>.</p><p>Predicting average ratings: As can be seen in Table <ref type="table" target="#tab_2">1</ref>, the performance of all audio and visual features, are closely similar to each other. These results with a close margin look similar to the performance of the genre descriptor, e.g. 0.1487 v.s. 0.1466. However, it can be noted that tag features is the best feature to predict the average ratings. These results show that user-generated tags assigned to movies contain semantic information that are well correlated with ratings given to movies by user. One can also observe that the DNN model is the best model to predict the average ratings, though it is only marginally better than the SNN model.</p><p>Predicting standard deviation of ratings: The results of predicting standard deviations of ratings show the audio-visual features and genre metadata have very similar results. Again we can see the best feature to predict the standard deviation of ratings is tag metadata using the DNN learning model. Average results indicate that the DNN model is the best learning model to predict the standard deviation of ratings.</p><p>Predicting number of ratings: As for predicting number of ratings, it can be seen that except for the tag feature with the best performance, the rest of audio-visual features and genre metadata yield very similar results. For the tag feature, we observe a substantial superiority in performance compared to all other features; it reduces the RMSE by about 55% (0.0232 v.s. 0.0515 for tag v.s. genre), compared to the second best feature (genre). More interestingly, we can see for predicting number of ratings, the best model is the simple RR learning model followed by SNN but not DNN.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">CONCLUSION</head><p>This paper reports the method used by team MASlab-ZNU for the Multimedia for Recommender Systems task at MediaEval 2019 <ref type="bibr" target="#b5">[6]</ref>.</p><p>Results of experiments using different regression approaches are promising and show the efficacy of audio and visual content in comparison with genre metadata but overall it is the tag feature that provides the best prediction quality in all experimental cases.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>• Shallow Neural Network (SNN): To model the relationship between features in a non-linear way, we also apply a neural network to predict the scores. Here, we consider a simple (shallow) neural network model. SNN has a hidden layer with 24 neurons and Rectified Linear Unit (ReLU ) as activation function. As for the output layer we use the Siдmoid activation function.</figDesc><table><row><cell>• Deep Neural Network (DNN): This method is similar to</cell></row><row><cell>the SNN but we have more hidden layers to consider deeper</cell></row><row><cell>relations between features. In our deep model, we have 3</cell></row><row><cell>hidden layers; the first layer has 128 neurons with ReLU as</cell></row><row><cell>activation function; the second layer has 64 neurons and</cell></row><row><cell>uses a Siдmoid activation function; the third one has 32</cell></row><row><cell>neurons and uses ReLU.</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 1 :</head><label>1</label><figDesc>Results of regression in terms of RMSE, calculated on the validation set. The first and second best result for each score are shown in bold and italic respectively. RR are the best predictors, in most cases generating the best performance for each feature while SVR is the worst. The final submitted runs are selected based on the ones performing the best on the validSet, which are highlighted in bold in Table</figDesc><table><row><cell>MediaEval'19, 27-29 October 2019, Sophia Antipolis, France</cell></row></table><note>. Regarding the comparison of learning models, we can see</note></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<ptr target="https://www.trade.gov/topmarkets/pdf/Top%20Markets%20Media%20and%20Entertinment%202017.pdf" />
		<title level="m">Markets Report Media and Entertainment</title>
				<imprint>
			<date type="published" when="2017">2018. 2017. 2018. 2018-12-27</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<ptr target="https://investmentbank.com/media-and-entertainment-industry-overview/" />
		<title level="m">Media and Entertainment Industry Overview</title>
				<imprint>
			<date type="published" when="2018-12-27">2018. 2018. 2018-12-27</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Audio-visual encoding of multimedia content for enhancing movie recommendations</title>
		<author>
			<persName><forename type="first">Yashar</forename><surname>Deldjoo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mihai</forename><surname>Gabriel Constantin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hamid</forename><surname>Eghbal-Zadeh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bogdan</forename><surname>Ionescu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Markus</forename><surname>Schedl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Cremonesi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 12th ACM Conference on Recommender Systems, RecSys 2018</title>
				<meeting>of the 12th ACM Conference on Recommender Systems, RecSys 2018<address><addrLine>Vancouver, BC, Canada</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018-10-02">2018. October 2-7, 2018</date>
			<biblScope unit="page" from="455" to="459" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">MMTF-14K: a multifaceted movie trailer feature dataset for recommendation and retrieval</title>
		<author>
			<persName><forename type="first">Yashar</forename><surname>Deldjoo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mihai</forename><surname>Gabriel Constantin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bogdan</forename><surname>Ionescu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Markus</forename><surname>Schedl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Cremonesi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 9th ACM Multimedia Systems Conference, MMSys 2018</title>
				<meeting>the 9th ACM Multimedia Systems Conference, MMSys 2018<address><addrLine>Amsterdam, The Netherlands</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018-06-12">2018. June 12-15, 2018</date>
			<biblScope unit="page" from="450" to="455" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Movie genome: alleviating new item cold start in movie recommendation. User Model</title>
		<author>
			<persName><forename type="first">Yashar</forename><surname>Deldjoo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Maurizio</forename><surname>Ferrari Dacrema</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mihai</forename><surname>Gabriel Constantin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hamid</forename><surname>Eghbal-Zadeh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stefano</forename><surname>Cereda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Markus</forename><surname>Schedl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bogdan</forename><surname>Ionescu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Cremonesi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">User-Adapt. Interact</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="291" to="343" />
			<date type="published" when="2019">2019. 2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">The 2019 Multimedia for Recommender System Task: MovieREC and NewsREEL at MediaEval</title>
		<author>
			<persName><forename type="first">Yashar</forename><surname>Deldjoo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Benjamin</forename><surname>Kille</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Markus</forename><surname>Schedl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andreas</forename><surname>Lommatzsch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jialie</forename><surname>Shen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes Proceedings of the MediaEval 2019 Workshop</title>
				<meeting><address><addrLine>Sophia Antipolis, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019-10-29">2019. 27-29 October 2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Multi-feature fusion for predicting social media popularity</title>
		<author>
			<persName><forename type="first">Jinna</forename><surname>Lv</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Wu</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Meng</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">He</forename><surname>Gong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bin</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Huadong</forename><surname>Ma</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 25th ACM international conference on Multimedia</title>
				<meeting>the 25th ACM international conference on Multimedia</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="1883" to="1888" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Predicting Movie Popularity and Ratings with Visual Features</title>
		<author>
			<persName><forename type="first">Mehdi</forename><surname>Farshad B Moghaddam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Reza</forename><surname>Elahi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Christoph</forename><surname>Hosseini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marko</forename><surname>Trattner</surname></persName>
		</author>
		<author>
			<persName><surname>Tkalcic</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">14th International Workshop On Semantic And Social Media Adaptation And Personalization</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Using early view patterns to predict the popularity of youtube videos</title>
		<author>
			<persName><forename type="first">Henrique</forename><surname>Pinto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jussara</forename><forename type="middle">M</forename><surname>Almeida</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marcos</forename><forename type="middle">A</forename><surname>Gonçalves</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the sixth ACM international conference on Web search and data mining</title>
				<meeting>the sixth ACM international conference on Web search and data mining</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="365" to="374" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Predicting the Popularity of Online Content</title>
		<author>
			<persName><forename type="first">Gabor</forename><surname>Szabo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bernardo</forename><forename type="middle">A</forename><surname>Huberman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Commun. ACM</title>
		<imprint>
			<biblScope unit="volume">53</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="80" to="88" />
			<date type="published" when="2010">2010. 2010</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
