<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Role of social media in propagating controversies: the case of cultural microblog feeds</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Adrian</forename><surname>Chifu</surname></persName>
							<email>adrian.chifu@lsis.org</email>
							<affiliation key="aff0">
								<orgName type="institution" key="instit1">LSIS</orgName>
								<orgName type="institution" key="instit2">AMU</orgName>
								<address>
									<country key="FR">FRANCE</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Fidelia</forename><surname>Ibekwe-Sanjuan</surname></persName>
							<email>fidelia.ibekwe-sanjuan@univ-amu.fr</email>
							<affiliation key="aff1">
								<orgName type="institution" key="instit1">Aix Marseille Univ</orgName>
								<orgName type="institution" key="instit2">IRSIC</orgName>
								<address>
									<settlement>Marseille</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Nathanala</forename><surname>Andrianasolo</surname></persName>
							<email>nathanaela.andria@gmail.com</email>
							<affiliation key="aff1">
								<orgName type="institution" key="instit1">Aix Marseille Univ</orgName>
								<orgName type="institution" key="instit2">IRSIC</orgName>
								<address>
									<settlement>Marseille</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Role of social media in propagating controversies: the case of cultural microblog feeds</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">44B125F2DAB68FF8F813254C7AF84F49</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T20:31+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Focus IR</term>
					<term>opinion mining</term>
					<term>information visualization</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The aim of this research is to investigate how social media mediate social controversies in the public arena. For that, we will use the CLEF MC2 corpus of microblogs [1] that captured long term political and cultural controversies in order to follow the birth and development of controversies across time and pinpoint the increasing role that social media play in their propagation, regulation and resolution.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Social media such as Facebook and Twitter have become the dominant platform for information and communication in the web and big data era to the extent that they have displaced the traditional media as news outlet. They have become the most used channels for disseminating content and negotiating social status. Contents surrounding individuals, celebrities, political figures are henceforth publicised via social media. In the last years, important controversies born outside the digital sphere were quickly propagated on social media, usually via twitter and Facebook, where they acquired a life of their own before being publicised elsewhere. In this research, we aim to study the increasing role social media play in publicising, mediating, regulating and in resolving social controversies (not scientific ones). To this end, we have chosen three social controversies situated at three levels: i) A local controversy surrounding the exploitation of the Bois Blanc quarry in the French island of La Reunion (henceforth the "Bois blanc" or "BB" controversy);</p><p>ii) an international controversy involving the 2001 Nobel Prize in Physiology or Medicine Sir Tim Hunt whose comments during a luncheon with women scientists in Korea in 2015 were judged sexist, triggering a controversy via twitter which led to his social downfall (loss of reputation, prestige and all his honorary appointments (Tim Hunt or "TH" controversy); iii) A national controversy surrounding Christiane Taubira, the embattled ex-minister of justice in the Hollande government during her visit at the Cannes Festival in 2015 (henceforth "Christiane Taubira" or "CT" controversy).</p><p>Owing to restrictions imposed by social media data platforms, we could not gather all the data linked to the first two controversies on social media as they had "passed" by the tie we decided to embark on this study. They will therefore be studied qualitatively using the traces we were able to gather online from some twitter accounts, newspapers, websites and blogs. This qualitative study will serve as a methodology design phase to identify further research questions for such studies. For instance, it will be interesting to identify who the main actors were in these controversies, the platforms used to launch and propagate the controversies, the role of the traditional media (newspapers, TVs, radio) in publicising the controversies; the thematic content around which the controversies were cristallised, a timeline of how they were propagated and the role of social media in bringing it to the attention of the larger public and the media. This prior qualitative study, on small corpora will enable us to better formalise our methodology of analysis and identify the parts of it can be automated and scaled up to work a larger corpus. The Taubira controversy for which a larger corpus made up of tweets was collected in the framework of the CLEF 2017 Microblog Cultural Contextualization Track (the MC2@CLEF2017 lab has released a collection of 70 000 000 microblogs over 18 months dealing with cultural events the Microblog) will serve as a testbed of the automation of our methodology, tested on the two previous controversies (BL and TH controversies respectively).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Example of controversy in the MCcorpus</head><p>The two keywords "taubira" and "cannes" reveal a large controversy. For instance, after a plain search by these two keywords, the retrieved results provide hints of controversial opinions. Due to the huge amount of social media information, one should be able to automatically identify and quantify controversies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Query related controversy indicator</head><p>We propose several measures to evaluate if query results are impacted by a controversy on Twitter. We summarize the automatic steps as follows: aspect identification, sentiment polarity identification and temporal distribution. We give more details in the next paragraphs.</p><p>The controversy occurs around entities, however it is not completely represented by them. For instance, around the entities "Taubira" and "Cannes", controversy can be subject to various aspects, such as "appearance", or "presence". Thus, a first step of aspect identification is required, since the the topic of controversy is not exhaustively characterised by the involved entities.</p><p>The words around the identified aspects will be features for the analysis. Thus, the words from a context window around the target aspect are used to model the sentiments expressed on that particular aspect. The context window size is empirically set to 10 words around the target aspect (5 before and 5 after), as in <ref type="bibr" target="#b1">[2]</ref>.</p><p>Based on the context around the aspects, automatic sentiment analysis can be carried out in order to identify the sentiment polarity. The polarity score is a real value in the interval [−1, 1], with -1 being very negative, 0 neutral and 1 very positive, respectively. The sentiment analysis module is inspired by the research of Pang et al. <ref type="bibr" target="#b2">[3]</ref> and it is based on a Naïve Bayes classifier trained on a set of 50,000 movie reviews with annotated sentiment polarities, from IMDb<ref type="foot" target="#foot_0">3</ref> . For a group of tweets, the dispersion will indicate the intensity of the controversy.</p><p>The topics of controversy generate reactions that are distributed in time. We can capture the temporal distributions of tweets. In this manner, we can identify when some topic is "fresh", or "hot", or recurrent ("comeback"). The temporal features also allow to cluster tweets by their period, in order to form some sort of "discussions".</p></div>			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_0">http://ai.stanford.edu/ amaas/data/sentiment/</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Cultural micro-blog contextualization 2016 workshop overview: data and pilot tasks</title>
		<author>
			<persName><forename type="first">L</forename><surname>Ermakova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Goeuriot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mothe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Mulhem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Nie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Sanjuan</surname></persName>
		</author>
		<idno>CEUR-WS</idno>
		<ptr target=".org" />
	</analytic>
	<monogr>
		<title level="m">Working Notes of CLEF 2016 -Conference and Labs of the Evaluation forum</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">K</forename><surname>Balog</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Cappellato</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Ferro</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Macdonald</surname></persName>
		</editor>
		<meeting><address><addrLine>Évora, Portugal</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016-09">September. 2016. 2016</date>
			<biblScope unit="volume">1609</biblScope>
			<biblScope unit="page" from="1197" to="1200" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity in Temporal-Related Reviews (to appear)</title>
		<author>
			<persName><forename type="first">I</forename><surname>Badache</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Fournier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Chifu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">21th International Conference on Knowledge Based and Intelligent Information and Engineering Systems</title>
				<imprint>
			<date type="published" when="2017">KES2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Thumbs up?: sentiment classification using machine learning techniques</title>
		<author>
			<persName><forename type="first">B</forename><surname>Pang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Vaithyanathan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">EMNLP</title>
				<imprint>
			<date type="published" when="2002">2002</date>
			<biblScope unit="page" from="79" to="86" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
