<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">ItaGLAM: A corpus of Cultural Communication on Twitter during the Pandemic</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Gennaro</forename><surname>Nolano</surname></persName>
							<email>gnolano@unior.it</email>
							<affiliation key="aff0">
								<orgName type="laboratory">UniOr NLP Research Group</orgName>
								<orgName type="institution">&quot;L&apos;Orientale&quot; University of Naples</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Carola</forename><surname>Carlino</surname></persName>
							<email>ccarlino@unior.it</email>
							<affiliation key="aff0">
								<orgName type="laboratory">UniOr NLP Research Group</orgName>
								<orgName type="institution">&quot;L&apos;Orientale&quot; University of Naples</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Maria</forename><surname>Pia</surname></persName>
							<affiliation key="aff0">
								<orgName type="laboratory">UniOr NLP Research Group</orgName>
								<orgName type="institution">&quot;L&apos;Orientale&quot; University of Naples</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Di</forename><surname>Buono</surname></persName>
							<email>mpdibuono@unior.it</email>
							<affiliation key="aff0">
								<orgName type="laboratory">UniOr NLP Research Group</orgName>
								<orgName type="institution">&quot;L&apos;Orientale&quot; University of Naples</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Johanna</forename><surname>Monti</surname></persName>
							<email>jmonti@unior.it</email>
							<affiliation key="aff0">
								<orgName type="laboratory">UniOr NLP Research Group</orgName>
								<orgName type="institution">&quot;L&apos;Orientale&quot; University of Naples</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">ItaGLAM: A corpus of Cultural Communication on Twitter during the Pandemic</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">D2451AC0A183E7856DFB6204E0E2ED9A</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-19T15:39+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper describes the compilation and annotation of ItaGLAM, a corpus of tweets written by Italian Galleries, Libreries, Archives and Museums (GLAMs) during the lockdown period in Italy due to the COVID-19 pandemic. ItaGLAM has been annotated with a set of labels which may be useful to identify different types of communication. Furthermore, the collected data have been used to train a set of classifiers. The results are analyzed to evaluate the information flow between GLAM and users and to analyze cultural communication on the Web.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Over the last years, Social Networks have become one of the most popular platforms for sharing experiences and opinions through the use of simple strings of text <ref type="bibr" target="#b23">(Zhao and Rosson, 2009)</ref>. Indeed, this way of communicating has become an essential interaction tool, not only among private users, but also among companies to engage with their audience and to promote their brands <ref type="bibr" target="#b1">(Alturas and Oliveira, 2016)</ref>.</p><p>The use of social networks has also been adopted by museums, that, over time, have changed their way of communicating with their audience 2 . In particular, in regards to the GLAM sector, a new trend has been observed in recent years: the use of the web as a way to create and foster an online community <ref type="bibr" target="#b13">(Langa, 2014;</ref><ref type="bibr" target="#b0">Allen-Greil and MacArthur, 2010)</ref>.</p><p>While during the first decade of this century museum professionals considered the exhibition of collections on social networks <ref type="bibr" target="#b14">(Laws, 2015)</ref> as 'excessive', nowadays the use of these platforms has become the norm. As <ref type="bibr" target="#b2">Amanatidis et al. (2020)</ref> pointed out in their study about the use of social networks (and in particular Instagram) by museums in the Greek culture scene: 'social media has become a key factor in the way that cultural organizations communicate with their public in supporting the marketing of performing art organizations'.</p><p>Such centrality makes the Social network a potentially effective means that allows GLAMs to reach a wide and heterogeneous audience and to adapt to it. Therefore, we believe that the analysis of the cultural communication implies an analysis of how cultural corporations interact with the audience through social networks. After considering the most used social networks (namely Facebook, Instagram, Twitter) in the cultural sector, we have decided to focus our research on the use of Twitter, which has already been proven to be a solid basis to analyze institutional communication, as Preot ¸iuc-Pietro et al. <ref type="bibr">(2015)</ref> have highlighted.</p><p>Therefore, the main aim of our research is to investigate how Italian GLAMS have extraordinarily <ref type="bibr" target="#b11">(Giraud, 2020)</ref> interacted with their audience during the lockdown in Italy due to the COVID-19 pandemic (NEMO -Network of European Museum Organisations, 2020), i.e. in the period from the 8th of March to the 5th of May 2020 (as per DPCM March 11 2020).</p><p>Over this time many cultural initiatives have been launched with the aim of strengthening the dialogue with the audience and make sure that, despite the impossibility of any kind of physical access, the connection between GLAMs and their visitors would not be interrupted<ref type="foot" target="#foot_0">3</ref> .</p><p>It has been observed that during the aforementioned period, while GLAMs institutions have drastically increased their use of Facebook, Instagram and Twitter, the latter one was the only Social Network for which an increase in the interaction audience-institution has been registered <ref type="bibr">(Politecnico di Milano, 2020)</ref>.</p><p>The study of communicative intents by GLAMs through Social Networks in the Italian language is still novel and, as such, best practices and tools to use still need to be tested and honed. In particular, there is still the need for an annotated corpus and a classifier that can be used on large amounts of data.</p><p>Despite the time frame taken into account is relatively short (covering 58 days in total), we think that investigating how Italian GLAMs used the web when it was the only form of communication at their disposal, represents a good training ground to test our practices and to train and evaluate different kinds of classifiers useful also in future works.</p><p>The paper is organized as follows: Section 2 describes the related works in the analysis of communication on Twitter by cultural institutions. Section 3 introduces the methodology used in this analysis: namely, it describes the creation of the corpus, the creation and use of the annotation set, and the training and evaluation of different classifiers. Finally, in Section 4 we explain the results of the research.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related Work</head><p>The large amount of data available on Twitter makes this platform ideal for several studies. As such, during the years tweets have been used in several research projects regarding disaster response <ref type="bibr" target="#b22">(Zahra et al., 2019)</ref>, content classification <ref type="bibr" target="#b8">(Dann, 2010;</ref><ref type="bibr" target="#b20">Stvilia and Gibradze, 2014)</ref> and, in particular, sentiment analysis <ref type="bibr" target="#b17">(O'Connor et al., 2010;</ref><ref type="bibr" target="#b10">Gamallo and Garcia, 2014;</ref><ref type="bibr" target="#b21">Talbot et al., 2015)</ref>. Despite these efforts, only a few studies have focused on the classification of communicative intents of organizations and institutions on Twitter, like Lovejoy and Saxton (2012) and <ref type="bibr" target="#b9">Foucault and Courtin (2016)</ref>, who focused on French tweets written during the MuseumWeek event. Similar kinds of study can be found in researches dealing with Italian tweets, with several contributions dealing with sentiment analysis <ref type="bibr" target="#b4">(Basile and Nissim, 2013;</ref><ref type="bibr" target="#b6">Cimino et al., 2014)</ref> and automatic misogyny identification <ref type="bibr" target="#b3">(Anzovino et al., 2018)</ref>. To the best of our knowledge, no work has been done so far on communicative intent classification for Italian tweets.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Methodology</head><p>The task of tweet classification has turned out to be rather challenging for various reasons, many inherent to the platform itself. First and foremost, tweets are very short texts (with the maximum length of 280 characters), and with an average token count of 16.80 in our corpus. Secondly, it is not unusual to find tweets composed only of hashtags, or URLs. While URLs by themselves are rarely if ever useful in a classification task, hashtags could represent a source of information only if they are used according to their original communicative intent or to the initiative to which they are related. In the following subsections we describe how:</p><p>• the corpus was created;</p><p>• the annotation set has been chosen and then applied;</p><p>• the classifiers have been trained and tested.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Dataset</head><p>Because of the COVID-19 outbreak, the Italian Government (as many others around the world) imposed a lockdown policy, which lasted from the 8th March to the 5th May 2020 (58 days in total as per DPCM March 11 2020). During this period of time, museums and art galleries adopted several strategies to continue engaging with their audience in order to maintain the communication alive, and to grant access to digital cultural heritage media. As already mentioned in Section 1, they increased the scope of their communication on the main social platforms, i.e. Facebook, Twitter and Instagram. In this context, the focus of our analysis is the use of Twitter. The communication on Twitter is characterised by the use of certain hashtags, which have been used by GLAMs to propose several types of initiatives to their audience. Initially, the set of hashtags we used was made up of 33 hashtags promoted and used by Italian GLAMs and Italy's Ministry of Cultural Heritage and Activities (Italian: Ministero per i Bene e le Attività Culturali e per il Turismo -MiBACT), and selected on the basis of their popularity according to the Twitter trend topics (TT)<ref type="foot" target="#foot_1">4</ref> . Among these hashtags, #museitaliani (and its graphic variation #museiitaliani) is the only one already existing before the pandemic, and subsequently adapted by museums for the initiatives proposed during the pandemic; while others, such as #artyouready and #emptymuseum have been created ad hoc during the lockdown period to describe specific initiatives. By using these hashtags as a queue in the public Twitter API<ref type="foot" target="#foot_2">5</ref> we have created a corpus with a total of 23,716 tweets. To better focus on the tweets and their intents concerning cultural communication, we have decided to filter out of the corpus any hashtag with less than 1,000 occurrences. We have thus obtained a queue of six hashtags (#artyouready, #emptymuseum, #museitaliani, #museichiusimuseiaperti, #laculturanonsiferma, #laculturaincasa) and a corpus of 15,988 tweets. This corpus has been filtered once again so that only unique tweets (i.e. no retweets) written in Italian have been kept. By using a list of GLAMs manually extracted from the corpus, we have then extracted out of the remaining 8,038 tweets those written by a GLAM institution, thus ending up with our final corpus of 3,429 tweets published by 213 Italian cultural institutions. Table <ref type="table" target="#tab_0">1</ref> shows the occurrences of the hashtags in the final corpus. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Annotation Process</head><p>In order to define the intents of GLAMs towards the users, the corpus has been annotated with four communication categories first presented by <ref type="bibr" target="#b7">Courtin et al. (2015)</ref>, and then used by <ref type="bibr" target="#b9">Foucault and Courtin (2016)</ref>, and <ref type="bibr" target="#b12">Juanals and Minel (2018)</ref> to annotate the information flow on a social network during a cultural event. The annotation has been done at tweet level, using a set of labels composed as follows:</p><p>• characterized by square and cylindrical towers with loopholes and storm drains). A fifth category N/A has been included in order to classify tweets that do not fit in any of the aforementioned categories, like the ones composed of only hashtags. Following this set of categories and our guidelines, the tweets have been annotated using the open source platform INCEpTION<ref type="foot" target="#foot_3">6</ref> , and a first round of annotation has been carried on 400 tweets, double annotated by a domain expert and a non-expert in order to calculate the Inter-Annotator Agreement (IAA). The use of a non-expert was necessary so that the annotation would not have been influenced by any external knowledge (for example the original meaning behind the various hashtags). The resulting Fleiss' Kappa has revealed to be moderately good at 0.629, which is considered sufficient for the task at hand. As it can be seen from the confusion matrix in Figure <ref type="figure">1</ref>, the agreement is very strong on PI and ItC, moderately strong on SE, and very weak on PP. Furthermore, 89 tweets have been deemed unus-Figure <ref type="figure">1</ref>: Confusion matrix for the agreement on every label. able as they have been tagged with the label N/A, therefore, they have been removed from the corpus. Table <ref type="table" target="#tab_3">2</ref> presents the number of occurrences for each label for the remaining 3,340 tweets. These results show an issue regarding the label PP, that is severely underrepresented in the corpus. The effects of this underrepresentation on our classifiers will be explained in detail in Section 4, and the analysis of possible solutions will be the focus of future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Intent classification</head><p>In order to train the classifiers, the corpus has been preprocessed so that all tweets are lowercase, and all punctuation marks, URLs, numbers and   The results show that the methodology adopted in this work can be useful in better understanding how cultural institutions communicate on the Web. The tools used in this specific task are adequate in annotating and automatically classifying the way cultural institutions communicate on the Twitter platform. That being said, the results shown in Section 3 demonstrate that our experiments can still be improved. Firstly, the increase in the size of the dataset would surely enhance the performances of the classifiers. In particular, this should be done focusing on the label PP, that, as it can be observed in Table <ref type="table" target="#tab_5">4</ref>, is the less frequent among the four. Furthermore, while the precision for the label PP is usually higher than the average (note how it reaches 1.00 in our baseline), its recall is very low, even for our SVM classifier, which shows the best results overall. The intuition here is that, while it is usually easy for the classifiers to understand which tweet has the PP label, they are also very "picky", and cannot really learn all the features needed in order to classify this label against the others. Other possible solutions to this issue can be the use of techniques such as resampling and cost-based methods. Secondly, by focusing on the textual features of the tweets, we can further investigate where improvements can be made. In particular, looking at the top 5 tf-idf scores for each label (Table <ref type="table" target="#tab_6">5</ref>), we notice that the selected hashtags may occur in all types of tweets with a low difference among their scores. Such a low deviation does not contribute enough Those data could give us some insight on how museums communicate through the Twitter platform. Indeed, usually, GLAMs tend to use the same hashtags regardless of their communicative intents (even when the hashtag used was initially linked to certain initatives), which was already expected with some general hashtags, like #iorestoacasa.</p><p>The effects of possible removal or reweighting of these hashtags needs to be further explored.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusion and Future Work</head><p>In this work, we have described our project for classifying communicative intents in tweets written by Italian GLAMs during the COVID-19 lockdown. Through the experiments and the following analysis we have shown how this task can be challenging. As future work we will focus on: increasing the size of the corpus, integrating statistical techniques to help dealing with imbalanced labels, and finally improving the selection and reweighting of the features (in particular concerning the hashtags). Another topic which needs further investigation concerns the use of different kinds of textual embeddings, which might improve the result. Once honed, the metholody and the tools we have used in this research could become an important asset in better understanding and analyzing cultural communication on the Web.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>Number of occurrences for each hashtag.</figDesc><table><row><cell>Hashtag</cell><cell># Occ.</cell></row><row><cell>#artyouready</cell><cell>367</cell></row><row><cell>#emptymuseum</cell><cell>373</cell></row><row><cell>#museitaliani</cell><cell>906</cell></row><row><cell>#museichiusimuseiaperti</cell><cell>1560</cell></row><row><cell>#laculturanonsiferma</cell><cell>668</cell></row><row><cell>#laculturaincasa</cell><cell>283</cell></row><row><cell>Total</cell><cell>4,157</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head></head><label></label><figDesc>Sharing Experience -SE: tweets that share an experience, an opinion or one's feeling</figDesc><table><row><cell>Example: Eccoci qui oggi a ricordare e a rac-</cell></row><row><cell>contare come i musei chiusi non siano chiusi</cell></row><row><cell>e i musei vuoti non siano vuoti. Forza!</cell></row><row><cell>(Here we are today, reminding and telling how</cell></row><row><cell>closed museums are not actually closed and</cell></row><row><cell>empty museums are not actually empty. Come</cell></row><row><cell>on!);</cell></row><row><cell>• Promoting Participation -PP: tweets that</cell></row><row><cell>require some kind of activity from the users,</cell></row><row><cell>either in real life or on-line</cell></row><row><cell>Example: Art you ready? Domani partecipa</cell></row><row><cell>anche tu al contest di @ museitaliani con-</cell></row><row><cell>dividendo con noi le tue foto dei musei privi</cell></row><row><cell>di persone. Cerca fra i ricordi, seleziona la</cell></row><row><cell>foto, e condividi con # artyouready # Muse-</cell></row><row><cell>umFromHome # iorestoacasa. Ti aspettiamo!</cell></row><row><cell>(Art you ready? Take part in tomorrow's @</cell></row><row><cell>museitaliani contest by sharing with us your</cell></row><row><cell>photos of empty museums. Search through</cell></row><row><cell>your memories, choose the photo and share</cell></row><row><cell>it with # artyouready # MuseumFromHome #</cell></row><row><cell>iorestoacasa. We are waiting for you!);</cell></row><row><cell>• Interacting with the Community -ItC:</cell></row><row><cell>tweets through which Institutions create and</cell></row><row><cell>foster their communities by directly interact-</cell></row><row><cell>ing with the users</cell></row><row><cell>Example: Siete stati davvero tanti ad ac-</cell></row><row><cell>cogliere l'invito a partecipare al flashmob #</cell></row><row><cell>artyouready e tutti avete postato foto merav-</cell></row><row><cell>igliose! Ecco i tre scatti selezionati tra i più</cell></row><row><cell>belli</cell></row><row><cell>(So many of you accepted to take part in the</cell></row><row><cell># artyouready flashmob, and you all posted</cell></row><row><cell>great photos! Here are the three shots selected</cell></row><row><cell>among the most beautiful ones);</cell></row><row><cell>• Promoting-Informing -PI: tweets that pro-</cell></row><row><cell>mote or inform other users about activities,</cell></row><row><cell>exhibitions, or about any sort of information</cell></row><row><cell>on the museum.</cell></row><row><cell>Example: Il castello di Fénis si trova in</cell></row><row><cell>Valle d'Aosta circondato da una doppia cinta</cell></row><row><cell>di mura merlate è caratterizzato da torri</cell></row><row><cell>quadrate e cilindriche con feritoie e caditoie.</cell></row></table><note>(Fénis Castle is located in Aosta Valley, with its double crenellated surrounding walls, it is</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 2 :</head><label>2</label><figDesc>Number of occurrences and percentage in the corpus for each label.</figDesc><table><row><cell cols="4">stopwords 7 removed. The cleaning process has</cell></row><row><cell cols="4">been done via the NLTK package for Python 8 ,</cell></row><row><cell cols="4">which has also been used for tokenization.</cell></row><row><cell cols="4">The experiments have been conducted on six</cell></row><row><cell cols="4">classifiers: five more traditional classifiers trained</cell></row><row><cell cols="4">on a TF-IDF vectorized text (created using the</cell></row><row><cell cols="4">machine learning library for Python Scikit-learn 9 ),</cell></row><row><cell cols="4">and a Feed Forward Neural Network 10 created</cell></row><row><cell cols="4">with Keras 11 and trained on a 100-dimensions</cell></row><row><cell cols="2">GlOve 12 embedded text.</cell><cell></cell><cell></cell></row><row><cell cols="4">The set of classifiers is thus the following: a</cell></row><row><cell cols="4">Naive Bayes (NB, also used as baseline); a</cell></row><row><cell cols="4">Support Vector Classifier (SVC); a the K-Nearest</cell></row><row><cell cols="4">Neighbors classifier (KNN); a Decision Tree (DT);</cell></row><row><cell cols="4">a Multilayer Perceptron (MLP) and a Neural</cell></row><row><cell cols="2">Network classifier (NN).</cell><cell></cell><cell></cell></row><row><cell cols="4">The dataset was split using the train test split tool</cell></row><row><cell cols="4">found in the sklearn library for Python, which</cell></row><row><cell cols="4">splits the data into random train and test subsets</cell></row><row><cell cols="4">given a test set size. With test size set at 0.3, the</cell></row><row><cell cols="4">training set is composed of 2,338 tweets, and the</cell></row><row><cell cols="4">testing set is composed of the remaining 1,002</cell></row><row><cell>tweets.</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">In order to evaluate the classification task, the</cell></row><row><cell cols="4">values of precision, recall and F1 have been all</cell></row><row><cell cols="4">weighted by the number of samples of each label.</cell></row><row><cell cols="4">The final results are shown in Table 3.</cell></row><row><cell>Classifier</cell><cell>P</cell><cell>R</cell><cell>F1</cell></row><row><cell>NB</cell><cell cols="3">0.69 0.66 0.64</cell></row><row><cell>SVC</cell><cell cols="3">0.70 0.68 0.67</cell></row><row><cell>KNN</cell><cell cols="3">0.70 0.39 0.35</cell></row><row><cell>DT</cell><cell cols="3">0.56 0.55 0.55</cell></row><row><cell>MLP</cell><cell cols="3">0.66 0.66 0.66</cell></row><row><cell>NN</cell><cell cols="3">0.64 0.63 0.63</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 3 :</head><label>3</label><figDesc>System results.</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 4 :</head><label>4</label><figDesc>Precision (P) and Recall (R) for each label.</figDesc><table><row><cell>Classifier</cell><cell>SE</cell><cell>PP</cell><cell>ItC</cell><cell>PI</cell></row><row><cell>NB</cell><cell cols="4">P 0.66 1.00 0.58 0.61</cell></row><row><cell></cell><cell cols="4">R 0.37 0.02 0.74 0.76</cell></row><row><cell>SVC</cell><cell cols="4">P 0.66 0.88 0.72 0.60</cell></row><row><cell></cell><cell cols="4">R 0.48 0.52 0.65 0.81</cell></row><row><cell>KNC</cell><cell cols="4">P 0.36 0.69 0.74 0.68</cell></row><row><cell></cell><cell cols="4">R 0.79 0.32 0.43 0.39</cell></row><row><cell>DT</cell><cell cols="4">P 0.47 0.43 0.51 0.50</cell></row><row><cell></cell><cell cols="4">R 0.51 0.50 0.53 0.48</cell></row><row><cell>MLP</cell><cell cols="4">P 0.59 0.71 0.67 0.68</cell></row><row><cell></cell><cell cols="4">R 0.62 0.54 0.68 0.67</cell></row><row><cell>NN</cell><cell cols="4">P 0.61 0.77 0.60 0.71</cell></row><row><cell></cell><cell cols="4">R 0.64 0.47 0.71 0.61</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_6"><head>Table 5 :</head><label>5</label><figDesc>Top 5 word by their tf-idf score on each label.to the classification process, as shown by #museichiusimuseiaperti values which are seemingly strong enough as a feature to differentiate PP against the others, but does not do a good job differentiating the other labels against each other.</figDesc><table><row><cell>Token</cell><cell>SE</cell><cell>PP</cell><cell>ItC</cell><cell>PI</cell></row><row><cell cols="5">#museichiusimuseiaperti 1.55 2.47 2.12 1.99</cell></row><row><cell>#iorestoacasa</cell><cell cols="4">1.66 1.92 1.51 1.71</cell></row><row><cell>#museitaliani</cell><cell cols="4">2.43 2.35 2.05 2.33</cell></row><row><cell>#laculturanonsiferma</cell><cell cols="4">2.88 2.47 2.83 2.35</cell></row><row><cell>#emptymuseum</cell><cell cols="4">3.02 1.97 3.29 3.4</cell></row><row><cell>#artyouready</cell><cell>2.9</cell><cell cols="3">1.71 3.04 3.23</cell></row><row><cell>#laculturaincasa</cell><cell>4.1</cell><cell>-</cell><cell cols="2">3.47 2.96</cell></row><row><cell>flashmob</cell><cell>-</cell><cell>2.1</cell><cell>-</cell><cell>-</cell></row><row><cell>mibact</cell><cell cols="2">3.34 2.4</cell><cell>2.7</cell><cell>3.09</cell></row><row><cell>oggi</cell><cell cols="2">3.03 -</cell><cell>-</cell><cell>2.78</cell></row><row><cell>youtube</cell><cell>-</cell><cell>-</cell><cell cols="2">3.16 -</cell></row><row><cell>cultura</cell><cell cols="2">4.18 -</cell><cell cols="2">3.12 4.18</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_0">https://icom.museum/en/news/how-toreach-and-engage-your-public-remotely/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_1">This process is described in details in<ref type="bibr" target="#b5">Carlino et al. (2020)</ref> </note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_2">https://developer.twitter.com/en</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_3">https://inception-project.github.io/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_4">The list of stopwords used is the default one for Italian found in the NLTK package. Furthermore, the term 'Twitter' has been added to this list after the first experiments.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_5">https://www.nltk.org/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_6">https://scikit-learn.org/stable/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_7"> Parameters: 4 layers, dropout=0.7, Adam Optimizer   </note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_8">https://keras.io/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="12" xml:id="foot_9">https://nlp.stanford.edu/projects/ glove/</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work has been partially supported by Programma Operativo Nazionale Ricerca e Innovazione 2014-2020 -Fondo Sociale Europeo, Azione I.2 "Attrazione e Mobilità Internazionale dei Ricercatori" Avviso D.D. n 407 del 27/02/2018 and by PON Ricerca e Innovazione 2014-2020 "Dottorati innovativi con caratterizzazione industriale". Authorship Attribution is as follows: Gennaro Nolano is author of Section 3.3 and 4, Carola Carlino is author of Section 3, 3.1 and 5, Maria Pia di Buono is author of Section 2 and 3.2, and Johanna Monti is author of Section 1.</p></div>
			</div>


			<div type="funding">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>2 https://www.osservatori.net/it/ ricerche/comunicati\-stampa/laumento\-del\-livello\-di\-interesse\-per\-le\-attivita\-online\-dei-musei\-incentivato\-dal\-covid\-19\-e\-gli\-investimenti\-per\-migliorare\-i\-servizi\-offerti</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Small towns and big cities: How museums foster community on-line</title>
		<author>
			<persName><forename type="first">Dana</forename><surname>Allen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">-Greil</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Matthew</forename><surname>Macarthur</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings. Toronto: Archives &amp; Museum Informatics</title>
				<meeting>Toronto: Archives &amp; Museum Informatics</meeting>
		<imprint>
			<date type="published" when="2010-03-31">2010. 2010. March, 31. 2010</date>
			<biblScope unit="page" from="84" to="96" />
		</imprint>
	</monogr>
	<note>Museums and the Web</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Consumers using social media: impact on companies&apos; reputation</title>
		<author>
			<persName><forename type="first">Bráulio</forename><surname>Alturas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Liliana</forename><surname>Oliveira</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Academy of Marketing Conference 2016: Radical Marketing</title>
				<meeting>the Academy of Marketing Conference 2016: Radical Marketing</meeting>
		<imprint>
			<publisher>Academy of Marketing</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Social media for cultural communication: A critical investigation of museums&apos; instagram practices</title>
		<author>
			<persName><forename type="first">Dimitrios</forename><surname>Amanatidis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ifigeneia</forename><surname>Spyridon Mamalis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Irene</forename><forename type="middle">Eirini</forename><surname>Kamenidou</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Tourism, Heritage &amp; Services Marketing</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="38" to="44" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Automatic identification and classification of misogynistic language on twitter</title>
		<author>
			<persName><forename type="first">Maria</forename><surname>Anzovino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Elisabetta</forename><surname>Fersini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Rosso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Natural Language Processing and Information Systems</title>
				<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="57" to="64" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Sentiment analysis on italian tweets</title>
		<author>
			<persName><forename type="first">Valerio</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Malvina</forename><surname>Nissim</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</title>
				<meeting>the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<author>
			<persName><forename type="first">Carola</forename><surname>Carlino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gennaro</forename><surname>Nolano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Maria</forename><forename type="middle">Pia</forename><surname>Di Buono</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Johanna</forename><surname>Monti</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2005.10527</idno>
		<title level="m">Laculturanonsifermareport su uso e la diffusione degli hashtag delle istituzioni culturali italiane durante il periodo di lockdown</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Linguistically-motivated and lexicon features for sentiment analysis of italian tweets</title>
		<author>
			<persName><forename type="first">Andrea</forename><surname>Cimino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stefano</forename><surname>Cresci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Felice</forename><surname>Dell'orletta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Maurizio</forename><surname>Tesconi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The 4th Conference for Evaluation of NLP and Speech Tools for Italian (EVALITA)</title>
				<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">A tool-based methodology to analyze social network interactions in cultural fields: The use case &quot;museumweek</title>
		<author>
			<persName><forename type="first">Antoine</forename><surname>Courtin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Brigitte</forename><surname>Juanals</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jean-Luc</forename><surname>Minel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mathilde De Saint</forename><surname>Leger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC&apos;16)</title>
				<meeting>the Tenth International Conference on Language Resources and Evaluation (LREC&apos;16)</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Twitter content classifcation</title>
		<author>
			<persName><forename type="first">Stephen</forename><surname>Dann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">First Monday</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Automatic classification of tweets for analyzing communication behavior of museums</title>
		<author>
			<persName><forename type="first">Nicolas</forename><surname>Foucault</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Antoine</forename><surname>Courtin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC&apos;16)</title>
				<meeting>the Tenth International Conference on Language Resources and Evaluation (LREC&apos;16)<address><addrLine>Portorož, Slovenia</addrLine></address></meeting>
		<imprint>
			<publisher>ELRA</publisher>
			<date type="published" when="2016-05">2016. May</date>
			<biblScope unit="page" from="3006" to="3013" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Citius: A naive-Bayes strategy for sentiment analysis on English tweets</title>
		<author>
			<persName><forename type="first">Pablo</forename><surname>Gamallo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marcos</forename><surname>Garcia</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)</title>
				<meeting>the 8th International Workshop on Semantic Evaluation (SemEval 2014)<address><addrLine>Dublin, Ireland</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="171" to="175" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">Il Ministero della Cultura presenta un report sulla reputazione online dei musei durante il Covid</title>
		<author>
			<persName><forename type="first">Claudia</forename><surname>Giraud</surname></persName>
		</author>
		<ptr target="https://www.artribune.com/professioni-e-professionisti/2020/07/il-ministero-della-cultura-presenta-un-report-sulla-reputazione-online-dei-musei-durante-il-covid/" />
		<imprint>
			<date type="published" when="2020-09">2020. 10/09/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Analysing cultural events on twitter</title>
		<author>
			<persName><forename type="first">Brigitte</forename><surname>Juanals</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jean-Luc</forename><surname>Minel</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Current Trends in Web Engineering</title>
				<editor>
			<persName><forename type="first">Irene</forename><surname>Garrigós</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Manuel</forename><surname>Wimmer</surname></persName>
		</editor>
		<imprint>
			<publisher>Cham. Springer International Publishing</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="213" to="222" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Does Twitter help museums engage with visitors?</title>
		<author>
			<persName><forename type="first">Lesley</forename><forename type="middle">A</forename><surname>Langa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">iConference 2014 Proceedings</title>
				<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<author>
			<persName><forename type="first">Ana</forename><surname>Sánchez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Laws</forename></persName>
		</author>
		<title level="m">Museum websites and social media: issues of participation, sustainability, trust and diversity</title>
				<imprint>
			<publisher>Berghahn Books</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="volume">8</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Information, community, and action: How nonprofit organizations use social media</title>
		<author>
			<persName><forename type="first">K</forename><surname>Lovejoy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gregory</forename><forename type="middle">D</forename><surname>Saxton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Economic &amp; Social Impacts of Innovation eJournal</title>
				<imprint>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">NEMO -Network of European Museum Organisations</title>
		<ptr target="https://www.ne-mo.org/fileadmin/Dateien/public/NEMOdocuments/NEMOCOVID19Report12.05.2020.pdf" />
	</analytic>
	<monogr>
		<title level="m">Survey on the impact of the COVID-19 situation on museums in Europe: Final Report</title>
				<imprint>
			<date type="published" when="2020-09-10">2020. last accessed 10/09/2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">From tweets to polls: Linking text sentiment to public opinion time series</title>
		<author>
			<persName><forename type="first">O'</forename><surname>Brendan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ramnath</forename><surname>Connor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bryan</forename><forename type="middle">R</forename><surname>Balasubramanyan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Noah</forename><forename type="middle">A</forename><surname>Routledge</surname></persName>
		</author>
		<author>
			<persName><surname>Smith</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ICWSM</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page" from="1" to="2" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<ptr target="http://musei.beniculturali.it/notizie/notifiche/report-maggio-2020-sulla-reputazione-online-dei-musei" />
		<title level="m">La reputazione online di musei, parchi archeologici, istituti e luoghi della cultura italiani -Report relativo al mese di maggio</title>
				<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
		<respStmt>
			<orgName>Politecnico di Milano</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">An analysis of the user occupational class through Twitter content</title>
		<author>
			<persName><forename type="first">Daniel</forename><surname>Preot ¸iuc-Pietro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Vasileios</forename><surname>Lampos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nikolaos</forename><surname>Aletras</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing</title>
				<meeting>the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing<address><addrLine>Beijing, China</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="1754" to="1764" />
		</imprint>
	</monogr>
	<note>: Long Papers)</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">What do academic libraries tweet about, and what makes a library tweet useful?</title>
		<author>
			<persName><forename type="first">Besiki</forename><surname>Stvilia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Leila</forename><surname>Gibradze</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Library &amp; Information Science Research</title>
				<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Swash: A naive bayes classifier for tweet sentiment identification</title>
		<author>
			<persName><forename type="first">Ruth</forename><surname>Talbot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chloe</forename><surname>Acheampong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Richard</forename><forename type="middle">H</forename><surname>Wicentowski</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">SemEval@NAACL-HLT</title>
				<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Automatic identification of eyewitness messages on twitter during disasters</title>
		<author>
			<persName><forename type="first">Kiran</forename><surname>Zahra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Muhammad</forename><surname>Imran</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Frank</forename><surname>Ostermann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information Processing and Management</title>
		<imprint>
			<biblScope unit="volume">57</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">How and why people twitter: the role that micro-blogging plays in informal communication at work</title>
		<author>
			<persName><forename type="first">Dejin</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mary</forename><surname>Beth Rosson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the ACM 2009 international conference on Supporting group work</title>
				<meeting>the ACM 2009 international conference on Supporting group work</meeting>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="243" to="252" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
