<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">ConteCorpus: An Analysis of People Response to Institutional Communications During the Pandemic</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Viviana</forename><surname>Ventura</surname></persName>
							<email>viviana.ventura01@universitadipavia.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Humanities</orgName>
								<orgName type="institution">University of Pavia</orgName>
								<address>
									<settlement>Pavia</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Elisabetta</forename><surname>Jezek</surname></persName>
							<email>jezek@unipv.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Humanities</orgName>
								<orgName type="institution">University of Pavia</orgName>
								<address>
									<settlement>Pavia</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">ConteCorpus: An Analysis of People Response to Institutional Communications During the Pandemic</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">AAF5ACB07A13F33B9722B4DEC85B5CC5</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T03:42+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Topic 1</term>
					<term>Economics 2</term>
					<term>Prime Minister 3</term>
					<term>Politics 4</term>
					<term>Pandemic 5</term>
					<term>Home Terms</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The study of institutional communication related to the pandemic, and to the population's response to it, is of great relevance today. The Italian spokesperson for communication regarding the pandemic has been, during the year 2020, the former Prime Minister Giuseppe Conte. We retrieved 4,860,395 comments from his Facebook official page and built the Con-teCorpus, a new Italian resource annotated in CoNLL-U format. A first aim of the research was to evaluate the performance of the model used to annotate the corpus. Models trained on social media texts are usually not very generalizable. Nevertheless, the results of the evaluation were good, especially in parsing metrics, and showed that a parser trained on Twitter data can be successfully applied to Facebook data. A second aim of the research was to provide an overall view of the content of such a large corpus; for this purpose, topic modeling was conducted, training an LDA model. The model generated 5 topics that cover different aspects linked to the pandemic emergency, from economic to political issues. Through the topic modeling we investigated which topics are prevalent on particular days.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>During the year 2020, the Prime Minister Giuseppe Conte has played a major role in institutional communication, particularly in communication regarding the policies undertaken to manage the health emergency. We assumed that inter-Copyright ©️ 2021 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). esting content from the point of view of the response of the population to institutional communications regarding the pandemic would have been found on his social media profiles. Therefore, we created ConteCorpus,<ref type="foot" target="#foot_0">1</ref> retrieving more than 4 million comments from his Facebook page<ref type="foot" target="#foot_1">2</ref> starting from January 2020 until December 2020, and we annotated it in CoNLL-U format <ref type="bibr">3</ref> .</p><p>A first aim of the research was to evaluate the performance of the model used to annotate the dataset. Models trained on social media texts usually are poorly generalizable even on text retrieved from the same social media, therefore we wanted to test the performance on Facebook texts of a model trained on Twitter texts. In order to evaluate the model, we created a gold standard by extracting 1,000 sentences from the ConteCorpus and manually revising them.</p><p>A second aim of the research was to provide an overall view of this large corpus. For this purpose we performed a Topic Modeling. We trained a LDA model sampling 10% of the ConteCorpus. The LDA model generated 5 topics related to different aspects of the pandemic emergency. The model was used to see which topics were the most relevant before and after the announcement of the first and the second period of restrictions adopted to fight the pandemic in Italy.</p><p>The paper is structured as follows: we first review the relevant literature for our research (section 2), then we describe the data collection and the creation of the corpus (section 3). In section 4, we describe the evaluation we performed of the model we used to annotate the corpus in CoNLL-U format, and in section 5 we report the results of the topic modeling experiment. In section 6 we provide some concluding observations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>2</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>State of the Art</head><p>Since the beginning of the health emergency, there has been a proliferation of computational analyses that exploit data extracted from social media. These data are considered relevant as they allow us to generalize about human social and linguistic behavior, especially regarding the pandemic event. Among the tasks that have been conducted on data drawn from social media in this period, sentiment analysis, emotion profiling and topic modeling are the most common (Gagliardi  et al., 2020; Tamburini, 2020; Vitale et al., 2020;  Stella et al., 2020a; Stella et al., 2020b; Stella et  al., 2021; De Santis et al., 2020; Sciandra, 2020;  Trevisan et al., 2021; Gozzi et al., 2020; Kruspe et  al., 2020; Hussain et al., 2021; Chakraborty et al.,  2020; Nemes e Kiss, 2020; Jelodar et al., 2021;  Lamsal, 2020; Duong et al., 2021; Gupta et al.,  2021; Sullivan et al., 2021; Su et al., 2020; Garcia  et Berton, 2021; Ahmed et al., 2020).</p><p>In particular, Topic Modeling aims at finding hidden semantic structures within the texts and to model them into concepts. The unsupervised clustering technique LDA (Latent Dirichlet Allocation), developed by Blei (2003), has been used extensively in analyses conducted on social media data during the pandemic (Dashtian et Murthy,  2021; Feng et Zhou, 2020; Ordun et al., 2020;  Wang et al., 2020; Kabir et Mandria, 2020; Amara  et al., 2020; Abd-Alzaraq et al., 2020; Naseem et  al., 2021; Low et al. 2020, Andreadis et al., 2021). LDA is a statistical model that represents each document in a corpus as a probabilistic distribution over latent topics and each topic as a probabilistic distribution over words. A topic has a probability of generating various words, where the words are all the observed words in the corpus. Thus, the terms in the set of documents are used to discover hidden topics in a large corpus.</p><p>As is well known, the language of the web is characterized by deviation from the standard language that challenges the use of NLP tools. Several classifications have been proposed to label the nature of web and social media language. In general, the labels aim to define a variety of language that is diaphasically low and at an indefinite point on the diamesic axis, e.g., "netspeak" (Crystal, 2001). Web and social media language is characterized by little planning in text structure and a greater propensity for parataxis, absence of revision and punctuation, abrupt interruption of periods, and an imitation of the continuous flow of speech (Fiorentino, 2013). Although some persistent traits of web and social media language can be described, it does not constitute a single variety of language from a sociolinguistic perspective (Fiorentino, 2013). This poses a double challenge in the use of NLP tools. First, because the tools are calibrated to standard language variety resources. Secondly, even if we created models that are better suited to web and social media languages, they would not be generalizable to every language variety on the web (Sanguinetti et al.,  2018).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">ConteCorpus Construction</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Data Collection</head><p>We have downloaded 4,860,395 comments and 534 posts published during the year 2020 on Giuseppe Conte's Facebook official profile. We made call to any 2020 post ID of Giuseppe Conte's official page to retrieve text, object id, and created time of comments. The calls to the Facebook API Graph<ref type="foot" target="#foot_3">4</ref> were made month to month in the same fashion. Nevertheless, as Table <ref type="table" target="#tab_0">1</ref> shows, a larger amount of data has been retrieved in the month of March, April, and October. In the same period in Italy the more restrictive measures to fight pandemic were taken by the government.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Processing with the Neural Pipeline Stanza</head><p>After the data collection, we processed the data with the Neural Pipeline Stanza<ref type="foot" target="#foot_4">5</ref> to enrich the texts with some annotations.</p><p>Stanza is an opensource Python NLP toolkit, which "features a language-agnostic fully neural pipeline for text analysis, including tokenization, multiword token expansion, lemmatization, part-of-speech and morphological feature tagging, dependency parsing, and named entity" (Qi et al., 2020). The kit supports more than 77 human languages and uses the formalism Universal Dependencies<ref type="foot" target="#foot_5">6</ref> Knowing the difficulties of annotating non standard texts such as those derived from social media, we chose to use this pipeline because the evaluation of its models found that Stanza neural language agnostic architecture "adapts well to text of different genres […] achieving state-of-the-art or competitive performance at each step of the pipeline" (Qi  et al., 2020). Moreover, models that can be downloaded from Stanza have been trained each on a single language and on a specific text genre dataset. We chose to download the model trained on PoSTWITA-UD. <ref type="bibr">7</ref> PoSTWITA-UD is an Italian Twitter treebank in Universal Dependencies (Sanguinetti et al., 2018). Although the language of social media is very peculiar and changes from one social media to another and from groups to groups (Fiorentino, 2013), we thought that the model downloadable from Stanza -trained on this dataset -could be generalizable to our data, being indomain. Moreover, Sanguinetti et al. (2018) have added customized tags to the UD scheme to deal with some social media peculiar phenomena: "discourse:emo" for emojis and emoticons, and "parataxis:hashtag" for hashtags. They tagged the link found in some sentences as "dep" (unspecified relation) and used the "upos" (universal part-ofspeech) tag "SYM" (symbol) for hashtags and emojis. Additionally, they manually inserted the lemma of non-standard word forms not recognized by the lemmatizer (Sanguinetti et al., 2018). We processed the data divided in 12 packages; each correspond to one month data. We used every processor of the pipeline, besides the Named Entity Recognition module (TokenizeProcessor, POSProcessor, LemmaProcessor, Dep-parseProcessor). We personalized the model in or-der not to split the sentences,<ref type="foot" target="#foot_7">8</ref> forcing the To-kenizeProcessor to consider each comment as a sentence. Furthermore, we added two metadata to each sentence: one refers to the id of the post from which the comment was retrieved, and the other is the creation time of the comment. The aim is to make it easier to retrieve the comments from the corpus by their created time or post id if one needs to analyze a particular period of time or a particular post.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">End-to-End Evaluation</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Construction of the Gold Standard</head><p>We built a gold standard with a dual purpose: to evaluate the performance of the model on this new collection of social media texts, and to create a standard that can be used for future training and testing. We randomly selected 83 sentences from each file of the corpus annotated automatically (one file is composed of one-month comments), and manually revised the 1,000 sentences collected. The manual revision has followed the principle that what is understandable by a human would be correct.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Evaluation with CoNLL 2018 UD Shared Task Official Evaluation Script</head><p>To perform the evaluation, we used CoNLL 2018 UD shared task official evaluation script. <ref type="bibr">9</ref> Table <ref type="table" target="#tab_2">2</ref> shows the scores of evaluation metrics resulting from the performance of Stanza model on the test set of the ConteCorpus. resulting scores on Universal part-of-speech tagging metric, XPOS on language-specific part-ofspeech tagging metric, and UFeats on morphological features tagging metric. The last 5 rows show scores in five different parsing metrics.</p><p>What we found most challenging during the manual revision of the 1,000 sentences annotated automatically was correcting the errors in tokenization: many words that the tokenizer should have splitted were joined together. This type of tokenization error is often found when punctuation is used with non standard function. For example: we found that the token "oneste…volevo" ("hon-est…I wanted to") -an adjective, a punctuation mark and a verb -are conflated in a single token. In the manual revision, tokens like this have been splitted in three different tokens and other missing tags were added. The presence of such conflated words mayhave caused a worse score in the metric that evaluates the performance of segmentation, and consequentially in the other scores. The evaluation on the parser starts with aligning system nodes and gold nodes; their respective parent nodes are also considered; if the system parent is not aligned with the gold parent or if the relation label differs, the word is not counted as correctly attached. Despite errors in segmentation seem frequent in the corpus, this did not cause an excessive lowering of the scores on the various metrics reported in Table <ref type="table" target="#tab_3">2 and 3</ref>. Another error that appears frequently regards the lemma assigned to the abbreviations that are not present in PoSTTWITA-UD. Canonical abbreviations are tagged correctly, for example "cmq" for "comunque" ("however"). The abbreviations tagged incorrectly are those which appeared few times: such as "ql" that stands for "quelli" (those). An unexpected good result has been achieved on parsing metrics. This result could be due to the "preference of UD scheme in assigning headedness to content words" (Sanguinetti et al., 2018); therefore, the tendency of the social media languages to eliminate function words does not affect the performance of the parser. Another explanation can be found in the very similar frequencies distribution of part-ofspeeches and syntactic relations in the training set and the gold standard, as shown in Figure <ref type="figure" target="#fig_1">1 and 2</ref>.</p><p>Overall, the model trained on PoSTWITA-UD turned out to perform well on the test set of the ConteCorpus because PoSTWITA-UD tagset has been adapted with attention to some recurrent features of social media languages. Our evaluation showed that a model trained on texts retrieved by social media can adapt well to other social media texts if one pays attention to the neural architecture of the model and the annotation format being used.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Topic Modeling</head><p>To provide an overall view of the content of this large corpus we performed a Topic Modeling training and testing an LDA model on the Con-teCorpus.  To perform topic modelling, we sampled 10% of the sentences in our dataset and trained a LDA model. We treated each sentence as a document. We pre-processed lemmas removing stopwords, downloading Italian stopwords list from the NLTK (Natural Language Toolkit) library 10 and manually inserting missing stopwords. We filtered out tokens that appear in less than 15 documents and tokens with less than three letters; additionally, we kept only the 100,000 most frequent words. We transformed the documents into vectors creating a bag-of-words representation of each document. Then, we performed the term frequency-inverse document frequency (TF-IDF) on the whole corpus to assign higher weights to the most important words. Gensim LDA model 11 was applied first to the bags-of-words and secondly on the TF-IDF corpus to extract latent topics. Better performances were achieved with the LDA model applied to bags-of-words. We determined the optimal number of topics in LDA using the Coherence Value metric. <ref type="bibr">12</ref> The underlying idea is that a good model will generate topics with high topic Coherence Value score. We ran different LDA ex-10 https://www.nltk.org/. 11 https://radimrehurek.com/gensim/models/ldamodel. html.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>5.1</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methodology</head><p>periments varying the number of topics and selected the model with the highest medium topic Coherence Value score. Our final model generated 5 topics and has a topic medium Coherence Value score of 0.5. Table <ref type="table">4</ref> illustrates the top ten most representative terms associated with each detected topic.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">Results</head><p>As expected, all the topics extracted from the corpus are related to the concerns about the emergency. The focus is on the economic aspect of the emergency. The first ten most frequent words in Economics topic (Table <ref type="table">4 and Figure</ref>  <ref type="figure" target="#fig_2">3</ref>) are economic terms: "loan", "company", "to pay" "money" etc. In all the other topics at least one of the 10 most frequent words comes from the economic sphere. Among the ten most frequent words of each topic there are only two words regarding the pandemic, found in Pandemic topic: "virus" and "pandemic". It is no coincidence that the most frequent word in this topic is "to go out". The need to face the emergency through the intervention of the institutions is evident. This is shown espe Table <ref type="table">4</ref>. Topic generated from the LDA model and the ten most frequent terms. cially by Prime Minister and Politics topics (Table 4). Prime Minister topic most frequent words are related to the Prime Minister. Perhaps words like "bravo" and "thank you" and "dear" show a positive judgement towards him. In Politics topic one finds words of the institutional sphere such as: "country", "government", "people", "bank". Home topic is related to the private sphere with words like "to hope", "home", "to wait", "to lose", although there is no shortage of words from the economic sphere. In Figure <ref type="figure" target="#fig_2">3</ref>  ). In the days that followed, the prevailing topic is Economics: on 28 October, the "ristoro" decree was approved to financially support commercial activities. A peak in the topic of Economics occurred on 18 March: on those days, discussions were taking place on whether to ask the European Union for financial aid to overcome the pandemic. The prevailing topics are therefore usually related to current events.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Concluding Observations</head><p>As mentioned before, models trained with data from social media are hardly generalizable. This stems from the fact that from a sociolinguistic perspective, the language of social media does not constitute a single variety. So, we expected that the results in the various evaluation metrics we performed would be worse than the results in the evaluation conducted on the PoSTWITA-UD test set. Surprisingly, in some metrics the results on evaluating the ConteCorpus test set were better than the results on the PoSTWITA-UD test set. To offer an overall view of the content of the Con-teCorpus we performed topic modeling. The topics generated by the LDA model cover various aspects of the pandemic emergency, with a preponderance of political and economic issues. Unexpectedly, topics identified do not show concern regard the risk of contagion and the possibility of catching the disease.   </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 .</head><label>1</label><figDesc>Figure 1. Frequency distribution of syntactic relation tags in the training set and the gold standard.</figDesc><graphic coords="4,71.50,70.90,452.77,167.95" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 .</head><label>2</label><figDesc>Figure 2. Frequency distribution of part-ofspeech tags in the training set and the gold standard.</figDesc><graphic coords="4,71.65,261.50,217.20,115.20" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 .</head><label>3</label><figDesc>Figure 3. Intertopic distance Map and Top-30 most relevant terms for Topic 1. For a better view visit: https://sites.google.com/view/ldavisualizationcontecorpus/home-page.</figDesc><graphic coords="5,121.27,174.75,352.80,187.75" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head></head><label></label><figDesc>the distance between the centre of the circles indicates the similarity between the topics. Here you can see that only Economics topic and Prime Minister topic overlap; this indicates that the two topics are more similar with respect to the other topics. Moreover, the size of the area of each circle represents the importance of the topic relative to the corpus. Economics topic is the most important topic in the corpus. Finally, we tested our model on unseen documents: the comments published between 15 February and 30 March 2020, before and after the announcement of the first period of restrictions to combat the pandemic, and between 1 October and 14 November 2020, before and after the announcement of the second period of restrictions. Figures4, 5and 6 show trends in topics over time. Each line represents a topic and the x-axis shows the time progression. On 23 February, the first restrictive policies were announced for some Italian cities: Figure5shows a peak in the pandemic topic on that day. Figure4shows how the prevalence of the five topics changes on 8-12 March 2020. The Figure shows a peak on 9 March in Prime Minister topic: on that day he announced the first national restrictions period to combat the pandemic. Overall, the prevalent topics on those days are economics and pandemic. On 13 October, after a summer without major restrictions, with a new exponential increase in the curve of contagions, the Italian Parliament passed a decree limiting the possibility of aggregation. That day we have a new peak in the Pandemic theme (Figure 6</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 4 .</head><label>4</label><figDesc>Figure 4. Prevalence of topics during the days 8-12 March 2020.</figDesc><graphic coords="6,70.90,227.15,208.75,128.10" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 5 .</head><label>5</label><figDesc>Figure 5. Prevalence of topics during the days 15 February-30 March 2020.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>Figure 6 .</head><label>6</label><figDesc>Figure 6. Prevalence of topics during the days 1 October-15 November 2020.</figDesc><graphic coords="6,306.15,70.90,211.20,125.65" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Number of posts and comments retrieved for each month.</figDesc><table><row><cell></cell><cell cols="3">January February March</cell><cell>April</cell><cell>May</cell><cell>June</cell><cell>July</cell><cell cols="6">August September October November December Tot</cell></row><row><cell>Post</cell><cell>48</cell><cell>59</cell><cell>48</cell><cell>45</cell><cell>26</cell><cell>44</cell><cell>61</cell><cell>24</cell><cell>43</cell><cell>75</cell><cell>33</cell><cell>28</cell><cell>534</cell></row><row><cell cols="3">Comment 115,971 154,266</cell><cell cols="7">681,221 775,972 361,179 335,772 449,913 190,777 260,237</cell><cell cols="2">666,126 441,822</cell><cell>427,139</cell><cell>4,860,395</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 3</head><label>3</label><figDesc></figDesc><table><row><cell>compares the</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2 .</head><label>2</label><figDesc>Performance of Stanza's UD pre-trained model tested on the test set of ConteCorpus.</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 3 .</head><label>3</label><figDesc>Performance of Stanza's UD pre-trained model tested on official test set of PoSTWITA-UD and on test set of ConteCorpus. The scores shown are calculated using the F-measure.</figDesc><table /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://github.com/Viviana-dev/Conte_Corpus</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://www.facebook.com/GiuseppeConte64/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">https://universaldependencies.org/format.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">https://developers.facebook.com/docs/graph-api?lo-cale=it_IT</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">https://stanfordnlp.github.io/stanza/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">Universal Dependencies (UD) is a "framework for consistent annotation of grammar (parts of speech, morphological features, and syntactic dependencies) across different human languages" (https://universaldependencies.org/).</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_6">https://universaldependencies.org/treebanks/it_postwita/index.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_7"> 8  Sentence segmentation and tokenization are jointly performed by the TokenizeProcessor<ref type="bibr" target="#b26">(Qi et al., 2020)</ref>.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="12" xml:id="foot_8">Coherence Value metric is developed byRoder  (2015). It evaluates a single topic by measuring the degree of semantic similarity between high scoring words in the topic.</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Top concerns of tweeters during the COVID-19 pandemic: infoveillance study</title>
		<author>
			<persName><forename type="first">A</forename><surname>Abd-Alrazaq</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Alhuwail</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Househ</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hamdi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Shah</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of medical Internet research</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page">e19016</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">E</forename><surname>Ahmed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">R I</forename><surname>Rabin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">N</forename><surname>Chowdhury</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2006.00804</idno>
		<title level="m">COVID-19: Social media sentiment analysis on reopening</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis</title>
		<author>
			<persName><forename type="first">A</forename><surname>Amara</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A H</forename><surname>Taieb</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">B</forename><surname>Aouicha</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Applied Intelligence</title>
		<imprint>
			<biblScope unit="volume">51</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="3052" to="3073" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">A social media analytics platform visualising the spread of COVID-19 in Italy via exploitation of automatically geotagged tweets</title>
		<author>
			<persName><forename type="first">S</forename><surname>Andreadis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Antzoulatos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Mavropoulos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Giannakeris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tzionis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Pantelidis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">.</forename><forename type="middle">.</forename><surname>Kompatsiaris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Online Social Networks and Media</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="page">100134</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Latent dirichlet allocation</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Y</forename><surname>Ng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">I</forename><surname>Jordan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">the Journal of machine Learning research</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="993" to="1022" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
	<note>JMach)</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Sentiment Analysis of COVID-19 tweets by Deep Learning Classifiers-A study to show how popularity is affecting accuracy in social media</title>
		<author>
			<persName><forename type="first">K</forename><surname>Chakraborty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Bhatia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Bhattacharyya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Platos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Bag</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">E</forename><surname>Hassanien</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Applied Soft Computing</title>
		<imprint>
			<biblScope unit="volume">97</biblScope>
			<biblScope unit="page">106754</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Language and the Internet</title>
		<author>
			<persName><forename type="first">D</forename><surname>Crystal</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
			<publisher>Cambridge University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Dashtian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Murthy</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2101.12202</idno>
		<title level="m">Cml-covid: A large-scale covid-19 twitter dataset with latent topics, sentiment and location information</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">An Infoveillance System for Detecting and Tracking Relevant Topics from Italian Tweets During the COVID-19 Event</title>
		<author>
			<persName><forename type="first">E</forename><surname>De Santis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Martino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rizzi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="132527" to="132538" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Wild language&quot; goes Web: new writers and old problems in the elaboration of the written code</title>
		<author>
			<persName><forename type="first">G</forename><surname>Fiorentino</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Languages Go Web. Standard and non-standard languages on the Internet</title>
				<editor>
			<persName><forename type="first">E</forename><surname>Miola</surname></persName>
		</editor>
		<meeting><address><addrLine>Alessandria,</addrLine></address></meeting>
		<imprint>
			<publisher>Edizioni dell&apos;Orso</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="67" to="90" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Deep biaffine attention for neural dependency parsing</title>
		<author>
			<persName><forename type="first">T</forename><surname>Dozat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">D</forename><surname>Manning</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2017 International Conference on Learning Representations (ICLR)</title>
				<meeting>the 2017 International Conference on Learning Representations (ICLR)</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">The ivory tower lost: How college students respond differently than the general public to the covid-19 pandemic</title>
		<author>
			<persName><forename type="first">V</forename><surname>Duong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Luo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Pham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Social Networks Analysis and Mining (ASONAM)</title>
				<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="126" to="130" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Is working from home the new norm? an observational study based on a large geo-tagged covid-19 twitter dataset</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Feng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhou</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2006.08581</idno>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">L&apos;impatto emotivo della comunicazione istituzionale durante la pandemia di Covid-19: uno studio di Twitter Sentiment Analysis</title>
		<author>
			<persName><forename type="first">G</forename><surname>Gagliardi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Gregori</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Suozzi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Seventh Italian Conference on Computational Linguistics, CLiC-it 2020</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<meeting>the Seventh Italian Conference on Computational Linguistics, CLiC-it 2020<address><addrLine>Bologna, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">2769</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Topic detection and sentiment analysis in Twitter content related to COVID-19 from Brazil and the USA</title>
		<author>
			<persName><forename type="first">K</forename><surname>Garcia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Berton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Applied Soft Computing</title>
		<imprint>
			<biblScope unit="volume">101</biblScope>
			<biblScope unit="page">107057</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Collective Response to Media Coverage of the COVID-19 Pandemic on Reddit and Wikipedia: Mixed-Methods Analysis</title>
		<author>
			<persName><forename type="first">N</forename><surname>Gozzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Tizzani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Starnini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Ciulla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Paolotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Panisson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Perra</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of medical Internet research</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page">e21597</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">An emotion care model using multimodal textual analysis on COVID-19</title>
		<author>
			<persName><forename type="first">V</forename><surname>Gupta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Jain</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Katariya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kumar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mohan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ahmadian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ferrara</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Chaos, Solitons &amp; Fractals</title>
		<imprint>
			<biblScope unit="volume">144</biblScope>
			<biblScope unit="page">110708</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Artificial Intelligence-Enabled Analysis of Public Attitudes on Facebook and Twitter Toward COVID-19 Vaccines in the United Kingdom and the United States: Observational Study</title>
		<author>
			<persName><forename type="first">A</forename><surname>Hussain</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tahir</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Hussain</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Sheikh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Gogate</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Dashtipour</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of medical Internet research</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page">e26627</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Deep sentiment classification and topic discovery on novel coronavirus or covid-19 online discussions: Nlp using lstm recurrent neural network approach</title>
		<author>
			<persName><forename type="first">H</forename><surname>Jelodar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Orji</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Huang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Journal of Biomedical and Health Informatics</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page" from="2733" to="2742" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Kruspe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Häberle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Kuhn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><forename type="middle">X</forename><surname>Zhu</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2008.12172</idno>
		<title level="m">Cross-language sentiment analysis of European Twitter messages during the COVID-19 pandemic</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Design and analysis of a large-scale COVID-19 tweets dataset</title>
		<author>
			<persName><forename type="first">R</forename><surname>Lamsal</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Applied Intelligence</title>
		<imprint>
			<biblScope unit="page" from="1" to="15" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Using APIs for data collection on social media</title>
		<author>
			<persName><forename type="first">S</forename><surname>Lomborg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Bechmann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Information Society</title>
		<imprint>
			<biblScope unit="volume">30</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="256" to="265" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Natural Language Processing Reveals Vulnerable Mental Health Support Groups and Heightened Health Anxiety on Reddit During COVID-19: Observational Study</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Low</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Rumker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Talkar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Torous</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Cecchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Ghosh</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of medical Internet research</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page">e22635</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">COVIDSenti: a large-scale benchmark Twitter data set for COVID-19 sentiment analysis</title>
		<author>
			<persName><forename type="first">U</forename><surname>Naseem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Razzak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Khushi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">W</forename><surname>Eklund</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kim</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE transactions on computational social systems</title>
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Social media sentiment analysis based on COVID-19</title>
		<author>
			<persName><forename type="first">L</forename><surname>Nemes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kiss</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Information and Telecommunication</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="15" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><surname>Ordun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Purushotham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Raff</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2005.03082</idno>
		<title level="m">Exploratory analysis of covid-19 tweets using topic modeling, umap, and digraphs</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Stanza: A Python Natural Language Processing Toolkit for Many Human Languages</title>
		<author>
			<persName><forename type="first">P</forename><surname>Qi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bolton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">D</forename><surname>Manning</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Association for Computational Linguistics (ACL) System Demonstrations</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Exploring the space of topic coherence measures</title>
		<author>
			<persName><forename type="first">M</forename><surname>Röder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Both</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Hinneburg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the eighth ACM international conference on Web search and data mining</title>
				<meeting>the eighth ACM international conference on Web search and data mining</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="399" to="408" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">PoST-WITA-UD: an Italian Twitter Treebank in universal dependencies</title>
		<author>
			<persName><forename type="first">M</forename><surname>Sanguinetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bosco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lavelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mazzei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Antonelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Tamburini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC</title>
				<meeting>the Eleventh International Conference on Language Resources and Evaluation (LREC</meeting>
		<imprint>
			<date type="published" when="2018-05">2018. May. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">COVID-19 Outbreak through Tweeters&apos; Words: Monitoring Italian Social Media Communication about COVID-19 with Text Mining and Word Embeddings</title>
		<author>
			<persName><forename type="first">A</forename><surname>Sciandra</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE Symposium on Computers and Communications (ISCC)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2020">2020. 2020</date>
			<biblScope unit="page" from="1" to="6" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">#lockdown: Network-enhanced emotional profiling in the time of Covid-19</title>
		<author>
			<persName><forename type="first">M</forename><surname>Stella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Restocchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>De Deyne</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Big Data and Cognitive Computing</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page">14</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Cognitive network science reconstructs how experts, news outlets and social media perceived the COVID-19 pandemic</title>
		<author>
			<persName><forename type="first">M</forename><surname>Stella</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Systems</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page">38</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Stella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">S</forename><surname>Vitevitch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Botta</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2103.15909</idno>
		<title level="m">Cognitive networks identify the content of English and Italian popular posts about COVID-19 vaccines: Anticipation, logistics, conspiracy and loss of trust</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b33">
	<analytic>
		<title level="a" type="main">Examining the impact of COVID-19 lockdown in Wuhan and Lombardy: a psycholinguistic analysis on Weibo and Twitter</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Su</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Xue</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Chen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International journal of environmental research and public health</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="issue">12</biblScope>
			<biblScope unit="page">4552</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Characterization of Anonymous Physician Perspectives on COVID-19 Using Social Media Data</title>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">J</forename><surname>Sullivan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Burden</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Keniston</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Banda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">E</forename><surname>Hunter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Pac Symp Biocomput</title>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<monogr>
		<author>
			<persName><forename type="first">F</forename><surname>Tamburini</surname></persName>
		</author>
		<ptr target="http://corpora.fic-lit.unibo.it/EmoItaly/" />
		<title level="m">EmoItaly</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b36">
	<analytic>
		<title level="a" type="main">Debate on online social networks at the time of COVID-19: An Italian case study</title>
		<author>
			<persName><forename type="first">M</forename><surname>Trevisan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Vassio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Giordano</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Online Social Networks and Media</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="page">100136</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b37">
	<analytic>
		<title level="a" type="main">Concerns Expressed by Chinese Social Media Users During the COVID-19 Pandemic: Content Analysis of Sina Weibo Microblogging Data</title>
		<author>
			<persName><forename type="first">J</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Evans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Zhu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of medical Internet research</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">11</biblScope>
			<biblScope unit="page">e22152</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b38">
	<analytic>
		<title level="a" type="main">#andràtuttobene: Images, Texts, Emojis and Geodata in a Sentiment Analysis Pipeline</title>
		<author>
			<persName><forename type="first">P</forename><surname>Vitale</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Pelosi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Falco</surname></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-2769/paper_62.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Seventh Italian Conference on Computational Linguistics, CLiC-it 2020</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<meeting>the Seventh Italian Conference on Computational Linguistics, CLiC-it 2020<address><addrLine>Bologna, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="volume">2769</biblScope>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
