<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">You Said it? How Mis-and Disinformation Tweets Surrounding the Corona-5G-conspiracy Communicate Through Implying</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<affiliation key="aff0">
								<orgName type="department">Lynn de Rijk</orgName>
								<orgName type="institution">Radboud University</orgName>
								<address>
									<country key="NL">Netherlands</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">You Said it? How Mis-and Disinformation Tweets Surrounding the Corona-5G-conspiracy Communicate Through Implying</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">3B333921BEFB2D189D27F5F95D94AD7B</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T07:12+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper aims to investigate if implied meaning plays a role in mis-/disinformation tweets and what linguistic cues might signal this. A qualitative analysis of 130 mis-/disinformation tweets regarding the corona-5G-conspiracy using Speech Act Theory, shows that often meaning is implied by leaving out coherence markers, putting the words in someone else's mouth through citing and ambiguous phrasing/punctuation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">RESULTS</head><p>Users seem to employ a couple of strategies to avoid outright claiming there is a conspiracy, often communicating through</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION</head><p>Systems that automatically recognize mis-/disinformation are challenged by certain basic communicative features, such as sarcasm or implied meaning. It is therefore relevant to find out if indirect communication plays a role in mis-/disinformation and how large this role is. Speech Act Theory <ref type="bibr" target="#b0">[1]</ref> can be used as a framework to index implied meaning. It distinguishes three forces in every utterance: 1) locution, what is said literally, 2) illocution, what the utterance does and 3) perlocution, what happens as a result. For example, in: "Peter, you are standing on my foot", the locutionary force is asserting this state of affairs. The illocution, would generally be requesting that this Peter lifts his foot, now that he is made aware. The perlocutionary act would then be that Peter indeed places his foot elsewhere. However, Micheal Geis argues that that the illocutionary force of a Speech Act (SA) is mostly dependent on the context, not linguistic cues <ref type="bibr" target="#b1">[2]</ref>. Geis gives the example of a teacher asking a student if they can solve a quadratic equation vs. the same question being asked by a fellow student. The first would count as a request for information (Do you need help?), where the second is more likely a request for action (I don't understand, help me.) The importance of context is what makes recognizing illocutionary force difficult for an automated system. When dealing with data from a platform such as Twitter, however, this becomes less of an issue, since in one-to-many communication every member of the audience is addressed relatively equally and the general context is the same for each tweet (Twitter). This exploratory study aims to find if indirect SA's play a role in mis-/disinformation tweets and if so, if there are linguistic features identifiable that can capture indirect SA's.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">METHODS</head><p>Using the dataset compiled for the MediaEval 2020 FakeNews Task <ref type="bibr" target="#b2">[3]</ref>, a qualitative analysis of 130 tweets was performed, coding for direct and indirect SA's in Atlas.ti (version 8.4.5). The coding process was iterative and additional codes were added based on patterns found in the data, such as recurring linguistic features, like coherence markers (e.g., 'no meaningful connectives') and certain SA's (e.g., 'citing'). Some codes were mutually exclusive (such as 'Indirect SA: None' with other Indirect SA's), where other codes were not (e.g., tweets were often coded for multiple direct SA's, see Example 1). Only tweets supporting a conspiracy (e.g., corona is a coverup for 5G deaths or 5G causes corona) were included in the result section, as tweets with a different stance were rarely found (n = 7). Furthermore, only tweets that were not part of a thread were analyzed, to ensure that the context did not differ in regard to one-to-many vs. one-to-one interaction. The data was coded iteratively until a saturation point was reached within these inclusion criteria (n = 90). The final code list can be found in Appendix 1. Indirect SA's were only coded for when these were the primary communicative force. For example, the tweet below was coded for 'Indirect SA: concluding', as the implied meaning is most likely the primary meaning. Without the implication the assertions made are simply loose statements.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Anyone else curious about the majority of deaths in china seem to be the same areas they rolled out their stand alone 5G just a couple months ago. Verry few deaths being reported in other areas in comparison. #5G #CoronavirusOutbreak #COVID19 #5gamechanger</head><p>(Example 1)</p><p>The user asks a question in the first sentence, evidenced by the syntactic structure of the sentence ('direct SA: asking'), even though they did not use punctuation. This is followed by two assertions ('direct SA: asserting'). The intended relation between the three sentences is not made explicit through coherence markers such as meaningful connectives ('no meaningful connectives'). The last sentence does have a lexical cue phrase (phrases that show the relation between sentences or the attitude of the speaker, e.g., 'in my opinion' or, in this tweet, 'in comparison'), that shows the relation between areas the user wishes to point out. As the causal relation is not made explicit, the act of concluding that these assertions are causally related is an indirect SA.</p><p>implication (see Table <ref type="table" target="#tab_0">2</ref>). First, they often omit connectives, leaving the relation between sentences implicit (e.g., Example 1 and Table <ref type="table" target="#tab_3">3</ref>). Second, they tend to cite others (  or only the part describing a possible relation between 5G and corona (there might be a relation between corona and 5G)? Depending on the interpretation, the meaning of the tweet changes completely, even with regard to the user's stance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">DISCUSSION</head><p>This study found that there are linguistic cues identifiable that capture indirect SA's, such as omission of certain connectives and lexical cue phrases. A possible explanation for these findings is that users might (subconsciously) try to circumvent the forewarning effect. This effect has been studied extensively in psychology and suggests that forewarning is a factor that causes resistance to persuasion <ref type="bibr" target="#b3">[4]</ref>. Using connectives and lexical cue phrases helps reader comprehension in informative texts, but in persuasive texts can build up the reader's resistance, since they recognize more easily when they are being persuaded <ref type="bibr" target="#b3">[4]</ref>. Apart from the omission of connectives and lexical cue phrases, the ambiguity of certain tweets also points to this explanation. Alternatively, the omission of certain words might also be a result of the affordances <ref type="bibr" target="#b4">[5]</ref> of Twitter, where the limited characters per tweet might incentivize users to leave out words they deem unnecessary. However, this does not seem to be the case, as one would expect that emojis would be used quite often, since they leave room for ambiguity while simultaneously only taking up one character space. As seen in Table <ref type="table" target="#tab_1">1</ref>, this is not the case. Additionally, emojis are often used decoratively as well, such as arrows or bullet points, without making use of their potential for ambiguity. Furthermore, citing linked article titles is an interesting practice with these affordances in mind, as it leaves little room for the user's own view. The limited space Twitter affords gives weight to the chosen citation, which in turn creates the implication that the citation is important/true/relevant. Lastly, leaving things ambiguous and citing others, could point to a user orientation to distance themselves from conspiracythinking -a way for users to keep plausible deniability for their support of what is said. It should be noted though, that grammatical or punctuation ambiguity might also be a result of users not being native English-speakers or simply inattentiveness or oversight by the user. Similar to <ref type="bibr" target="#b5">[6]</ref>'s findings on Indonesian hoax data, I found that SA Theory can be useful to analyze mis-/disinformation online data. Where they focused on direct SA's, finding that assertive, directive and expressive SA's were most common. In this study, indirect SA were also considered, showing that communication is often indirect in mis-/disinformation tweets. Future research could compare these findings to nonconspiracy-tweets, to shed some light on which explanation provided here is more plausible and to show if the found linguistic features might aid in distinguishing information from mis-/disinformation. It would also be useful to see if machine-learning might have already been able to distinguish between the two regardless of picking up on the indirect SA's.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 2 )</head><label>2</label><figDesc>, such as citing the headline of an article they then link to, or use other means to put a middleman between themselves and what is said, as can be seen in Example 2:</figDesc><table><row><cell>I've been reading a few posts from ppl I know about how the</cell></row><row><cell>#CoronavirusOutbreak is because of 5G trials and Wuhan was</cell></row><row><cell>the place that first rolled this out and therefore is seen to be</cell></row><row><cell>used as a biologocal warfare weapon... 😬😬 #coronavirus</cell></row><row><cell>(Example 2)</cell></row><row><cell>In this tweet, connectives and lexical cue phrases (because of,</cell></row><row><cell>therefore) are used to make author intent clear, but the user</cell></row><row><cell>puts the words in the mouths of 'ppl I know'.</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1 : Frequency of most used direct SA's</head><label>1</label><figDesc></figDesc><table><row><cell>Code (not mutually</cell><cell>Number of tweets</cell><cell>Percentage of the</cell></row><row><cell>exclusive)</cell><cell>(n = 90)</cell><cell>dataset</cell></row><row><cell>Direct SA:</cell><cell></cell><cell></cell></row><row><cell>-asking</cell><cell>28</cell><cell>32.2%</cell></row><row><cell>-asserting</cell><cell>69</cell><cell>76.7%</cell></row><row><cell>-citing</cell><cell>24</cell><cell>26.6%</cell></row><row><cell>-describing</cell><cell>29</cell><cell>32.2%</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2 : Distribution of indirect SA's (suggesting, concluding, inviting, describing)</head><label>2</label><figDesc></figDesc><table><row><cell>Code</cell><cell>Number of tweets</cell><cell>Percentage of</cell></row><row><cell></cell><cell>(n=90)</cell><cell>the dataset</cell></row><row><cell>Indirect SA</cell><cell>59</cell><cell>65.6%</cell></row><row><cell>No indirect SA</cell><cell>31</cell><cell>34.4%</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 3 : Distribution of linguistic features</head><label>3</label><figDesc></figDesc><table><row><cell>Code (not mutually</cell><cell>Number of tweets</cell><cell>Percentage of</cell></row><row><cell>exclusive)</cell><cell>(n = 90)</cell><cell>the dataset</cell></row><row><cell>No meaningful</cell><cell>52</cell><cell>57.7%</cell></row><row><cell>connectives</cell><cell></cell><cell></cell></row><row><cell>Lexical cue phrases:</cell><cell></cell><cell></cell></row><row><cell>-Opinion</cell><cell>26</cell><cell>28.9%</cell></row><row><cell>-Relation</cell><cell>19</cell><cell>21.1%</cell></row><row><cell>Emoji use</cell><cell>12</cell><cell>13.3%</cell></row><row><cell cols="3">Third, users employ ambiguous phrasing. This can be seen in</cell></row><row><cell cols="3">Example 1, where the question is not clearly stated, since a</cell></row><row><cell cols="3">question mark is omitted and instead an assertion follows</cell></row><row><cell cols="3">immediately. It is thus phrased initially as a question, but the</cell></row><row><cell cols="3">question itself does not seem of much importance. In Example</cell></row><row><cell cols="3">2, this strategy can also be seen in '😬😬'. It is left to the reader</cell></row><row><cell cols="3">to infer what the target of the emoji is; is it the entire prior</cell></row><row><cell cols="3">statement (how awkward that people I know say these things)</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>ACKNOWLEDGMENTS</head><p>I thank Martha Larson for her input and critical eye. I also thank John Keates and Zhengyu Zhao for help with preprocessing.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">What is a Speech Act?</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Searle</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Philosophy in America</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Black</surname></persName>
		</editor>
		<imprint>
			<publisher>Allen and Unwin</publisher>
			<date type="published" when="1965">1965</date>
			<biblScope unit="page" from="221" to="239" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Geis</surname></persName>
		</author>
		<title level="m">Speech acts and conversational interaction</title>
				<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="1995">1995</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">FakeNews: Corona Virus and 5G Conspiracy Task at MediaEval</title>
		<author>
			<persName><forename type="first">K</forename><surname>Pogorelov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">T</forename><surname>Schroeder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Burchard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Moe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Brenner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Filkukova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Langguth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes Proceedings of the MediaEval 2020 Workshop</title>
				<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">The Forewarning Effect of Coherence Markers in Persuasive Discourse: Evidence from Persuasion and Processing</title>
		<author>
			<persName><forename type="first">J</forename><surname>Kamalski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Lentz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Sanders</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A</forename><surname>Zwaan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Discourse Processes</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="545" to="579" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Technologies, Texts and Affordances</title>
		<author>
			<persName><forename type="first">I</forename><surname>Hutchby</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Sociology</title>
		<imprint>
			<biblScope unit="volume">35</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="441" to="456" />
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Manipulation and Persuasion through Language Features in Fake News</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">A</forename><surname>Iswara</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">A</forename><surname>Bisena</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">RETORIKA: Jurnal Ilmu Bahasa</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="26" to="32" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
