<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">A Hybrid Human-In-The-Loop Framework for Fact Checking</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">David</forename><forename type="middle">La</forename><surname>Barbera</surname></persName>
							<email>david.labarbera@uniud.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Udine</orgName>
								<address>
									<addrLine>Via Delle Scienze 206</addrLine>
									<settlement>Udine</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Kevin</forename><surname>Roitero</surname></persName>
							<email>kevin.roitero@uniud.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Udine</orgName>
								<address>
									<addrLine>Via Delle Scienze 206</addrLine>
									<settlement>Udine</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Stefano</forename><surname>Mizzaro</surname></persName>
							<email>stefano.mizzaro@uniud.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Udine</orgName>
								<address>
									<addrLine>Via Delle Scienze 206</addrLine>
									<settlement>Udine</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">A Hybrid Human-In-The-Loop Framework for Fact Checking</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">006239C6BC58CD5239E063F2CE507205</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T14:50+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Misinformation</term>
					<term>Human-in-the-loop</term>
					<term>Artificial Intelligence . Mizzaro) 0000-0002-8215-5502 (D. L. Barbera); 0000-0002-9191-3280 (K. Roitero)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Online misinformation is posing a serious threat for the modern society. Assessing the veracity of online information is a complex problem which nowadays is addressed by heavily relying on trained fact-checking experts. This solution is not scalable and, also due the importance of the problem the issue gained the attention of the scientific community, which proposed many AI-based automatic solutions. Despite the efforts made, the effectiveness of such approaches is not yet enough to allow them to be used without supervision. In this position paper, we propose a hybrid human-in-the-loop framework for fact-checking: we address the misinformation issue by relying on a combination of automatic AI methods, crowdsourcing ones, and experts. We study the single components of the frameworks as well as their interactions, and we propose an interleaving of the different components which we believe will serve as useful starting point for the future research towards effective and scalable fact-checking.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Modern times have highlighted the centrality of the threat for the modern society of fake news and misinformation. Traditionally, misinformation detection is a slow and costly process that is made solely by expert trained fact-checkers, that can not cope with the ever-increasing amount of information shared online everyday. To address this issue, researchers are developing automatic techniques to identify misinformation at scale, and significant efforts have been made to develop fast and scalable state-of-the-art Artificial Intelligence (AI) algorithms <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3,</ref><ref type="bibr" target="#b3">4]</ref>. Another less traditional approach to tackle such issue is to take advantage of the wisdom of the crowd <ref type="bibr" target="#b4">[5]</ref> and leverage crowdsourcing workers <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b8">9,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b11">12,</ref><ref type="bibr" target="#b12">13]</ref>. Both approaches have pro-and contra: while AI is usually cheaper and scalable, crowd-workers can perform more reliable and explainable classifications. To take the best from both worlds, researchers proposed hybrid Human-In-The-Loop (HITL) approaches that integrate AI, crowd, and experts, even though only few implementations exist <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b14">15,</ref><ref type="bibr" target="#b15">16,</ref><ref type="bibr" target="#b16">17]</ref>. Differently from previous work <ref type="bibr" target="#b16">[17]</ref>, in this paper we propose a concrete architecture for fact-checking, and we inspect the responsibilities of each component as well as their interactions. In particular, we detail a pragmatical workflow which should be implemented to effectively classify the veracity of a set of statements at scale.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related Work</head><p>There are both numerous examples of AI techniques for misinformation detection <ref type="bibr" target="#b1">[2]</ref> as well as of academic interest on their development and evaluation <ref type="bibr" target="#b17">[18]</ref>. Many different AI approaches exist: Ozbay and Alatas <ref type="bibr" target="#b2">[3]</ref> tested 23 supervised AI algorithms on public datasets, Zhao et al. <ref type="bibr" target="#b3">[4]</ref> integrated linguistic, topic, sentiment, and behavioral features to develop a model for health misinformation, Stammbach and Neumann <ref type="bibr" target="#b19">[19]</ref> used evidence retrieval techniques and finetune a BERT-based model for the FEVER challenge, Konstantinovskiy et al. <ref type="bibr" target="#b20">[20]</ref> developed a pipeline to identify misinformation using a multi-task learning approach. Related to that, many approaches addressed the issue of credibility in social media <ref type="bibr" target="#b21">[21]</ref>.</p><p>Focusing on misinformation detection using crowdsourcing, La Barbera et al. <ref type="bibr" target="#b6">[7]</ref> first found an effect of judgment scales and evidence of worker assessors' bias on political statements, Soprano et al. <ref type="bibr" target="#b10">[11]</ref> used the dataset from Roitero et al. <ref type="bibr" target="#b7">[8]</ref> to leverage a multidimensional scale to measure different aspects of a statement, Draws et al. <ref type="bibr" target="#b12">[13]</ref> found that workers generally overestimate the truthfulness and that different type of workers show different biases when evaluating a given statement, Pennycook and Rand <ref type="bibr" target="#b5">[6]</ref> used the crowd to study effects of reducing social media users' exposure to low-quality news, and Allen et al. <ref type="bibr" target="#b11">[12]</ref> compared the accuracy ratings between fact-checkers and crowd-workers.</p><p>Finally, some work investigated the combination of AI and humans: Demartini et al. <ref type="bibr" target="#b16">[17]</ref> introduced a theoretical hybrid HITL framework for misinformation, Qu et al. <ref type="bibr" target="#b22">[22]</ref> used selfreported scores from both AI and crowd to develop a hybrid system, Shabani et al. <ref type="bibr" target="#b13">[14]</ref> used humans to provide feedback on news stories about statement contextual information and integrated those features into an AI pipeline, and Yang et al. <ref type="bibr" target="#b14">[15]</ref> showed the potential speed up to the fact-checking process by organizing and selecting representative statements.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Limitations of Current Approaches</head><p>As highlighted by Demartini et al. <ref type="bibr" target="#b16">[17,</ref><ref type="bibr">Figure 2</ref>] each of the three state-of-the-art approaches for misinformation detection i.e., experts, AI tools, and crowd has its own advantages and disadvantages in terms of accuracy, scale, cost, explainability, and bias control. We detail these aspects in this section, focusing on the limitations of each approach.</p><p>Certainly AI tools outperform both crowd and experts when considering costs<ref type="foot" target="#foot_0">1</ref> and evaluation speed, but despite recent works <ref type="bibr" target="#b26">[26,</ref><ref type="bibr" target="#b27">27]</ref>, they provide less or no explainability. More importantly, such models achieve lower accuracy than crowd or experts. To provide some examples, classical machine learning models achieved 74% accuracy on a two-level scale <ref type="bibr" target="#b28">[28]</ref>, and the best model of this year CLEF CheckThat! Lab reached 54.7% accuracy on a four-level scale <ref type="bibr" target="#b17">[18]</ref>. Considering the accuracy from the crowd, experimental results <ref type="bibr" target="#b11">[12]</ref> show a high correlation with the experts in terms of agreement, whereas other work reports accuracy values that are lower and comparable to those obtained by AI methods <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b8">9,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b12">13]</ref>; although further studies are needed to draw definitive conclusions it seems reasonable to assume that crowd accuracy can be higher than automatic AI solutions. The highest accuracy is achieved by the experts, which is always set to the value of 1 for practical reason. Nevertheless, even domain experts need confrontation and discussion phases to reach a final consensus (see for example the process used by PolitiFact<ref type="foot" target="#foot_2">2</ref> ).</p><p>Bias is also a crucial limitation of current approaches. Experts and crowd-workers being humans are subject to cognitive biases <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b6">7]</ref>, which can be mitigated by the discussion phase in the case of experts, but are difficult to remove for crowd-workers <ref type="bibr" target="#b29">[29]</ref>. Moreover, all the aforementioned biases can be propagated from humans to AI models, e.g., when training or fine-tuning a model.</p><p>Another limitation of current approaches is given by the specific truthfulness scales used; different scales exist and are used, and such heterogeneity, apart from making a fair comparison difficult, has an impact on the quality of the collected data <ref type="bibr" target="#b6">[7]</ref>.</p><p>We believe that a HITL framework for misinformation detection should address and overcome all of the limitations detailed above by fruitfully combining the capabilities of AI, crowd, and experts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">HITL Framework for Misinformation Detection</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Possible Architectures</head><p>A natural solution to the task investigated in this paper is to employ a pipeline model where the components are sorted with an increasing accuracy (i.e., first the AI, then the crowd, and finally the experts). Thus, if a statement is not adequately classified by a component, the subsequent pipeline component will perform a more accurate classification. Also, such a pipeline concatenates each component according to their increasing cost and evaluation time. This allows to perform a pipeline of annotation tasks where the majority of the statements are quickly and automatically labeled by AI, only a subset of the statements is sent for a slower evaluation to the crowd, and the few remaining statements are sent to experts for an in-depth investigation.</p><p>The key advantage of this configuration is that it takes the best from each component, and that it allows to minimize the overall costs. Particularly, this configuration lets the experts (i.e., the more costly component) to evaluate a very small number of statements. Nevertheless, the pipeline model has important limitations as it does not provide feedback among the components: a statement is simply forwarded until it is eventually classified with not much cooperation among the components.</p><p>Another possible combination of the components is by means of a blackboard architecture, a common solution in distributed multi-agent settings <ref type="bibr" target="#b30">[30]</ref>. Such an approach allows the components to select which statements to evaluate. Each component is an autonomous agent that can access a central repository that contains both the statements and the partial contributions provided by each component. This approach would require both a high synergy between the components and to split a classification task in atomic sub-tasks to take advantage from each specific component of the architecture.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">General Framework</head><p>An ideal framework should maximize accuracy while minimizing the cost of each component and strengthening the cooperation between and within its modules. Therefore, we propose first a basic framework, where each component provides feedback to, and cooperates with, the others. We then discuss possible variants and extensions.</p><p>Our proposal is summarized in Figure <ref type="figure" target="#fig_0">1</ref>. Given a statement, each of the three components (AI, crowd, and experts) generates: a classification on a chosen scale and a confidence score for the performed classification. Whenever the component AI or crowd generates a prediction with a high confidence score, the statement is considered as correctly classified. Otherwise if the confidence is low, the statement is forwarded to the subsequent component. If this is the case, the output of the component (such as the confidence score and the classification) can be optionally forwarded along with the statement. This could allow the subsequent component to perform an informed assessment, if necessary. Also samples of statements considered as correctly classified by the component (i.e, with high confidence) should be propagated, to double check their classification score and deal with the problem of unknown-unknowns (i.e., statements for which AI is highly confident about its predictions but is wrong) using humans <ref type="bibr" target="#b31">[31,</ref><ref type="bibr" target="#b32">32,</ref><ref type="bibr" target="#b33">33]</ref>. This allows each component to provide feedback to the previous ones, thereby improving their classifications.</p><p>In the following sections we will detail for each component: its possible internal structure, its specific interactions with other components, and additional outputs that can be added to the general framework.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">First Component: AI</head><p>Assuming the use of a state-of-the-art model for misinformation detection <ref type="bibr" target="#b28">[28,</ref><ref type="bibr" target="#b1">2,</ref><ref type="bibr" target="#b19">19,</ref><ref type="bibr" target="#b2">3,</ref><ref type="bibr" target="#b3">4,</ref><ref type="bibr" target="#b34">34,</ref><ref type="bibr" target="#b17">18]</ref>, the output provided by the AI component should be at least a classification score on a chosen truthfulness scale, and a confidence score. While the classification score is straightforward, the confidence can be reliably calibrated following the methodology by Guo et al. <ref type="bibr" target="#b35">[35]</ref>. To provide an adequate classification, AI tool can rely on a Knowledge Base (KB) to perform evidence retrieval.</p><p>Examples of such a system are the ones proposed by La Barbera et al. <ref type="bibr" target="#b36">[36]</ref> and Stammbach and Neumann <ref type="bibr" target="#b19">[19]</ref>, who both use a transformer architecture who rely on retrieved evidence. The choice of the Knowledge Base (KB) to use to produce a classification and an explanation is not straightforward, since there is no evidence of a "universally best" KB <ref type="bibr" target="#b37">[37]</ref>. Thus, the choice of the specific KB should be performed ad-hoc by leveraging statements and domain specific features, as for example the topic, speaker, year, etc. of the set of statements being processed.</p><p>To evaluate the classification score given by the component, we can use optional output. For example, many AI models are able to provide reasons for their predictions <ref type="bibr" target="#b26">[26,</ref><ref type="bibr" target="#b27">27]</ref>. Some implementations are delivered by Kazemi et al. <ref type="bibr" target="#b38">[38]</ref> and by Brand et al. <ref type="bibr" target="#b39">[39]</ref> who develop models able to generate an explanation for their misinformation assessment. The generation of an explanation could improve the framework by providing additional and human-readable information useful for both the subsequent human-based components and the final classification.</p><p>Finally, the AI component could provide self-feedback by using counterfactual explanations <ref type="bibr" target="#b40">[40]</ref>: generating instances that the model finds hard to classify or deceiving could improve the model performances, robustness, and generalization abilities.</p><p>The output of the AI component is thus made by classification, confidence, and optional information, such as explanation and retrieved evidence. The decision whether the statement has been adequately classified or not can be then performed by relying on the confidence of the model <ref type="bibr" target="#b22">[22]</ref> as detailed in Section 4.2. To help this decision, it could be used the optional explanation, for example considering its readability or semantic scores. The decision for some statements might be more critical and not straightforward: a very recent statement made by an important public figure over a highly relevant topic with not much evidence available might be worth further investigation. Hence, it might be worth studying the effectiveness of an importance score using the statement's metadata.</p><p>Finally, if the assessment for the statement has a low confidence, the explanation is not satisfactory, or the assessment needs to be refined for any other reason, the statement is sent to the subsequent component: the crowd.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4.">Second Component: Crowd</head><p>As for the AI, the crowd component should perform two tasks: misinformation classification and provide feedback to itself and to the AI component. There are many examples of misinformation classification directly performed by the crowd <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b11">12,</ref><ref type="bibr" target="#b12">13]</ref>. It could also be reasonable to perform an informed assessment relying on the output of the AI component <ref type="bibr" target="#b41">[41]</ref>. Nevertheless, the use of this additional information could introduce biases into the assessment performed by the crowd, hence further studies in this direction are required. Moreover, to reduce workers cognitive effort, it is possible to design a two steps task using disjoints sets of workers: the first set will search for evidence for a given statement, the second will classify the statement using the provided evidence (and additional data). While all of the different mentioned tasks are indeed reasonable, it is necessary to perform ad-hoc studies to find the best possible setting. Along this line, we can leverage work done in related fields <ref type="bibr" target="#b42">[42]</ref> to identify the subset of best workers and exploit their features to be able to minimize the workforce needed and at the same time maximize its effectiveness.</p><p>Also, the crowd can be asked to provide additional rationales to motivate their classification <ref type="bibr" target="#b43">[43,</ref><ref type="bibr" target="#b44">44]</ref>. The classifications can be used to improve the AI component by fine-tuning the models with additional data, or even both workers and AI rationales can be used to adjust the confidence of the final assessment; nevertheless, this should be implemented with caution, as workers rationales might contain bias that can be involuntary injected into AI models. Finally, a subset of crowd-workers should look for counterfactual examples that could highlight AI classification errors with high confidence. While these methodologies still need to be tested in the field of misinformation detection, some work <ref type="bibr" target="#b45">[45]</ref> shows the promising results of this approach applied to different domains.</p><p>As for the AI component, the output of the crowd component is composed by the default classification and confidence, along with optional additional data such as evidence, explanation, and rationales. Therefore, to decide if a statement is correctly classified or not it is possible to rely not only on the data generated by the crowd, but also to check for agreement and inconsistencies between crowd and AI <ref type="bibr" target="#b22">[22]</ref>.</p><p>At this point of the evaluation, the majority of the statements have been classified by the framework, and only a very small subset will reach the final step of the workflow: the experts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.5.">Third Component: Experts</head><p>The last step of the framework is made by the experts. It is possible to let them evaluate a statement using a pre-defined fact-checking methodology, and ideally to provide to them all the outputs from the previous components to perform an informed assessment. The effects of such a decision need to be studied since, as discussed for the crowd, the use of additional information could introduce bias in the final evaluation. We remark that we believe that critical, important, and difficult statements should always be evaluated or at least checked by the experts. Note that to identify those statements it would be necessary to find a metric to be able to automatically evaluate the importance of a statement in a given context. Also, to increase the robustness of the framework, the experts should be able to directly look at the statements classified by the previous components and to decide whether some of them need to be re-assessed or not. Finally, each classification performed by the experts should be used to re-train the AI models, and used as an example to train the crowd before performing the task. This final aspect could also be performed interactively, following an active-learning scenario.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions</head><p>In this work we study the limitations of the current approaches for misinformation detection and propose a hybrid HITL framework that combines AI, crowd, and experts. Our main contributions are the following: we frame the problem and review the related work detailing frameworks for fact-checking; we study possible framework architectures detailing their respective advantages and disadvantages; we propose a solid architecture for performing fact-checking at scale, and we describe each component focusing on its role and outputs, as well as its interactions with other components. The main advantages of our framework are given by an efficient combination of the components in terms of increasing accuracy and evaluation time, decreasing costs, and by the feedback between and within each component.</p><p>Future work aims at proving a full framework implementation. More in detail, further study will be done on the synergies between crowd and AI to investigate the effects of an informed assessment made by the crowd leveraging AI outputs, and to set thresholds to decide about statement forwarding among components.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Overview of the proposed framework.</figDesc></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">while training language models from scratch can cost up to millions of dollars<ref type="bibr" target="#b23">[23]</ref>, once trained they can be used multiple times leveraging few-or zero-shot learning[24,  </note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="25" xml:id="foot_1">].</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_2">https://www.politifact.com/article/2018/feb/12/principles-truth-o-meter-politifacts-methodology-i/ #Truth-O-Meter%20ratings</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Preface to the sixth workshop on natural language for artificial intelligence (nl4ai)</title>
		<author>
			<persName><forename type="first">D</forename><surname>Nozza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Passaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Polignano</surname></persName>
		</author>
		<idno>CEUR-WS.org</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Sixth Workshop on Natural Language for Artificial Intelligence (NL4AI 2022) co-located with 21th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2022)</title>
				<editor>
			<persName><forename type="first">D</forename><surname>Nozza</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><forename type="middle">C</forename><surname>Passaro</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Polignano</surname></persName>
		</editor>
		<meeting>the Sixth Workshop on Natural Language for Artificial Intelligence (NL4AI 2022) co-located with 21th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2022)</meeting>
		<imprint>
			<date type="published" when="2022-11-30">November 30, 2022. 2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">B</forename><surname>Guo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Ding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Yao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Liang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Yu</surname></persName>
		</author>
		<idno type="DOI">10.48550/ARXIV.1909.03654</idno>
		<title level="m">The Future of Misinformation Detection: New Perspectives and Trends</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Fake news detection within online social media using supervised artificial intelligence algorithms</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>Ozbay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Alatas</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.physa.2019.123174</idno>
	</analytic>
	<monogr>
		<title level="j">Physica A: Statistical Mechanics and its Applications</title>
		<imprint>
			<biblScope unit="volume">540</biblScope>
			<biblScope unit="page">123174</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Detecting health misinformation in online health communities: Incorporating behavioral features into machine learning based approaches</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Da</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Yan</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.ipm.2020.102390</idno>
	</analytic>
	<monogr>
		<title level="j">Information Processing &amp; Management</title>
		<imprint>
			<biblScope unit="volume">58</biblScope>
			<biblScope unit="page">102390</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">The Wisdom of Crowds</title>
		<author>
			<persName><forename type="first">J</forename><surname>Surowiecki</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2005">2005</date>
			<publisher>Anchor</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Fighting misinformation on social media using crowdsourced judgments of news source quality</title>
		<author>
			<persName><forename type="first">G</forename><surname>Pennycook</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">G</forename><surname>Rand</surname></persName>
		</author>
		<idno type="DOI">10.1073/pnas.1806781116</idno>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the National Academy of Sciences</title>
		<imprint>
			<biblScope unit="volume">116</biblScope>
			<biblScope unit="page" from="2521" to="2526" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Crowdsourcing Truthfulness: The Impact of Judgment Scale and Assessor Bias</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">La</forename><surname>Barbera</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Roitero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Spina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mizzaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Demartini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 42nd European Conference on Information Retrieval</title>
				<meeting>the 42nd European Conference on Information Retrieval</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="207" to="214" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Can The Crowd Identify Misinformation Objectively? The Effects of Judgment Scale and Assessor&apos;s Background</title>
		<author>
			<persName><forename type="first">K</forename><surname>Roitero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Soprano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Fan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Spina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mizzaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Demartini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 43rd Conference on Research and Development in Information Retrieval, SIGIR</title>
				<meeting>the 43rd Conference on Research and Development in Information Retrieval, SIGIR</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="439" to="448" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">The COVID-19 Infodemic: Can the Crowd Judge Recent Misinformation Objectively?</title>
		<author>
			<persName><forename type="first">K</forename><surname>Roitero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Soprano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Portelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Spina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Della Mea</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Serra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mizzaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Demartini</surname></persName>
		</author>
		<idno type="DOI">10.1145/3340531.3412048</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 29th Conference on Information &amp; Knowledge Management</title>
				<meeting>the 29th Conference on Information &amp; Knowledge Management</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="1305" to="1314" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Can the Crowd Judge Truthfulness? A Longitudinal Study on Recent Misinformation about COVID-19</title>
		<author>
			<persName><forename type="first">K</forename><surname>Roitero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Soprano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Portelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Luise</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Spina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Della Mea</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Serra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mizzaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Demartini</surname></persName>
		</author>
		<idno type="DOI">10.1007/s00779-021-01604-6</idno>
	</analytic>
	<monogr>
		<title level="m">Personal and Ubiquitous Computing</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">The many dimensions of truthfulness: Crowdsourcing misinformation assessments on a multidimensional scale</title>
		<author>
			<persName><forename type="first">M</forename><surname>Soprano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Roitero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>La</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Barbera</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Ceolin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Spina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Mizzaro</surname></persName>
		</author>
		<author>
			<persName><surname>Demartini</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.ipm.2021.102710</idno>
	</analytic>
	<monogr>
		<title level="j">Information Processing &amp; Management</title>
		<imprint>
			<biblScope unit="volume">58</biblScope>
			<biblScope unit="page">102710</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Scaling up fact-checking using the wisdom of crowds</title>
		<author>
			<persName><forename type="first">J</forename><surname>Allen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">A</forename><surname>Arechar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Pennycook</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">G</forename><surname>Rand</surname></persName>
		</author>
		<idno type="DOI">10.1126/sciadv.abf4393</idno>
	</analytic>
	<monogr>
		<title level="j">Science Advances</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page">4393</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">The Effects of Crowd Worker Biases in Fact-Checking Tasks</title>
		<author>
			<persName><forename type="first">T</forename><surname>Draws</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>La</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Barbera</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Soprano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Roitero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ceolin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Checco</surname></persName>
		</author>
		<author>
			<persName><surname>Mizzaro</surname></persName>
		</author>
		<idno type="DOI">10.1145/3531146.3534629</idno>
	</analytic>
	<monogr>
		<title level="m">Conference on Fairness, Accountability, and Transparency, FAccT</title>
				<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="2114" to="2124" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">SAMS: Human-in-the-loop Approach to Combat the Sharing of Digital Misinformation</title>
		<author>
			<persName><forename type="first">S</forename><surname>Shabani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Charlesworth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sokhn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Schuldt</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">CEUR Workshop Proc</title>
		<imprint>
			<biblScope unit="page">2846</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Scalable Fact-checking with Human-in-the-Loop</title>
		<author>
			<persName><forename type="first">J</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Vega-Oliveros</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Seibt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rocha</surname></persName>
		</author>
		<idno type="DOI">10.1109/WIFS53200.2021.9648388</idno>
	</analytic>
	<monogr>
		<title level="m">IEEE Workshop on Information Forensics and Security, WIFS</title>
				<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="1" to="6" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Karagiannis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Saeed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Papotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Trummer</surname></persName>
		</author>
		<idno>CoRR abs/2003.06708</idno>
		<title level="m">Scrutinizer: A Mixed-Initiative Approach to Large-Scale, Data-Driven Claim Verification</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Human-in-the-loop Artificial Intelligence for Fighting Online Misinformation: Challenges and Opportunities</title>
		<author>
			<persName><forename type="first">G</forename><surname>Demartini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mizzaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Spina</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bulletin of IEEE Computer Society</title>
		<imprint>
			<biblScope unit="volume">43</biblScope>
			<biblScope unit="page" from="65" to="74" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><surname>Nakov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Barrón-Cedeño</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Da San Martino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Alam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Struß</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Mandl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Míguez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Caselli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kutlu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zaghouani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Shaar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">K</forename><surname>Shahi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Mubarak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Nikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Babulkov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">S</forename><surname>Kartal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Wiegand</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Siegel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Köhler</surname></persName>
		</author>
		<title level="m">Overview of the CLEF</title>
				<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">CheckThat! Lab on Fighting the COVID-19 Infodemic and Fake News Detection</title>
	</analytic>
	<monogr>
		<title level="m">Experimental IR Meets Multilinguality, Multimodality, and Interaction</title>
				<editor>
			<persName><forename type="first">A</forename><surname>Barrón-Cedeño</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Da San Martino</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Degli</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Esposti</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Sebastiani</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Macdonald</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Pasi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Hanbury</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Potthast</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Faggioli</surname></persName>
		</editor>
		<editor>
			<persName><surname>Ferro</surname></persName>
		</editor>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="495" to="520" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Team DOMLIN: Exploiting evidence enhancement for the FEVER shared task</title>
		<author>
			<persName><forename type="first">D</forename><surname>Stammbach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Neumann</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/D19-6616</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2nd Workshop on Fact Extraction and VERification, FEVER, ACL</title>
				<meeting>the 2nd Workshop on Fact Extraction and VERification, FEVER, ACL</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="105" to="109" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Toward Automated Factchecking: Developing an Annotation Schema and Benchmark for Consistent Automated Claim Detection</title>
		<author>
			<persName><forename type="first">L</forename><surname>Konstantinovskiy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Price</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Babakar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Zubiaga</surname></persName>
		</author>
		<idno type="DOI">10.1145/3412869</idno>
	</analytic>
	<monogr>
		<title level="j">Digital Threats</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Credibility in social media: opinions, news, and health information-a survey</title>
		<author>
			<persName><forename type="first">M</forename><surname>Viviani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Pasi</surname></persName>
		</author>
		<idno type="DOI">10.1002/widm.1209</idno>
		<idno>doi:</idno>
		<ptr target="https://doi.org/10.1002/widm.1209" />
	</analytic>
	<monogr>
		<title level="j">WIREs Data Mining and Knowledge Discovery</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page">e1209</biblScope>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Combining Human and Machine Confidence in Truthfulness Assessment</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Qu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">L</forename><surname>Barbera</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Roitero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mizzaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Spina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Demartini</surname></persName>
		</author>
		<idno type="DOI">10.1145/3546916</idno>
	</analytic>
	<monogr>
		<title level="j">Data and Information Quality</title>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<author>
			<persName><forename type="first">O</forename><surname>Sharir</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Peleg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Shoham</surname></persName>
		</author>
		<title level="m">The Cost of Training NLP Models: A Concise Overview</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv</note>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Language models are few-shot learners</title>
		<author>
			<persName><forename type="first">T</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Ryder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Subbiah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Kaplan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Dhariwal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Neelakantan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Shyam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Sastry</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Askell</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Advances in neural information processing systems</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="page" from="1877" to="1901" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Large Language Models are Zero-Shot Reasoners</title>
		<author>
			<persName><forename type="first">T</forename><surname>Kojima</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Reid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Matsuo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Iwasawa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Workshop on Knowledge Retrieval and Language Models</title>
				<imprint>
			<publisher>ICML</publisher>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<title level="m" type="main">Generating Fact Checking Explanations</title>
		<author>
			<persName><forename type="first">P</forename><surname>Atanasova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">G</forename><surname>Simonsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Lioma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Augenstein</surname></persName>
		</author>
		<idno>CoRR abs/2004.05773</idno>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<monogr>
		<author>
			<persName><forename type="first">N</forename><surname>Kotonya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Toni</surname></persName>
		</author>
		<idno>CoRR abs/2011.03870</idno>
		<title level="m">Explainable Automated Fact-Checking: A Survey</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Fake news detection using naive Bayes classifier</title>
		<author>
			<persName><forename type="first">M</forename><surname>Granik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Mesyura</surname></persName>
		</author>
		<idno type="DOI">10.1109/UKRCON.2017.8100379</idno>
	</analytic>
	<monogr>
		<title level="m">IEEE First Ukraine Conference on Electrical and Computer Engineering</title>
				<meeting><address><addrLine>UKRCON</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="900" to="903" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">A Checklist to Combat Cognitive Biases in Crowdsourcing</title>
		<author>
			<persName><forename type="first">T</forename><surname>Draws</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rieger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Inel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">U</forename><surname>Gadiraju</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tintarev</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the AAAI Conference on Human Computation and Crowdsourcing</title>
				<meeting>the AAAI Conference on Human Computation and Crowdsourcing</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="page" from="48" to="59" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">Event-based blackboard architecture for multi-agent systems</title>
		<author>
			<persName><forename type="first">J</forename><surname>Dong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-J</forename><surname>Jeng</surname></persName>
		</author>
		<idno type="DOI">10.1109/ITCC.2005.149</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Conference on Information Technology: Coding and Computing</title>
				<meeting>the Conference on Information Technology: Coding and Computing</meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="379" to="384" />
		</imprint>
	</monogr>
	<note>of ITCC</note>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Beat the Machine: Challenging Humans to Find a Predictive Model&apos;s &quot;Unknown Unknowns</title>
		<author>
			<persName><forename type="first">J</forename><surname>Attenberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Ipeirotis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Provost</surname></persName>
		</author>
		<idno type="DOI">10.1145/2700832</idno>
	</analytic>
	<monogr>
		<title level="j">J. Data and Information Quality</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Identifying Unknown Unknowns in the Open World: Representations and Policies for Guided Exploration</title>
		<author>
			<persName><forename type="first">H</forename><surname>Lakkaraju</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kamar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Caruana</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Horvitz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 31st AAAI Conference on Artificial Intelligence</title>
				<meeting>the 31st AAAI Conference on Artificial Intelligence</meeting>
		<imprint>
			<publisher>AAAI Press</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="2124" to="2132" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<analytic>
		<title level="a" type="main">Towards Hybrid Human-AI Workflows for Unknown Unknown Detection</title>
		<author>
			<persName><forename type="first">A</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Guerra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Fung</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Matute</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kamar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Lasecki</surname></persName>
		</author>
		<idno type="DOI">10.1145/3366423.3380306</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of The Web Conference, WWW, ACM</title>
				<meeting>The Web Conference, WWW, ACM</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="2432" to="2442" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<author>
			<persName><forename type="first">B</forename><surname>Taboubi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A B</forename><surname>Nessir</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Haddad</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">iCompass at CheckThat! 2022: combining deep language models for fake news detection</title>
		<title level="s">Working Notes of CLEF</title>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<analytic>
		<title level="a" type="main">On Calibration of Modern Neural Networks</title>
		<author>
			<persName><forename type="first">C</forename><surname>Guo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Pleiss</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">Q</forename><surname>Weinberger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 34th Conference on Machine Learning</title>
				<meeting>the 34th Conference on Machine Learning</meeting>
		<imprint>
			<publisher>JMLR</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="volume">70</biblScope>
			<biblScope unit="page" from="1321" to="1330" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b36">
	<analytic>
		<title level="a" type="main">BUM at Check-That! 2022: A Composite Deep Learning Approach to Fake News Detection using Evidence Retrieval</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">La</forename><surname>Barbera</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Roitero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mackenzie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Spina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Demartini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mizzaro</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Working Notes of CLEF 2022-Conference and Labs of the Evaluation Forum</title>
				<imprint>
			<publisher>CLEF</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="564" to="572" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b37">
	<monogr>
		<title level="m" type="main">The Choice of Knowledge Base in Automated Claim Checking</title>
		<author>
			<persName><forename type="first">D</forename><surname>Stammbach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Ash</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2111.07795</idno>
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b38">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Kazemi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Pérez-Rosas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Mihalcea</surname></persName>
		</author>
		<idno>CoRR abs/2104.12918</idno>
		<title level="m">Extractive and Abstractive Explanations for Fact-Checking and Evaluation of News</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b39">
	<analytic>
		<title level="a" type="main">A Neural Model to Jointly Predict and Explain Truthfulness of Statements</title>
		<author>
			<persName><forename type="first">E</forename><surname>Brand</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Roitero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Soprano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rahimi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Demartini</surname></persName>
		</author>
		<idno type="DOI">10.1145/3546917</idno>
	</analytic>
	<monogr>
		<title level="j">Data and Information Quality</title>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b40">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Ziegler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Stiennon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">B</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Radford</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Amodei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">F</forename><surname>Christiano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Irving</surname></persName>
		</author>
		<idno>CoRR abs/1909.08593</idno>
		<title level="m">Fine-Tuning Language Models from Human Preferences</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b41">
	<analytic>
		<title level="a" type="main">Humans and Algorithms Detecting Fake News: Effects of Individual and Contextual Confidence on Trust in Algorithmic Advice</title>
		<author>
			<persName><forename type="first">C</forename><surname>Snijders</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Conijn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Fouw</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Berlo</surname></persName>
		</author>
		<idno type="DOI">10.1080/10447318.2022.2097601</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Human-Computer Interaction</title>
		<imprint>
			<biblScope unit="page" from="1" to="12" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b42">
	<analytic>
		<title level="a" type="main">Cheaper and Better: Selecting Good Workers for Crowdsourcing</title>
		<author>
			<persName><forename type="first">H</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Liu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the AAAI Conference on Human Computation and Crowdsourcing</title>
				<meeting>the AAAI Conference on Human Computation and Crowdsourcing</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="20" to="21" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b43">
	<analytic>
		<title level="a" type="main">Why Is That Relevant? Collecting Annotator Rationales for Relevance Judgments</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mcdonnell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lease</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kutlu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Elsayed</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th Conference on Human Computation and Crowdsourcing</title>
				<meeting>the 4th Conference on Human Computation and Crowdsourcing</meeting>
		<imprint>
			<publisher>HCOMP</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page" from="139" to="148" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b44">
	<analytic>
		<title level="a" type="main">Annotator Rationales for Labeling Tasks in Crowdsourcing</title>
		<author>
			<persName><forename type="first">M</forename><surname>Kutlu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Mcdonnell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lease</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Elsayed</surname></persName>
		</author>
		<idno type="DOI">10.1613/jair.1.12012</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Artificial Intelligence Research</title>
		<imprint>
			<biblScope unit="volume">69</biblScope>
			<biblScope unit="page" from="143" to="189" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b45">
	<monogr>
		<title level="m" type="main">TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">X</forename><surname>Morris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Lifland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Y</forename><surname>Yoo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Grigsby</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Jin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Qi</surname></persName>
		</author>
		<idno type="DOI">10.48550/ARXIV.2005.05909</idno>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
