<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">A Bayesian Framework for Reputation in Citizen Science</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Joan</forename><surname>Garriga</surname></persName>
							<email>jgarriga@ceab.csic.es</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Centre de Recerca Ecològica i Aplicacions Forestals (CREAF)</orgName>
								<address>
									<addrLine>Carrer de les Cases Sert 54</addrLine>
									<postCode>08193</postCode>
									<settlement>Cerdanyola del Vallès, Barcelona</settlement>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="laboratory">Centre d&apos;Estudis Avançats de Blanes (CEAB-CSIC)</orgName>
								<address>
									<addrLine>Carrer Accés Cala Sant Francesc 14</addrLine>
									<postCode>17300</postCode>
									<settlement>Girona</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jaume</forename><surname>Piera</surname></persName>
							<affiliation key="aff0">
								<orgName type="laboratory">Centre de Recerca Ecològica i Aplicacions Forestals (CREAF)</orgName>
								<address>
									<addrLine>Carrer de les Cases Sert 54</addrLine>
									<postCode>08193</postCode>
									<settlement>Cerdanyola del Vallès, Barcelona</settlement>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Institut de Ciències del Mar (ICM-CSIC)</orgName>
								<address>
									<addrLine>Passeig Marítim de la Barceloneta 37-49</addrLine>
									<postCode>08003</postCode>
									<settlement>Barcelona</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Frederic</forename><surname>Bartumeus</surname></persName>
							<affiliation key="aff0">
								<orgName type="laboratory">Centre de Recerca Ecològica i Aplicacions Forestals (CREAF)</orgName>
								<address>
									<addrLine>Carrer de les Cases Sert 54</addrLine>
									<postCode>08193</postCode>
									<settlement>Cerdanyola del Vallès, Barcelona</settlement>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="laboratory">Centre d&apos;Estudis Avançats de Blanes (CEAB-CSIC)</orgName>
								<address>
									<addrLine>Carrer Accés Cala Sant Francesc 14</addrLine>
									<postCode>17300</postCode>
									<settlement>Girona</settlement>
								</address>
							</affiliation>
							<affiliation key="aff3">
								<orgName type="department">Institució Catalana de Recerca i Estudis Avançats (ICREA)</orgName>
								<address>
									<addrLine>Passeig de Lluís Companys 23</addrLine>
									<postCode>08010</postCode>
									<settlement>Barcelona</settlement>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">A Bayesian Framework for Reputation in Citizen Science</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">67F05BC8FF573CBF04E4325562F14A9A</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T12:35+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>citizen science</term>
					<term>reputation system</term>
					<term>Bayesian network</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The viability of any Citizen Science (CS) research program is absolutely conditioned to the engagement of the citizen. In a CS framework in which participants are expected to perform actions that can be later on validated, the incorporation of a reputation system can be a successful strategy to increase the overall data quality and the likelihood of engagement, and also to evaluate how close citizens fulfill the goals of the CS research program. Under the assumption that participant actions are validated using a simple discrete rating system, current reputation models, thoroughly applied in e-platform services, can be easily adapted to be used in CS frameworks. However, current reputation models implicitly assume that rated items and scored agents are the same entity, and this does not necessarily hold in a CS framework, where one may want to rate actions but score the participants generating it. We present a simple approach based on a Bayesian network representing the flow described above (user, action, validation), where participants are aggregated in a discrete set of user classes and we use the global evidence in the data base to estimate both the prior and the posterior distribution of the user classes. Afterwards, we evaluate the expertise of each participant by computing the user-class likelihood of the sequence of actions/validations observed for that user. As a proof of concept we implement our model in a real CS case, namely the Mosquito Alert project.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Since its origins, back in the mid-90's, citizen science (CS) has been questioned by the scientific community as an adequate scientific methodology <ref type="bibr" target="#b6">[7]</ref>. Pros and cons aside, a basic principle to bring citizens and scientists into a productive relationship is to match the public understanding of science with the science's understanding of the public <ref type="bibr" target="#b6">[7]</ref>. To this end, modern citizen science is rethinking methods for citizen engagement <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b0">1]</ref>. Key concepts in participants engagement are connection and reward. Connection refers to connecting the scientific goals of the CS research program with the citizen perception of a social worry or interest (the basic motivation to start cooperating). Reward refers to providing feedback that can be neatly perceived as a reward (the basic motivation to keep cooperating). Nevertheless, it is well known by psychologists <ref type="bibr" target="#b0">[1]</ref> that the effect of a reward resides in its expectation and vanishes as soon as it is achieved. Thus, in order to increase the likelihood of participation in the long run, it is necessary to generate continuous reward expectations. A successful strategy to achieve sustained participation requires the implementation of a reputation system as a core component of any CS research program. In addition, wellgrounded reputation systems provide back-end information of participants that is valuable to augment data quality and to increase the fitness for use <ref type="bibr" target="#b11">[12]</ref> of CS research programs.</p><p>Reputation is a broad concept not only suitable to people but also to many kinds of things or services <ref type="bibr" target="#b1">[2]</ref>. Extending the notion given in <ref type="bibr" target="#b10">[11]</ref>, reputation is the perception that an agent (or item) creates through past actions about its intentions, norms, knowledge, expertise or value. Reputation can be seen as an asset, not only to promote oneself, but also to help us to make sound judgments in the absence of any better information. However, reputation is highly contextual and what works well in a specific context may inevitably fail in many others. As a consequence details about reputation systems are profusely treated in the literature <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b5">6,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b13">14]</ref>. The simplest reputation systems scale down to a ranking/voting system where information is aggregated into a single score used to qualify and sort items (e.g. songs in iTunes, users in Stackoverflow ). Systems for collecting and representing reputation information are usually based on simple rating mechanisms such as thumbs-up/down or a five star rating. The difficulties arise at the time of aggregating this information.</p><p>Many rating aggregation systems recently proposed (e.g. Amazon, iTunes, YouTube) are different forms of Bayesian Rating (BR), a pseudo-average computed as a weighted combination of the average rating of a particular item and the average rating for all items. In a k-way rating system, (i.e with k discrete rating levels r ∈ {1, . . . , k}), with a total of m rates and an overall rating r (all) = m j=1 r (y) j /m, the BR of an item y with n ratings, and an average rating r (y) = n j=1 r (y) j /n is given by,</p><formula xml:id="formula_0">BR (y) = n r (y) + m r (all) m + n = w r (y) + (1 − w) r (all)<label>(1)</label></formula><p>with w = n/ (n + m). A clear benefit of BR is that an item with only a few ratings (i.e w → 0) will approach the overall mean rating, hence, does not receive the lowest (unfair and discouraging) rate but the average rate, while the more the item is rated (i.e n 0) the largest the weight of its own average rating. In any case m n, hence the scoring is focused on the quality of the ratings rather than on the quantity of ratings. The Beta reputation system <ref type="bibr" target="#b8">[9]</ref> (binomial) and the Dirichlet reputation system <ref type="bibr" target="#b9">[10]</ref> (DR), the multinomial generalization of the former, are reputation models based on a sound statistical machinery that explains away the Bayesian rating concept and frames it in a real Bayesian perspective. Consider a k-way rating system and let the rating level be indexed by i (i.e. 1 ≤ i ≤ k). Let n (y) i be the rating counts for item y (the observed evidence), and let a i be a base rate expressing the biases in our prior belief about each rating level. A Dirichlet (or Beta for k = 2) rating yields a multinomial probability distribution S (y) i over the k rating levels, where the expectation value for each rating level is computed as,</p><formula xml:id="formula_1">S (y) i = n (y) i + C a i C + n (y)<label>(2)</label></formula><p>where n (y) = k i=1 n (y) i and C is the a priori constant that can be set to C = k if we consider a uniform prior (i.e. a i = 1/k). In this case Equation <ref type="formula" target="#formula_1">2</ref>defaults to the classical Laplace smoothing <ref type="bibr" target="#b8">[9]</ref>. The larger the value of C with respect to k the less the influence of the observed ratings and the more S (y) i will approach the base rate a i . Assuming the k rating levels evenly distributed in the range [0, 1], a point estimate reputation score is computed as,</p><formula xml:id="formula_2">DR (y) = k i=1 i k S (y) i<label>(3)</label></formula><p>Multinomial form aside, the similarity with BR (Eq. 1) is clear. But the difference can not be obviated: while the weighting in Equation 1 emerges from a pure frequentist persepective, in a DR the factor C a i can convey specific a priori information provided by domain experts or any other external source <ref type="bibr" target="#b8">[9]</ref>.</p><p>Agents (and in particular human agents) may change their behaviour over time. This issue is usually approached either by incorporating a cutoff factor that limits the series of ratings to the most recent ones, up to a given period or a given number, or by introducing a longevity factor that assigns a time relative weight to ratings <ref type="bibr" target="#b8">[9]</ref>. An additional concern in reputation systems for e-service platforms is its resistance against typical strategies for reputation cheating (e.g. whitewashing, sybil attacks, fraudulent ranking) reviewed in e.g. <ref type="bibr" target="#b3">[4]</ref>.</p><p>CS research programs constitute a different scenario where the aim is not to promote user interaction but to collect useful data for their scientific goals. Hence, reputation issues do not arise from peer-to-peer interaction but from the need to increase citizen engagement and data quality. However, a systematic review of 234 CS research programs presented in <ref type="bibr" target="#b4">[5]</ref> reveals that despite of this general concern on data quality, very few has been made in terms of participants' reputation. Data validation is usually performed by a core of domain experts or project coordinators, eventually assited by authomated methods or with some level of intra-community interaction (e.g. eBird, Galaxy Zoo, iSpot) or more broadly via crowd-sourcing (e.g. www.crowdcrafting.org). In a few cases, local coordinators take into account the participants' experience for validating data (e.g Common Bird Monitoring, Weather Observations Website), and just in a handful of them it is the community of participants itself who directly validate data (e.g Galaxy Zoo, iSpot, oldWeather ). Among the later, Notes from Nature and iSpot <ref type="bibr" target="#b14">[15]</ref> are the ones going further in terms of community-based validation and participants' reputation, implementing simple agreement based algorithms to rank participants and assign digital badges in recognition of specific achievements. Community-based validation explicitly requires a core reputation system integrated with the CS research program. However, there is not a general approach and each research program implements reputation in a functional way to fit its needs, neither framing its system in general conceptual frameworks, nor making it available to the scientific community.</p><p>Notably, an implicit assumption in any of the reputation models above is that the agent (or item) being rated is the one that is scored and, more explicitely, that the rating system used to collect ratings for an agent is the rating system used to score that agent (e.g. Equation2). This apparently obvious and irrelevant assumption might become more subtle in a CS framework. CS research programs expect participants to perform a set of actions (basically, reporting information in specific formats) and these actions are later on validated. In this case, the rated items are the actions, but the scored agents are the participants. Importantly, each kind of action might require its own discrete rating system (not necessarily coincident in the number of levels). Yet the expertise of participants might be expressed based on a specific set of user classes (with its own number of levels), and scored based on the ratings of all their possibly different actions. A straightforward way to overcome this problem is to compute separate scores for each type of action and get an overall score using a weighted combination of the former. Alternatively, we propose a novel model for user reputation based on a Bayesian network describing the characteristic flow of CS research programs, (i.e. user, action, validation). The proposed method (i) decouples action rating from participant scoring, (ii) provides a unified framework to process validation information regardless of the rating levels used for each kind of action, (iii) accounts for a good balance of both quality and quantity of evidence, and (iv) is more responsive to participants' actions, which may augment engagement dynamics.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Mosquito Alert: CS for public health</head><p>Mosquito Alert (MA) is a CS research program initially devised to monitor the expansion in Spain of the Asian tiger mosquito (Aedes albopictus), a disease carrying mosquito. Since the expansion of the Zika virus threat in 2016, MA included the monitoring of the yellow fever mosquito (Aedes aegypti ). Both species are world wide distributed, living in urban environments, and being specially invasive and agressive vectors of tropical infectious diseases such as Dengue, Chikungunya, Yellow Fever, and Zika.</p><p>Aside from its scientific goals (e.g. unveiling the population expansion mechanisms, forecasting vector and disease threats), a particular challenge for MA arises from its impact on the public health domain. MA is aimed to provide reliable early warning information (in recently colonised areas), and real-time management information (in areas where it resides) to public health administrators. Public health administrations at different organizational levels in the territory, use MA to improve their surveillance and control programs with the goal of decreasing mosquito populations, specially in urban areas. Because of all this, MA is designed as a multi-platform system structured as follows:</p><p>1. The MA smartphone app (freely available for Android and iOS), by means of which citizen can send reports of observations of mosquitoes (and their breeding sites) potentially belonging to disease vector species (namely the Asian Tiger and the Yellow Fever mosquito). 2. The corresponding server-side functionality (Django, SQL) managing the reception and storage of data, along with an ever evolving set of tools for the management and analysis of the data, including machine-learning algorithms to help automating the validation of information <ref type="bibr" target="#b12">[13]</ref>. 3. A private platform called Ento Lab. This is a restricted access service through which a set of experts can make a previous filtering of inappropriate reports and classify the rest as either positive or negative ones. Only classified reports are afterwards made visible to the rest of the services. 4. Another private platform named Managers Portal which grants on-demandaccess to stakeholders (e.g. public health administrations, mosquito control services, private mosquito control companies), open GIS tools to visualize all the available data in the portal (including their own imported management data), and the possibility to directly comunicate control actions through the app. 5. A public platform http://www.mosquitoalert.com, providing data and visualization tools to all the public via interactive maps, where participants can find their individual contributions validated by the experts (Figure <ref type="figure" target="#fig_0">1</ref>, right).</p><p>The direct involvement of public health institutions make citizen truly conscious of the usefulness of their contributions (much beyond science). Triggering mosquito control actions in the territory through MA participation constitutes the necessary reward to keep citizens engaged in the research program. The work described in this paper is based on data corresponding to the last two years (2015-2016) of MA, summarized in Table <ref type="table" target="#tab_0">1</ref>, with more than 30000 app downloads, 2993 active users and 5349 reports submited. Reports are of type adult (4177) correponding to observations of adult mosquitoes (either Asian tiger or Yellow Fever) or bSite (1172) corresponding to potential mosquitoes' breeding sites. Reports of type adult and bSite are reviewed by experts who manually label them. Reports of type adult are labelled as: −2, definitely not corresponding to the species of interest; −1, probably not corresponding to the species of interest; 0, can not tell; 1, probably corresponding to the species of interest; and 2, definitely corresponding to the species of interest. Reports of type bSite are labelled as: −1, does not correspond to a breeding site; 0, can not tell; and +1, does correspond to a breeding site. NC stands for not-classified reports which either do not provide an image or have not yet been reviewed by the experts. The hd stands for hidden reports which correspond to reports with improper images that are hidden by the experts (not shown in the map). -Users are participants of the CS research program aggregated in a variable U = {1, . . . , k} specifying an ordinal set of user-classes with increasing levels of expertise (e.g. beginner, advanced, expert). Beyond the intuition that U should be a discrete ordinal variable, we have no prior idea of the optimal number of user classes we should define nor about their prior distribution. -Actions consist in submitting reports either of type adult or bSite. Thus, we define A = {adult, bSite} such that P (A|U ) specifies the probability that a user of a given class submits a report of a particular type. We assume that each submitted report is an independent event. -Validations are expressed in terms of ratings and each kind of action can be rated with a different number of discrete rating levels. Thus, we define a variable V such that P (V |A) specifies the probability that an action of a given type gets the corresponding rating (or, transitorily, no rating), namely (V |A = adult) = {N one, hd, −2, −1, 0, 1, 2} and (V |A = bSite) = {N one, hd, −1, 0, 1}.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Methods</head><p>From the joint distribution P (U, A, V ) and by simple Bayes' rule we have,</p><formula xml:id="formula_3">P (U |A, V ) = 1 P (A, V ) P (V |A) P (A|U ) P (U )</formula><p>where P (A, V ) is just a normalization constant. So, if we have some means to estimate a prior distribution P (U ), and the conditional distributions P (A|U ) and P (V |A), we can evaluate the posterior distribution of user-class expertise for a user y given the observed sequence of actions/validations S (y) (Figure <ref type="figure" target="#fig_2">2</ref>, right),</p><formula xml:id="formula_4">P (U |S (y)) = A,V ∈S(y) P (U |A, V )</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Guessing a prior P (U )</head><p>We start by guessing a number of user expertise levels. For each user we consider three features regarding to the sequence S (y): (i) the quantity of reports, (ii) the quality of the reports, and (iii) a user's mobility index mI describing the average area covered by the user defined as the variance of the pairwise geolocation distances between the reports,</p><formula xml:id="formula_5">mI (y) = 1 2 |S (y) | 2 (p,q)∈S(y) [ (p x − q x ) 2 + (p y − q y ) 2 ]<label>(4)</label></formula><p>where (p x , p y ) , (q x , q y ) are the geolocation coordinates.</p><p>Based on these features we define the following proxy variables of the userclass U (Figure <ref type="figure" target="#fig_2">2</ref>, left, darkgrey nodes): (i) a quantitative proxy aggregating users sending less or more than a given number θ 1 of reports (N = {less, more}); (ii) a mobility proxy aggregating users with a mobility index lower/higher than a given value θ 2 (M = {lower, higher}); (iii) a quality proxy aggregating reports in four categories: hidden, low quality (those labeled as (−2, −1)), medium quality (those labeled as 0), and high quality (those labeled as (1, 2)), (Q = {hidden, low, medium, high}, we do not count here not-classified reports). Note that (i) and (ii) account for the attitude of the participants, while (iii) accounts for their skills, and both aspects are deemed important. The joint combination of the above three proxys results in a primary partition of the users' expertise space into 16 categories summarized in Table <ref type="table" target="#tab_1">2</ref>. The threshold values where selected by looking at the corresponding histograms and taking the values that yield the most possible balanced distribution.  By looking at this table, we should now infer a set of user classes with increasing levels of expertise. We prioritize as following: (i) the quality of the reports before the quantity (low quality reports just result in a waste of experts' time); (ii) the quantity of reports before the mobility index of the users (we give the lowest priority to the mobility index because the meaning of this variable is double folded: for surveillance purposes it is important that participants send reports covering the broadest geographical area possible, but for control purposes it is also important that participants keep sending reports within their neighbourhoods). Also, we are not looking here for a fine grain discretization of the expertise space. Taking into account the unbalances present in Table <ref type="table" target="#tab_1">2</ref> it looks reasonable to impose an ordering of the 16 expertise categories into a set of k = 6 user-classes, i.e U = {1, . . . , 6}, as shown in Table <ref type="table" target="#tab_3">3</ref>. Tables <ref type="table" target="#tab_3">2 and 3</ref> together express a joint distribution P (N, Q, M, U ) from which the prior P (U ) follows straightforwardly by marginalization,</p><formula xml:id="formula_6">= 2, θ2 = 10 −9 . N ≤ θ1, M ≤ θ2 N ≤ θ1, M &gt; θ2 N &gt; θ1, M ≤ θ2 N &gt; θ1, M &gt;</formula><formula xml:id="formula_7">P (U ) = N,Q,M P (N, Q, M, U )<label>(5)</label></formula><p>and we get a temptative prior for the user-class variable (Table <ref type="table" target="#tab_4">4</ref>).</p><p>Having defined the user classes we know the user-class value of each report, and we can make estimations (maximum a posteriori, MAP) for the action con-   <ref type="table" target="#tab_5">5</ref>) and also for the validation conditional distributions P (V |A = adult) and P (V |A = bSite) (Table <ref type="table" target="#tab_6">6</ref>).</p><formula xml:id="formula_8">N ≤ θ1, M ≤ θ2 N ≤ θ1, M &gt; θ2 N &gt; θ1, M ≤ θ2 N &gt; θ1, M &gt; θ2 hiddden 1 1</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Computing the posterior P (U |A, V )</head><p>Applying Bayes' rule we compute the posterior distribution P (U |A, V ),</p><formula xml:id="formula_9">P (U |A, V ) = 1 W P (V |A) P (A|U ) P (U )<label>(6)</label></formula><p>where W = k i=1 p (u i |A, V ). Equation <ref type="formula" target="#formula_9">6</ref>evaluates the probability that an action (report) of a given type, with a given validation (rating), belongs to a particular user-class (Table <ref type="table" target="#tab_7">7</ref>).</p><p>Note that, in our Bayesian approach ratings become a fuzzy qualification of the user-class. Also note that this is indeed a two parameter model (θ 1 , θ 2 ) allowing a degree of control over the user-class prior and, ultimately, over the user-class posterior distributions, (i.e. we can push the classification of reports to a lower/upper user-class by increasing/decreasing either one or both of the parameters (this is shown later in Figure <ref type="figure">4</ref>). Note that, so far, our UAV-network (Figure <ref type="figure" target="#fig_2">2</ref> , left) just yields the user-class distribution of a single action, not the users' scoring that we aim. To compute the scores we consider the report sequences S (y) (Figure <ref type="figure" target="#fig_2">2</ref>, right). To add some dynamics to the model we consider a third parameter θ 3 (a cutoff factor ) that limits the sequences to the last θ 3 reports. Assuming an iid sequence of reports, the corresponding user-class posterior distribution is given by,</p><formula xml:id="formula_10">P (U |S (y)) = P 0 P w θ3 j=1 P (U |A j , V j ) (<label>7</label></formula><formula xml:id="formula_11">)</formula><p>where P 0 = (1/k, . . . , 1/k) sets a starting uniform user-class distribution and P w = k i=1 P (u i |S) is a normalization factor. Afterwards, an expertise score can be computed as the user's expected user-class,</p><formula xml:id="formula_12">X (y) = 1 k E[U ] P (U |S) = 1 k k i=1 u i p (u i |S)<label>(8)</label></formula><p>Equation 8 yields a normalized score with a lower bound given by 1 k E[U ] P0 which avoids a discouraging zero-score for new comers. Usually, the computation of Equation 7 is subject to numerical precision problems and therefore we implement a log computation as,</p><formula xml:id="formula_13">log P (U |S (y)) = log P 0 + θ3 j=1 log P (U |A j , V j ) − log P w<label>(9)</label></formula><p>For gamification purposes, users are ranked based on their scores. Ties are solved by mobility index. The rank position, not the score, is notified to the users via the smartphone app, together with a quantile based category label as either gold, silver or bronze (Figure <ref type="figure" target="#fig_0">1</ref>, left).</p><p>In summary, we use the global evidence in the data base to guess the joint distribution P (U, Q, N, M ) and estimate a prior P (U ), from which we can derive the posteriors P (U |A, V ). Afterwards, we use the evidence observed for each particular user S (y), to evaluate the posterior distribution P (U |S (y)) and compute a score for that user. Essentially, our scoring model is a naïve Bayes classifier where the number of features varies with the number of reports used to qualify the user. The larger the number of reports, the better the profiling of the user. Because we use the global evidence to estimate the user-class distribution, the scores change dynamically as the contents of the data base grow and all individual expertise scores are in part dependent on the overall average performance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Results</head><p>Based on the user-class posterior distributions (Table <ref type="table" target="#tab_7">7</ref>) and applying Equations 9 and 8 we get the results shown in Figure <ref type="figure">3</ref>. In the x-axis, users are ranked by score (blue line). Scores are plotted together with scaled versions of the mobility index (yellow), the number of breedingSite reports (cyan) and the number of adultMosquito reports (magenta). The scoring yields several plateaus corresponding to typical number of submitted reports. We highlight (darkgrey rows in Table <ref type="table">8</ref>) the position of users who only submitted one report, classified as +1 (positions 1494:2053, 560 users), or classified as +2 (positions 2244:2684, 441 users), which correspond to the largest plateaus in Figure <ref type="figure">3</ref>. As expected, the larger the number of positive reports the higher the rank (Table <ref type="table">8</ref> bottom rows), and the larger the number of negative reports the lower the rank (Table <ref type="table">8</ref> top rows), with hidden reports being strongly punished.</p><p>We also analyze the effect of the threshold parameters (θ 1 , θ 2 , θ 3 ) (Figure <ref type="figure">4</ref>). In the x-axis, participants are ranked by score (blue line). The red line depicts the score coresponding to a different value of the threshold parameters. In general, the scores do not change much in terms of value though sudden breaks in the increasing trend of the red line reveal users whose position in the ranking has been affected by the change of the parameter value. The plateaus remain almost invariant and we only appreciate some changes of position at the borders of the plateaus. Increasing (decreasing) θ 1 and θ 2 (Figure <ref type="figure">4</ref>, left) together, move reports to the left (right) columns of Table <ref type="table" target="#tab_1">2</ref> and consequently force a change in the prior distribution. As a result, the plateaus are globally pushed lower (higher). This change is propagated to the posteriors and originates also the rank changes that can be observed at the borders of the plateaus (Figure <ref type="figure">4</ref>, left). In the case of θ 3 we observe that by not looking so far in the past (Figure <ref type="figure">4</ref>, right), some low rank users are upgraded (users who clearly improved their performance over time) while some high rank users are downgraded (users who worsened their performance over time).</p><p>The dynamics of the scoring are also shown. In Figure <ref type="figure" target="#fig_4">5</ref> (top) we simulate the evolution of the score (blue line) and rank (red line) for a particular user in a static situation where nothing is changing, no new users are coming and no reports are submitted by third users. Each submitted report is shown as a coloured dot, where the color indicates the validation value of the report. It is apparent that good/bad reports push the score/rank up/down. Figure <ref type="figure" target="#fig_4">5</ref> (bottom) shows the evolution of the score in a realistic situation to show the effect of third users' actions or new comers to MA. Note the double effect of the overall dynamics, inducing soft fluctuations in the score but really significant changes in the rank. The stronger dynamics of the rank makes it much more effective for gamification purposes.  We also show a comparison of our scores with Bayesian Rating and Dirichlet reputation scores (Figure <ref type="figure" target="#fig_5">6</ref>). The scores used in the comparison have been computed taking into account only the reports of type adult. In this way we avoid to analyze second order effects due to weight-averaging of adult and bSite reports, given that in BR or DR these must be independently computed and combined later on. In the x-axis, participants are ranked by our score (blue line). BR scores (Figure <ref type="figure" target="#fig_5">6</ref>, left, magenta line) are clearly affected by the weight of the overall average rating (note the scale of the right y-axis). However, BR still yields the same plateaus and we only observe slight ranking changes at the borders of the plateaus. These changes are due to the differences in the leverage of the rate values (i.e. the values P (U |A, V ) in Tables 7 versus the rating levels r = {1, . . . , k} used in Equation <ref type="formula" target="#formula_0">1</ref>). DR scores (Figure <ref type="figure" target="#fig_5">6</ref>, right) are computed for different values of the C constant. It is clear that C is playing the role of the overall average factor in BR, but DR gives us some control over it. The most important plateaus are also found, but the differences at the borders of the plateaus are more significant. Notably, there is a great difference in the sensibility of our model in comparison to BR and DR. In this context, sensibility represents a better responsiveness of the scoring in relation to participants actions, which we consider it to be a good property to improve participants' engagement in CS research programs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Discussion</head><p>The model we propose is similar to a naïve-bayes classifier where the number of feature nodes varies with the number of actions performed (i.e. reports submitted) by the user. Starting from a uniform user-class distribution, each validated report contributes with new evidence to refine the profiling of the user.</p><p>The key issue of our approach is to estimate a user-class prior that suffices for our scoring purpose. We suggest to select a set of proxy variables of the userclass, with a clear semantics in terms of user expertise, to make a guess about this prior. Nonetheless, any alternative to compute the prior can be considered and applied as well. Anyway, it is crucial to make a guess of the prior that leads to a well balanced (as much as possible) prior and to well behaved (as much as possible) posteriors (i.e. good ratings favouring higher user classes and bad ratings favouring lower user classes). If the probability mass distribution of the posteriors is not in clear correlation with the user classes the behaviour of the algorithm can become non-monotonic with respect to increasing evidence about a certain class. Thus, this step must be carefully considered.</p><p>In the same way that it is not good to score new users excesively low, it is also not good to score them excessively high. The reason to initialize the score with a unifom instead of the prior user-class distribution is that the later will usually be unbalanced, in our case, clearly unbalanced towards the high expertise classes (Table <ref type="table" target="#tab_4">4</ref>), and consequently users with no validated ativity would be ranked either excesively high or excessively low. Using the prior to initialize the score, not-active users get a score of 0.74011666 (i.e. P (U |S) = P (U ) in Equation <ref type="formula" target="#formula_12">8</ref>), while using a uniform distribution their score is 0.58333333 (arround 0.5) and they are positioned by the middle-low part of the rank (rank positions 1069:1278 in Table <ref type="table">8</ref>) which is fairly reasonable.</p><p>As scores are relative to the performance of the whole community, scores are quite dynamic. As participants increase their expertise, all good scores are globally pushed higher. Nevertheless, it is the rank what is ultimately notified to the participants, thus along a period of no activity, a participant might be downgraded with time. Furthermore, in periods of no activity the score can indeed increase if better positioned participants suddenly start sending reports of low quality. These unexpected dynamics could easily generate some confusion or disappointment among the participants. We avoid this situation by giving the basic hints of our scoring system in the project's web page <ref type="foot" target="#foot_0">5</ref> where, indeed, we promote the gamification side of these features in order to use them in our favour. Alternatively, unexpected dynamics as described above could be controlled by implementing an age weighted rating as proposed in <ref type="bibr" target="#b8">[9]</ref>. In our case, this solution should be implemented with special care because of the seasonality of mosquito population and, consequently, minimal report activity during winter and spring. This long periods of minimal activity would uniform all scores and many experienced participants might feel disappointed. At the moment, our decision is to keep participants' scores from one season to another.</p><p>With respect to BR and DR, while essentially capturing the same concept of rating-based reputation, our model shows a much higher sensibility to the observed evidence, and a good balance of both, evidence of quality (the rates themselves) and quantity of evidence (the number of ratings). The reason lies in the way that evidence is cumulated, i.e. by multiplication (Equation <ref type="formula" target="#formula_12">8</ref>) instead of by addition as in BR (Equation <ref type="formula" target="#formula_0">1</ref>) and in DR (Equation <ref type="formula" target="#formula_1">2</ref>). A larger sensibility results in a stronger responsiveness to specific participant actions. Augmenting engagement dynamics with more sensible reputation systems may probably bind better the participants to the long term goals of CS research programs. Furthermore, our model decouples action validation from participant scoring by means of an integrative and unified treatment of any action under consideration, independently of the rating system used for each type of action. By summer 2017, MA is going to collect extra data from participants with a recently added tool designed to reinforce citizen participation in the research program, whilst easing the experts' validation task. This new tool, natively incorporated to the app, allows citizens to validate mosquito and breeding site images sent by third users and challenge their expertise in identifying mosquito species. This new action will provide valuable information in terms of participants' expertise, based on a binary rating system, i.e. right/wrong. Given the structure of our scoring model, this information can be readily translated into a new user-class posterior distribution and incorporated to the scoring algorithm.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusion</head><p>We propose a novel reputation system based on a Bayesian network representing the characteristic flow typically present in CS research programs where participants are expected to perform actions that are validated later on (i.e. user, action, validation), what we call the UAV network. In this network, the users node represents an aggregation of participants into expertise classes. The key issue of our approach is to estimate a prior for the user-class distribution that suffices for our scoring pourpose. We suggest to select a set of proxy variables of the user-class, with a clear semantics in terms of user expertise, to make a guess about this prior. However, any other means to get a valid estimate of the prior can readily be used. With respect to Bayesian rating and the Dirichlet reputation models, our approach presents some advantages: (i) is more responsive to the observed evidence, and thus, it bridges better participants with their actions, (ii) it decouples action rating from user scoring, providing a unified processing of any action under consideration, no matter the number of rating levels defined for each, and (iii) it yields a better balance of both, evidence of quality (the rates themselves) and quantity of evidence (the number of ratings). As a proof of concept this model is implemented as part of the Mosquito Alert CS research program.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Mosquito Alert (left) home screen of the app showing the rank and category; (right) web map showing the validated reports.</figDesc><graphic coords="5,136.16,115.84,127.55,141.73" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>Let's consider a Bayesian network describing the characteristic flow in a CS research program, what we call the User-Action-Validation (UAV) network (Figure 2, left, lightgrey nodes). In our particular case, the nodes of the UAV network represent the following:</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. The User-Action-Validation Bayesian network to model users' expertise on the Mosquito Alert's platform.</figDesc><graphic coords="7,136.16,115.84,173.47,99.21" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Fig. 3 .Fig. 4 .</head><label>34</label><figDesc>Fig. 3. Ranked score plotTable 8. Ranked score table</figDesc><graphic coords="12,173.03,115.83,269.29,136.07" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Fig. 5 .</head><label>5</label><figDesc>Fig. 5. Score dynamics: (top) simulating a static situation in which the rest of participants do not perform any action; (bottom) real situation with new reports submitted by third participants and new participants joining the Mosquito Alert research program.</figDesc><graphic coords="14,134.77,270.08,354.32,147.40" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Fig. 6 .</head><label>6</label><figDesc>Fig. 6. Comparison of scores: (left) UAV-network vs. Bayesian Rating; (right) UAVnetwork vs. Dirichlet reputation system with different values of C.</figDesc><graphic coords="14,134.77,481.72,368.50,141.73" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Summary of Mosquit Alert's data-base (2015-2016).</figDesc><table><row><cell></cell><cell>total NC hd -2 -1 0 +1 +2</cell></row><row><cell cols="2">adult 4177 19 188 429 128 655 1317 1441</cell></row><row><cell>%</cell><cell>0.00 0.05 0.10 0.03 0.16 0.32 0.34</cell></row><row><cell></cell><cell>total NC hd -1 0 +1</cell></row><row><cell cols="2">bSite 1172 564 160 90 129 229</cell></row><row><cell>%</cell><cell>0.48 0.14 0.08 0.11 0.20</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Join distribution of the proxy variables, P (N, M, Q), resulting in 16 expertise categories. Threshold values: θ1</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 3 .</head><label>3</label><figDesc>Definition of user classes</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 4 .</head><label>4</label><figDesc>Expertise-class prior distribution, P (U )</figDesc><table><row><cell>1</cell><cell>2</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 5 .</head><label>5</label><figDesc>Action conditional distribution, P (A|U )</figDesc><table><row><cell>U</cell><cell>1</cell><cell>2</cell><cell>3</cell><cell>4</cell><cell>5</cell><cell>6</cell></row><row><cell cols="7">adult .3454 0.7994 0.8456 0.4234 0.9339 0.9144</cell></row><row><cell cols="7">bSite 0.6546 0.2006 0.1544 0.5766 0.0661 0.0856</cell></row><row><cell></cell><cell cols="6">1.0000 1.0000 1.0000 1.0000 1.0000 1.0000</cell></row><row><cell cols="3">3.3 Computing users' scores.</cell><cell></cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_6"><head>Table 6 .</head><label>6</label><figDesc>Validation conditional distribution, P (V |A, U )</figDesc><table><row><cell>adult reports</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>U</cell><cell>1</cell><cell>2</cell><cell>3</cell><cell>4</cell><cell>5</cell><cell>6</cell></row><row><cell>NC</cell><cell cols="6">0.0059 0.0059 0.0059 0.0059 0.0059 0.0059</cell></row><row><cell>hd</cell><cell cols="6">0.9643 0.0559 0.0022 0.0033 0.0008 0.0006</cell></row><row><cell>-2</cell><cell cols="6">0.0060 0.7326 0.1401 0.0033 0.0008 0.0006</cell></row><row><cell>-1</cell><cell cols="6">0.0060 0.1996 0.0657 0.0033 0.0008 0.0006</cell></row><row><cell>0</cell><cell cols="6">0.0060 0.0020 0.7817 0.9778 0.0008 0.0006</cell></row><row><cell>+1</cell><cell cols="6">0.0060 0.0020 0.0022 0.0033 0.5511 0.4135</cell></row><row><cell>+2</cell><cell cols="6">0.0060 0.0020 0.0022 0.0033 0.4397 0.5780</cell></row><row><cell></cell><cell cols="6">1.0000 1.0000 1.0000 1.0000 1.0000 1.0000</cell></row><row><cell>bSite reports</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>U</cell><cell>1</cell><cell>2</cell><cell>3</cell><cell>4</cell><cell>5</cell><cell>6</cell></row><row><cell>NC</cell><cell cols="6">0.4742 0.4742 0.4742 0.4742 0.4742 0.4742</cell></row><row><cell>hd</cell><cell cols="6">0.5153 0.0932 0.0063 0.0064 0.0060 0.0035</cell></row><row><cell>-1</cell><cell cols="6">0.0035 0.4193 0.1837 0.0064 0.0060 0.0035</cell></row><row><cell>0</cell><cell cols="6">0.0035 0.0067 0.3294 0.5066 0.0060 0.0035</cell></row><row><cell>+1</cell><cell cols="6">0.0035 0.0067 0.0063 0.0064 0.5079 0.5152</cell></row><row><cell></cell><cell cols="6">1.0000 1.0000 1.0000 1.0000 1.0000 1.0000</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_7"><head>Table 7 .</head><label>7</label><figDesc>User-class posterior distributions</figDesc><table><row><cell>adult reports</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>L</cell><cell>NC</cell><cell>hd</cell><cell>-2</cell><cell>-1</cell><cell>0</cell><cell>+1</cell><cell>+2</cell></row><row><cell>U = 1</cell><cell cols="7">0.0272 0.7810 0.0016 0.0052 0.0012 0.0005 0.0004</cell></row><row><cell>U = 2</cell><cell cols="7">0.1157 0.1926 0.8351 0.7363 0.0018 0.0007 0.0006</cell></row><row><cell>U = 3</cell><cell cols="7">0.1137 0.0074 0.1571 0.2382 0.6856 0.0007 0.0007</cell></row><row><cell>U = 4</cell><cell cols="7">0.0408 0.0040 0.0013 0.0042 0.3074 0.0004 0.0004</cell></row><row><cell>U = 5</cell><cell cols="7">0.3055 0.0075 0.0025 0.0080 0.0019 0.5050 0.3683</cell></row><row><cell>U = 6</cell><cell cols="7">0.3972 0.0075 0.0025 0.0080 0.0019 0.4927 0.6296</cell></row><row><cell></cell><cell cols="7">1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000</cell></row><row><cell>bSite reports</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>L</cell><cell>NC</cell><cell>hd</cell><cell>-1</cell><cell>0</cell><cell>+1</cell><cell></cell><cell></cell></row><row><cell>U = 1</cell><cell cols="5">0.2389 0.8849 0.0107 0.0050 0.0058</cell><cell></cell><cell></cell></row><row><cell>U = 2</cell><cell cols="5">0.1346 0.0901 0.7252 0.0054 0.0062</cell><cell></cell><cell></cell></row><row><cell>U = 3</cell><cell cols="5">0.0963 0.0044 0.2273 0.1922 0.0042</cell><cell></cell><cell></cell></row><row><cell>U = 4</cell><cell cols="5">0.2575 0.0119 0.0212 0.7900 0.0115</cell><cell></cell><cell></cell></row><row><cell>U = 5</cell><cell cols="5">0.1003 0.0043 0.0077 0.0036 0.3545</cell><cell></cell><cell></cell></row><row><cell>U = 6</cell><cell cols="5">0.1724 0.0044 0.0078 0.0037 0.6178</cell><cell></cell><cell></cell></row><row><cell></cell><cell cols="5">1.0000 1.0000 1.0000 1.0000 1.0000</cell><cell></cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_0">http://www.mosquitoalert.com/en/project/send-data/</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7">Acknowledgments</head><p>We would like to thank the Mosquito Alert team for continuous effort and support and the Mosquito Alert community for its unvaluable cooperation. This work is part of Mosquito Alert CS program research funded by the Spanish Ministry of Economy and Competitiveness (MINECO, Plan Estatal I+D+I CGL2013-43139-R) and la Caixa Banking Foundation. Mosquito Alert is currently promoted by la Caixa Banking Foundation.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>adult bSite rank score hd -2 -1 0 +1 +2 hd -1 0 +1 </p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Hooked: How to Build Habit-Forming Products</title>
		<author>
			<persName><forename type="first">N</forename><surname>Eyal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Hoover</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2014">2014</date>
			<publisher>Portfolio Penguin</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Building Web Reputation Systems</title>
		<author>
			<persName><forename type="first">Randy</forename><surname>Farmer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bryce</forename><surname>Glass</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2010">2010</date>
			<publisher>Yahoo! Press</publisher>
			<pubPlace>USA</pubPlace>
		</imprint>
	</monogr>
	<note>1st edition</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">A behavior model for persuasive design</title>
		<author>
			<persName><surname>Bj Fogg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th International Conference on Persuasive Technology, Persuasive &apos;09</title>
				<meeting>the 4th International Conference on Persuasive Technology, Persuasive &apos;09<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="1" to="40" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Manipulation-resistant reputation systems</title>
		<author>
			<persName><forename type="first">Eric</forename><surname>Friedman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paul</forename><surname>Resnick</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Rahul</forename><surname>Sami</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Algorithmic Game Theory</title>
				<editor>
			<persName><forename type="first">Noam</forename><surname>Nisan</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Tim</forename><surname>Roughgarden</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Eva</forename><surname>Tardos</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Vijay</forename><surname>Vazirani</surname></persName>
		</editor>
		<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="677" to="698" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Understanding Citizen Science &amp; Environtmental Monitoring</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">E</forename><surname>Roy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Preston</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">B</forename><surname>Roy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Savage</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Tweddle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Robinson</forename><forename type="middle">L D</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Final Report on behalf of UK-EOF</title>
				<imprint>
			<date type="published" when="2012-11">November 2012</date>
		</imprint>
		<respStmt>
			<orgName>NERC Centre for Ecology &amp; Hidrology and Natural History Museum</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Reputation systems</title>
		<author>
			<persName><forename type="first">Ferry</forename><surname>Hendrikx</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kris</forename><surname>Bubendorfer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ryan</forename><surname>Chard</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. Parallel Distrib. Comput</title>
		<imprint>
			<biblScope unit="volume">75</biblScope>
			<biblScope unit="page" from="184" to="197" />
			<date type="published" when="2015-01">jan 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Citizen Science: A Study of People, Expertise and Sustainable Development</title>
		<author>
			<persName><forename type="first">A</forename><surname>Irwin</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1995">1995</date>
			<publisher>Routledge</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Trust and reputation systems</title>
		<author>
			<persName><forename type="first">Audun</forename><surname>Jøsang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Foundations of Security Analysis and Design IV: FOSAD 2006/2007 Tutorial Lectures</title>
				<editor>
			<persName><forename type="first">Alessandro</forename><surname>Aldini</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Roberto</forename><surname>Gorrieri</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin, Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer-Verlag</publisher>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="209" to="245" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Dirichlet reputation systems</title>
		<author>
			<persName><forename type="first">Audun</forename><surname>Josang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jochen</forename><surname>Haller</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Availability, Reliability and Security</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2007">2007. 2007</date>
			<biblScope unit="page" from="112" to="119" />
		</imprint>
	</monogr>
	<note>The Second International Conference on</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">The beta reputation system</title>
		<author>
			<persName><forename type="first">Audun</forename><surname>Jøsang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Roslan</forename><surname>Ismail</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">BLED 2002 Proceedings</title>
				<imprint>
			<date type="published" when="2002">2002</date>
			<biblScope unit="page">41</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">A computational model of trust and reputation for e-businesses</title>
		<author>
			<persName><forename type="first">L</forename><surname>Mui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mohtashemi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Halberstadt</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS&apos;02)</title>
				<meeting>the 35th Annual Hawaii International Conference on System Sciences (HICSS&apos;02)<address><addrLine>Washington, DC, USA</addrLine></address></meeting>
		<imprint>
			<publisher>IEEE Computer Society</publisher>
			<date type="published" when="2002">2002</date>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page">188</biblScope>
		</imprint>
	</monogr>
	<note>HICSS &apos;02</note>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">Whitepaper on citizen science for environmental research</title>
		<author>
			<persName><forename type="first">Colin</forename><surname>Robertson</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Machine learning assists the classification of reports by citizens on disease-carrying mosquitoes</title>
		<author>
			<persName><forename type="first">Antonio</forename><surname>Rodriguez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Frederic</forename><surname>Bartumeus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ricard</forename><surname>Gavaldà</surname></persName>
		</author>
		<ptr target="CEUR-WS.org" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the First Workshop on Data Science for Social Good co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Dicovery in Databases, SoGood@ECML-PKDD 2016</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">Ricard</forename><surname>Gavaldà</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Indre</forename><surname>Zliobaite</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">João</forename><surname>Gama</surname></persName>
		</editor>
		<meeting>the First Workshop on Data Science for Social Good co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Dicovery in Databases, SoGood@ECML-PKDD 2016<address><addrLine>Riva del Garda, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016-09-19">September 19, 2016. 2017</date>
			<biblScope unit="volume">1831</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Review on computational trust and reputation models</title>
		<author>
			<persName><forename type="first">Jordi</forename><surname>Sabater</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carles</forename><surname>Sierra</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Artif. Intell. Rev</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="33" to="60" />
			<date type="published" when="2005-09">September 2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Crowdsourcing the identification of organisms: A casestudy of ispot</title>
		<author>
			<persName><forename type="first">J</forename><surname>Silvertown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Harvey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Greenwood</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dodd</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Rosewell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Rebelo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ansine</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Mcconway</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ZooKeys</title>
		<imprint>
			<biblScope unit="volume">480</biblScope>
			<biblScope unit="page" from="125" to="146" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
