<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">In the Context of Narrative, we Never Properly Defined the Concept of Valence</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Peter</forename><surname>Boot</surname></persName>
							<email>peter.boot@huygens.knaw.nl</email>
							<affiliation key="aff0">
								<orgName type="institution">Huygens Institute for the History and Culture of the Netherlands (KNAW)</orgName>
								<address>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Angel</forename><surname>Daza</surname></persName>
							<email>j.daza@esciencecenter.nl</email>
							<affiliation key="aff1">
								<orgName type="institution">Netherlands eScience Center</orgName>
								<address>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Carsten</forename><surname>Schnober</surname></persName>
							<email>c.schnober@esciencecenter.nl</email>
							<affiliation key="aff1">
								<orgName type="institution">Netherlands eScience Center</orgName>
								<address>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Willem</forename><surname>Van Hage</surname></persName>
							<email>w.vanhage@esciencecenter.nl</email>
							<affiliation key="aff1">
								<orgName type="institution">Netherlands eScience Center</orgName>
								<address>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">In the Context of Narrative, we Never Properly Defined the Concept of Valence</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">7F5A89A71AC0823B0E2D3E42AC3509E4</idno>
					<idno type="DOI">10.5281/zenodo.13942218</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:47+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>valence</term>
					<term>polarity</term>
					<term>sentiment</term>
					<term>word-embedding</term>
					<term>narrative</term>
					<term>computational literary studies</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Valence is a concept that is increasingly being used in the computational study of narrative texts. We discuss the history of the concept and show that the word has been interpreted in various ways. Then we look at a number of Dutch tools for measuring valence. We use them on sample fragments from a large collection of narrative texts and find only moderate correlations between the valences as established by the various tools. We discuss these differences and how to handle them. We argue that the root cause of the problem is that Computational Literary Studies never properly defined the concept of valence in a narrative context.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The study of emotion and sentiment is increasingly popular in computational literary studies <ref type="bibr" target="#b19">[20]</ref>. In this paper we will look specifically at the concept of valence, the positive or negative sentiment associated with a word or a text passage. The concept is used in a number of recent studies, but it seems to be used in quite different ways. This calls for a deeper look at the history of the concept.</p><p>One of the most-quoted studies in the field is the article by Reagan et al. about six basic shapes in the emotional arcs in stories <ref type="bibr" target="#b29">[30]</ref>. The emotional arcs that the authors create represent the flow of what they call 'sentiment', measured by the 'Hedonometer', a dictionary-based tool that assigns sentiment to words <ref type="bibr" target="#b10">[11]</ref>. There is no discussion in the article about the status of this sentiment: does it correspond to the sentiment that readers experience? Does it correspond to sentiment of the characters or the narrator? The paper includes an annotated emotion arc for Harry Potter and the Deathly Hallows which seems to show that the highs and lows of the story correspond to the maxima and minima in the arc. But the paper treats the construction of the arcs on the basis of the sentiment data as a purely technical problem, without asking what it is exactly that these arcs are modelling.</p><p>Bizzoni and Feldkamp <ref type="bibr" target="#b1">[2]</ref> do address the issue in a case study on The Old Man and the Sea. For each sentence in the novel, two annotators rated 'the sentiment expressed by the sentence'. They were instructed 'to avoid rating how a sentence made them feel and to try to report only on the sentiments actually embedded in the sentence, i.e., to think about the valence of each sentence individually, without overthinking the story's narrative to reduce contextual interpretation'. This is an interesting instruction, in that it explicitly states the sentiment is not in the reader and it shouldn't relate to the story events. It assumes that there is such a thing as 'the sentiment embedded in a sentence'. The paper then goes on to check whether LLM or dictionary-based sentiment models correlate with the annotators' ratings. We will come back to this study below.</p><p>Rebora, in his survey of sentiment analysis in literary studies <ref type="bibr" target="#b30">[31]</ref>, also asks where the sentiment is supposed to reside: in the text or in the reader. He notes that in literary studies, narratologists would opt for the text, while students of reader response would look at the reader. But he also points at a third possibility: the characters as vehicles of emotion. Nalisnick and Baird, e.g., have applied sentiment analysis to Shakespeare's plays in order to study the relations between the plays characters <ref type="bibr" target="#b26">[27]</ref>. And there are other possibilities: the sentiment that one finds in the text can also be used to gauge the sentiment of the author, as in <ref type="bibr">Stirman and</ref> Pennebaker's study of suicidal poets <ref type="bibr" target="#b34">[35]</ref>. It is even possible to study the sentiment in novels and other texts not out of an interest in anything that has to with the book itself, but to read larger social attitudes. For example, in their study of the perception of coal and oil in recent U.S. works <ref type="bibr" target="#b13">[14]</ref>, Grubert and Algee-Hewitt are interested in the perception of these energy sources in contemporary U.S. society.</p><p>It is clear that all of the above approaches to sentiment can be enlightening. But there is a danger that we forget that sentiment in texts can have these different aspects, and our current tooling is certainly not able to distinguish them. In an effort to create some clarity, we will in this paper briefly recount the history of the concept of valence, focusing on the various definitions researchers have used as well as on how it was established or computed. In a second, empirical part, we look at a number of tools for computing valence in Dutch. In so far as they are dictionary-based we look at the overlap between dictionaries and the degree to which they assign the same values to their shared words. Then we use a sample of fragments from Dutch fiction to assess how the various tools compare in their assignment of valence to these fragments. <ref type="foot" target="#foot_0">1</ref>Note that our main interest here is not in the sentiment arcs that can be derived from book segments' valences (see also <ref type="bibr" target="#b11">[12]</ref>). What we want to contribute to is the much more elementary question: what is a word or chunk valence in the first place, and how do we compute it?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Background: Valence, Sentiment and Polarity</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Valence in psycho-linguistic studies</head><p>We start our short history of the concept of valence with the 1957 book The measurement of meaning, by Osgood, Suci and Tannenbaum <ref type="bibr" target="#b28">[29]</ref>. Osgood and his colleagues wanted to describe  the meaning of certain concepts, such as the word 'lady', and asked test subjects to associate these nouns with positions on a scale between opposite adjectives, such as good vs. bad, hard vs. soft or kind vs. cruel. They used a Likert scale, and the form might look as in Figure <ref type="figure" target="#fig_0">1</ref>.</p><p>After averaging, this gave them, for twenty concepts and fifty pairs of adjectives, 1000 measurements. A factor analysis identified the hidden dimensions underlying the measurements. A fragment of the resulting table is reproduced in Figure <ref type="figure" target="#fig_1">2</ref>. We see that the results of some adjective pairs, such as good vs. bad and beautiful vs. ugly, are almost completely explained by the hidden variable I, the scores on strong vs. weak are mostly explained by hidden variable II, etc. But what are these hidden variables? Osgood et al. then write 'The problem of labelling factors is somewhat simpler here than in the usual case. (...) The first factor is clearly identifiable as evaluative (...). The second variable identifies itself fairly well as a potency variable (...). The third factor appears to be mainly an activity variable (...)' (italics original).</p><p>Later, these dimension would become known by other names: evaluativeness as pleasure or valence, activity as arousal, potency as power or dominance. This is not the whole story, but Osgood and colleagues made a fundamental contribution to the three-factor theory of emotion. It is interesting to note that they already mention that these factor loadings depend on cultures: e.g., for Japanese and Korean respondents, the adjective pair delicate vs. rugged clearly belonged to the evaluative dimension, for U.S. respondents it did not.</p><p>We continue with a look at The General Inquirer, one of the first text analysis programs <ref type="bibr" target="#b35">[36]</ref>. The 1966 book describes among others the Harvard III dictionary, developed for use with the General Inquirer. Four groups of words were included to account for the high and low ends of the evaluativeness and potency variables found by <ref type="bibr">Osgood (pp. 176,</ref><ref type="bibr">185)</ref>. We see here that what for Osgood were hidden variables, the result of a computational process that required the interpretative step of labelling, have now become measurable entities underlying texts. A next step in our tale was set by Bradley and Lang in 1999, in their paper 'Affective Norms for English Words' <ref type="bibr" target="#b6">[7]</ref>. As prompts in psychological research they needed words with known values for the dimensions of (what they called) Pleasure, Arousal and Dominance. They asked subjects to rate the words using a 'self-assessment mannikin' (see Figure <ref type="figure" target="#fig_2">3</ref> for an example). For pleasure, the instruction that they gave their subjects was 'At one extreme of this scale, you are happy, pleased, satisfied, contented, hopeful. When you feel completely happy you should indicate this by bubbling in the figure at the left. The other end of the scale is when you feel completely unhappy, annoyed, unsatisfied, melancholic, despaired, or bored' (italics ours). The valence associated with a word now equals (possibly) complete happiness or unhappiness of the subject, i.e., a subjective feeling.</p><p>The creation of ever larger dictionaries of words with associated valence and other variables has been a constant in psycholinguistic research ever since. We mention a few: Warriner and colleagues created a list of almost 14000 English lemmas with Valence, Arousal and Dominance <ref type="bibr" target="#b39">[40]</ref>. They used Likert scales rather than the self-assessment mannikins, but they stuck to the subjective language: 'At one extreme of this scale, you are happy, pleased, satisfied, contented, hopeful. When you feel completely happy you should indicate this by choosing rating 1'. One reason why their work is remarkable is that they found that there are systematic differences between men and women in how they rate various categories of words. The last study for English words that we mention here is that by Mohammad <ref type="bibr" target="#b23">[24]</ref>. Mohammad used a procedure called best-worst scaling where respondents were asked: 'Which of the four words below is associated with the MOST happiness / pleasure / positiveness / satisfaction / contentedness / hopefulness OR LEAST unhappiness / annoyance / negativeness / dissatisfaction / melancholy / despair?' The choice of words 'is associated with' is less subjective and seems to ask for a more objective relation between the words and the associated values than formulations such as 'when you feel completely happy'. Still, Mohammad too found important differences between various groups of people (by gender, age and self-assessed Big 5 personality characteristics) with respect to the values they associate with the various words.</p><p>One study, especially relevant for the second part of this paper, was done by Moors and colleagues <ref type="bibr" target="#b25">[26]</ref>. They created a list of 4300 Dutch words with associated valence and other values. Participants 'were asked to judge the extent to which the words in the study referred to something that is positive/pleasant ("positief/aangenaam") or negative/unpleasant ("negatief/onaangenaam")'. Introducing a third perspective, rather than giving a subjective response to a word <ref type="bibr">(Bradley)</ref> or a judgment about the language system (Mohammad), subjects were now asked about properties of the objects in the world that the words represent. Moors presents separate valence scales for the general population, for women and for men. The examples that we could give of differences in evaluation between women and men are all distressingly predictable.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Valence or sentiment in consumer reviews</head><p>With the appearance of consumer reviews, the problem of establishing the opinions that they expressed and the related sentiments became important subjects for marketeers <ref type="bibr" target="#b20">[21]</ref>. As an example of early studies we mention Hu and Liu's work <ref type="bibr" target="#b14">[15]</ref>. Hu and Liu find sentences in reviews that contain both a product feature and an opinion. The opinion words and their polarity (= whether they express a positive or negative sentiment) are deduced from WordNet by starting with some seedwords. Over the years, their work has produced a 6800-word opinion lexicon. In the study of consumer reviews, the question is no longer how a word strikes the reader, but what was the opinion that a writer wanted to express. That also means that the attention moves to adjectives rather than the nouns that the tradition established by Bradley <ref type="bibr" target="#b6">[7]</ref> was typically interested in.</p><p>A Dutch sentiment dictionary with a focus on review sentiment was created by De Smedt and Daelemans <ref type="bibr" target="#b7">[8]</ref>. They used frequently occurring adjectives from book reviews, and asked annotators 'to classify each adjective in terms of positive-negative polarity and subjectivity'. The question here no longer refers to subjective feeling or properties of an object, but to a property of the sentiment word. The dictionary is special (among sentiment dictionaries) in that it distinguishes between multiple word senses. E.g. the word 'scherp' (sharp) applied to a sound has negative polarity, but in 'a sharp thinker' the word's polarity is positive. Using computational tools and based on several linguistic resources, the initial list of words has been expanded to 5500 word senses. De Smedt and Daelemans' dictionary can be used as part of the Pattern toolset <ref type="bibr" target="#b8">[9]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Valence from word embeddings</head><p>Under the name of SentiArt, Arthur Jacobs introduced a completely different way of estimating valence in the field of computational literary studies <ref type="bibr" target="#b16">[17]</ref>. The method uses a set of positive and negative seed words in combination with a word embedding. The valence of a word is then computed summing its similarities to the positive seed words and subtracting the similarities to the negative ones. The method is based on the assumption that, in the word embedding, words with similar meanings cluster together. Because there is no need for human ratings, this seems like an objective procedure. However, the choice of the texts that are used to create the word embedding, the procedure that is used for its computation as well as the choice of seed words are to some extent arbitrary. It is also not self-evident that the procedure should work at all. Jacobs <ref type="bibr">[17, p. 3]</ref> states that the method was able to explain 34% of the variance between words found by Warriner <ref type="bibr" target="#b39">[40]</ref>, which isn't exactly promising.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Valence beyond the word</head><p>The various approaches to the valence concept that we have discussed all assign valence at the word level. It is not self evident that it is possible to define valence at a higher level, e.g. that of sentences, paragraphs or even book chapters.</p><p>Bradley and Lang extended their work on word valence to small texts (one to a few sentences) <ref type="bibr" target="#b5">[6]</ref> These small texts describe, in the second person, situations that would probably cause an emotional state with a certain valence (as well as arousal and dominion). We give an example with low valence: 'You gag, seeing a roach moving slowly over the surface of the pizza. You knock the pie on the floor. Warm cheese spatters on your shoes'.</p><p>Specifically in the context of narrative, Rebora asked students to rate the sentiment in paragraphs of a story by Pirandello <ref type="bibr" target="#b31">[32]</ref>. Their agreement was very weak. As we saw, in <ref type="bibr" target="#b1">[2]</ref> two researchers rated sentiment in sentences in The Old Man and the Sea. Their result was better than Rebora's: the correlation between their ratings was strong, after detrending even very strong. Kaakinen and colleagues report on a multilingual database of short stories (ca. 1,000 characters) <ref type="bibr" target="#b18">[19]</ref>, for which raters established valence and arousal using self-assessment mannikins. Finally, Jacobs <ref type="bibr">[16, pp. 117-119]</ref> briefly reports on a number of experiments where readers were asked to rate the valence of sentiments or short sections of among others a Harry Potter novel and Pippi Longstocking. He does not report agreement measures, but the resulting sentiment arcs could be predicted reasonably well using his SentiArt toolset.</p><p>Outside of the domains of psychology or narrative, there exists a plethora of studies on the polarity of especially reviews and social media texts. We just mention work on the orientation of tweets <ref type="bibr" target="#b33">[34]</ref>, targeted not so much at establishing a text's valence, but at classifying a text as positive, negative or neutral, and work on product reviews, where beyond establishing a text's overall valence the aim is to find the aspects of a product that reviewers are positive or negative about <ref type="bibr" target="#b40">[41]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.5.">Provisional conclusion</head><p>We have seen how the concept of valence morphed from a hidden dimension of meaning into something that could be measured in text using just a few adjectives, and further into a subjective feeling, a property of the language or a property of the things that we talk about. It is clear that in practice, these are not unrelated. If I consider peace a good thing, I'll probably feel good about the word and I'll use positive words in talking about it. But conceptually they are distinct, and the extent to which they agree in practice is an empirical question. Another thing that we learned in this short review is that valence judgments vary by gender, age, personality and culture. Anyone who confidently writes about 'the sentiment' of a narrative work should be aware of that.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Method</head><p>After this brief look at the history and the operationalisation of the notion of valence, we turn to a comparison of various tools that assign valence to Dutch words and texts. Except for the approaches already discussed, we also include two transformer-based language models. These tools by themselves have no notion of valence, but have been trained to fulfill classification tasks that assign short texts to evaluative categories.</p><p>Our interest here is not in finding the tool that best approximates the 'true' sentiment value, if such a thing exists, or the gold labels, which we don't have. What we are interested in is whether these tools agree or disagree and what that says about the current state of sentiment analysis for narrative, at least for Dutch.</p><p>The tools that we use are:</p><p>LiLaH the sentiment mapping of LiLaH <ref type="bibr" target="#b21">[22]</ref>, a manually corrected version of an automatic translation of the NRC emotion lexicon <ref type="bibr" target="#b24">[25]</ref>, see A.2.2.</p><p>LIWC a general-purpose tool for text analysis <ref type="bibr" target="#b4">[5]</ref> based on an underlying dictionary, see A.2.3. For a further description of the tools that we will use, as well as for the settings that we apply in computing the SentiArt valences, we refer to the Appendix (A.1).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Moors</head><p>For the dictionary-based tools (that includes the tools with curated dictionaries as well as SentiArt), we first compare the dictionaries. We report a number of measures:</p><p>• dictionary size; • number of shared words; • word-level agreement of assigned values; • words where the dictionaries disagree.</p><p>After the dictionary comparison, we compare the result of the tools on a sample of Dutch novels. We create the sample as follows: from a collection of 10,921 recently published Dutch books we remove non-fiction and books with less than 5000 words, then select every fifth book. For every selected book (n=2087) we select a random newline character, which usually corresponds with a paragraph start. Starting from that location we select the first hundred tokens.</p><p>Then, for each of the tools mentioned in the Appendix we compute the valence assigned to the fragment. For Moors, LiLaH and SentiArt, the valence of the text fragment is the average lemma valence for all words whose lemma occurs in the relevant lexicon. Words whose lemma does not occur in the resource are ignored. For the other tools, see the Appendix for the computational procedure.</p><p>We then compute correlations between the tools' valence assignments. For some pairs of tools we also look at fragments where the two tools produce very different results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Results</head><p>In describing correlations, we use the labels proposed by Evans <ref type="bibr" target="#b12">[13]</ref>: 0.00 -0.19: very weak, 0.20 -0.39: weak, 0.40 -0.59: moderate, 0.60 -0.79: strong, 0.80 -1.00: very strong.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Comparison at dictionary level</head><p>For simplicity's sake, in this section we ignore LIWC15, as it did much worse than LIWC07 in the analysis of the fragment valences.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.1.">Overlap at dictionary level</head><p>Table <ref type="table" target="#tab_3">A</ref>.4 in the Appendix presents the tool dictionary sizes and their overlap. We notice a few things: (i) The SentiArt dictionary obviously dwarfs the curated dictionaries. Because of the better coverage of the text, we might expect the SentiArt dictionary to perform better than the other ones. More importantly: (ii) the overlap between the other dictionaries is in most cases quite small, also in comparison with the number of words that they do not share. Only LiLaH and Moors share more than 1000 words (1481), and for Moors, that is about one third of the words it contains. In all other comparisons, the number of overlapping words as a fraction of a dictionary's total words is (much) smaller. It would be surprising if tools that share so small a part of their vocabularies would report similar valences.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.2.">Agreement at dictionary level</head><p>We look first at the agreement between the continuously valued (SentiArt, Moors and Pattern) and the binary-valued tools (LIWC07 and LiLaH), given in table <ref type="table" target="#tab_1">1</ref>. These look as one would expect. In all cases there is a clear distinction between the mean values of the positive and the negative words in LiLaH and LIWC07. For LIWC07, the difference between the means is somewhat larger than for LILaH. LIWC07 seems to agree better with the continuous tools than LiLaH does.</p><p>Then we look at the correlations between the continuously valued valences (Table <ref type="table" target="#tab_2">2</ref>). The correlations of Moors and Pattern with SentiArt are strong, the correlation of the two curated dictionaries Moors and Pattern is very strong. We should be aware, however, that these are correlations over a relatively small number of words.  Finally, for all dictionary pairs we also checked whether there are words where the dictionaries disagree, and if so, whether there is an obvious culprit. For LIWC07 and Moors and to a lesser extent Pattern we hardly found apparent mistakes. LiLaH contains a number of clear errors, maybe due to the limited availability of the Dutch translator <ref type="bibr">[22, p. 154</ref>]. In the Sen-tiArt dictionary, there are countless misclassifications. See Table <ref type="table" target="#tab_3">A</ref>. <ref type="bibr" target="#b4">5</ref> for examples. We also note here that among the top positive words in the SentiArt valences, there appears a curious group of words related to hospitality, such as dinner, hostess, sommelier, catering, service and culinary, which also raises some questions about the adequacy of the procedure.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Comparison at text fragment level</head><p>After computing the valences assigned to the fragments we computed their correlations (see Figure <ref type="figure" target="#fig_3">4</ref>). We see that the correlations between the results of the various sentiment analysis tools that we have looked at doesn't get better than moderate (the only strong correlation is between the two LIWC flavours). As we don't know the 'true' valences, we're not in a position to say which is the best tool, but we can say something.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">The correlation of the XLM transformer-based model is with the others is at best weak,</head><p>for the Van Roy model there is no correlation. But we already knew that these tools were very different than the other tools, and it confirms (if confirmation were needed) that the type of training text is really important. 2. Of the other dictionaries, the agreement of Pattern with the rest is at best moderate but mostly weak. The reason is probably Pattern's background in consumer review analysis. 3. From the two LIWC dictionaries, LIWC15 performs noticeably less than LIWc07. This is probably due to the Dutch LIWC15 being an automatic translation of the English dictionary. 4. The remaining curated dictionaries (LiLaH, LIWC07 and Moors) and the SentiArt approach have moderate correlations with each other.</p><p>For the tools Moors, SentiArt and LIWC07 we also did an analysis of text fragments that scored high on one tool and low on another. The main causes for differences were apparent errors in the SentiArt word valence, homonymies, the word 'niet' (not) in Moors being assigned a low valence, and words not being present in the curated dictionaries. Some of those are unavoidable in the context of a dictionary approach, others could be avoided by better curation. Some of the options that we used for the SentiArt computations resulted from preliminary testing with these differently-rated fragments. See the Appendix for details.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Discussion</head><p>As the main results of the empirical part of this paper we see:</p><p>1. The SentiArt procedure to assign valence to large collections of words has serious limitations, even when computed on the basis of a domain-specific word embedding. 2. These limitations can to some extent be overcome by computing distances to a centroid vector rather than to the individual seed words, by only looking at the top and bottom quartile of the resulting valence distribution, and by excluding punctuation and function words (see Appendix for details). However, while leaving out punctuation and function words from the fragment valence computation helps, it is not a real solution to the problem that apparently the word embedding-based valence assignment is producing flawed results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">The agreement between other tools for computing valence of Dutch narrative text are never better than moderate.</head><p>With respect to the first two items in this list, this suggests that we may have to look beyond word2vec for a better answer to questions of semantic relatedness between lemmas (e.g. to contextual text representations as provided by Transformer-based language models <ref type="bibr" target="#b9">[10]</ref>). The limited number of pretty arbitrary seed words seems another limitation of the SentiArt approach. A better way of obtaining valence ratings for many more words than can be manually curated might be machine learning with as target the Moors valences, and as features (among others) the word2vec distances to some of the top and bottom Moors words.</p><p>With respect to the last item, the question is: how bad is it that these tools only agree moderately? If we knew that one of the tools is mostly correct, it wouldn't matter, we could just stop using the others. But we suspect that this is not the case. We have seen enough limitations in all of the tools and in the dictionary approach as such that it is unlikely that any of these tools presents us with more than a rough approximation of correctness.</p><p>That might lead us to asking why we have focussed here on dictionary-based approaches. In their survey of sentiment analysis in literary studies, Kim and Klinger <ref type="bibr" target="#b19">[20]</ref> wrote that 'much digital humanities research (especially dealing with text) uses the methods of text analysis that were in fashion in computational linguistics twenty years ago'. And in a direct comparison, current machine learning tools usually perform better in predicting human valence ratings (see e.g. <ref type="bibr" target="#b37">[38]</ref> for a study by Van Atteveldt and colleagues in the field of politicology). So why do we still study these methods, rather than follow the lead of computational linguistics?</p><p>One answer to that question could come from Teodorescu and Mohammad <ref type="bibr" target="#b36">[37]</ref>, who show that, in spite of instance-level inaccuracy, dictionary-based methods work very well for larger bins of texts (e.g. for groups of 30 or a 100 tweets). They argue that '[f]or applications where simple, interpretable, low-cost, and low-carbon-footprint systems are desired, the lexiconbased systems [...] are often more suitable'. That suggests that dictionary-based methods might be better suited to study sentiment at the chapter than at the sentence level of a novel. Another answer comes from Öhman <ref type="bibr" target="#b27">[28]</ref>, who argues that for the large texts that in the humanities we are often interested in, the annotation efforts on which machine learning tools depend are are just not feasible. But the best answer is maybe one that Öhman also hints at when she proposes to leave the term 'sentiment analysis' to the computational linguists and argues that even if it is not computational sentiment, differences between texts that are made visible by dictionary-based tools, if statistically significant, are still relevant research findings.</p><p>We wouldn't go so far as as to say that what dictionary-based methods can do is not sentiment analysis. But it is true that most of the work in computational linguistics has been on the detection of sentiment in the sense of stance, where the aim is to detect the view that a text's author expresses about some object. In narrative, and especially literary narrative, the aim of the text is not to convey the author's view about the characters or the events, and if it were, it wouldn't necessarily be the aim of researchers to uncover that view. This doesn't mean that we don't need to work on well-annotated corpora of narrative on which we can apply the tools of machine learning, far from that, but it does mean that current pre-trained sentiment analysis tools have been trained on corpora so different from the corpora that we are interested in that they may not be very relevant to the analysis of narrative.</p><p>Returning to the question of how much of a problem we have with these moderate correlations, and assuming, for the sake of argument, that the correlation of our tools with the 'true' valence is about equal to the best of their mutual correlations, that is .51, what is it can we do with a measurement that misses so much information? Maybe we could look at some patterns, very carefully. But it certainly would not make sense to use these measurements as ingredients in e.g. predictive modelling or the construction of narrative arcs.</p><p>We see some ways of moving forward:</p><p>1. creating a much larger dictionary than the present curated dictionaries for Dutch along the lines sketched earlier in this section; 2. using some sort of ensemble measure, in the hope that the tools can compensate for each other's weaknesses; 3. only using sentiment analysis in a narrative context on larger text segments; 4. starting an annotation effort for valence in narrative fragments.</p><p>All of these, however, are only stop-gap measures for what we believe is the real problem, which is that we have as a discipline not really defined the concept of valence in a narrative context. As we saw in our overview of the history of the concept of word valence in section 2, many completely different definitions and operationalisations have have been proposed. It has been possible to get away with these differences in the analysis of by and large simple and straightforward texts such as social media posts. In his survey of sentiment analysis in literary studies <ref type="bibr" target="#b30">[31]</ref>, Rebora writes 'S[entiment] A[nalysis], in fact, can be performed by selecting or combining an ample variety of approaches [...]. Choosing one approach over the other means also defining the very nature of the object under examination'. We might add that to 'define the very nature of the object under examination' is what, with respect to valence, we have up to now, and to our peril, shied away from. The texts that we used for the word-embedding are the full texts of 13,210 novels taken from a larger collection of 18,467 books in Dutch.</p><p>We use a number of different options in the SentiArt computation of the word and the fragment valences (see table A.1). Table <ref type="table" target="#tab_3">A</ref>.2 gives the seed words that we use for computing the SentiArt valence. The fact that we included among the SentiArt options the possibilities to exclude punctuation and/or function words is the result of preliminary testing. We saw in these tests that punctuation and function word valences had a sizable effect on the SentiArt-assigned fragment valences, even under the 'lens' condition. E.g. the comma, the full stop and the indefinite article 'een' ('a') all have a SentiArt valence in the upper quartile of the distribution.</p><p>After computing the four SentiArt valence dictionaries, we look at their distributions. We also list the 50 words with the highest and lowest valence, in order to check whether the computation makes sense.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A.2.5. Transformer-based models</head><p>As mentioned, we use two transformer-based models on the fragments. One model is the cardiffnlp/twitter-xlm-roberta-base-sentiment model <ref type="bibr" target="#b0">[1]</ref>. This is a multilingual model, trained on tweets. It does not assign a continuous valence value, but classifies a text as positive, negative or neutral. <ref type="foot" target="#foot_1">2</ref> The other model is robbert-v2-dutch-base-hebban-reviews5. It is a model trained on Dutch book reviews from the book discussion site Hebban, and aims to predict the rating associated with the review. <ref type="foot" target="#foot_2">3</ref> For both tools, the text types that they were trained on are very different from the book fragments that we will use them on. For this reason, we did not expect that they would agree with the other tools, but were willing to be surprised. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A.3. Computing the SentiArt valences</head><p>We computed four SentiArt valences, as described above. Here we used only the 17,306 lemmas that occur in the novel fragments that we will analyse.</p><p>As a first check of the results, we looked at the 50 words with highest or lowest valence for each of the computations. The words with the reportedly lowest valence are for all four computations indeed words that describe very unpleasant things. As an example, here are (English translations of) the 10 words with lowest valence for the centroid -nouns and adjectives condition: fear, distraught, anger, anxious, rage, shame, misunderstood, powerlessness, anger, confounded. For the words with highest valence, the picture is somewhat different. There are many words that no doubt represent a positive evaluation (excellent, fantastic, great), but there also appears a curious group of words that seem somehow related to hospitality, such as dinner, hostess, sommelier, catering, service and culinary; words that certainly are far removed from the positive seed words that went into the process.</p><p>Next we look at the distribution of the computed valences. Figure A.1 shows that the centroid-based computations, and especially the one with nouns and adjectives as seed words, have a somewhat wider distribution. That seems an attractive property, as it provides stronger distinctive power to the valence assignment.</p><p>The correlations between the SentiArt valences are very strong (Table <ref type="table" target="#tab_3">A</ref>.3). We see there is a 2 to 3 percent disagreement between the centroid and non-centroid versions, and a 6 to 7 percent disagreement between the nouns versus the nouns and adjectives seed words.</p><p>For each dictionary pair, we selected 10 words that get a high valence rating in one dictionary but a low rating in another. In none of the pairs, this created word lists where intuitively we would consider one of the dictionaries wrong, except for the pair centroid and nouns / noncentroid and nouns. Here, the words that were rated high in the centroid but low in the  non-centroid condition were all obviously positive: humour, eagerness, liveliness, surrender, delight, self-assurance, cheerfulness, approval, passion, lust. This provides another argument in favour of the centroid-based computation.</p><p>As explained in the previous section, for SentiArt we have five binary options, and therefore 32 different results. In initial testing, it appeared that the computation without centroid consistently led to results with lower correlations to the other tools than the computation with centroid. We dropped the computation without centroid from further consideration. From the remaining sixteen SentiArt valences, the best correlation with the other tools was reached with the options lens, just the original (noun) labels, and not considering punctuation and stop words (see Figure A.2 for Pearson correlation with the other dictionary-based tools.). We continue with this SentiArt valence, which in the rest of the paper we just call SentiArt. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A.4. Other tables and figures</head></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Lay-out of Osgood et al. 's questions. The Measurement of Meaning, p. 34. Image from the Internet Archive.</figDesc><graphic coords="3,130.95,84.17,333.37,51.51" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Factor loadings in Osgood et al. The Measurement of Meaning, p. 37. Image with student notes from the Internet Archive.</figDesc><graphic coords="3,130.95,179.85,333.37,189.14" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Self-assessment mannikin.</figDesc><graphic coords="4,130.95,84.17,333.37,192.15" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Spearman correlations of computed valences.</figDesc><graphic coords="9,178.98,302.22,237.32,142.81" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure A. 1 :</head><label>1</label><figDesc>Figure A.1: Distribution of the four SentiArt valences. 'ct': centroid, 'nct': non-centroid, 'n': noun labels, 'na': noun and adjective labels.</figDesc><graphic coords="19,130.95,84.18,333.37,166.69" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure A. 2 :</head><label>2</label><figDesc>Figure A.2: Pearson correlations of all valences excluding the ones from the Transformer-based models. The Sentiart valences are named based on the options used in their computation. 'n': noun labels, 'na': noun and adjective labels, 'l': lens method, 'nl': no lens, 'f': include function words, 'nf': no function words, 'p': include punctuation, 'np': no punctuation.</figDesc><graphic coords="21,164.12,129.47,593.61,279.05" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1</head><label>1</label><figDesc>Means for the continuously valued tools' values for the positive and negative words in the binary-valued tools.</figDesc><table><row><cell cols="6">tool stdev LILaH pos LILaH neg LIWC07 pos LIWC07 neg</cell></row><row><cell>Moors</cell><cell>1.06</cell><cell>4.95</cell><cell>2.98</cell><cell>5.50</cell><cell>2.60</cell></row><row><cell>Pattern</cell><cell>0.42</cell><cell>0.33</cell><cell>-0.34</cell><cell>0.52</cell><cell>-0.44</cell></row><row><cell>Sentiart</cell><cell>0.08</cell><cell>0.04</cell><cell>-0.07</cell><cell>0.07</cell><cell>-0.07</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2</head><label>2</label><figDesc>Correlation for continuously valued valences.</figDesc><table><row><cell></cell><cell></cell><cell cols="2">word overlap correlation</cell></row><row><cell>Moors</cell><cell>Pattern</cell><cell>726</cell><cell>0.86</cell></row><row><cell cols="2">Moors SentiArt</cell><cell>4279</cell><cell>0.63</cell></row><row><cell cols="2">Pattern Sentiart</cell><cell>3223</cell><cell>0.69</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table A .</head><label>A</label><figDesc></figDesc><table><row><cell>1</cell><cell></cell></row><row><cell cols="2">SentiArt valence computation options</cell></row><row><cell>Applied when com-</cell><cell>Option</cell></row><row><cell>puting valence of</cell><cell></cell></row><row><cell>word</cell><cell>To compute or not compute the centroids (average value) for the pos-</cell></row><row><cell></cell><cell>itive words and the negative seed words before we compute and sub-</cell></row><row><cell></cell><cell>tract the similarities.</cell></row><row><cell>word</cell><cell>To use a list of noun-only seed words or to include also corresponding</cell></row><row><cell></cell><cell>adjectives (and one verb, where there is no corresponding adjective).</cell></row><row><cell>fragment</cell><cell>To use or not to use the lens method [18, p. 22], which excludes the</cell></row><row><cell></cell><cell>second and third quartiles of the valence distribution from the compu-</cell></row><row><cell></cell><cell>tation.</cell></row><row><cell>fragment</cell><cell>To exclude or not punctuation characters.</cell></row><row><cell>fragment</cell><cell>To exclude or not function words. We use the function words as de-</cell></row><row><cell></cell><cell>fined by Dutch LIWC 2007.</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table A .</head><label>A</label><figDesc></figDesc><table><row><cell>2</cell><cell></cell></row><row><cell>Valence seed words</cell><cell></cell></row><row><cell cols="2">Condition Positive</cell><cell>Negative</cell></row><row><cell>Basic</cell><cell cols="2">tevredenheid (contentment) walging (disgust)</cell></row><row><cell></cell><cell>blijdschap (joy)</cell><cell>verlegenheid (shyness)</cell></row><row><cell></cell><cell>genot (pleasure)</cell><cell>angst (anxiety)</cell></row><row><cell></cell><cell>trots (pride)</cell><cell>verdriet (sadness)</cell></row><row><cell></cell><cell>opluchting (relief)</cell><cell>schaamte (shame)</cell></row><row><cell></cell><cell>voldoening (satisfaction)</cell></row><row><cell></cell><cell>verrassing (surprise)</cell></row><row><cell>Extended</cell><cell>Basic seed words &amp;</cell></row><row><cell></cell><cell>tevreden (contented)</cell><cell>walgend (disgusted)</cell></row><row><cell></cell><cell>blij (glad)</cell><cell>verlegen (shy)</cell></row><row><cell></cell><cell>genieten (enjoy)</cell><cell>angstig (anxious)</cell></row><row><cell></cell><cell>trots (proud)</cell><cell>verdrietig (sad)</cell></row><row><cell></cell><cell>opgelucht (relieved)</cell><cell>beschaamd (ashamed)</cell></row><row><cell></cell><cell>voldaan (satisfied)</cell></row><row><cell></cell><cell>verrast (surprised)</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table A . 3</head><label>A3</label><figDesc>Pearson correlations between the four SentiArt dictionaries. 'ct': centroid, 'nct': non-centroid, 'n': noun labels, 'na': noun and adjective labels.</figDesc><table><row><cell></cell><cell cols="4">valence_nct_n valence_ct_n valence_nct_na valence_ct_na</cell></row><row><cell>valence_nct_n</cell><cell>1.00</cell><cell></cell><cell></cell><cell></cell></row><row><cell>valence_ct_n</cell><cell>0.97</cell><cell>1.00</cell><cell></cell><cell></cell></row><row><cell>valence_nct_na</cell><cell>0.91</cell><cell>0.87</cell><cell>1.00</cell><cell></cell></row><row><cell>valence_ct_na</cell><cell>0.94</cell><cell>0.93</cell><cell>0.98</cell><cell>1.00</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">The notebooks and datasets underlying this paper are available at https://doi.org/10.5281/zenodo.13942218.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://huggingface.co/cardiffnlp/twitter-xlm-roberta-base-sentiment.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">https://huggingface.co/BramVanroy/robbert-v2-dutch-base-hebban-reviews5.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This research was funded by the Netherlands eScience Center, grant number ASDI.2020.032.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Appendix</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A.1. Tools</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A.2. Moors: Norms of valence, (...) for 4,300 Dutch words</head><p>The Moors approach is based on the Moors et al. article <ref type="bibr" target="#b25">[26]</ref> discussed in the text. The valences reported by Moors vary from 1 (lowest) to 7 (highest).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A.2.1. Pattern: A Subjectivity Lexicon for Dutch Adjectives</head><p>We use the De Smedt and Daelemans subjectivity dictionary discussed in the text <ref type="bibr" target="#b7">[8]</ref>. For the dictionary comparison, if a word occurs in the dictionary multiple times (because of multiple word senses) we take the average valence. For the computation of the fragment valence, we do not use the dictionary directly, but apply the Pattern toolset to the unlemmatised text fragment. Pattern does not just look at individual words, but uses some aspects of the context, such as the presence of intensifying adverbs ('awfully good').</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A.2.2. LiLaH: The LiLaH Emotion Lexicon of Croatian, Dutch and Slovene</head><p>The LiLaH dictionary <ref type="bibr" target="#b21">[22]</ref> is a manually corrected version of an automatic translation of the NRC emotion lexicon <ref type="bibr" target="#b24">[25]</ref> for three languages. It assigns words to positive or negative sentiment (+1 or -1) as well as to specific emotions. For Dutch, however, only the sentiment values are available. It contains 5746 Dutch words, 2519 are positive, 3431 are negative and 204 are both positive and negative. In the computation of the fragment valence, if a word is both positive and negative, we count its value as 0.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A.2.3. LIWC: Linguistic Inquiry and Word Count</head><p>LIWC is a general-purpose tool for text analysis created by psychologist James Pennebaker and colleagues. Its latest version is LIWC 2022 <ref type="bibr" target="#b4">[5]</ref>. Underlying the tool is an English-language dictionary that assigns words to (multiple) categories, including categories for positive and negative emotion. There exist Dutch translations for the 2007 <ref type="bibr" target="#b3">[4]</ref> and 2015 <ref type="bibr" target="#b38">[39]</ref> versions of LIWC. The translation of the LIWC 2007 dictionary is a manual translation. It includes wildcards. The translation of LIWC 2015 is an automatic translation that resolved wildcards.</p><p>What LIWC reports is the relative frequency in a text of words in the various categories. We compute the relative frequencies using the LIWCTools python package <ref type="bibr" target="#b2">[3]</ref>. For computation of the fragment valence we subtract the relative frequency of negative emotion from the relative frequency of positive emotion.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A.2.4. SentiArt: a word-embedding based computation</head><p>For SentiArt, we use a word2vec-computed <ref type="bibr" target="#b22">[23]</ref> word embedding. We used the gensim package <ref type="bibr" target="#b32">[33]</ref> to do the computation; we pre-tokenized, lemmatized, and lowercase the text with a window size of 8 words and only counted lemmas that appear in the corpus 5 or more times.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table A.4</head><p>Overlap in words between the tools' dictionaries. A limitation to take into account: in LIWC07, some terms include wildcards. The number of words that it covers are therefore larger than the number reported here. The computation of the overlap does not take into account the wildcards. </p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond</title>
		<author>
			<persName><forename type="first">F</forename><surname>Barbieri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">E</forename><surname>Anke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Camacho-Collados</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Thirteenth Language Resources and Evaluation Conference</title>
				<meeting>the Thirteenth Language Resources and Evaluation Conference<address><addrLine>Marseille, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="258" to="266" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Comparing Transformer and Dictionary-Based Sentiment Models for Literary Texts: Hemingway as a Case-Study</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Bizzoni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Feldkamp</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Joint 3rd International Conference on Natural Language Processing for Digital Humanities and 8th International Workshop on Computational Linguistics for Uralic Languages</title>
				<meeting>the Joint 3rd International Conference on Natural Language Processing for Digital Humanities and 8th International Workshop on Computational Linguistics for Uralic Languages<address><addrLine>Tokyo, Japan</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="219" to="228" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title/>
		<author>
			<persName><forename type="first">P</forename><surname>Boot</surname></persName>
		</author>
		<idno>3.3. 2016</idno>
		<ptr target="https://github.com/pboot/LIWCtools" />
	</analytic>
	<monogr>
		<title level="j">LIWCTools. Version</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">The Dutch Translation of the Linguistic Inquiry and Word Count (LIWC) 2007 Dictionary</title>
		<author>
			<persName><forename type="first">P</forename><surname>Boot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zijlstra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Geenen</surname></persName>
		</author>
		<idno type="DOI">10.1075/dujal.6.1.04boo</idno>
	</analytic>
	<monogr>
		<title level="j">Dutch Journal of Applied Linguistics</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="65" to="76" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">The Development and Psychometric Properties of LIWC-22</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">L</forename><surname>Boyd</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ashokkumar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Seraj</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">W</forename><surname>Pennebaker</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page">10</biblScope>
			<pubPlace>Austin, TX</pubPlace>
		</imprint>
		<respStmt>
			<orgName>University of Texas at Austin</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Affective Norms for English Text (ANET): Affective ratings of text and instruction manual</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">M</forename><surname>Bradley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">J</forename><surname>Lang</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2007">2007</date>
			<pubPlace>Gainesville, FL</pubPlace>
		</imprint>
		<respStmt>
			<orgName>University of Florida</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Techical Report. D-1</note>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">M</forename><surname>Bradley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">J</forename><surname>Lang</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
	<note type="report_type">Tech. rep. Technical report C-1</note>
	<note>the center for research in psychophysiology</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Vreselijk mooi!&quot; (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">De</forename><surname>Smedt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Daelemans</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Lrec</title>
				<meeting><address><addrLine>Istanbul, Turkey</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="3568" to="3572" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Pattern for Python</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">De</forename><surname>Smedt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Daelemans</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of Machine Learning Research</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="2063" to="2067" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding</title>
		<author>
			<persName><forename type="first">J</forename><surname>Devlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-W</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Toutanova</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.1810.04805</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
		<title level="s">Long and Short Papers</title>
		<meeting>the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>Minneapolis, Minnesota</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="4171" to="4186" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Human Language Reveals a Universal Positivity Bias</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">S</forename><surname>Dodds</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">M</forename><surname>Clark</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Desu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">R</forename><surname>Frank</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Reagan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Williams</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Mitchell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">D</forename><surname>Harris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">M</forename><surname>Kloumann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Bagrow</surname></persName>
		</author>
		<idno type="DOI">10.1073/pnas.1411678112</idno>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the national academy of sciences</title>
		<imprint>
			<biblScope unit="volume">112</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="2389" to="2394" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">The shapes of Stories: Sentiment Analysis for Narrative</title>
		<author>
			<persName><forename type="first">K</forename><surname>Elkins</surname></persName>
		</author>
		<idno type="DOI">10.1017/9781009270403</idno>
		<imprint>
			<date type="published" when="2022">2022</date>
			<publisher>Cambridge University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Straightforward Statistics for the Behavioral Sciences</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Evans</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1996">1996</date>
			<publisher>Thomson Brooks/Cole Publishing Co</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Villainous or Valiant? Depictions of Oil and Coal in American Fiction and Nonfiction Narratives</title>
		<author>
			<persName><forename type="first">E</forename><surname>Grubert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Algee-Hewitt</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.erss.2017.05.030</idno>
	</analytic>
	<monogr>
		<title level="j">Energy research &amp; social science</title>
		<imprint>
			<biblScope unit="volume">31</biblScope>
			<biblScope unit="page" from="100" to="110" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Mining and Summarizing Customer Reviews</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Liu</surname></persName>
		</author>
		<idno type="DOI">10.1145/1014052.1014073</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining</title>
				<meeting>the tenth ACM SIGKDD international conference on Knowledge discovery and data mining</meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="168" to="177" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Neurocomputational Poetics: How the Brain Processes Verbal Art</title>
		<author>
			<persName><forename type="first">A</forename><surname>Jacobs</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2023">2023</date>
			<publisher>Anthem Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Sentiment Analysis for Words and Fiction Characters from the Perspective of Computational (Neuro-) Poetics</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Jacobs</surname></persName>
		</author>
		<idno type="DOI">10.3389/frobt.2019.00053</idno>
	</analytic>
	<monogr>
		<title level="j">Frontiers in Robotics and AI</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page">53</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Computing the Affective-Aesthetic Potential of Literary Texts</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Jacobs</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kinder</surname></persName>
		</author>
		<idno type="DOI">10.3390/ai1010002</idno>
	</analytic>
	<monogr>
		<title level="j">Ai</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="11" to="27" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">IDEST: International database of emotional short texts</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">K</forename><surname>Kaakinen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Werlen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Kammerer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Acartürk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Aparicio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Baccino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">U</forename><surname>Ballenghein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Bergamin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Castells</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Costa</surname></persName>
		</author>
		<idno type="DOI">10.1371/journal.pone.0274480</idno>
	</analytic>
	<monogr>
		<title level="j">PLOS one</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page">e0274480</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">A Survey on Sentiment and Emotion Analysis for Computational Literary Studies</title>
		<author>
			<persName><forename type="first">E</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Klinger</surname></persName>
		</author>
		<idno type="DOI">10.17175/2019\_008\_v2</idno>
	</analytic>
	<monogr>
		<title level="j">Zeitschrift für digitale Geisteswissenschaften</title>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">Sentiment Analysis and Opinion Mining</title>
		<author>
			<persName><forename type="first">B</forename><surname>Liu</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-031-02145-9</idno>
		<imprint>
			<date type="published" when="2012">2012</date>
			<publisher>Morgan Claypool</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">The LiLaH emotion lexicon of Croatian, Dutch and Slovene</title>
		<author>
			<persName><forename type="first">N</forename><surname>Ljubešić</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Markov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Fišer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Daelemans</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Third Workshop on Computational Modeling of People&apos;s Opinions, Personality, and Emotion&apos;s in Social Media</title>
				<meeting>the Third Workshop on Computational Modeling of People&apos;s Opinions, Personality, and Emotion&apos;s in Social Media<address><addrLine>Barcelona, Spain; Barcelona, Spain</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020-12">December, 2020. 2020</date>
			<biblScope unit="page" from="1" to="5" />
		</imprint>
	</monogr>
	<note>(Online), ACL</note>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Distributed Representations of Words and Phrases and their Compositionality</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sutskever</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Corrado</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Dean</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 26th International Conference on Neural Information Processing Systems -Volume 2. Nips&apos;13</title>
				<meeting>the 26th International Conference on Neural Information Processing Systems -Volume 2. Nips&apos;13<address><addrLine>Red Hook, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Curran Associates Inc</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="3111" to="3119" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Obtaining Reliable Human Ratings of Valence, Arousal, and Dominance for 20,000 English Words</title>
		<author>
			<persName><forename type="first">S</forename><surname>Mohammad</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/P18-1017</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 56th annual meeting of the association for computational linguistics (volume 1: Long papers)</title>
				<meeting>the 56th annual meeting of the association for computational linguistics (volume 1: Long papers)<address><addrLine>Melbourne, Australia</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="174" to="184" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Crowdsourcing a Word-emotion Association Lexicon</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Mohammad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">D</forename><surname>Turney</surname></persName>
		</author>
		<idno type="DOI">10.1111/j.1467-8640.2012.00460</idno>
	</analytic>
	<monogr>
		<title level="j">Computational intelligence</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="436" to="465" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Norms of Valence, Arousal, Dominance, and Age of Acquisition for 4,300 Dutch Words</title>
		<author>
			<persName><forename type="first">A</forename><surname>Moors</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">De</forename><surname>Houwer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Hermans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Wanmaker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Van Schie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A.-L</forename><surname>Van Harmelen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">De</forename><surname>Schryver</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>De Winne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Brysbaert</surname></persName>
		</author>
		<idno type="DOI">10.3758/s13428-012-0243-8</idno>
	</analytic>
	<monogr>
		<title level="j">Behavior research methods</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="page" from="169" to="177" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Character-to-character Sentiment Analysis in Shakespeare&apos;s Plays</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">T</forename><surname>Nalisnick</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">S</forename><surname>Baird</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics</title>
		<title level="s">Short Papers</title>
		<meeting>the 51st Annual Meeting of the Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="479" to="483" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">The validity of lexicon-based sentiment analysis in interdisciplinary research</title>
		<author>
			<persName><forename type="first">E</forename><surname>Öhman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the workshop on natural language processing for digital humanities</title>
				<meeting>the workshop on natural language processing for digital humanities<address><addrLine>NIT Silchar, India</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="7" to="12" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">E</forename><surname>Osgood</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">J</forename><surname>Suci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Tannenbaum</surname></persName>
		</author>
		<title level="m">The Measurement of Meaning</title>
				<imprint>
			<publisher>University of Illinois press</publisher>
			<date type="published" when="1957">1957</date>
			<biblScope unit="volume">47</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">The Emotional Arcs of Stories are Dominated by Six Basic Shapes</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Reagan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Mitchell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Kiley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">M</forename><surname>Danforth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">S</forename><surname>Dodds</surname></persName>
		</author>
		<idno type="DOI">10.1140/epjds/s13688-016-0093-1</idno>
	</analytic>
	<monogr>
		<title level="j">EPJ data science</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="12" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">Sentiment Analysis in Literary Studies. A Critical Survey</title>
		<author>
			<persName><forename type="first">S</forename><surname>Rebora</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">DHQ: Digital Humanities Quarterly</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="issue">3</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Shared Emotions in Reading Pirandello. An Experiment with Sentiment Analysis</title>
		<author>
			<persName><forename type="first">S</forename><surname>Rebora</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Atti del IX Convegno Annuale AIUCD. La svolta inevitabile: sfide e prospettive&apos;per l&apos;Informatica Umanistica</title>
				<editor>
			<persName><forename type="first">C</forename><surname>Marras</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Passarotti</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Franzini</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">E</forename><surname>Litta</surname></persName>
		</editor>
		<meeting><address><addrLine>Milano</addrLine></address></meeting>
		<imprint>
			<publisher>Università Cattolica del Sacro Cuore</publisher>
			<date type="published" when="2020">2020. 2020</date>
			<biblScope unit="page" from="216" to="221" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Gensim-Python Framework for Vector Space Modelling</title>
		<author>
			<persName><forename type="first">R</forename><surname>Rehurek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Sojka</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">NLP Centre</title>
				<meeting><address><addrLine>Brno, Czech Republic</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="volume">3</biblScope>
		</imprint>
		<respStmt>
			<orgName>Faculty of Informatics, Masaryk University</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<monogr>
		<title level="m" type="main">SemEval-2017 task 4: Sentiment Analysis in Twitter</title>
		<author>
			<persName><forename type="first">S</forename><surname>Rosenthal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Farra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Nakov</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/S17-2088</idno>
		<idno type="arXiv">arXiv:1912.00741</idno>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Word Use in the Poetry of Suicidal and Nonsuicidal Poets</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">W</forename><surname>Stirman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">W</forename><surname>Pennebaker</surname></persName>
		</author>
		<idno type="DOI">10.1097/00006842-200107000-00001</idno>
	</analytic>
	<monogr>
		<title level="j">Psychosomatic medicine</title>
		<imprint>
			<biblScope unit="volume">63</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="517" to="522" />
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<monogr>
		<title level="m" type="main">The General Inquirer: A Computer Approach to Content Analysis</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">J</forename><surname>Stone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">C</forename><surname>Dunphy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">S</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Ogilvie</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1966">1966</date>
			<publisher>MIT press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b36">
	<analytic>
		<title level="a" type="main">Evaluating Emotion Arcs across Languages: Bridging the Global Divide in Sentiment Analysis</title>
		<author>
			<persName><forename type="first">D</forename><surname>Teodorescu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mohammad</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2023.findings-emnlp.271</idno>
	</analytic>
	<monogr>
		<title level="m">Findings of the Association for Computational Linguistics: EMNLP 2023</title>
				<meeting><address><addrLine>Singapore</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="4124" to="4137" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b37">
	<analytic>
		<title level="a" type="main">The Validity of Sentiment Analysis: Comparing Manual Annotation, Crowd-coding, Dictionary approaches, and Machine Learning Algorithms</title>
		<author>
			<persName><forename type="first">W</forename><surname>Van Atteveldt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Van Der Velden</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Boukes</surname></persName>
		</author>
		<idno type="DOI">10.1080/19312458.2020.1869198</idno>
	</analytic>
	<monogr>
		<title level="j">Communication Methods and Measures</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="121" to="140" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b38">
	<analytic>
		<title level="a" type="main">An Electronic Translation of the LIWC Dictionary into Dutch</title>
		<author>
			<persName><forename type="first">L</forename><surname>Van Wissen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Boot</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Electronic lexicography in the 21st century: Proceedings of eLex 2017 conference. Lexical Computing</title>
				<meeting><address><addrLine>Leiden, The Netherlands</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="703" to="715" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b39">
	<analytic>
		<title level="a" type="main">Norms of Valence, Arousal, and Dominance for 13,915 English Lemmas</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">B</forename><surname>Warriner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Kuperman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Brysbaert</surname></persName>
		</author>
		<idno type="DOI">10.3758/s13428-012-0314-x</idno>
	</analytic>
	<monogr>
		<title level="j">Behavior research methods</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="page" from="1191" to="1207" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b40">
	<analytic>
		<title level="a" type="main">A Survey on Aspect-Based Sentiment Analysis: Tasks, Methods, and Challenges</title>
		<author>
			<persName><forename type="first">W</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Deng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Bing</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Lam</surname></persName>
		</author>
		<idno type="DOI">10.1109/TKDE.2022.3230975</idno>
		<idno>doi:</idno>
		<ptr target="https://doi.ieeecomputersociety.org/10.1109/TKDE.2022.3230975" />
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Knowledge and Data Engineering</title>
		<imprint>
			<biblScope unit="volume">35</biblScope>
			<biblScope unit="issue">11</biblScope>
			<biblScope unit="page" from="11019" to="11038" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
