<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Sound Symbolism in Automatic Emotion Recognition and Sentiment Analysis</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Alexander</forename><forename type="middle">J</forename><surname>Kilpatrick</surname></persName>
							<email>alexander_kilpatrick@nucba.ac.jp</email>
							<affiliation key="aff0">
								<orgName type="institution">Nagoya University of Commerce and Business</orgName>
								<address>
									<settlement>Nisshin</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Sound Symbolism in Automatic Emotion Recognition and Sentiment Analysis</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">A74920955A9AFE77ABB523401503DF39</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T18:16+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Sound Symbolism</term>
					<term>Automatic Emotion Recognition</term>
					<term>Automatic Sentiment Analysis</term>
					<term>XGBoost</term>
					<term>Extreme Gradient Boosting</term>
					<term>Artificial Intelligence</term>
					<term>Human-AI Interaction</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This report documents the construction and output of extreme gradient boosted algorithms that were trained using the phonemes that make up American English words to identify how different sounds express emotion and sentiment. The data comprised of two corpora that consist of words that have been assigned scores according to how they reflect certain emotions and sentiments. The models are trained only on the phonemes that make up each word. This is a unique approach to automatic emotion recognition and sentiment analysis which typically does not consider individual phonemes. In addition to the boosted algorithms, linear regression is used to examine the relationships between word length, and emotions and sentiments.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The principle of the arbitrariness of the sign <ref type="bibr" target="#b0">[1]</ref> is a foundational concept in linguistics. It posits that there is no inherent or logical connection between the sound of a word and its meaning. In other words, linguistic signs are considered to be arbitrary, with their meanings assigned by convention rather than by any intrinsic relationship between sound and sense. In recent years, there has been a growing interest in exploring sound symbolism, challenging the notion of the arbitrariness of the sign (e.g., <ref type="bibr" target="#b1">[2]</ref>, <ref type="bibr" target="#b2">[3]</ref>, <ref type="bibr" target="#b3">[4]</ref>, <ref type="bibr" target="#b4">[5]</ref>). These studies have revealed that concepts such as size and shape exhibit stochastic relationships with sounds. Furthermore, many of these relationships are found to be consistent across different languages. For instance, the mil/mal effect, which observes that vowels like /i/ and /a/ are more frequently used in the names of small and large referents respectively, and the kiki/bouba effect, which notes that spiky shapes are often associated with sounds resembling kiki, while rounded shapes are often associated with sounds resembling bouba, have been found to hold true in numerous languages around the world (e.g., <ref type="bibr" target="#b1">[2]</ref>, <ref type="bibr" target="#b3">[4]</ref>). Emotional sound symbolism refers to the phenomenon where the sounds of words are associated with, and convey, specific emotional or affective qualities. In this linguistic concept, certain phonemes or combinations of sounds are thought to evoke or symbolize particular emotional states or feelings. In a cross-linguistic study of five languages <ref type="bibr" target="#b5">[6]</ref>, researchers observed a pattern whereby phonemes at the beginnings of words predict emotional valence most strongly and that phonemes associated with negative valence were uttered more quickly, drawing parallels between emotional sound symbolism and alarm calls in the animal kingdom. Important to the present study, they observed that in English, phonemes like /ʌ/, /d/, /ɪ/, and /oʊ/ were associated with negative valence while phonemes like /tʃ/, /ɛ/, and /p/ were associated with positive valence.</p><p>Automatic emotion recognition and sentiment analysis is a subfield of natural language processing which endeavors to construct algorithms that understand human emotion and sentiment in language. Sound symbolism has been largely overlooked in automatic emotion recognition and sentiment analysis although there are a few studies that explore sound symbolism through the lens of machine learning. For example, Winter and Perlman <ref type="bibr" target="#b6">[7]</ref> constructed random forest algorithms to show a systematic sizesound relationship in English size adjectives. As with the present study, the data was engineered in a manner so that each sample returned a count of the number of times each phoneme occurs in each word. The outcome of this is a dataset comprised of mostly null values. Following on this method researchers constructed algorithms to classify Pokémon names according to their evolution level using sound symbolism <ref type="bibr" target="#b7">[8]</ref>. They showed that the random forest algorithms were able to classify novel Pokémon names more accurately than Japanese university students assigned to an identical task. An issue of overfitting due to the high number of null values in the dataset was uncovered and resolved using crossvalidation. In the present study, word length is found to be a significant predictor of several emotions and sentiments and this effect is taken into consideration in the design and analysis of the algorithms. Potentially related, Li et al. <ref type="bibr" target="#b8">[9]</ref> examined the relationship between utterance length and word error rates in automatic speech recognition and speech emotion recognition. They found that shorter utterances tended to have higher word error rates likely due to a lack of contextual information.</p><p>The present report outlines the construction and output of algorithms designed to combine sound symbolism with automatic emotion recognition and sentiment analysis. Two corpora, with a combined total of almost 20,000 words, are used to train 19 algorithms that are each designed to classify samples according to specific emotions and sentiments. All algorithms return significant accuracy estimates.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Method</head><p>All data, files, codes, and links to a YouTube series documenting this project can be found in the following online repository: https://osf.io/brus3/?view_only=63412.</p><p>The present study uses two separate corpora to train machine learning algorithms. The first is the Glasgow Norms <ref type="bibr" target="#b9">[10]</ref>, a list of 5,500 words that have been assigned Likert scores for 9 sentiments. A full list of the sentiments in the Glasgow Norms is provided in Table <ref type="table" target="#tab_0">1</ref>. The second corpus is the NRC Word-Emotion Association Lexicon <ref type="bibr" target="#b10">[11]</ref>, a list of 14,000 words that have been assigned a binary score as to whether each word is associated with 10 emotions and sentiments. A full list of the emotions and sentiments in the NRC Lexicon can be found in Table <ref type="table" target="#tab_1">2</ref>. Words from both corpora were cross referenced with the Carnegie Mellon University Pronouncing Dictionary (CMU <ref type="bibr" target="#b11">[12]</ref>) to obtain American English phonemes for each word. Words that did not find a match in the CMU were manually checked. Instances of mismatched spelling were corrected. All other unmatched samples were discarded.</p><p>All analyses were conducted in the R environment <ref type="bibr" target="#b12">[13]</ref>. Word length was calculated by summing the number of phonemes in each word. No additional considerations were made for diphthongs or long vowels which were counted as single phonemes. The relationships between word length, and emotions and sentiments were analyzed using a series of regression equations, dependent variables being the average Likert scores in the Glasgow Norms and the binary classification in the NRC Lexicon; independent variables being word length. The XGBoost algorithms were constructed using the XGBoost <ref type="bibr" target="#b13">[14]</ref> and caret <ref type="bibr" target="#b14">[15]</ref> packages. K-fold cross-validation (K = 28) was used to avoid the overfitting issue reported in <ref type="bibr" target="#b7">[8]</ref>. The data was split into 8 subsets (A-H) and recombined using a Latin square resulting in 28 subsets with a 3:1 training to testing split. For example, the first iteration of each model is trained using subsets A through F and tested on subsets G and H. The following results report on the aggregate of each series of 28 iterations. Combined significance was calculated using Stouffer's <ref type="bibr" target="#b15">[16]</ref> and Fisher's <ref type="bibr" target="#b16">[17]</ref> methods; however only Fisher's method is reported as it returned more conservative significance estimates. The algorithms for both corpora were designed to classify samples so the Likert scale values in the Glasgow Norms were assigned to binary categories using a median split. The XGBoost algorithm was found to be susceptible to distribution skew, so categories were balanced by randomly removing samples from the majority category. This had little effect on the Glasgow Norms dataset due to the median split but removed around 80% of samples in the NRC dataset because only around 10% of samples in that dataset have a value of 1 in the binary dependent variable. To increase variability, balancing was conducted after cross-validation sub-setting. To limit the influence of word length in the XGBoost models, phoneme counts were divided by word length so that features were a percentage how much a phoneme makes up each word. This resulted in a convergence issue during tuning, so α was manually adjusted and the same learning rate was applied to all models (α = 0.1). All other hyperparameters were automatically tuned by inputting diverse hyperparameter settings into a tuning grid.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Linear Regression and Word Length</head><p>A series of linear regression models were calculated to test the relationship between word length, and the emotions and sentiments in the Glasgow Norms (Likert score) and the NRC Lexicon (binary). Table <ref type="table" target="#tab_0">1</ref> reports on the findings of the analyses conducted on the Glasgow Norms. Increased Age of Acquisition, Arousal, Size, and Valence had a significant positive correlation with word length while Concreteness, Familiarity, and Imaginability had a negative one. All significant relationships observed in the analyses conducted on the NRC Lexicon (Table <ref type="table" target="#tab_1">2</ref>) showed a positive correlation. These include Anger, Sadness, and Trust emotions while the Negative and Positive sentiments were also significant. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">XGBoost Accuracy</head><p>All models constructed and tested using the Glasgow Norms achieved an accuracy greater than chance and a Fisher's combined p-value &lt; 0.001. Table <ref type="table" target="#tab_2">3</ref> reports on the aggregated accuracy and standard deviation of these models. A similar result was found in the NRC models except in the case of the Surprise algorithm (p = 0.022 using Fisher's method and p = 0.021 using Stouffer's method). The NRC models are presented in Table <ref type="table" target="#tab_3">4</ref>. The Glasgow Norms models did report a greater accuracy than the NRC models; however, it is important to note that these were constructed and tested on larger datasets due to the balancing outlined in 2. The accuracy was, on average, higher and variability was lower in the models constructed using the Glasgow Norms models compared to the NRC models. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">XGBoost Feature Importance</head><p>Tables <ref type="table">5 and 6</ref> report the 15 most important features for the Glasgow Norms and NRC models respectively. Certain features are consistently important across models despite measuring different emotions. Features with high feature importance across models include voiceless plosives (/t/ and /k/), the alveolar nasal (/n/), approximant consonants (/ɹ/ and /l/), the alveolar fricative (/s/), and the openmid back vowel (/ʌ/) which appears particularly important, but it should be noted that this is also the most common phoneme in American English according to the CMU.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 5</head><p>The most important phonemes (IPA) and their feature importance (IMP) in the Glasgow Norms models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 6</head><p>The most important phonemes (IPA) and their feature importance (IMP) in the NRC models. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Discussion</head><p>All models achieved accuracy significantly greater than chance (p &lt; 0.001 in all cases but one).</p><p>Although further investigation is recommended, the results suggest that it is unlikely that sound symbolism in American English expresses fine-grained emotions and sentiments because the feature importance scores suggest many models are using the same features to make decisions. Rather, it seems that sound symbolism communicates emotional and sentimental weight. Consider that the Valence model in the Glasgow Norms-where high valence is positive and low valence is negative-and the Positive and Negative models in the NRC Lexicon all showed a positive correlation with word length. Positivity and negativity are sound symbolically expressed through longer words, although this is slightly stronger for positive sentiments as shown by the NRC lexicon models and the significant, but relatively weak, positive correlation in the Valence regression model. Those sounds that have high feature importance scores across models include voiceless plosives (/t/ and /k/), the alveolar nasal (/n/), approximant consonants (/ɹ/ and /l/), the alveolar fricative (/s/) and the open-mid back vowel (/ʌ/). Most of the consistently important consonants are produced at the alveolar ridge. /ʌ/ appears to be an especially important feature across models. This observation falls in line with <ref type="bibr" target="#b5">[6]</ref> who showed that /ʌ/ was associated with negative valence; however, few other patterns can be drawn between the that study and the present report. The high importance of /ʌ/ might also be due to a combination of its high occurrence frequency, being the most common phoneme in the CMU, and the distribution of null values in independent variables (NRC = 85%; Glasgow Norms = 88%). XGBoost algorithms are constructed using decision trees which base their decisions upon the outcomes of nodes. At each node a certain number of features are tested. /ʌ/ is the most commonly occurring phoneme in English and it will often be tested against low frequency features with null values. This issue was somewhat mitigated by dividing phoneme counts against word length, but it doesn't solve the problem entirely. Take for example Age of Acquisition Likert scores which were shown to have the strongest association with increased word length across all models, this is an unsurprising finding. However, Age of Acquisition XGBoost model feature importance scores revealed that /ʌ/ was the most important feature in that model, to a much greater degree than other models. This suggests that word length is still contributing to the models despite attempts to mitigate its influence through model tuning and data engineering. Word length could be included in the XGBoost models; however, given that length has a greater range than phoneme counts and no null values, this would likely mask weaker features <ref type="bibr" target="#b7">[8]</ref>.</p><p>That said, all models reported significant accuracy. Given that most automatic emotion recognition and sentiment analysis systems rely heavily on lexical and syntactic features, this study underscores the potential of phonemic information as an additional valuable resource for improving the accuracy of such systems, especially when dealing with emotional and sentimental aspects of language. While the current study provides valuable insights into the role of sound symbolism in sentiment analysis, future research could delve further into the interplay between phonemic features and linguistic and contextual factors to enhance the robustness and generalizability of sentiment analysis models across different languages and domains.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Word Length and the Glasgow Norms.</figDesc><table><row><cell>Sentiment</cell><cell cols="2">F-statistic p-value</cell><cell>R 2</cell></row><row><cell>Age of Acquisition</cell><cell>986.4</cell><cell cols="2">&lt; 0.001 0.152</cell></row><row><cell>Arousal</cell><cell>152.7</cell><cell cols="2">&lt; 0.001 0.027</cell></row><row><cell>Concreteness</cell><cell>321.5</cell><cell cols="2">&lt; 0.001 0.055</cell></row><row><cell>Dominance</cell><cell>0.273</cell><cell>0.601</cell><cell>0</cell></row><row><cell>Familiarity</cell><cell>63.72</cell><cell cols="2">&lt; 0.001 0.011</cell></row><row><cell>Gender</cell><cell>0.617</cell><cell>0.432</cell><cell>0</cell></row><row><cell>Imaginability</cell><cell>333.4</cell><cell cols="2">&lt; 0.001 0.057</cell></row><row><cell>Size</cell><cell>578.3</cell><cell cols="2">&lt; 0.001 0.095</cell></row><row><cell>Valence</cell><cell>5.928</cell><cell>0.015</cell><cell>0.001</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Word Length and the NRC Lexicon</figDesc><table><row><cell>Variable</cell><cell cols="3">Measure t-value p-value</cell></row><row><cell>Anger</cell><cell>Emotion</cell><cell>2.099</cell><cell>0.036</cell></row><row><cell cols="2">Anticipation Emotion</cell><cell>1.835</cell><cell>0.067</cell></row><row><cell>Disgust</cell><cell>Emotion</cell><cell>-0.241</cell><cell>0.809</cell></row><row><cell>Fear</cell><cell>Emotion</cell><cell>0.252</cell><cell>0.801</cell></row><row><cell>Joy</cell><cell>Emotion</cell><cell>-0.02</cell><cell>0.984</cell></row><row><cell>Negative</cell><cell cols="2">Sentiment 3.571</cell><cell>&lt; 0.001</cell></row><row><cell>Positive</cell><cell cols="2">Sentiment 7.572</cell><cell>&lt; 0.001</cell></row><row><cell>Sadness</cell><cell>Emotion</cell><cell>2.467</cell><cell>0.014</cell></row><row><cell>Surprise</cell><cell>Emotion</cell><cell>0.125</cell><cell>0.901</cell></row><row><cell>Trust</cell><cell>Emotion</cell><cell>4.965</cell><cell>&lt; 0.001</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Glasgow Norms model accuracy (ACC) and standard deviation (STD) and Fisher's combined p values (p).</figDesc><table><row><cell>Sentiment</cell><cell cols="2">ACC STD</cell><cell>p</cell></row><row><cell cols="2">Age of Acquisition 63%</cell><cell cols="2">1% &lt; 0.001</cell></row><row><cell>Arousal</cell><cell>58%</cell><cell cols="2">1% &lt; 0.001</cell></row><row><cell>Concreteness</cell><cell>61%</cell><cell cols="2">1% &lt; 0.001</cell></row><row><cell>Dominance</cell><cell>53%</cell><cell cols="2">1% &lt; 0.001</cell></row><row><cell>Familiarity</cell><cell>56%</cell><cell cols="2">1% &lt; 0.001</cell></row><row><cell>Gender</cell><cell>58%</cell><cell cols="2">1% &lt; 0.001</cell></row><row><cell>Imaginability</cell><cell>62%</cell><cell cols="2">1% &lt; 0.001</cell></row><row><cell>Size</cell><cell>58%</cell><cell cols="2">1% &lt; 0.001</cell></row><row><cell>Valence</cell><cell>56%</cell><cell cols="2">1% &lt; 0.001</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4</head><label>4</label><figDesc>NRC model accuracy (ACC) and standard deviation (STD) and Fisher's combined p values (p).</figDesc><table><row><cell>Variable</cell><cell cols="2">ACC STD</cell><cell>p</cell></row><row><cell>Anger</cell><cell>54%</cell><cell cols="2">2% &lt; 0.001</cell></row><row><cell cols="2">Anticipation 53%</cell><cell cols="2">2% &lt; 0.001</cell></row><row><cell>Disgust</cell><cell>55%</cell><cell cols="2">2% &lt; 0.001</cell></row><row><cell>Fear</cell><cell>54%</cell><cell cols="2">2% &lt; 0.001</cell></row><row><cell>Joy</cell><cell>53%</cell><cell cols="2">3% &lt; 0.001</cell></row><row><cell>Negative</cell><cell>54%</cell><cell cols="2">1% &lt; 0.001</cell></row><row><cell>Positive</cell><cell>54%</cell><cell cols="2">2% &lt; 0.001</cell></row><row><cell>Sadness</cell><cell>54%</cell><cell cols="2">2% &lt; 0.001</cell></row><row><cell>Surprise</cell><cell>51%</cell><cell>3%</cell><cell>0.022</cell></row><row><cell>Trust</cell><cell>52%</cell><cell cols="2">2% &lt; 0.001</cell></row></table></figure>
		</body>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Age of</head></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">D</forename><surname>Saussure</surname></persName>
		</author>
		<title level="m">Cours de linguistique générale</title>
				<meeting><address><addrLine>Paris</addrLine></address></meeting>
		<imprint>
			<publisher>Payot</publisher>
			<date type="published" when="1916">1916</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">The bouba/kiki effect is robust across cultures and writing systems</title>
		<author>
			<persName><forename type="first">A</forename><surname>Ćwiek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Fuchs</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Draxler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">L</forename><surname>Asu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Dediu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hiovain</surname></persName>
		</author>
		<author>
			<persName><forename type="first">.</forename><forename type="middle">.</forename><surname>Winter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Philosophical Transactions of the Royal Society B</title>
		<imprint>
			<biblScope unit="volume">377</biblScope>
			<biblScope unit="page">20200390</biblScope>
			<date type="published" when="1841">2022. 1841</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Symbouki: a meta-analysis on the emergence of sound symbolism in early language acquisition</title>
		<author>
			<persName><forename type="first">M</forename><surname>Fort</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Lammertink</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Peperkamp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Guevara-Rukoz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Fikkert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Tsuji</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Developmental science</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page">e12659</biblScope>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">A cross-linguistic study of sound symbolism: The images of size</title>
		<author>
			<persName><forename type="first">K</forename><surname>Shinohara</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kawahara</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Annual meeting of the berkeley linguistics society</title>
				<imprint>
			<date type="published" when="2010">2010</date>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="page" from="396" to="410" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Five mechanisms of sound symbolic association</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Sidhu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">M</forename><surname>Pexman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Psychonomic bulletin &amp; review</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="page" from="1619" to="1643" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Emotional sound symbolism: Languages rapidly signal valence via phonemes</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">S</forename><surname>Adelman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Estes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Cossu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Cognition</title>
		<imprint>
			<biblScope unit="volume">175</biblScope>
			<biblScope unit="page" from="122" to="130" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Size sound symbolism in the English lexicon</title>
		<author>
			<persName><forename type="first">B</forename><surname>Winter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Perlman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Glossa: a journal of general linguistics</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Random forests, sound symbolism and Pokémon evolution</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Kilpatrick</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ćwiek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kawahara</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PloS one</title>
		<imprint>
			<biblScope unit="volume">18</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page">e0279350</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Klejch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Bell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Lai</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2305.16065</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">The Glasgow Norms: Ratings of 5,500 words on nine scales</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">G</forename><surname>Scott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Keitel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Becirspahic</surname></persName>
		</author>
		<idno type="DOI">10.3758/s13428-018-1099-3</idno>
		<ptr target="https://doi.org/10.3758/s13428-018-1099-3" />
	</analytic>
	<monogr>
		<title level="j">Behav Res</title>
		<imprint>
			<biblScope unit="volume">51</biblScope>
			<biblScope unit="page" from="1258" to="1270" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">NRC Emotion Lexicon</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Mohammad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">D</forename><surname>Turney</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">National Research Council Canada</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page">234</biblScope>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<ptr target="http://www.speech.cs.cmu.edu/cgi-bin/cmudict" />
		<title level="m">CMU Pronouncing Dictionary</title>
				<imprint>
			<date type="published" when="2023-06-16">June 16, 2023</date>
		</imprint>
		<respStmt>
			<orgName>Carnegie Mellon University</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<author>
			<persName><forename type="first">Team</forename><surname>Core</surname></persName>
		</author>
		<title level="m">R: A language and environment for statistical computing</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note>Computer software</note>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">XGBoost: Extreme Gradient Boosting</title>
		<author>
			<persName><forename type="first">T</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>He</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">R package version</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">5</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
	<note>Computer software</note>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">caret: Classification and Regression Training</title>
		<author>
			<persName><forename type="first">M</forename><surname>Kuhn</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">R package version</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="0" to="88" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note>Computer software</note>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Stouffer</surname></persName>
		</author>
		<title level="m">The American Soldier: Adjustment During Army Life</title>
				<imprint>
			<publisher>Princeton University Press</publisher>
			<date type="published" when="1949">1949</date>
			<biblScope unit="volume">1</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Statistical methods for research workers</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A</forename><surname>Fisher</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1925">1925</date>
			<publisher>Oliver and Boyd</publisher>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
