<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">LLM embeddings on test items predict post hoc loadings in personality tests</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">𝑀𝑜𝑛𝑖𝑐𝑎</forename><surname>𝐶𝑎𝑠𝑒𝑙𝑙𝑎</surname></persName>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Department of humanistic studies</orgName>
								<orgName type="department" key="dep2">Natural and artificial cognition laboratory &quot;Orazio Miglino&quot;</orgName>
								<orgName type="institution">University of Naples Federico II</orgName>
								<address>
									<addrLine>via Porta di Massa 1</addrLine>
									<postCode>80125</postCode>
									<settlement>Naples</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Maria</forename><surname>Luongo</surname></persName>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Department of humanistic studies</orgName>
								<orgName type="department" key="dep2">Natural and artificial cognition laboratory &quot;Orazio Miglino&quot;</orgName>
								<orgName type="institution">University of Naples Federico II</orgName>
								<address>
									<addrLine>via Porta di Massa 1</addrLine>
									<postCode>80125</postCode>
									<settlement>Naples</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Davide</forename><surname>Marocco</surname></persName>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Department of humanistic studies</orgName>
								<orgName type="department" key="dep2">Natural and artificial cognition laboratory &quot;Orazio Miglino&quot;</orgName>
								<orgName type="institution">University of Naples Federico II</orgName>
								<address>
									<addrLine>via Porta di Massa 1</addrLine>
									<postCode>80125</postCode>
									<settlement>Naples</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author role="corresp">
							<persName><forename type="first">Nicola</forename><surname>Milano</surname></persName>
							<email>nicola.milano@unina.it</email>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Department of humanistic studies</orgName>
								<orgName type="department" key="dep2">Natural and artificial cognition laboratory &quot;Orazio Miglino&quot;</orgName>
								<orgName type="institution">University of Naples Federico II</orgName>
								<address>
									<addrLine>via Porta di Massa 1</addrLine>
									<postCode>80125</postCode>
									<settlement>Naples</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Michela</forename><surname>Ponticorvo</surname></persName>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Department of humanistic studies</orgName>
								<orgName type="department" key="dep2">Natural and artificial cognition laboratory &quot;Orazio Miglino&quot;</orgName>
								<orgName type="institution">University of Naples Federico II</orgName>
								<address>
									<addrLine>via Porta di Massa 1</addrLine>
									<postCode>80125</postCode>
									<settlement>Naples</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">LLM embeddings on test items predict post hoc loadings in personality tests</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">103DA5AEC0E2410CCD54F447F6683225</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:57+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Embeddings</term>
					<term>language models</term>
					<term>personality traits</term>
					<term>semantic similarity 1</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this article we examine the application of Large Language Models (LLMs) in predicting factor loadings in personality tests through the semantic analysis of test items. By leveraging text embeddings generated from LLMs, we assess the semantic similarity of test items and their alignment with hypothesized factorial structures, without relying on human response data. Our methodology uses embeddings from the Big Five personality test to explore correlations between item semantics and their grouping in factorial analyses. Preliminary results suggest that LLM-derived embeddings can effectively capture semantic similarities among test items, potentially serving as a valid measure for initial survey design and refinement. This approach offers insights into the robustness of embedding techniques in psychological evaluations, indicating a significant correlation with traditional test structures and providing a novel perspective on test item analysis.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>In psychological testing and, more in general, in the different contexts that include evaluation, it is very important to assess item quality. This process, known as item analysis, foresees the evaluation of different item characteristics including item difficulty, item discrimination, item and test reliability. The two main approaches to run item analysisclassical test theory (CTT) and item response theory (IRT) -(see <ref type="bibr" target="#b0">[1]</ref>) share the starting point of a person per item matrix: a matrix where examinees are rows and items are columns. This raises some problems, for example the matrix can be sparse if a lot of missing data are present. Whereas the item formulation relies on procedures that come before the test (and items) administration, for example focus group with experts <ref type="bibr" target="#b1">[2]</ref>, item analysis is typically based on post hoc analysis that used data coming from the administration to a sample of respondents. In this phase, together with item characteristics, evaluation is run on the test, especially in the framework of TCT, including the study reliability and of test structure in terms of latent variables by the means of factor analysis <ref type="bibr" target="#b3">[3]</ref>. This is a consolidated approach. Factor analysis is used to reduce a large number of variables into fewer numbers of factors, which have a psychological meaning. This technique extracts maximum common variance from all variables and expresses them into a common score, that is used for further analysis. This analysis allows also to calculate factor loadings that are basically the correlation coefficient between the single variable and the factor. Factor analysis has been widely used in psychological research both in cognitive domain (consider the factorial theories of intelligence, <ref type="bibr" target="#b4">[4]</ref><ref type="bibr" target="#b5">[5]</ref>) and in personality domain. In this latter case, between the theoretical proposals up the 1990s ( <ref type="bibr" target="#b6">[6]</ref><ref type="bibr" target="#b7">[7]</ref>), the Big Five Theory of personality is a notable example of how personality can be conceived and described as a constellation of different dimensions, factors in the terms we have used before. Evidence of this theory has grown over the years and five broad personality traits described by the theory are identified in extraversion, agreeableness, openness, conscientiousness, and neuroticism. This approach was developed in 1949 by <ref type="bibr" target="#b8">[8]</ref> and later expanded by other researchers up to the work by <ref type="bibr" target="#b9">[9]</ref>. For the development of this approach, a key role has been played by traits measurement. Traditionally, a Big Five personality test is taken with a questionnaire and response on a Likert scale <ref type="bibr" target="#b10">[10]</ref>. The Big Five Questionnaire <ref type="bibr" target="#b11">[11]</ref>, and its newer version BFQ-2 <ref type="bibr" target="#b12">[12]</ref>, is used in different contexts and represents a golden standard for measuring personality, according to the Big Five theory. It is formed of 132 items. Some questions ask how much a person agrees or disagrees that he or she is someone who exemplifies various specific statements, such as: "Is open to trying new experiences" (for openness, or open-mindedness) or "Is anxious about the future all the time" (for neuroticism, or negative emotionality). The responses, "Strongly agree" to "Strongly Disagree" (with alternatives in between) determines to what degrees the respondent show that specific traits. In this contribution, we propose a method to understand the strength of the connection between items and factors based only on the item considered as a linguistic material before the test administration. In order to do this, we used LLM, large language models <ref type="bibr" target="#b13">[13]</ref>, artificial intelligence models that are able to process and generate natural language. In these models, embeddings take a piece of text -a word, sentence, paragraph or even a whole article, and convert that into an array of floating-point numbers, the embedding vector. This way, the artificial neural network can code the linguistic information. Embeddings are a numerical representation of the semantic meaning of the content in a many-multidimensional space.</p><p>We used this representation to analyze item and check if the proximity of items in this multidimensional space corresponds to post-hoc loadings in personality test factor analysis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Materials and Methods: Utilizing Item Embeddings for Semantic Analysis</head><p>Advanced large language models (LLMs), have set new benchmarks in processing and generating text that is understandable by humans, seeing widespread application across countless tasks by millions globally. Within the realm of psychology, the ability of LLMs to interpret, contextualize, and extrapolate from human language without prior exposure has sparked considerable interest due to their potential in exploring unseen texts through a zero-shot approach <ref type="bibr" target="#b14">[14]</ref><ref type="bibr" target="#b16">[15]</ref><ref type="bibr" target="#b17">[16]</ref>. This section outlines the methodology for leveraging LLMs to assess semantic similarities among test items, determining if these similarities align with a hypothesized factorial structure subsequently identified in participant responses. LLMs internally convert input text into vector representations, known as embeddings, through the training phase. Each word or sentence is transformed into a fixed-size vector of floating-point numbers.</p><p>Research indicates that the space of these embeddings possesses metric characteristics <ref type="bibr" target="#b20">[18]</ref>, allowing for the mapping of similar concepts, such as colors or synonyms, closer together than disparate ones <ref type="bibr" target="#b20">[18]</ref><ref type="bibr" target="#b21">[19]</ref>. By utilizing this feature, embeddings serve as a tool for gauging semantic similarity across various domains.</p><p>We propose employing embeddings from established psychological tests to examine item similarities and verify whether the test exhibits the anticipated factorial structure by focusing solely on the items, excluding participant responses. To this end, we employ the Bidirectional Encoder Representations from Transformers (BERT) model <ref type="bibr" target="#b22">[20]</ref>, a pre-trained, open-source language model developed by Google. BERT, which has been trained on over five billion sentences from Wikipedia and the Google Book Corpus, aims to predict missing words in sentences. Since its introduction in 2018, numerous enhancements to the BERT model have been suggested. Our methodology utilizes roBERTa, a variant that has achieved top results on standard LLM benchmarks <ref type="bibr">[21]</ref>. As an initial step, we apply the Big5 personality test <ref type="bibr" target="#b23">[22]</ref>, mapping each of its 50 items into the embedding space with BERT to achieve a 1024dimensional vector representation for each item.</p><p>To determine the proximity of embeddings for different items within this space, we use cosine similarity, which calculates the cosine of the angle between two vectors. This measure, dependent solely on the angle and not the magnitude of vectors, is obtained by dividing the dot product of two vectors by the product of their magnitudes.</p><formula xml:id="formula_0">cosinesimilarity(A, B) = 𝐀 ⋅ 𝐁 ‖𝐀‖‖𝐁‖ (1)</formula><p>The resulting cosine similarity equation, where A and B represent the embeddings of two different items, produces a similarity matrix. This matrix has various applications, such as clustering to observe if the embeddings align with the hypothesized factorial structure or directly applying principal component analysis. The cosine similarity matrix, particularly when vectors are centered to have a zero mean, equates to the Pearson correlation coefficient. Thus, by analyzing items through the lens of LLMs, we can deduce whether our test's structure conforms to the expected factorial arrangement. Moreover, this approach allows for the modification or elimination of items that either poorly correlate with others or duplicate the same construct, streamlining the test process.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results : Alignment with Human Responses</head><p>Our analysis aims to determine whether embeddings derived from large language models can successfully identify semantic similarities among items and predict human responses to them. To this end, we first examine whether there is similarity between construct-related items in the embedding space.</p><p>Figure <ref type="figure" target="#fig_0">1</ref> shows a t-SNE <ref type="bibr" target="#b24">[23]</ref> projection of the 1024dimensional embeddings of the Big Five items into a two-dimensional space. The colors and letters indicate the factors underlying the items; for example, yellow and 'O' represent openness, with the numbers specifying the particular item. Different categories of items are mapped closer together and occupy substantially different zones of the space. There is some overlap due to single items that are more difficult to classify and to specific constructs that share overlapping meanings, even for humans, such as Agreeableness and Extraversion. We then applied Principal Component Analysis (PCA) to the cosine correlation matrix of these embeddings, which revealed a latent space that accurately groups items belonging to the same construct under a single principal component. This process not only facilitates the interpretation of outcomes but also confirms the theoretically anticipated structure. The focus now shifts to comparing the latent structure unearthed from the semantic similarity analysis with that derived from actual human responses. Specifically, we aim to investigate whether the item loadings generated from the embedded item representations mirror those obtained from analyzing human responses. By examining the correlation between the two sets of loadings, we gain insight into the extent to which item semantics predict human response patterns. Ideally, a high correlation between construct loadings would indicate not only that related items are grouped accurately but also that both crossloadings and other factor loadings exhibit similar trends.</p><p>For this purpose, we sourced human response data from the Open Psychometrics website for the Big 5 personality test. We then replicated the previously outlined embedding analysis on this response data, starting with the calculation of Pearson correlations for each pair of items, followed by PCA to determine item loadings based on the theoretical number of latent factors. For items phrased in reverse, we adjusted their scales to align with the correct direction, a necessary step because the embedding model does not differentiate between reversed and non-reversed items. This adjustment ensures that the comparison of loadings between the models remains valid.</p><p>The results, depicted in Figure <ref type="figure" target="#fig_1">2</ref>, show the Spearman correlation coefficient for the Big 5 test examined. We observed a high correlation between the embedding loadings and the human response loadings within the same constructs (R &gt; 0.4, p-value &lt;&lt; 0.001). Additionally, a significant correlation (R &gt; 0.4, p-value &lt;&lt; 0.001) was noted between the constructs of Agreeableness and Extraversion. These findings suggest that the semantic similarities among items effectively reflect the relationships among factors as found in human subjects. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusions</head><p>From our analysis we found that the t-SNE projection of the embeddings maps items related to similar constructs close together in the embedding space. Despite some overlap due to ambiguous items or constructs with overlapping meanings, such as Agreeableness and Extraversion, the overall pattern suggests that embeddings derived from large language models capture semantic similarities among items.</p><p>Moreover, the application of Principal Component Analysis (PCA) on the cosine correlation matrix of the embeddings reveals a latent space where items belonging to the same construct are grouped under a single principal component. This alignment with the theoretical structure provides further validation of the embedding analysis. Considering the comparison with human response data, the correlation between the item loadings derived from the embedded item representations and those obtained from analyzing human responses indicates a significant relationship. The high correlation observed, particularly within the same constructs, suggests that the semantic similarities captured by the embeddings effectively mirror the relationships observed in human subjects. Notably, a significant correlation is observed between the constructs of Agreeableness and Extraversion, indicating a meaningful association between these factors in both the embedding analysis and human responses. Overall, these findings support the notion that embeddings derived from large language models can successfully identify semantic similarities among items and, so, serve as a valid preliminary measure in the context of survey design.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 .</head><label>1</label><figDesc>Figure 1. T-SNE projection of the embeddings in a 2dimensional space. Yellow points are openness (O) related items, Green points are Conscientiousness (C) items, Blue points are Neuroticism (N) items, Black points are extra-version (E) items and violet points are Agreeableness (A) items.</figDesc><graphic coords="3,304.60,85.05,191.25,191.25" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 .</head><label>2</label><figDesc>Figure 2. Spearman correlation between Item embedding loadings (x-axis) and human response loadings (y-axis). Sperman r greater than 0.40 shows significative correlation between the loadings (pvalue &lt;&lt; 0.001)</figDesc><graphic coords="4,85.05,293.97,236.00,177.00" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Psychological testing: A practical approach to design and evaluation</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">J</forename><surname>Kline</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
	<note>Sage publications</note>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title/>
		<author>
			<persName><forename type="first">B</forename><surname>Mallinckrodt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Miles</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Recabarren</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Using focus groups and Rasch item response theory to improve instrument development</title>
		<author>
			<persName><forename type="first">A</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Counseling Psychologist</title>
		<imprint>
			<biblScope unit="volume">44</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="146" to="194" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Utility of confirmatory factor analysis in test validation research</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Cole</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of consulting and clinical psychology</title>
		<imprint>
			<biblScope unit="volume">55</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page">584</biblScope>
			<date type="published" when="1987">1987</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Factor theories of intelligence are all right almost</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">J</forename><surname>Sternberg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Educational Researcher</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="6" to="18" />
			<date type="published" when="1980">1980</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">A three-stratum theory of intelligence: Spearman&apos;s contribution</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">B</forename><surname>Carroll</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Human abilities</title>
				<imprint>
			<publisher>Psychology Press</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="1" to="17" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Five (or three) robust questionnaire scale factors of personality without culture</title>
		<author>
			<persName><forename type="first">M</forename><surname>Zuckerman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Kuhlman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Thornquist</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Kiers</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Personality and individual Differences</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="569" to="578" />
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">The structure of human personality (Psychology Revivals)</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">J</forename><surname>Eysenck</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1953">1953</date>
			<publisher>Routledge</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Consistency of the factorial structures of personality ratings from different sources</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">W</forename><surname>Fiske</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of Abnormal and Social Psychology</title>
		<imprint>
			<biblScope unit="volume">44</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page">329</biblScope>
			<date type="published" when="1949">1949</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Four ways five factors are basic</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">T</forename><surname>Costa</surname><genName>Jr</genName></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">R</forename><surname>Mccrae</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Personality and individual differences</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="653" to="665" />
			<date type="published" when="1992">1992</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">A review of key Likert scale development advances: 1995-2019</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">T</forename><surname>Jebb</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Ng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Tay</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Frontiers in psychology</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="page">637547</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">The &quot;Big Five Questionnaire&quot;: A new questionnaire to assess the five factor model</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">V</forename><surname>Caprara</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Barbaranelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Borgogni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Perugini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Personality and individual Differences</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="281" to="288" />
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">V</forename><surname>Caprara</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Barbaranelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Borgogni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Vecchione</surname></persName>
		</author>
		<title level="m">BFQ-2. Big five questionnaire</title>
				<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="volume">2</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Attention is all you need</title>
		<author>
			<persName><forename type="first">A</forename><surname>Vaswani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Shazeer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Parmar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Uszkoreit</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Jones</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">N</forename><surname>Gomez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">.</forename><forename type="middle">.</forename><surname>Polosukhin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in neural information processing systems</title>
				<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="volume">30</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title/>
		<author>
			<persName><forename type="first">T</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Ryder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Subbiah</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Language models are few-shot learners</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Kaplan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Dhariwal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">.</forename><forename type="middle">.</forename><surname>Amodei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Advances in neural information processing systems</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="page" from="1877" to="1901" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Turning large language models into cognitive models</title>
		<author>
			<persName><forename type="first">M</forename><surname>Binz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Schulz</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2306.03917</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<title/>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">M S</forename><surname>Buschoff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Akata</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bethge</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">Have we built machines that think like people?</title>
		<author>
			<persName><forename type="first">E</forename><surname>Schulz</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2311.16093</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Simulating Opinion Dynamics with Networks of LLM-based Agents</title>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">S</forename><surname>Chuang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Goyal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Harlalka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Suresh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Hawkins</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">.</forename><forename type="middle">.</forename><surname>Rogers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">T</forename></persName>
		</author>
		<idno type="arXiv">arXiv:2311.09618</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Improving semantic similarity retrieval with word embeddings</title>
		<author>
			<persName><forename type="first">F</forename><surname>Yan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Fan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Concurrency and Computation: Practice and Experience</title>
		<imprint>
			<biblScope unit="volume">30</biblScope>
			<biblScope unit="issue">23</biblScope>
			<biblScope unit="page">e4489</biblScope>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Novel metrics for computing semantic similarity with sense embeddings</title>
		<author>
			<persName><forename type="first">D</forename><surname>Colla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Mensa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">P</forename><surname>Radicioni</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Knowledge-Based Systems</title>
		<imprint>
			<biblScope unit="volume">206</biblScope>
			<biblScope unit="page">106346</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<title level="m" type="main">Bert: Pre-training of deep bidirectional transformers for language understanding</title>
		<author>
			<persName><forename type="first">J</forename><surname>Devlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">W</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Toutanova</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1810.04805</idno>
		<ptr target="https://www.sbert.net/docs/pretrained_models.html" />
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Updating Norman&apos;s&quot; adequacy taxonomy&quot;: Intelligence and personality dimensions in natural language and in questionnaires</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">R</forename><surname>Mccrae</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">T</forename><surname>Costa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of personality and social psychology</title>
		<imprint>
			<biblScope unit="volume">49</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page">710</biblScope>
			<date type="published" when="1985">1985</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title/>
		<author>
			<persName><forename type="first">L</forename><surname>Van Der Maaten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Hinton</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Visualizing data using t-SNE</title>
	</analytic>
	<monogr>
		<title level="j">Journal of machine learning research</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">11</biblScope>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
