<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Text Mining to uncover Prehistoric Pastness in Museums</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Haley</forename><forename type="middle">Anne</forename><surname>Schwartz</surname></persName>
							<email>hschwartz@ub.edu</email>
							<affiliation key="aff0">
								<orgName type="department">Departament de Didàctiques Aplicades</orgName>
								<orgName type="institution">Universitat de Barcelona</orgName>
								<address>
									<addrLine>Pg. de la Vall d&apos;Hebron</addrLine>
									<postCode>171, 08035</postCode>
									<settlement>Barcelona</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">DIDPATRI Grup de Recerca</orgName>
								<orgName type="institution">d&apos;Hebron</orgName>
								<address>
									<addrLine>Pg. de la Vall</addrLine>
									<postCode>171, 08035</postCode>
									<settlement>Barcelona</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Paula</forename><surname>Jardón Giner</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">DIDPATRI Grup de Recerca</orgName>
								<orgName type="institution">d&apos;Hebron</orgName>
								<address>
									<addrLine>Pg. de la Vall</addrLine>
									<postCode>171, 08035</postCode>
									<settlement>Barcelona</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Departament de Didàctica de les Ciències Experimentals i Socials</orgName>
								<orgName type="institution">Universitat de València</orgName>
								<address>
									<addrLine>Av. de Blasco Ibáñez, 13, El Pla del Real</addrLine>
									<postCode>46010</postCode>
									<settlement>València, Valencia</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Xavier</forename><surname>Rubio Campillo</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Departament de Didàctiques Aplicades</orgName>
								<orgName type="institution">Universitat de Barcelona</orgName>
								<address>
									<addrLine>Pg. de la Vall d&apos;Hebron</addrLine>
									<postCode>171, 08035</postCode>
									<settlement>Barcelona</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">DIDPATRI Grup de Recerca</orgName>
								<orgName type="institution">d&apos;Hebron</orgName>
								<address>
									<addrLine>Pg. de la Vall</addrLine>
									<postCode>171, 08035</postCode>
									<settlement>Barcelona</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Text Mining to uncover Prehistoric Pastness in Museums</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">D238BE1D3E14EC6287864E3BBE0F1F25</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:50+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Text Mining</term>
					<term>Topic Modelling</term>
					<term>Museums</term>
					<term>Prehistory</term>
					<term>Digital Humanities</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper is a presentation of a current work in progress, specifically the exploratory phase for determining a methodological framework, clear objectives, and establishing preliminary results to guide the future direction of the project. The paper sees the application of text analysis to a corpus body of texts with a focus on highlighting heritage and intersectional data present within these texts. The approach of text analysis allows for a quantitative analysis of modern perceptions of the past, narratives given to the past by modern people, and the resulting context elements of the past are placed in stemming from modern influences. With a focus on how prehistory is presented to modern people, in the specific context of museums, it is necessary to trace the contents of texts depicting the past in these museums. The overall goal of this paper is to have a deeper understanding of the impact modern narratives attributed to the past has on the prehistoric past in an educational context. Specifically, looking at narratives focused on the process of neolithization as discussed in museums. Additionally, preliminary explorations give insight into the benefits of the methodology and how to best establish next steps to propel future research.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Museums are integral tools through which the past is understood and interpreted using tangible scientific evidence. These institutions provide modern people with both physical and experiential elements for developing interpretations and relationships with the past <ref type="bibr" target="#b32">[33]</ref> and educational opportunities <ref type="bibr" target="#b18">[19]</ref>. As far as the physical, museums make accessible surviving material culture through exhibits and displays, granting visitors direct visual and physical access to remnants of the past <ref type="bibr" target="#b6">[7]</ref>. Museums use storytelling and display techniques that cultivate an experience for visitors to further build connections <ref type="bibr" target="#b22">[23]</ref>. The contextualization of the past in the form of digestible interpretations for visitors are linked to the place and time of origin for the material culture used to aid in the storytelling and educational process of the archaeological information available <ref type="bibr" target="#b20">[21]</ref>. These include, but are not limited to photography, audiovisual supplementation, digital reconstructions, and texts. These modalities consumed by visitors are available and ripe for analysis.</p><p>The following paper details the exploratory stage for an in-progress research project utilizing quantitative textual analysis applied to a corpus of archaeological museum texts. This stage of the analysis was used to formulate the best methodological approach to carry out text mining for this corpus, set parameters, and solidify next steps in this project. This research is focused on the types of narratives linked with production societies within museums, specifically museums along the east of the Iberian Peninsula. The interest is in how prehistory-producing societies are taught in museums by analyzing associated texts.</p><p>As this is the first stage of the project all results are beneficial to evaluate the data, any need for additional data, and exploring the research questions through quantitative text analysis. The proposed questions are: 1) Through the texts produced in these museums is it possible to determine the key issues from those societies and periods? 2) Do the narratives fit with the current accepted research surrounding these societies and periods? 3) Are any problems from past societies related to problems of the present? and 4) How is the information treated and presented within these museums? These early results are indicative for next steps in this research project.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Background</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Archaeological Heritage Discourse and Text Analysis</head><p>In the context of archaeological heritage, textual sources are an important source highlighting shifting narratives, contexts, and interpretations linked to tangible and intangible archaeological heritage -including landscapes, surviving sites, museums, etc. Discourse studies and continually evolving quantitative methods continue to highlight the benefits of textual analysis applied to archaeological heritage data <ref type="bibr" target="#b28">[29,</ref><ref type="bibr" target="#b3">4,</ref><ref type="bibr" target="#b26">27,</ref><ref type="bibr" target="#b30">31]</ref>. Museums contain a surplus of textual sources discussing tangible and intangible archaeological heritage.</p><p>In previous research, this method has been used to discern how archaeologists discuss social issues over time <ref type="bibr" target="#b26">[27]</ref>, temporal shifts within academic articles about archaeological heritage landscapes [schwartz2023text ] and tracing geographical dispersion of archaeological research in regions using historical archive data and newspaper collections <ref type="bibr" target="#b21">[22]</ref>. Applying textual analysis tools on museum texts aids in uncovering subjectivity and biases within museums in how archaeological knowledge is presented to visitors.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Museums as facilitators of Memory and Relationships with the Past</head><p>Data from surveys and other research provides insight into how modern people cultivate relationships with the past through perceptions, interpretations, and context in which people are exposed to the past <ref type="bibr" target="#b1">[2]</ref>. These interpretations are correlated with individual experiences and cultural constructs illustrating that the past is seen by modern people not through the lens of what the past was but through the lens of the present <ref type="bibr" target="#b1">[2]</ref>. Our ability to connect with the past is relegated to individual and group perceptions, stemming from the context in which the past is interpreted and presented . There is a cycle in the interpretation, presentation, perception, and reinterpretation of the past that is dependent upon the era, new discoveries, new research, and shifting cultural structures.</p><p>Looking at museums as agents facilitating that cycle, it is important to look to the existing presentation of the past within these museums. In order to determine what constitutes the narratives currently presented to people about the past, to see the effects of archaeological education with this modality <ref type="bibr" target="#b10">[11]</ref>. It is necessary to make clear that archaeological knowledge -in whichever context -is a produced knowledge linked to era, bias, interpretation, and existing research and discoveries <ref type="bibr" target="#b8">[9]</ref>. Any and all educational pursuits are carried out subjectively, which is further reflected in museums -that span countries, cultures, and contexts. Furthermore, the dissemination of archaeological knowledge -in various stages of production or reproduction -is heavily influenced through social methods rather than more passive techniques <ref type="bibr" target="#b8">[9]</ref>. The more the presentation of the past is considered in the museum context, the more insight we gain into the specific educational methods and narratives consumed by visitors.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">The Context: Archaeological Museums and Neolithization in the Iberian Peninsula</head><p>Texts appearing within museums linked to the development of early farming societies and metal age societies-including discussions on the neolithization process -were utilized in this study. This project aims to gain a better understanding of the words and concepts linked to this part of the past. Specifically, the way in which narratives of neolithization and early producing societies are presented to the public in museums. Within the Iberian Peninsula, there are regional-specific variations and circumstances in which the overall development of these production societies occurred <ref type="bibr" target="#b0">[1]</ref>. There are differences in which neolithization impacted specific regions and the people within, in conjunction with clear neolithic traits not dependent on a region (i.e., farming practices, economic structures, technological advancements, etc.) <ref type="bibr" target="#b11">[12]</ref>. Additionally, changes to social structures and interactions between others which impacted the role of reciprocity, growth of social inequalities, and the development of social networks <ref type="bibr" target="#b0">[1]</ref>, will be explored in future research. Within the Iberian Peninsula, new research and discoveries focused on the region <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b16">17,</ref><ref type="bibr" target="#b17">18]</ref> brings attention to the museums displaying these processes and. As the production and reproduction of this type of archaeological knowledge evolves, how quickly and accurately does this knowledge evolve within the museums. Analyzing museum texts allows one to see what contexts and narratives surrounding production societies and neolithization in the Iberian Peninsula is presented.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Materials and Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Materials</head><p>The corpus used in this project was built from a collection of texts in Català, compiled from permanent exhibitions appearing in eight museums over a period of thirty-years along the east of the Iberian Peninsula. These texts -with an overall sum of twenty-five thousand wordswere extrapolated from panels that describe the different themes of the museum. They are explanatory texts of general Neolithic culture characterizations (i.e., farming, work techniques, religion, etc.). They exclude display cases as the sole interest is the main narratives presented to visitors. The texts are general descriptors of Neolithic elements, with a select few descriptors of specific sites. R Statistical Software (v4.2.2) <ref type="bibr" target="#b29">[30]</ref> was used to carry out all processing, analysis, and visualizations of the texts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Methods</head><p>The texts were all collected using the same process of OCR with manual and automatic inference. The individual texts were photographed and converted using OCR, with the texts manually verified when necessary. During this process, it was decided to record all texts as a single document per museum. The reasoning is each museum splits the content differently. Therefore, a side-by-side comparison of each text (i.e., panel) would not be useful and would include strong biases (i.e., museums where text is split in several panels would have a higher weight that other ones where text is concentrated on a few large panels). Following the text collection, Topic modelling was applied as a type of 'distant reading' to trace linguistic patterns regarding the storytelling process between different museums presentation of archaeological knowledge. R was used to explore content, word appearance, word frequency, and context. All texts were loaded into, processed, and the corpus built within R using a number of packages: tidytext (v0.3.4) <ref type="bibr" target="#b31">[32]</ref>, tm (v0.7.9) <ref type="bibr" target="#b5">[6]</ref> and topicmodels (v0.2.13) <ref type="bibr" target="#b7">[8]</ref>.</p><p>Topic modelling was selected for its machine learning prowess in the sorting and classification of documents in a corpus, assigning topics, and highlighting temporal distribution of topics <ref type="bibr" target="#b4">[5]</ref>. The process of topic modelling itself is a Bayesian analytic approach which is capable of identifying semantic structures within documents that make up a corpus and then reorder the entities within a corpus (words, documents, topics) based on probability distribution between the entities of a corpus (topics linked with words, documents linked with topics) <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b25">26]</ref>. There are a range of methods for carrying out topic modelling with Latent Dirichlet allocation (LDA) chosen due to how the algorithm searches for topics, assigns topics, and having unstructured topics which fits best with both the small size of the corpus and the datatype <ref type="bibr" target="#b25">[26]</ref>.Previous research highlighted the benefits of applying LDA to archaeological heritage texts <ref type="bibr" target="#b26">[27,</ref><ref type="bibr" target="#b30">31]</ref>.</p><p>Previous research was consulted to determine a preprocessing chain to help remove bias and noise as much as possible. A metrics test was run to determine the appropriate number of topics per the corpus, parameters of utilizing stopwords and applying lemmatization, and training an LDA model was decided upon using previous research as guidelines <ref type="bibr" target="#b25">[26,</ref><ref type="bibr" target="#b26">27,</ref><ref type="bibr" target="#b30">31]</ref>. Once the parameters were set, metrics tests were run, and the model trained -the processed corpus was analyzed using the previously mentioned R packages in conjunction with the additional package ldatuning (v1.0.2) <ref type="bibr" target="#b23">[24]</ref> to run the LDA algorithm.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Preliminary Explorations</head><p>This first exploration of the data used text mining techniques using R to analyze word frequency and occurrence. The focus was to find the overall most frequently appearing terms throughout the corpus and then the most frequently occurring terms for each text per museum. Looking first at the frequency of term appearance overall, as seen in Figure <ref type="figure">1</ref>, the entirety of the terms present are of importance. The terms all are indicative and relative to the overall process of neolithization. As the terms are in Català, translations include: neolitic(neolithic), pedra(stone), poblats(villages), cova/coves(cave(s)), restes(remains), edat(age), silex(flint), objectes(objects). Now looking at the frequency of word appearance per text per museum, differences in the narratives per museum are discernible within Figure <ref type="figure">2</ref>. For starters, in the entries for M BBAA Castelló (Museo de Bellas Artes de Castellón), key terms include: coure(copper), vida(life), canvis(changes), and all centered around the province Castelló -also a term -located in Valencia. Whereas, in the entries for MAC Barcelona (Museu d'Arqueologia de Catalunya, Barcelona), key terms are: sílex(flint), cabanes(huts), and fossa(moats).The last example are the entries for Museu Lleida( Museu de Lleida) with the terms: restes(remains), eines(tools), lloc(site) all pertaining to Minferri -an additional term -which is an Early Bronze Age settlement located in Lleida. Just from this intervention, there is a delineation between narratives in museumslinked to location and elements of value in each location. The next step was running topic modelling. In this iteration, the model is made up of nine topics composed of the ten most frequently occurring terms per topic. These preliminary explorations made visible the types of topics within the corpus and the trends in each topic, as illustrated in Figure <ref type="figure" target="#fig_1">3</ref>.</p><p>Inference based on the terms give insight to the general theme of the topic itself. For example, in Topic 4 terms include: lloc(place), bronz(bronze), vid(life), cult(worship), and art -which indicates the context of art and culture within the bronze age. Whereas in topic 8 key terms are: cov(cave), cultur(culture), décor(decoration), and bronz(bronze) -leading to infer this topic pertains to cave art during this period. Additionally, it is through interpreting these topics that names are developed as a result. Proposed names per topic are presented in Table <ref type="table" target="#tab_0">1</ref>.</p><p>Focusing on the themes within each of the topics highlight contexts within the text, of what terms were occurring in proximity together. These early results provided the researchers of the project the chance to consider potential changes to the dataset, adjustments to the methodological approach, and other ways of analyzing the contents of the texts. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Discussion</head><p>Museums are one example of a center for archaeological heritage discourse in which individuals can experience material and immaterial culture which helps develop one's relationship with the past <ref type="bibr" target="#b24">[25,</ref><ref type="bibr" target="#b15">16]</ref>. Focusing on archaeological museums specifically, they are a respected place in which visitors experience, perceive, and learn about archaeological knowledge in a public setting. Before continuing further, it is important to note the many criticisms and controversies that are necessary in museum discourse today regarding problems and unethical museum-practices of the past -and in some cases -the present <ref type="bibr" target="#b9">[10]</ref>. Regarding the scope of this paper as the exploratory phase, these points will not be discussed further presently but are integral to consider and include in the future. Museums are integral with cultivating memory of the past seen through the interpretations used to create narratives presented to visitors <ref type="bibr" target="#b14">[15]</ref>. Yet, narratives surrounding the past are not stagnant and as new information becomes available, new technologies allow for more complex study <ref type="bibr" target="#b19">[20]</ref>. It is necessary to trace shifts in changing narratives and interpretations, looking at how often museums update their presentations and exhibits <ref type="bibr" target="#b14">[15]</ref>. Archaeological heritage discourse sheds light on the narratives, contexts, and perceptions of the dominating views of the time -and that extends to discourse in museums. Texts existing within museums should be utilized to the same degree as other archaeological heritage textual sources to gain insight into the untapped data within. These first results are valuable for reviewing the current methodology process and determining how the project can best progress.</p><p>As far as assessing how text analysis can illuminate key issues from the past as it is connected to early production societies, the present themes and word frequency illuminate the focal point of the narratives and interpretations presented to visitors. If museums are determined to provide visitors with insight into all facets of the past, terms and themes related to said issues should be visible through text mining. At this stage of the project, any answers to research questions are preliminary and are more valuable in determining next steps. Again, the primary focus has been on the efÏcacy of the methods, potential changes to the data, and what adjustments need to be made for parameters.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Challenges</head><p>Originally, during the preprocessing of the texts before building the corpus, there was a limit to the cleaning of the text. In part, this is due to the text in Català and the need to see how the language is recognized within R. There was no general list of stopwords for Català, though there is one in English. Nor a personalized list of stopwords, something that would aid in removing noise <ref type="bibr" target="#b27">[28]</ref>. Early explorations indicated the necessity and additions to the preprocessing chain that include: general stopwords (i.e., s'hi(in), d'un(of one), and s'han(they have), lemmatization, and removing special characters. While these changes were employed and explored, more intervention into the preprocessing chain in the context of working with a non-English language is needed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Exploratory Results</head><p>The preliminary figures illustrate how with the strengthening of the methodological approach, the research questions can be answered through a quantitative analysis. With the goal of exploring the extent current exhibitions offering this archaeological knowledge consider all advances of knowledge over the last three decades.</p><p>With Figure <ref type="figure">1</ref>, focusing on early production societies and neolithization, the terms we can expect about these peoples and societies in general are present (i.e., pedra(stone), poblats(villages), cerámica(ceramics), silex(flint). For Figure <ref type="figure">2</ref>, it is further possible to discern the relationship between topics and museums. For example, looking at M BBAA Castelló, in this area neolithization is in relation to animals, with habitat more often in caves which is reflective of the mountain region. Where, MAC Barcelona sees terms that fit more describing the neolithic cultures in general within Catalunya, with more specialized concepts used by archaeologists. This is reflective of the museum having an academic and technological discourse.</p><p>Regarding these texts reflecting narratives in accepted research, there are discrepancies. For example, within these texts there is no mention of Mesolithic peoples, nor any mention of the transformations of neolithic landscapes. There is also a focus on specific neolithic activities that take precedence over others. There is a larger appearance of farming compared to other neolithic activities, such as herding. Additionally, there is no mention of the impact of neolithic on prehistoric landscapes. This reflects that some issues are the importance of purely economic activities and technology beyond other types of traits that define neolithic (i.e., social structures, sexual division of labor, environmental dynamics, etc.). Furthermore, with the goal of being able to interpret and understand the treatment and presentation of these texts, additional analysis can consult linked metadata or comparing the evolution or stagnation of the information.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.">Next Steps</head><p>Regarding the selected texts, at the present moment, these are the only texts and only museums utilized -but there is a potential to bring in texts from an additional four museums in the specified region. As far as narratives present within museums remaining up to date with current research, the use of a qualitative approach -'close reading' -of these texts and recent articles could provide more details. Another option could be adding an additional corpus made up of recent research to analyze separately and compare results. Comparing corpus to corpus, to see potential similarities and differences between the narratives presented in museums and those produced in peer-reviewed journals. Based on the early explorations of the corpus, the final analysis filtered the corpus for any term not present in at least 2 museums and the number of topics was set at nine following metrics testing. The number of term appearances can also be adjusted, along with additional metrics testing to further evaluate the number of topics. The end result of this stage is a future direction for how this project can further develop, solidify the code and methodological framework.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Conclusion</head><p>Our ability to interpret, perceive, and relate to the past is only possible resulting from the value and significance of surviving material and immaterial culture. The increasing accessibility of this material and immaterial culture present within museums is linked to what modern people find valuable and significant enough from the past, to hold this level of presence within modern culture -on display in the modern collective memory <ref type="bibr" target="#b12">[13]</ref>. Much like value and significance assigned to remnants of the past can change, so does the knowledge and interpretation of the surviving past change. Text analysis is a valuable tool to trace shifts in the knowledge and interpretation of archaeological heritage -including texts surrounding archaeological heritage currently residing in museums. This project will continue applying the methodological approach to better understand and disseminate current patterns of archaeological knowledge, and their respective narratives and contexts as they exist within museums.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :Figure 2 :</head><label>12</label><figDesc>Figure 1: Overall Term-Frequency</figDesc><graphic coords="5,89.28,84.17,416.72,166.47" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Top Ten Terms per Topic</figDesc><graphic coords="6,89.28,84.17,416.72,305.10" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>Interpreted Topic Names</figDesc><table><row><cell>Topic Number</cell><cell>Topic Name</cell><cell>Terms</cell></row><row><cell>Topic 1</cell><cell>ritus funeraris</cell><cell>necropol, funer, case</cell></row><row><cell>Topic 2</cell><cell>agricultura i ramaderia</cell><cell>domestic, societ, agric, cour</cell></row><row><cell>Topic 3</cell><cell>materials</cell><cell>pedr, ceramic, silex, neolitic</cell></row><row><cell>Topic 4</cell><cell>art del bronze</cell><cell>bronz, lloc, art, cult, conserve</cell></row><row><cell>Topic 5</cell><cell cols="2">tipus d'estructures arqueològiques fet, enterr, caracter, period, estr</cell></row><row><cell>Topic 6</cell><cell>paleolític</cell><cell>paleolitic, punt, represent, comun, product</cell></row><row><cell>Topic 7</cell><cell>edat del ferro</cell><cell>epoc, ferr, cultur, anim, pobl, trob</cell></row><row><cell>Topic 8</cell><cell>art rupestre</cell><cell>object, cultur, pres, bronz, cov, decor</cell></row><row><cell>Topic 9</cell><cell>treball de camp en arqueologia</cell><cell>excav, trev, tip, jac, mater, fin</cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">The social and symbolic context of Neolithization</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">B</forename><surname>Aubán</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">SAGVNTVM Extra</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="209" to="234" />
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Exhibiting archaeology: archaeology and museums</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">W</forename><surname>Barker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Annual Review of Anthropology</title>
		<imprint>
			<biblScope unit="volume">39</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="293" to="308" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Topic models</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Lafferty</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Text mining. Chapman and Hall/CRC</title>
				<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="101" to="124" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">The Words that Archaeologists Choose: A Maltese Case Study in Artifact Terminology, Corpus Linguistics and Discourse Analysis</title>
		<author>
			<persName><forename type="first">A</forename><surname>Burkette</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Skeates</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Mediterranean Archaeology</title>
		<imprint>
			<biblScope unit="volume">35</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Knowledge discovery through directed probabilistic topic models: a survey</title>
		<author>
			<persName><forename type="first">A</forename><surname>Daud</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Muhammad</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Frontiers of computer science in China</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page" from="280" to="301" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">tm: Text Mining Package</title>
		<author>
			<persName><forename type="first">I</forename><surname>Feinerer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hornik</surname></persName>
		</author>
		<ptr target="https://CRAN.R-project.org/package=tm" />
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">The museum environment and the visitor experience</title>
		<author>
			<persName><forename type="first">C</forename><surname>Goulding</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">European Journal of marketing</title>
		<imprint>
			<biblScope unit="volume">34</biblScope>
			<biblScope unit="issue">3/4</biblScope>
			<biblScope unit="page" from="261" to="278" />
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">topicmodels: Topic Models</title>
		<author>
			<persName><forename type="first">B</forename><surname>Grün</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hornik</surname></persName>
		</author>
		<ptr target="https://CRAN.R-project.org/package=topicmodels" />
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Archaeology and the politics of pedagogy</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Hamilakis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">World archaeology</title>
		<imprint>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="287" to="309" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Museums and controversy: Some introductory reflections</title>
		<author>
			<persName><forename type="first">N</forename><surname>Harris</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of American History</title>
		<imprint>
			<biblScope unit="volume">82</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="1102" to="1110" />
			<date type="published" when="1995">1995</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Archaeology and education</title>
		<author>
			<persName><forename type="first">D</forename><surname>Henson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Key concepts in public archaeology</title>
				<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="43" to="59" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Cultivation of Perception and the Emergence of the Neolithic World</title>
		<author>
			<persName><forename type="first">V.-P</forename><surname>Herva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Nordqvist</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lahelma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ikäheimo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Norwegian archaeological review</title>
		<imprint>
			<biblScope unit="volume">47</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="141" to="160" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Situating (in) significance</title>
		<author>
			<persName><forename type="first">T</forename><surname>Ireland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Schofield</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Heritage Studies</title>
		<imprint>
			<biblScope unit="volume">26</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="826" to="844" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">The Neolithic transition in the Iberian Peninsula: data analysis and modeling</title>
		<author>
			<persName><forename type="first">N</forename><surname>Isern</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Fort</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">F</forename><surname>Carvalho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">F</forename><surname>Gibaja</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">J</forename><surname>Ibañez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Archaeological Method and Theory</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="page" from="447" to="460" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Dialogues between past, present and future: Reflections on engaging the recent past</title>
		<author>
			<persName><forename type="first">S</forename><surname>Jones</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Archaeology, the Public and the Recent Past</title>
				<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="163" to="176" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Museum Best Practices for Managing Controversy</title>
		<author>
			<persName><forename type="first">O</forename><surname>Knauss</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019</date>
			<publisher>National Coalition Against Censorship</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Recent data and approaches on the Neolithization of the Iberian Peninsula</title>
		<author>
			<persName><forename type="first">Í</forename><forename type="middle">G</forename></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>De Lagrán</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">European Journal of Archaeology</title>
		<imprint>
			<biblScope unit="volume">18</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="429" to="453" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Solutions or illusions? An analysis of the available palaeogenetic evidence from the origins of the Neolithic in the Iberian Peninsula</title>
		<author>
			<persName><forename type="first">Í</forename><forename type="middle">G</forename></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>De Lagrán</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Fernández-Domıńguez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Rojo-Guerra</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Quaternary International</title>
		<imprint>
			<biblScope unit="volume">470</biblScope>
			<biblScope unit="page" from="353" to="368" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Teaching the past in museums</title>
		<author>
			<persName><forename type="first">J</forename><surname>Lea</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Museums and Archaeology</title>
				<imprint>
			<publisher>Routledge</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="473" to="484" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">New developments in the use of spatial technology in archaeology</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">D</forename><surname>Mccoy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">N</forename><surname>Ladefoged</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Archaeological Research</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="page" from="263" to="295" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Representing archaeological knowledge in museums: Exhibiting human origins and strategies for change</title>
		<author>
			<persName><forename type="first">S</forename><surname>Moser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Public Archaeology</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="3" to="20" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Further frontiers in GIS: Extending spatial analysis to textual sources in archaeology</title>
		<author>
			<persName><forename type="first">P</forename><surname>Murrieta-Flores</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Gregory</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Open Archaeology</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Museum communication and storytelling: articulating understandings within the museum structure</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">K</forename><surname>Nielsen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Museum management and curatorship</title>
		<imprint>
			<biblScope unit="volume">32</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="440" to="455" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<title level="m" type="main">ldatuning: Tuning of the Latent Dirichlet Allocation Models Parameters</title>
		<author>
			<persName><forename type="first">M</forename><surname>Nikita</surname></persName>
		</author>
		<ptr target="https://CRAN.R-project.org/package=ldatuning" />
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">Museums and controversy: you can&apos;t have one without the other</title>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">L</forename></persName>
		</author>
		<author>
			<persName><forename type="first">-A</forename><forename type="middle">M</forename><surname>O'mara</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
	<note type="report_type">PhD thesis</note>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">LDA-based topic modelling in text sentiment classification: An empirical analysis</title>
		<author>
			<persName><forename type="first">A</forename><surname>Onan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Korukoglu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Bulut</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Int. J. Comput. Linguistics Appl</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="101" to="119" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">How do archaeologists write about racism? Computational text analysis of 41 years of Society for American Archaeology annual meeting abstracts</title>
		<author>
			<persName><forename type="first">G</forename><surname>Park</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L.-Y</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Marwick</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Antiquity</title>
		<imprint>
			<biblScope unit="volume">96</biblScope>
			<biblScope unit="page" from="696" to="709" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">A document recommendation system of stemming and stopword removal impact: A web-based application</title>
		<author>
			<persName><forename type="first">W</forename><surname>Parwita</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Physics: Conference Series</title>
		<imprint>
			<biblScope unit="volume">1469</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page">12050</biblScope>
			<date type="published" when="2020">2020</date>
			<publisher>IOP Publishing</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Excavating archaeological texts: Applying digital humanities to the study of archaeological thought and banal nationalism</title>
		<author>
			<persName><forename type="first">G</forename><surname>Plets</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Huijnen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Van Oeveren</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Field Archaeology</title>
		<imprint>
			<biblScope unit="volume">46</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="289" to="302" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<monogr>
		<author>
			<persName><forename type="first">Team</forename><surname>Core</surname></persName>
		</author>
		<ptr target="https://www.R-project.org/" />
		<title level="m">R: A Language and Environment for Statistical Computing</title>
				<meeting><address><addrLine>Vienna, Austria</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
	<note>R Foundation for Statistical Computing</note>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">Text Mining Analysis of Perception in Archaeological Landscapes: The Case of Stonehenge</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">A</forename><surname>Schwartz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 7th ACM SIGSPATIAL International Workshop on Geospatial Humanities</title>
				<meeting>the 7th ACM SIGSPATIAL International Workshop on Geospatial Humanities</meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="40" to="43" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">tidytext: Text Mining and Analysis Using Tidy Data Principles in R</title>
		<author>
			<persName><forename type="first">J</forename><surname>Silge</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Robinson</surname></persName>
		</author>
		<idno type="DOI">10.21105/joss.00037</idno>
		<ptr target="http://dx.doi.org/10.21105/joss.00037" />
	</analytic>
	<monogr>
		<title level="j">Joss</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">3</biblScope>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Thinking about others through museums and heritage</title>
		<author>
			<persName><forename type="first">A</forename><surname>Witcomb</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Palgrave handbook of contemporary heritage research</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="130" to="143" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
