<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Text Mining to uncover Prehistoric Pastness in Museums</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Haley Anne Schwartz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paula JardónGiner</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xavier RubioCampill o</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CHR 2024: Computational Humanities Research Conference</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>DIDPATRI Grup de Recerca, Pg. de la Vall d'Hebron</institution>
          ,
          <addr-line>171, 08035 Barcelona</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Departament de Didàctica de les Ciències Experimentals i Socials, Universitat de València</institution>
          ,
          <addr-line>Av. de Blasco Ibáñez, 13, El Pla del Real, 46010 València, Valencia</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Departament de Didàctiques Aplicades, Universitat de Barcelona</institution>
          ,
          <addr-line>Pg. de la Vall d'Hebron, 171, 08035 Barcelona</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2015</year>
      </pub-date>
      <fpage>130</fpage>
      <lpage>143</lpage>
      <abstract>
        <p>This paper is a presentation of a current work in progress, specifically the exploratory phase for determining a methodological framework, clear objectives, and establishing preliminary results to guide the future direction of the project. The paper sees the application of text analysis to a corpus body of texts with a focus on highlighting heritage and intersectional data present within these texts. The approach of text analysis allows for a quantitative analysis of modern perceptions of the past, narratives given to the past by modern people, and the resulting context elements of the past are placed in stemming from modern influences. With a focus on how prehistory is presented to modern people, in the specific context of museums, it is necessary to trace the contents of texts depicting the past in these museums. The overall goal of this paper is to have a deeper understanding of the impact modern narratives attributed to the past has on the prehistoric past in an educational context. Specifically, looking at narratives focused on the process of neolithization as discussed in museums. Additionally, preliminary explorations give insight into the benefits of the methodology and how to best establish next steps to propel future research.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Text Mining</kwd>
        <kwd>Topic Modelling</kwd>
        <kwd>Museums</kwd>
        <kwd>Prehistory</kwd>
        <kwd>Digital Humanities</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Museums are integral tools through which the past is understood and interpreted using
tangible scientific evidence. These institutions provide modern people with both physical and
experiential elements for developing interpretations and relationships with the p3a3s]t a[nd
educational opportunities1[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. As far as the physical, museums make accessible surviving
material culture through exhibits and displays, granting visitors direct visual and physical access
to remnants of the past [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Museums use storytelling and display techniques that cultivate an
experience for visitors to further build connection23s][. The contextualization of the past in
the form of digestible interpretations for visitors are linked to the place and time of origin for
the material culture used to aid in the storytelling and educational process of the
archaeological information available21[]. These include, but are not limited to photography, audiovisual
supplementation, digital reconstructions, and texts. These modalities consumed by visitors are
available and ripe for analysis.
      </p>
      <p>The following paper details the exploratory stage for an in-progress research project utilizing
quantitative textual analysis applied to a corpus of archaeological museum texts. This stage of
the analysis was used to formulate the best methodological approach to carry out text mining
for this corpus, set parameters, and solidify next steps in this project. This research is focused
on the types of narratives linked with production societies within museums, specifically
museums along the east of the Iberian Peninsula. The interest is in how prehistory-producing
societies are taught in museums by analyzing associated texts.</p>
      <p>As this is the first stage of the project all results are beneficial to evaluate the data, any need
for additional data, and exploring the research questions through quantitative text analysis.
The proposed questions are: 1) Through the texts produced in these museums is it possible
to determine the key issues from those societies and periods? 2) Do the narratives fit with
the current accepted research surrounding these societies and periods? 3) Are any problems
from past societies related to problems of the present? and 4) How is the information treated
and presented within these museums? These early results are indicative for next steps in this
research project.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <sec id="sec-2-1">
        <title>2.1. Archaeological Heritage Discourse and Text Analysis</title>
        <p>
          In the context of archaeological heritage, textual sources are an important source highlighting
shifting narratives, contexts, and interpretations linked to tangible and intangible
archaeological heritage – including landscapes, surviving sites, museums, etc. Discourse studies and
continually evolving quantitative methods continue to highlight the benefits of textual analysis
applied to archaeological heritage dat2a9[
          <xref ref-type="bibr" rid="ref27 ref31 ref4">, 4, 27, 31</xref>
          ]. Museums contain a surplus of textual
sources discussing tangible and intangible archaeological heritage.
        </p>
        <p>
          In previous research, this method has been used to discern how archaeologists discuss
social issues over time 2[
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], temporal shifts within academic articles about archaeological
heritage landscapes [schwartz2023text ] and tracing geographical dispersion of archaeological
research in regions using historical archive data and newspaper collectio22n]s. [Applying
textual analysis tools on museum texts aids in uncovering subjectivity and biases within museums
in how archaeological knowledge is presented to visitors.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Museums as facilitators of Memory and Relationships with the Past</title>
        <p>
          Data from surveys and other research provides insight into how modern people cultivate
relationships with the past through perceptions, interpretations, and context in which people are
exposed to the past [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. These interpretations are correlated with individual experiences and
cultural constructs illustrating that the past is seen by modern people not through the lens of
what the past was but through the lens of the present2[]. Our ability to connect with the past
is relegated to individual and group perceptions, stemming from the context in which the past
is interpreted and presented . There is a cycle in the interpretation, presentation, perception,
and reinterpretation of the past that is dependent upon the era, new discoveries, new research,
and shifting cultural structures.
        </p>
        <p>
          Looking at museums as agents facilitating that cycle, it is important to look to the
existing presentation of the past within these museums. In order to determine what constitutes the
narratives currently presented to people about the past, to see the efects of archaeological
education with this modality1[
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. It is necessary to make clear that archaeological knowledge – in
whichever context – is a produced knowledge linked to era, bias, interpretation, and existing
research and discoveries9[]. Any and all educational pursuits are carried out subjectively, which
is further reflected in museums – that span countries, cultures, and contexts. Furthermore, the
dissemination of archaeological knowledge – in various stages of production or reproduction
– is heavily influenced through social methods rather than more passive technique9s][. The
more the presentation of the past is considered in the museum context, the more insight we
gain into the specific educational methods and narratives consumed by visitors.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. The Context: Archaeological Museums and Neolithization in the Iberian</title>
      </sec>
      <sec id="sec-2-4">
        <title>Peninsula</title>
        <p>Texts appearing within museums linked to the development of early farming societies and
metal age societies– including discussions on the neolithization process – were utilized in this
study. This project aims to gain a better understanding of the words and concepts linked to
this part of the past. Specifically, the way in which narratives of neolithization and early
producing societies are presented to the public in museums. Within the Iberian Peninsula, there
are regional-specific variations and circumstances in which the overall development of these
production societies occurred1][. There are diferences in which neolithization impacted
specific regions and the people within, in conjunction with clear neolithic traits not dependent on
a region (i.e., farming practices, economic structures, technological advancements, etc1.2)].[
Additionally, changes to social structures and interactions between others which impacted the
role of reciprocity, growth of social inequalities, and the development of social netwo1]r,ks [
will be explored in future research.</p>
        <p>
          Within the Iberian Peninsula, new research and discoveries focused on the regio1n4, [
          <xref ref-type="bibr" rid="ref17 ref18">17,
18</xref>
          ] brings attention to the museums displaying these processes and. As the production and
reproduction of this type of archaeological knowledge evolves, how quickly and accurately
does this knowledge evolve within the museums. Analyzing museum texts allows one to see
what contexts and narratives surrounding production societies and neolithization in the Iberian
Peninsula is presented.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Materials and Methods</title>
      <sec id="sec-3-1">
        <title>3.1. Materials</title>
        <p>
          The corpus used in this project was built from a collection of texts in Català, compiled from
permanent exhibitions appearing in eight museums over a period of thirty-years along the east
of the Iberian Peninsula. These texts – with an overall sum of twenty-five thousand words –
were extrapolated from panels that describe the diferent themes of the museum. They are
explanatory texts of general Neolithic culture characterizations (i.e., farming, work techniques,
religion, etc.). They exclude display cases as the sole interest is the main narratives presented
to visitors. The texts are general descriptors of Neolithic elements, with a select few descriptors
of specific sites. R Statistical Software (v4.2.2)[
          <xref ref-type="bibr" rid="ref30">30</xref>
          ] was used to carry out all processing, analysis,
and visualizations of the texts.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Methods</title>
        <p>
          The texts were all collected using the same process of OCR with manual and automatic
inference. The individual texts were photographed and converted using OCR, with the texts
manually verified when necessary. During this process, it was decided to record all texts as
a single document per museum. The reasoning is each museum splits the content diferently.
Therefore, a side-by-side comparison of each text (i.e., panel) would not be useful and would
include strong biases (i.e., museums where text is split in several panels would have a higher
weight that other ones where text is concentrated on a few large panels). Following the text
collection, Topic modelling was applied as a type of ‘distant reading’ to trace linguistic patterns
regarding the storytelling process between diferent museums presentation of archaeological
knowledge. R was used to explore content, word appearance, word frequency, and context. All
texts were loaded into, processed, and the corpus built within R using a number of packages:
tidytext (v0.3.4)[
          <xref ref-type="bibr" rid="ref32">32</xref>
          ], tm (v0.7.9)[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] and topicmodels (v0.2.13)8[].
        </p>
        <p>
          Topic modelling was selected for its machine learning prowess in the sorting and
classification of documents in a corpus, assigning topics, and highlighting temporal distribution of topics
[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The process of topic modelling itself is a Bayesian analytic approach which is capable of
identifying semantic structures within documents that make up a corpus and then reorder the
entities within a corpus (words, documents, topics) based on probability distribution between
the entities of a corpus (topics linked with words, documents linked with topic3s,)2[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. There
are a range of methods for carrying out topic modelling with Latent Dirichlet allocation (LDA)
chosen due to how the algorithm searches for topics, assigns topics, and having unstructured
topics which fits best with both the small size of the corpus and the datatype 2[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].Previous
research highlighted the benefits of applying LDA to archaeological heritage text2s7[
          <xref ref-type="bibr" rid="ref31">, 31</xref>
          ].
        </p>
        <p>
          Previous research was consulted to determine a preprocessing chain to help remove bias and
noise as much as possible. A metrics test was run to determine the appropriate number of topics
per the corpus, parameters of utilizing stopwords and applying lemmatization, and training an
LDA model was decided upon using previous research as guidelin2e6s, [
          <xref ref-type="bibr" rid="ref27 ref31">27, 31</xref>
          ]. Once the
parameters were set, metrics tests were run, and the model trained – the processed corpus
was analyzed using the previously mentioned R packages in conjunction with the additional
package ldatuning (v1.0.2) [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] to run the LDA algorithm.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Preliminary Explorations</title>
      <p>This first exploration of the data used text mining techniques using R to analyze word frequency
and occurrence. The focus was to find the overall most frequently appearing terms throughout
the corpus and then the most frequently occurring terms for each text per museum. Looking
ifrst at the frequency of term appearance overall, as seen in Figure 1, the entirety of the terms
present are of importance. The terms all are indicative and relative to the overall process of
neolithization. As the terms are in Català, translations include: neo(lnietioclithic), pedra(stone),
poblats(villages), cova/coves(cave(s)), restes(remains), edat(age), silex(flint) , objectes(objects).</p>
      <p>Now looking at the frequency of word appearance per text per museum, diferences in the
narratives per museum are discernible within Figure 2. For starters, in the entries for M BBAA
Castelló(Museo de Bellas Artes de Castellón), key terms include: cour(ecopper), vida(life),
canvis(changes), and all centered around the province Castelló – also a term – located in Valencia.
Whereas, in the entries for MAC Barcelon(aMuseu d’Arqueologia de Catalunya, Barcelona),
key terms are: sílex(flint) , cabanes(huts), and fossa(moats).The last example are the entries for
Museu Lleida( Museu de Lleida) with the terms: restes(remains), eines(tools), lloc(site) all
pertaining to Minferri – an additional term – which is an Early Bronze Age settlement located
in Lleida. Just from this intervention, there is a delineation between narratives in museums –
linked to location and elements of value in each location.</p>
      <p>The next step was running topic modelling. In this iteration, the model is made up of nine
topics composed of the ten most frequently occurring terms per topic. These preliminary
explorations made visible the types of topics within the corpus and the trends in each topic, as
illustrated in Figure 3.</p>
      <p>Inference based on the terms give insight to the general theme of the topic itself. For example,
in Topic 4 terms include: llo(pclace), bronz(bronze), vid(life), cult(worship), and art – which
indicates the context of art and culture within the bronze age. Whereas in topic 8 key terms
are: cov(cave), cultur(culture), décor(decoration), and bronz(bronze) – leading to infer this topic
pertains to cave art during this period. Additionally, it is through interpreting these topics that
names are developed as a result. Proposed names per topic are presented in Table 1.</p>
      <p>Focusing on the themes within each of the topics highlight contexts within the text, of what
terms were occurring in proximity together. These early results provided the researchers of
the project the chance to consider potential changes to the dataset, adjustments to the
methodological approach, and other ways of analyzing the contents of the texts.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion</title>
      <p>
        Museums are one example of a center for archaeological heritage discourse in which
individuals can experience material and immaterial culture which helps develop one’s relationship
with the past [
        <xref ref-type="bibr" rid="ref16 ref25">25, 16</xref>
        ]. Focusing on archaeological museums specifically, they are a respected
place in which visitors experience, perceive, and learn about archaeological knowledge in a
public setting. Before continuing further, it is important to note the many criticisms and
controversies that are necessary in museum discourse today regarding problems and unethical
museum-practices of the past – and in some cases – the present1[0]. Regarding the scope of
this paper as the exploratory phase, these points will not be discussed further presently but are
integral to consider and include in the future.
      </p>
      <p>
        Museums are integral with cultivating memory of the past seen through the interpretations
used to create narratives presented to visitor1s5[]. Yet, narratives surrounding the past are not
stagnant and as new information becomes available, new technologies allow for more complex
study [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. It is necessary to trace shifts in changing narratives and interpretations, looking at
how often museums update their presentations and exhibits [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Archaeological heritage
discourse sheds light on the narratives, contexts, and perceptions of the dominating views of the
time – and that extends to discourse in museums. Texts existing within museums should be
utilized to the same degree as other archaeological heritage textual sources to gain insight into the
untapped data within. These first results are valuable for reviewing the current methodology
process and determining how the project can best progress.
      </p>
      <p>As far as assessing how text analysis can illuminate key issues from the past as it is
connected to early production societies, the present themes and word frequency illuminate the
focal point of the narratives and interpretations presented to visitors. If museums are
determined to provide visitors with insight into all facets of the past, terms and themes related to
said issues should be visible through text mining. At this stage of the project, any answers to
research questions are preliminary and are more valuable in determining next steps. Again,
the primary focus has been on the efÏcacy of the methods, potential changes to the data, and
what adjustments need to be made for parameters.</p>
      <sec id="sec-5-1">
        <title>5.1. Challenges</title>
        <p>
          Originally, during the preprocessing of the texts before building the corpus, there was a limit to
the cleaning of the text. In part, this is due to the text in Català and the need to see how the
language is recognized within R. There was no general list of stopwords for Català, though there
is one in English. Nor a personalized list of stopwords, something that would aid in removing
noise [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ]. Early explorations indicated the necessity and additions to the preprocessing chain
that include: general stopwords (i.e., s’(hini), d’un(of one), and s’han(they have), lemmatization,
and removing special characters. While these changes were employed and explored, more
intervention into the preprocessing chain in the context of working with a non-English language
is needed.
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Exploratory Results</title>
        <p>The preliminary figures illustrate how with the strengthening of the methodological approach,
the research questions can be answered through a quantitative analysis. With the goal of
exploring the extent current exhibitions ofering this archaeological knowledge consider all
advances of knowledge over the last three decades.</p>
        <p>With Figure 1, focusing on early production societies and neolithization, the terms we can
expect about these peoples and societies in general are present (i.e., ped(srtaone), poblats(villages),
cerámica(ceramics), silex(flint) . For Figure 2, it is further possible to discern the relationship
between topics and museums. For example, looking at M BBAA Castelló, in this area
neolithization is in relation to animals, with habitat more often in caves which is reflective of the
mountain region. Where, MAC Barcelona sees terms that fit more describing the neolithic cultures
in general within Catalunya, with more specialized concepts used by archaeologists. This is
reflective of the museum having an academic and technological discourse.</p>
        <p>Regarding these texts reflecting narratives in accepted research, there are discrepancies. For
example, within these texts there is no mention of Mesolithic peoples, nor any mention of the
transformations of neolithic landscapes. There is also a focus on specific neolithic activities
that take precedence over others. There is a larger appearance of farming compared to other
neolithic activities, such as herding. Additionally, there is no mention of the impact of
neolithic on prehistoric landscapes. This reflects that some issues are the importance of purely
economic activities and technology beyond other types of traits that define neolithic (i.e.,
social structures, sexual division of labor, environmental dynamics, etc.). Furthermore, with the
goal of being able to interpret and understand the treatment and presentation of these texts,
additional analysis can consult linked metadata or comparing the evolution or stagnation of
the information.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Next Steps</title>
        <p>Regarding the selected texts, at the present moment, these are the only texts and only museums
utilized – but there is a potential to bring in texts from an additional four museums in the
specified region. As far as narratives present within museums remaining up to date with current
research, the use of a qualitative approach – ‘close reading’ – of these texts and recent articles
could provide more details. Another option could be adding an additional corpus made up of
recent research to analyze separately and compare results. Comparing corpus to corpus, to see
potential similarities and diferences between the narratives presented in museums and those
produced in peer-reviewed journals. Based on the early explorations of the corpus, the final
analysis filtered the corpus for any term not present in at least 2 museums and the number of
topics was set at nine following metrics testing. The number of term appearances can also be
adjusted, along with additional metrics testing to further evaluate the number of topics. The
end result of this stage is a future direction for how this project can further develop, solidify
the code and methodological framework.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>Our ability to interpret, perceive, and relate to the past is only possible resulting from the
value and significance of surviving material and immaterial culture. The increasing
accessibility of this material and immaterial culture present within museums is linked to what modern
people find valuable and significant enough from the past, to hold this level of presence within
modern culture – on display in the modern collective memo1r3y].[Much like value and
significance assigned to remnants of the past can change, so does the knowledge and interpretation
of the surviving past change. Text analysis is a valuable tool to trace shifts in the knowledge
and interpretation of archaeological heritage – including texts surrounding archaeological
heritage currently residing in museums. This project will continue applying the methodological
approach to better understand and disseminate current patterns of archaeological knowledge,
and their respective narratives and contexts as they exist within museums.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J. B.</given-names>
            <surname>Aubán</surname>
          </string-name>
          . “
          <article-title>The social and symbolic context of Neolithization”</article-title>
          .
          <source>ISnA:GVNTVM Extra 5</source>
          (
          <year>2002</year>
          ), pp.
          <fpage>209</fpage>
          -
          <lpage>234</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A. W.</given-names>
            <surname>Barker</surname>
          </string-name>
          . “
          <article-title>Exhibiting archaeology: archaeology and museums”</article-title>
          .
          <source>AInn:nual Review of Anthropology 39.1</source>
          (
          <issue>2010</issue>
          ), pp.
          <fpage>293</fpage>
          -
          <lpage>308</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Blei</surname>
          </string-name>
          and
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Laferty</surname>
          </string-name>
          . “
          <article-title>Topic models”</article-title>
          .
          <source>In: Text mining. Chapman and Hall/CRC</source>
          ,
          <year>2009</year>
          , pp.
          <fpage>101</fpage>
          -
          <lpage>124</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Burkette</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Skeates</surname>
          </string-name>
          . “
          <article-title>The Words that Archaeologists Choose: A Maltese Case Study in Artifact Terminology, Corpus Linguistics and Discourse Analysis”</article-title>
          .
          <source>JIonu:rnal of Mediterranean Archaeology 35.1</source>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Daud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Muhammad</surname>
          </string-name>
          . “
          <article-title>Knowledge discovery through directed probabilistic topic models: a survey”</article-title>
          .
          <source>InF:rontiers of computer science in China 4</source>
          (
          <year>2010</year>
          ), pp.
          <fpage>280</fpage>
          -
          <lpage>301</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>I.</given-names>
            <surname>Feinerer</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Hornikt.m: Text Mining Package</surname>
          </string-name>
          .
          <year>2022</year>
          . url: https://CRAN.R-project.
          <source>o rg/package=tm.</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C.</given-names>
            <surname>Goulding</surname>
          </string-name>
          . “
          <article-title>The museum environment and the visitor experience”</article-title>
          .
          <source>IEnu:ropean Journal of marketing 34.3/4</source>
          (
          <year>2000</year>
          ), pp.
          <fpage>261</fpage>
          -
          <lpage>278</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>B.</given-names>
            <surname>Grün</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Hornik</surname>
          </string-name>
          .topicmodels: Topic Models.
          <year>2022</year>
          . url: https://CRAN.R-project.or g/package=topicmodels.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hamilakis</surname>
          </string-name>
          . “
          <article-title>Archaeology and the politics of pedagogy”</article-title>
          .
          <source>IWn:orld archaeology 36.2</source>
          (
          <issue>2004</issue>
          ), pp.
          <fpage>287</fpage>
          -
          <lpage>309</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>N.</given-names>
            <surname>Harris</surname>
          </string-name>
          . “
          <article-title>Museums and controversy: Some introductory reflections”</article-title>
          .
          <source>ITnh: e Journal of American History 82.3</source>
          (
          <issue>1995</issue>
          ), pp.
          <fpage>1102</fpage>
          -
          <lpage>1110</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>D.</given-names>
            <surname>Henson</surname>
          </string-name>
          . “
          <article-title>Archaeology and education”. InK: ey concepts in public archaeology (</article-title>
          <year>2017</year>
          ), pp.
          <fpage>43</fpage>
          -
          <lpage>59</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>V.-P.</given-names>
            <surname>Herva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Nordqvist</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lahelma</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Ikäheimo</surname>
          </string-name>
          . “
          <article-title>Cultivation of Perception and the Emergence of the Neolithic World”</article-title>
          .
          <source>InN:orwegian archaeological review 47.2</source>
          (
          <issue>2014</issue>
          ), pp.
          <fpage>141</fpage>
          -
          <lpage>160</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>T.</given-names>
            <surname>Ireland</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Brown</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Schofield</surname>
          </string-name>
          . “
          <article-title>Situating (in) significance”</article-title>
          .
          <source>In: International Journal of Heritage Studies 26.9</source>
          (
          <issue>2020</issue>
          ), pp.
          <fpage>826</fpage>
          -
          <lpage>844</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>N.</given-names>
            <surname>Isern</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Fort</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. F.</given-names>
            <surname>Carvalho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. F.</given-names>
            <surname>Gibaja</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Ibañez</surname>
          </string-name>
          . “
          <article-title>The Neolithic transition in the Iberian Peninsula: data analysis and modeling”</article-title>
          .
          <source>IJno:urnal of Archaeological Method and Theory</source>
          <volume>21</volume>
          (
          <year>2014</year>
          ), pp.
          <fpage>447</fpage>
          -
          <lpage>460</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S.</given-names>
            <surname>Jones</surname>
          </string-name>
          . “
          <article-title>Dialogues between past, present and future: Reflections on engaging the recent past”</article-title>
          .
          <source>In: Archaeology, the Public and the Recent Past</source>
          (
          <year>2013</year>
          ), pp.
          <fpage>163</fpage>
          -
          <lpage>176</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>O.</given-names>
            <surname>Knauss</surname>
          </string-name>
          . “
          <article-title>Museum Best Practices for Managing Controversy”</article-title>
          .
          <source>InN:ational Coalition Against Censorship</source>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Í. G.-M. de Lagrán</surname>
          </string-name>
          . “
          <article-title>Recent data and approaches on the Neolithization of the Iberian Peninsula”</article-title>
          .
          <source>In:European Journal of Archaeology 18.3</source>
          (
          <issue>2015</issue>
          ), pp.
          <fpage>429</fpage>
          -
          <lpage>453</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Í. G.-M. de Lagrán</surname>
            , E. Fernández-Domńıguez, and
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Rojo-Guerra</surname>
          </string-name>
          .
          <article-title>“Solutions or illusions? An analysis of the available palaeogenetic evidence from the origins of the Neolithic in the Iberian Peninsula”</article-title>
          .
          <source>InQ: uaternary International</source>
          <volume>470</volume>
          (
          <year>2018</year>
          ), pp.
          <fpage>353</fpage>
          -
          <lpage>368</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lea</surname>
          </string-name>
          .
          <article-title>“Teaching the past in museums”</article-title>
          .
          <source>In: Museums and Archaeology. Routledge</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>473</fpage>
          -
          <lpage>484</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>M. D. McCoy</surname>
            and
            <given-names>T. N.</given-names>
          </string-name>
          <string-name>
            <surname>Ladefoged</surname>
          </string-name>
          . “
          <article-title>New developments in the use of spatial technology in archaeology”</article-title>
          .
          <source>In:Journal of Archaeological Research</source>
          <volume>17</volume>
          (
          <year>2009</year>
          ), pp.
          <fpage>263</fpage>
          -
          <lpage>295</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>S.</given-names>
            <surname>Moser</surname>
          </string-name>
          . “
          <article-title>Representing archaeological knowledge in museums: Exhibiting human origins and strategies for change”</article-title>
          .
          <source>In:Public Archaeology 3.1</source>
          (
          <issue>2003</issue>
          ), pp.
          <fpage>3</fpage>
          -
          <lpage>20</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>P.</given-names>
            <surname>Murrieta-Flores</surname>
          </string-name>
          and
          <string-name>
            <surname>I. Gregory.</surname>
          </string-name>
          “
          <article-title>Further frontiers in GIS: Extending spatial analysis to textual sources in archaeology”</article-title>
          .
          <source>InO:pen Archaeology 1.1</source>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>J. K.</given-names>
            <surname>Nielsen</surname>
          </string-name>
          . “
          <article-title>Museum communication and storytelling: articulating understandings within the museum structure”</article-title>
          .
          <source>In:Museum management and curatorship 32.5</source>
          (
          <issue>2017</issue>
          ), pp.
          <fpage>440</fpage>
          -
          <lpage>455</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>M.</given-names>
            <surname>Nikita</surname>
          </string-name>
          .
          <source>ldatuning: Tuning of the Latent Dirichlet Allocation Models Parameters</source>
          .
          <year>2020</year>
          . url: https://CRAN.R-project.org/package=ldatunin.g
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>B. L.-A. M. O'Mara</surname>
          </string-name>
          . “
          <article-title>Museums and controversy: you can't have one without the other</article-title>
          .”
          <source>PhD thesis</source>
          .
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>A.</given-names>
            <surname>Onan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Korukoglu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Bulut</surname>
          </string-name>
          . “
          <article-title>LDA-based topic modelling in text sentiment classification: An empirical analysis</article-title>
          .
          <source>” In:Int. J. Comput. Linguistics Appl</source>
          .
          <volume>7</volume>
          .
          <issue>1</issue>
          (
          <issue>2016</issue>
          ), pp.
          <fpage>101</fpage>
          -
          <lpage>119</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>G.</given-names>
            <surname>Park</surname>
          </string-name>
          , L.-
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Marwick</surname>
          </string-name>
          . “
          <article-title>How do archaeologists write about racism? Computational text analysis of 41 years of Society for American Archaeology annual meeting abstracts”</article-title>
          .
          <source>In: Antiquity</source>
          <volume>96</volume>
          .387 (
          <year>2022</year>
          ), pp.
          <fpage>696</fpage>
          -
          <lpage>709</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>W.</given-names>
            <surname>Parwita</surname>
          </string-name>
          . “
          <article-title>A document recommendation system of stemming and stopword removal impact: A web-based application”</article-title>
          .
          <source>In:Journal of Physics: Conference Series</source>
          . Vol.
          <volume>1469</volume>
          . 1.
          <string-name>
            <given-names>IOP</given-names>
            <surname>Publishing</surname>
          </string-name>
          .
          <year>2020</year>
          , p.
          <fpage>012050</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>G.</given-names>
            <surname>Plets</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Huijnen</surname>
          </string-name>
          , and
          <string-name>
            <surname>D. van Oeveren. “</surname>
          </string-name>
          <article-title>Excavating archaeological texts: Applying digital humanities to the study of archaeological thought and banal nationalism”</article-title>
          .
          <source>In: Journal of Field Archaeology 46.5</source>
          (
          <issue>2021</issue>
          ), pp.
          <fpage>289</fpage>
          -
          <lpage>302</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>R Core</given-names>
            <surname>Team. R:</surname>
          </string-name>
          <article-title>A Language and Environment for Statistical Computing. R Foundation for Statistical Computing</article-title>
          . Vienna, Austria,
          <year>2022</year>
          . urlh:ttps://www.R-project.org./
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>H. A.</given-names>
            <surname>Schwartz</surname>
          </string-name>
          . “
          <article-title>Text Mining Analysis of Perception in Archaeological Landscapes: The Case of Stonehenge”</article-title>
          .
          <source>In: Proceedings of the 7th ACM SIGSPATIAL International Workshop on Geospatial Humanities</source>
          .
          <year>2023</year>
          , pp.
          <fpage>40</fpage>
          -
          <lpage>43</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>J.</given-names>
            <surname>Silge</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Robinson</surname>
          </string-name>
          .
          <article-title>“tidytext: Text Mining and Analysis Using Tidy Data Principles in R”</article-title>
          .
          <source>In: Joss 1.3</source>
          (
          <year>2016</year>
          ). doi:
          <volume>10</volume>
          .21105/joss.00037. url: http://dx.doi.org/10.21105/joss.0
          <fpage>0037</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>