<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Impact of Digital Analysis and Large Language Models in Digital Humanity</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andrea Cigliano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francesca Fallucchi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Gerardi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Engineering Science, Guglielmo Marconi University</institution>
          ,
          <addr-line>00193 Rome</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Leibniz Institute for Educational Media, Georg Eckert Institute</institution>
          ,
          <addr-line>38118 Brunswick</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Core Technologies: Artificial Intelligence and Machine Learning [2</institution>
          ,
          <addr-line>3], Neural networks and trasformers [4, 5], Natural Language Processing [6], Big Data Analytics [7, 8], Cloud Computing [9, 10]</addr-line>
          ,
          <institution>Blockchain. • Research Tools: Text Analysis Software, Data Visualization Tools, Geographic Information Systems(GIS)</institution>
          ,
          <addr-line>Digital Asset Management Systems, Collaborative Platforms</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>The advent of digital analysis tools and Large Language Models (LLMs) has significantly altered the landscape of digital humanities, introducing new methodologies for processing and interpreting vast amounts of data. This paper provides a comprehensive analysis of these technologies, examining their implications for research within digital humanities. We focus on the transformative efects of digital analysis tools and LLMs, assessing their potential to enhance understanding and accessibility of complex humanistic data, while also discussing the inherent challenges and ethical considerations these technologies introduce. By analyzing case studies and reviewing recent developments in the field, this study aims to provide a nuanced understanding of how digital analysis and LLMs is reshaping scholarly practices in humanities disciplines.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;LLM</kwd>
        <kwd>Large Language Model</kwd>
        <kwd>NLP</kwd>
        <kwd>Natural Language Model</kwd>
        <kwd>GenI</kwd>
        <kwd>Generative Artificial Intelligence</kwd>
        <kwd>DH</kwd>
        <kwd>Digital Humanity</kwd>
        <kwd>AI</kwd>
        <kwd>Artificial Intelligence</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Digital humanities integrate computational tools into
traditional humanities disciplines to explore new research
methodologies. The incorporation of advanced digital
analysis and LLMs promises substantial enhancements in text
analysis, data visualization, and cultural data analytics. This
paper aims to systematically evaluate these impacts,
ofering insights into both the advancements and complications
posed by these technologies. Digital humanities encompass
a multidisciplinary field where digital technology intersects
with arts, humanities, and social sciences research. The
advent of advanced AI technologies, especially LLMs like
OpenAI’s GPT series, has introduced new tools for textual
analysis, data interpretation, and even the generation of
human-like text, ofering both unprecedented
opportunities and significant challenges. This paper explores these
dynamics, ofering insights into the integration of LLMs in
DH and discussing the broader implications for researchers.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Landscape</title>
      <p>
        Landscape represents the interconnected technologies,
methodologies, and considerations that shape the field of
DH [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Each of these areas contributes to how
humanities scholars engage with digital tools and data to conduct
research, preserve cultural heritage, and disseminate
knowledge. We can distinguish the following components of a
landscape relating to the DH:
      </p>
      <sec id="sec-2-1">
        <title>2.1. Digital Analysis</title>
        <p>Digital analysis involves the use of software and algorithms
to analyze cultural and historical data. It includes
techniques like data mining, visualization, and statistical
analysis, which allow scholars to uncover patterns and trends
that are not apparent through traditional methods.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Large Language Models (LLMs)</title>
        <p>
          LLMs like OpenAI’s GPT-3 represent a breakthrough in
natural language processing technology from November 2022.
These models, trained on extensive datasets, are capable of
generating coherent text, completing linguistic tasks, and
providing new tools for textual interpretation and
generation in the humanities. Generative AI refers to algorithms
capable of creating content, from text to images, by learning
from large datasets. LLMs, a subset of generative AI, are
trained on diverse internet corpora [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] and can generate
coherent, contextually relevant text based on input prompts.
Models such as GPT-3 have demonstrated capabilities that
include answering questions, writing essays, summarizing
texts, and more. Generative AI and Large Language Models
(LLMs) have gained significant attention within the digital
humanities due to their ability to process and generate large
volumes of text-based data. Their classification within this
context can be broadly organized into diferent categories
based on their applications, methodologies, and impacts.
Here’s an overview of how Generative AI and LLMs are
classified in the digital humanities.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>
        This study synthesizes data from various case studies,
peerreviewed articles, and firsthand experiments with LLMs
in digital humanities projects. Through qualitative and
quantitative analysis [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], we assess how these technologies
are currently applied and theorize their future trajectories
within the field.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Applications of Digital Analysis and LLMs</title>
      <p>
        The applications of digital analysis tools and Large Language
Models (LLMs) in digital humanities are vast and diverse.
These technologies have revolutionized how researchers,
educators, and practitioners approach the humanities,
ofering new insights and methodologies that were previously
unattainable. Below is an exploration of several key
applications and how they impact the field of digital humanities.
This is particularly useful in experimental literature,
automated storytelling, and the creation of dialogue or narrative
for digital artifacts [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. These models are adept at
interpreting and providing context to vast archives of text,
helping scholars identify themes, trends, and patterns across
large datasets that would be unmanageable for manual
review. In this context we can distinguish the following
subapplications:
• Semantic Analysis: LLMs can analyze large bodies
of text to understand contextual meanings, identify
themes, and detect nuances in language that are
often missed by traditional analysis methods. This
capability is crucial for Literary analysis, historical
documentation, and cultural studies.
• Sentiment Analysis: Digital tools can assess the
sentiment of historical texts, literary works, and
even large collections of social media posts to gauge
public sentiments over time or reactions to specific
events or figures in history.
• Stylistic Analysis: LLMs are used to study stylistic
elements of writing across diferent time periods or
authors. This application helps in authorship
attribution and understanding the evolution of language
and writing styles in literature.
      </p>
      <sec id="sec-4-1">
        <title>4.0.2. Data Mining and Big Data Analysis</title>
        <p>
          We can distinguish two macro components: Pattern
Recognition, in this case Digital analysis tools can sift through
vast amounts of data to identify patterns and trends [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
In historical studies, for example, this can reveal migration
patterns, economic changes, or social dynamics of a
particular era. Network Analysis, where in digital humanities,
network analysis tools can map relationships among various
entities within literature, such as characters in novels,
historical figures, or even concepts and ideas that are prevalent
within a specific cultural or intellectual community.
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>4.0.3. Language Translation and Transcription</title>
        <p>Machine Translation in LLMs models, provide tools for
translating rare or endangered languages, making ancient
manuscripts or texts accessible to a global audience. This
application is particularly significant in preserving
linguistic heritage and making non-English academic resources
available to a broader audience. Another case is Automatic
Transcription, where LLMs and digital tools automate the
transcription of audio recordings into text, such as oral
histories and lectures, which can then be analyzed or archived
digitally for future research. In general LLMs and GenAI
aids in transcribing handwritten or archaic documents into
digital formats, which are then more accessible for
analysis and interpretation(Historical Text Transcription). AI
models facilitate the translation of texts across languages,
making non-native and ancient literature accessible to a
global audience without the immediate need for human
translators(Translation).</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.0.4. Digital Archiving and Preservation</title>
        <p>We can distinguish Document Digitization and
Categorization, in this case Digital tools automate the scanning and
categorization of physical documents into digital archives.
LLMs can enhance this process by automatically tagging and
organizing the documents based on their content.
Preservation of Digital Artifacts: Digital humanities also involve the
preservation of digital art and online culture, where digital
analysis tools help in archiving web-based art and social
media content.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.0.5. Semantic and Sentiment Analysis</title>
        <p>Understanding the deeper meanings, connotations,
and contexts of words in historical and contemporary
texts(Semantic Analysis). Analyzing the emotions or
sentiments expressed in texts, which can be particularly
useful in studies of literature, social media, and historical
documents to gauge public opinion and cultural trends over
time(Sentiment Analysis).</p>
      </sec>
      <sec id="sec-4-5">
        <title>4.0.6. Data Visualization and Mapping</title>
        <p>AI can generate dynamic visual representations of textual
data, making it easier for researchers to explore complex
datasets and uncover relationships (Interactive Data
Displays). Linking textual data with geographical information
systems (GIS) to map the occurrence of certain themes or
discussions across diferent locations(Geospatial Mapping).</p>
      </sec>
      <sec id="sec-4-6">
        <title>4.0.7. Ethical and Responsible AI Use</title>
        <p>
          In digital humanities, the classification of AI tools also
involves identifying potential biases in AI-generated content
and developing methodologies to mitigate these biases (Bias
Mitigation) [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Using AI to verify the authenticity of
digital artifacts and trace their historical and cultural
origins(Authenticity and Provenance Verification).
1–7
        </p>
      </sec>
      <sec id="sec-4-7">
        <title>4.0.8. Cultural Analytics and Critique</title>
        <p>LLMs identify and analyze patterns in art, music, literature,
and other cultural artifacts over large temporal and spatial
scales (Cultural Pattern Recognition). This includes using
AI to critique and analyze its own impact on digital
humanities, assessing how technology shapes cultural and societal
understanding(Critical AI Studies).</p>
      </sec>
      <sec id="sec-4-8">
        <title>4.0.9. Archival and Curation Tools</title>
        <p>
          Digital Archiving: AI facilitates the organization,
categorization, and searchability of digital archives, enhancing the
accessibility of vast digital collections. Curatorial Assistance:
AI tools assist in curating digital exhibitions and virtual
museum tours by selecting themes and artifacts based on
scholarly input and visitor interactions. These categories
relfect the multifaceted roles that generative AI and LLMs play
in the digital humanities [
          <xref ref-type="bibr" rid="ref17 ref18">17, 18</xref>
          ]. They not only enhance
traditional research methodologies but also introduce new
modes of scholarly inquiry and interaction with digital texts
and datasets. As these technologies evolve, their
classification may expand to include more specialized functions and
applications tailored to the nuanced needs of humanities
research.
        </p>
      </sec>
      <sec id="sec-4-9">
        <title>4.0.10. Creation of Digital Exhibits and Virtual</title>
      </sec>
      <sec id="sec-4-10">
        <title>Spaces</title>
        <p>
          Interactive Exhibits: Museums and educational institutions
utilize digital tools to create virtual tours and interactive
exhibits that allow users to explore artifacts, artworks, and
historical documents from their devices, enhancing
accessibility and engagement. Virtual Reality (VR) and Augmented
Reality (AR): These technologies are employed to create
immersive educational experiences, such as virtual reality
environments that simulate historical sites or events [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
        </p>
      </sec>
      <sec id="sec-4-11">
        <title>4.0.11. Educational Tools and E-Learning</title>
        <p>Adaptive Learning Systems: Digital analysis tools can help
develop educational platforms that adapt to the learning
pace and style of individual students, particularly in
humanities education, by analyzing student interactions and
providing customized feedback. Online Courses and
Workshops: LLMs can assist in the creation of educational
content, generating reading materials, quizzes, and interactive
discussions that facilitate online learning in the humanities.</p>
      </sec>
      <sec id="sec-4-12">
        <title>4.0.12. Public Engagement and Outreach</title>
        <p>Social Media Analysis: Digital tools analyze social media
data to study contemporary cultural trends and public
opinions, providing humanities researchers with real-time data
on human behavior and societal changes. Crowdsourcing
and Collaborative Research: Platforms designed with the
help of digital tools enable collaborative projects where
scholars and the public can contribute to and engage with
humanities research globally.</p>
        <sec id="sec-4-12-1">
          <title>4.1. Digital Humanities: Scope and Tools</title>
          <p>
            Digital humanities involve applying digital tools to
humanities subjects, enabling new research methodologies like text
mining, machine learning, and data visualization. These
tools facilitate the exploration of cultural patterns,
historical data, and literary analysis at scales previously
unmanageable. DH is an interdisciplinary field that merges the
techniques and methodologies of the digital world with
the study of the humanities. This field includes, but is not
limited to, disciplines like literature, history, art, music,
philosophy, cultural studies, and linguistics. The scope of DH
is vast, integrating computational methods into the
traditional humanities to enhance research, teaching, and the
dissemination of knowledge.
4.1.1. Enhanced Research Methods
• Textual Analysis: Digital tools allow for the analysis
of large volumes of text, enabling scholars to uncover
patterns, trends, and insights at a scale not feasible
manually;
• Historical GIS (Geographic Information Systems):
Integrates geographical data into humanities research,
allowing for the spatial analysis of historical events
and cultural developments.
• Cultural Analytics: Uses data analysis to study
patterns in visual culture, music, and other art forms,
identifying trends and influences over time.
4.1.2. Interdisciplinary Collaboration
• Promotes collaboration across disciplines,
combining methods from social sciences, computer sciences,
and humanities to tackle complex research
questions.
• Encourages partnerships between academicians,
technologists, and sometimes the public, fostering a
richer, more diverse research environment,
including emulation platforms [
            <xref ref-type="bibr" rid="ref20">20</xref>
            ].
4.1.3. Public Engagement and Education
• Digital exhibitions and virtual reality experiences
make humanities more accessible to the public.
• Open-source projects and tools democratize access
to knowledge, allowing broader participation in
humanities scholarship.
          </p>
        </sec>
      </sec>
      <sec id="sec-4-13">
        <title>4.1.4. Preservation and Archiving</title>
        <p>Digital archiving ensures the preservation of important
historical and cultural materials, making them accessible
worldwide and safeguarding them against physical degradation.</p>
      </sec>
      <sec id="sec-4-14">
        <title>4.1.5. Scholarly Communication</title>
        <p>Digital platforms facilitate the sharing of research outputs
and scholarly discourse more dynamically and widely than
traditional publishing methods.</p>
        <sec id="sec-4-14-1">
          <title>4.2. Tools in Digital Humanities</title>
          <p>The tools used in digital humanities are diverse, ranging
from data analysis software to digital storytelling platforms,
each serving diferent aspects of humanities research:
1. Text Analysis Tools: - Natural Language Processing
(NLP) Software: Tools like NLTK, spaCy, and GPT-3 help
analyze text for syntax, sentiment, and semantic content.
- Concordance Software: Allows scholars to locate every
occurrence of a word or phrase, facilitating analysis of texts.
1–7
2. Data Visualization Tools: - Tableau, Gephi: Useful for
creating visual representations of data, helping to illustrate
complex relationships and patterns. - Timeline Tools: Tools
like TimelineJS help researchers create interactive timelines
that highlight historical events and trends.</p>
          <p>3. Mapping Tools: - GIS Software: ArcGIS and QGIS are
used to analyze and display spatial data, providing insights
into geographical aspects of historical and cultural studies.
StoryMap JS: Allows for the creation of maps that combine
narrative text with images and multimedia content.</p>
          <p>4. Digital Archiving Systems: - Omeka: Provides a web
platform for publishing collections and exhibitions, widely
used by libraries, museums, and archives. - Digital Asset
Management Systems (DAMS): Helps institutions manage
their digital content efectively.</p>
          <p>5. Programming and Development Environments:
Python and R: Popular programming languages for
handling data-intensive projects in DH. - Jupyter Notebooks:
Ofers an interactive environment for coding, visualizing,
and publishing data analysis projects.</p>
          <p>6. Collaborative Tools: - Wiki Software: Facilitates
collaborative writing and knowledge sharing. - Slack, GitHub:
Used for project management and collaboration in larger
digital projects.</p>
          <p>7. Content Management Systems (CMS): - WordPress,
Drupal: Often used to create and manage digital content,
facilitating dynamic web presence for DH projects.</p>
          <p>The integration of these tools into humanities research
has not only transformed traditional methodologies but also
expanded the boundaries of what can be explored and
understood in the humanities. As digital technologies continue to
evolve, the scope and tools of digital humanities will likely
expand further, continuing to redefine the engagement with
humanistic studies in the digital age.</p>
        </sec>
        <sec id="sec-4-14-2">
          <title>4.3. Methodology</title>
          <p>This study employs a qualitative research approach,
synthesizing information from various case studies, scholarly
articles, and empirical research findings to evaluate the
impact of LLMs in digital humanities. The research includes a
meta-analysis of published digital humanities projects
utilizing generative AI and a review of literature discussing
theoretical and methodological implications.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Applications of LLMs in Digital</title>
    </sec>
    <sec id="sec-6">
      <title>Humanities</title>
      <sec id="sec-6-1">
        <title>5.1. Text Analysis and Interpretation</title>
        <p>LLMs are particularly potent in their ability to process and
analyze large volumes of text, providing insights into
linguistic patterns, stylistic features, and historical language
evolution. Examples include the use of AI for analyzing
vast archives of unstructured text in libraries and museums.
Text analysis and interpretation form a core component of
the intersection between Large Language Models (LLMs)
and digital humanities. LLMs, due to their deep learning
foundations and substantial training datasets, ofer
sophisticated tools for automating and enhancing the examination
of textual data. This synergy is particularly influential in the
ifeld of digital humanities, where traditional hermeneutic
methods meet modern computational analytics. Text
analysis involves using algorithms to understand, interpret, and
derive meaningful information from textual data. In the
context of digital humanities, LLMs apply several techniques:
• Tokenization and Parsing: Breaking down text into
manageable pieces or tokens to analyze structural
and grammatical relationships.
• Named Entity Recognition (NER): Automatically
identifying and classifying key elements in text into
predefined categories such as names of people,
organizations, locations, etc.
• Sentiment Analysis: Determining the afective state
of the text, whether positive, negative, or neutral,
which can be particularly useful in analyzing
literary works or historical documents to gauge the
sentiments expressed.</p>
      </sec>
      <sec id="sec-6-2">
        <title>5.2. Enhancing Accessibility of Historical</title>
      </sec>
      <sec id="sec-6-3">
        <title>Documents</title>
        <p>Generative AI assists in the transcription and translation
of historical manuscripts, making them more accessible to
researchers worldwide. This not only democratizes access
but also enriches global understanding of historical contexts.</p>
      </sec>
      <sec id="sec-6-4">
        <title>5.3. Creative Writing and Artistic Endeavors</title>
        <p>In creative domains, LLMs have been used to generate
literary texts and assist in experimental poetry and narrative
projects, often blurring the lines between human and
machine creativity.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>6. Challenges and Limitations</title>
      <p>
        Regarding the challenges and limitations we can identify
the following strands: Technological Dependence, the
reliance on sophisticated computational tools requires
ongoing technical support and significant resources, which can
be a barrier for institutions with limited funding. Data
Privacy and Ethical Concerns, as with all digital data, there are
concerns regarding privacy, particularly when dealing with
sensitive or personal historical records. Ethical
considerations also arise in how data is interpreted and presented
[
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. The Impact on research and scholarship is relevant, in
particular scale and Scope: Digital tools allow researchers to
handle datasets of unprecedented size and complexity, often
referred to as "big data" in humanities [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. This capacity
has led to the emergence of "distant reading" techniques,
where scholars analyze patterns across thousands of texts,
rather than studying a single piece in depth ("close
reading"). And another fied is Interdisciplinary Research: Digital
analysis fosters interdisciplinary research approaches that
blend techniques from computer science, statistics, and the
arts and humanities. This crossover has enriched academic
research, leading to new insights and methodologies.
Enhanced Accuracy and New Insights: Automated analysis
can process information with a level of accuracy and speed
unattainable by manual methods. Moreover, it can reveal
insights that were previously obscured by the sheer volume
of data or the limitations of human analysis. Educational
Transformations in this scope are the following impact in
teaching and Learning: Digital humanities have transformed
pedagogical approaches within humanities disciplines.
Interactive tools and online archives provide students with
direct access to primary materials and analytical tools,
supporting more active learning environments. Open Access
1–7
and Collaboration: Digital humanities promote open access
to information and collaborative research projects.
Platforms like digital archives, open-source tools, and
collaborative research networks democratize access to knowledge
and facilitate global research partnerships.
      </p>
      <sec id="sec-7-1">
        <title>6.1. Data Privacy and Security</title>
        <p>Concerns about data privacy and the ethical use of digital
archives are paramount, particularly when dealing with
sensitive or personal historical data.</p>
      </sec>
      <sec id="sec-7-2">
        <title>6.2. Dependence on Technology Providers</title>
        <p>The reliance on proprietary AI technologies, often
developed by major corporations, poses risks related to
accessibility, transparency, and continuity of research.</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>7. Future Directions and discussion</title>
      <p>
        DH continue to evolve, driven by advancements in AI,
Machine Learning and Data Visualization technologies. Future
research may focus on improving the interpretability of
machine-assisted analyses and enhancing the integration of
qualitative and quantitative methods. Ethical considerations
and bias are crucial aspects to address when discussing the
integration of AI and Large Language Models (LLMs) in
digital humanities. Inherent Biases in Training Data, LLMs
are trained on vast amounts of text data, which can
perpetuate existing biases in historical and cultural narratives.
These biases may disproportionately represent certain
perspectives, cultures, or time periods, potentially skewing
research outcomes. Another type of risk is the
underrepresenting minority voices, non-Western perspectives, and
marginalized groups in AI-assisted research. This could
lead to reinforcing existing power structures and narratives
in humanities scholarship [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. The "black box" nature of
many AI systems makes it challenging to understand how
they arrive at certain conclusions or interpretations [
        <xref ref-type="bibr" rid="ref24 ref25">24, 25</xref>
        ].
This lack of transparency can be problematic in
humanities research, where understanding the reasoning behind
interpretations is crucial. Using AI to analyze historical
or cultural data raises questions about consent, especially
when dealing with sensitive or personal information from
the past [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. There’s a need to balance research benefits
with respect for privacy and cultural sensitivities. As AI
becomes more capable of generating human-like text, there
are concerns about maintaining the authenticity of
historical documents and distinguishing between AI-generated
and human-created content. Over-reliance on AI tools could
potentially lead to a reduction in critical thinking skills or
traditional research methods in humanities [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. There’s a
need to find a balance between leveraging AI capabilities and
maintaining core humanities research skills. Researchers
must consider the ethical implications of using AI-generated
insights, especially when these insights could impact
cultural understanding or policy decisions. The adoption of
AI tools in digital humanities could create or exacerbate
existing inequalities between well-funded institutions and
those with fewer resources [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. This raises questions about
equitable access to advanced research tools and
methodologies. AI systems may not always understand or respect
cultural nuances, leading to misinterpretations or
insensitive handling of cultural data. Determining responsibility
for errors or biased outcomes in AI-assisted research can
be challenging [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. There’s a need for clear guidelines on
accountability in collaborative human-AI research projects.
To address these concerns, researchers and institutions can:
• Implement rigorous bias detection and mitigation
strategies in AI systems used for humanities
research;
• Develop ethical guidelines specific to the use of AI
in digital humanities;
• Promote diverse and inclusive datasets for training
      </p>
      <p>AI models used in humanities research;
• Encourage interdisciplinary collaboration to ensure
AI tools are developed with input from humanities
scholars;
• Prioritize transparency in AI methodologies used
in research;
• Invest in education and training on ethical AI use
for humanities researchers.</p>
    </sec>
    <sec id="sec-9">
      <title>8. Conclusion</title>
      <p>In conclusion, digital analysis within digital humanities is
not just about incorporating new tools but fundamentally
transforming the ways that humanistic research can be
conducted, presented, and taught. This evolution continues
to ofer both exciting opportunities and significant
challenges that will shape the future of humanities scholarship.
Generative AI and LLMs ofer transformative potential for
digital humanities, enabling sophisticated textual analysis
and broadening the scope of what is technologically possible
in humanities research. However, their integration must be
handled with care, considering ethical, methodological, and
practical challenges. Future research should focus on
developing equitable and transparent use of these technologies
to enhance the digital humanities field.
1–7</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Risam</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>New Digital Worlds: Postcolonial Digital Humanities in Theory, Praxis, and Pedagogy</article-title>
          . Northwestern University Press.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Capizzi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , Lo Sciuto,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Napoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            and
            <surname>Tramontana</surname>
          </string-name>
          ,
          <string-name>
            <surname>E.</surname>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>An advanced neural network based solution to enforce dispatch continuity in smart grids</article-title>
          .
          <source>Applied Soft Computing, n. 62</source>
          , pp.
          <fpage>768</fpage>
          -
          <lpage>775</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Lo</given-names>
            <surname>Sciuto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Capizzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Shikler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            and
            <surname>Napoli</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>Organic solar cells defects classification by using a new feature extraction algorithm and an EBNN with an innovative pruning algorithm</article-title>
          .
          <source>International Journal of Intelligent Systems</source>
          ,
          <volume>36</volume>
          (
          <issue>6</issue>
          ), pp.
          <fpage>2443</fpage>
          -
          <lpage>2464</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Napoli</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pappalardo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tramontana</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nowicki</surname>
            ,
            <given-names>R. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Starczewski</surname>
            ,
            <given-names>J. T.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Woźniak</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Toward work groups classification based on probabilistic neural network approach</article-title>
          .
          <source>Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)</source>
          ,
          <volume>9119</volume>
          , pp.
          <fpage>79</fpage>
          -
          <lpage>89</lpage>
          , DOI 10.1007/978-3-
          <fpage>319</fpage>
          - 19324-
          <issue>3</issue>
          _
          <fpage>8</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Tibermacine</surname>
            ,
            <given-names>I. E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tibermacine</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guettala</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Napoli</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Russo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2023</year>
          ).
          <article-title>Enhancing Sentiment Analysis on SEED-IV Dataset with Vision Transformers: A Comparative Study</article-title>
          . ACM International Conference Proceeding Series, pp.
          <fpage>238</fpage>
          -
          <lpage>246</lpage>
          , DOI 10.1145/3638985.3639024.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Kelleher</surname>
            ,
            <given-names>J. D.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>Deep Learning for Natural Language Processing: Creating Neural Networks with Python</article-title>
          .
          <source>Apress.</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>O</given-names>
            <surname>'Neil</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          (
          <year>2016</year>
          ).
          <source>Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy</source>
          . Crown.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Zook</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barocas</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boyd</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crawford</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , Keller, E.,
          <string-name>
            <surname>Gangadharan</surname>
            ,
            <given-names>S. P.</given-names>
          </string-name>
          , ... ,
          <string-name>
            <surname>Pasquale</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Ten simple rules for responsible big data research</article-title>
          .
          <source>PLoS Computational Biology</source>
          ,
          <volume>13</volume>
          (
          <issue>3</issue>
          ),
          <year>e1005399</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Bonanno</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Capizzi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , Lo Sciuto,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Napoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Pappalardo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            and
            <surname>Tramontana</surname>
          </string-name>
          ,
          <string-name>
            <surname>E.</surname>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>A cascade neural network architecture investigating surface plasmon polaritons propagation for thin metals in openmp</article-title>
          .
          <source>In Artificial Intelligence and Soft Computing: 13th International Conference, ICAISC</source>
          <year>2014</year>
          , Zakopane, Poland, June 1-5,
          <year>2014</year>
          , Proceedings,
          <string-name>
            <surname>Part I</surname>
          </string-name>
          13 (pp.
          <fpage>22</fpage>
          -
          <lpage>33</lpage>
          ). Springer International Publishing.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Illari</surname>
            ,
            <given-names>S. I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Russo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Avanzato</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Napoli</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2020</year>
          ).
          <article-title>A cloud-oriented architecture for the remote assessment and follow-up of hospitalized patients</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          , 2694 pp.
          <fpage>29</fpage>
          -
          <lpage>35</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vizzarri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Mazzenga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Giuliano</surname>
          </string-name>
          ,
          <article-title>Future technologies for train communication: The role of leo hts satellites in the adaptable communication system</article-title>
          ,
          <source>Sensors</source>
          <volume>23</volume>
          (
          <year>2023</year>
          ). URL: https: //www.mdpi.com/1424- 8220/23/1/68. doi:
          <volume>10</volume>
          .3390/s23010068.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Seargeant</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , ,
          <string-name>
            <surname>Tagg</surname>
          </string-name>
          , C. (Eds.). (
          <year>2018</year>
          ).
          <article-title>The Language of Social Media: Identity and Community on the Internet</article-title>
          . Palgrave Macmillan.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Bender</surname>
            ,
            <given-names>E. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gebru</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McMillan-Major</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shmitchell</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <source>On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency</source>
          (pp.
          <fpage>610</fpage>
          -
          <lpage>623</lpage>
          ). ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Spiro</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>This Is Why We Fight: Defining the Values of the Digital Humanities</article-title>
          . In M. K. Gold (Ed.),
          <source>Debates in the Digital Humanities</source>
          (pp.
          <fpage>16</fpage>
          -
          <lpage>35</lpage>
          ). University of Minnesota Press.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Eficient Estimation of Word Representations in Vector Space</article-title>
          .
          <source>In Proceedings of the International Conference on Learning Representations (ICLR).</source>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Bostrom</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yudkowsky</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>The Ethics of Artiifcial Intelligence</article-title>
          . In K. Frankish, W. M. Ramsey (Eds.),
          <source>The Cambridge Handbook of Artificial Intelligence</source>
          (pp.
          <fpage>316</fpage>
          -
          <lpage>334</lpage>
          ). Cambridge: Cambridge University Press.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>F.</given-names>
            <surname>Ranaldi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. S.</given-names>
            <surname>Ruzzetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Onorati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ranaldi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Giannone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Favalli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Romagnoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Zanzotto</surname>
          </string-name>
          ,
          <article-title>Investigating the impact of data contamination of large language models in text-tosql translation</article-title>
          ,
          <source>ArXiv abs/2402</source>
          .08100 (
          <year>2024</year>
          ). URL: https://api.semanticscholar.org/CorpusID: 267636801.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Brandizzi</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Russo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Brociek</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Wajda</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>First Studies to Apply the Theory of Mind Theory to Green and Smart Mobility by Using Gaussian Area Clustering</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          , 3118 pp.
          <fpage>71</fpage>
          -
          <lpage>76</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>R.</given-names>
            <surname>Giuliano</surname>
          </string-name>
          ,
          <article-title>From 5g-advanced to 6g in 2030: New services, 3gpp advances, and enabling technologies</article-title>
          ,
          <source>IEEE Access 12</source>
          (
          <year>2024</year>
          )
          <fpage>63238</fpage>
          -
          <lpage>63270</lpage>
          . doi:
          <volume>10</volume>
          .1109/ACCESS.
          <year>2024</year>
          .
          <volume>3396361</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>M.</given-names>
            <surname>Berbineau</surname>
          </string-name>
          et al.
          <article-title>Zero on site testing of railway wireless systems: the Emulradio4Rail platforms</article-title>
          .
          <source>2021 IEEE 93rd Vehicular Technology Conference (VTC2021-Spring)</source>
          , Helsinki, Finland,
          <year>2021</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          , doi: 10.1109/VTC2021-
          <fpage>Spring51267</fpage>
          .
          <year>2021</year>
          .
          <volume>9448903</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Straumsheim</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Digital Humanities and Ethics</article-title>
          . Inside Higher Ed.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Rieder</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , ,
          <string-name>
            <surname>Simon</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Big Data: A New Empiricism and its Epistemic and Socio-Political Consequences</article-title>
          .
          <source>Knowledge Organization</source>
          ,
          <volume>44</volume>
          (
          <issue>4</issue>
          ),
          <fpage>235</fpage>
          -
          <lpage>246</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Birhane</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , ,
          <string-name>
            <surname>Guest</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>Towards decolonising computational sciences</article-title>
          .
          <source>Women, Gender &amp; Research</source>
          ,
          <volume>30</volume>
          (
          <issue>1</issue>
          ),
          <fpage>60</fpage>
          -
          <lpage>73</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>D</given-names>
            <surname>'Ignazio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            , ,
            <surname>Klein</surname>
          </string-name>
          ,
          <string-name>
            <surname>L. F.</surname>
          </string-name>
          (
          <year>2020</year>
          ).
          <article-title>Data Feminism</article-title>
          . MIT Press. Eubanks,
          <string-name>
            <surname>V.</surname>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor</article-title>
          . St. Martin's Press.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Woźniak</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Połap</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nowicki</surname>
            ,
            <given-names>R. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Napoli</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pappalardo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Tramontana</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Novel approach toward medical signals classifier</article-title>
          .
          <source>Proceedings of the International Joint Conference on Neural Networks</source>
          ,
          <year>September 2015</year>
          , DOI 10.1109/IJCNN.
          <year>2015</year>
          .
          <volume>7280556</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>Crawford</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence</article-title>
          . Yale University Press.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <surname>Fyfe</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2022</year>
          ).
          <article-title>Digital Humanities and the Climate Crisis: Opportunities and Challenges</article-title>
          .
          <source>Digital Scholarship in the Humanities</source>
          ,
          <volume>37</volume>
          (
          <issue>2</issue>
          ),
          <fpage>546</fpage>
          -
          <lpage>562</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Noble</surname>
            ,
            <given-names>S. U.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Algorithms of Oppression: How Search Engines Reinforce Racism</article-title>
          . NYU Press.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <surname>Underwood</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>Distant Horizons: Digital Evidence and Literary Change</article-title>
          . University of Chicago Press.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>