<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>IRCDL</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Enhancing Historical Documents: Deep Learning and Image Processing Approaches</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Zahra Ziran</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Massimo Mecella</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simone Marinai</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Sapienza University of Rome</institution>
          ,
          <addr-line>Rome</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Florence</institution>
          ,
          <addr-line>Florence</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>21</volume>
      <fpage>20</fpage>
      <lpage>21</lpage>
      <abstract>
        <p>Historical documents from Late Antiquity to the early Middle Ages often sufer from degraded image quality due to aging, inadequate preservation, and environmental factors, presenting significant challenges for paleographical analysis. These documents contain crucial graphical symbols representing administrative, economic, and cultural information, which are time-consuming and error-prone to interpret manually. This research investigates image processing algorithms and deep learning models for enhancing these historical documents. Using image processing techniques, we improve symbol readability and visibility, while our deep learning approach aids in reconstructing degraded content and identifying patterns. This work contributes to improving the quality of historical document analysis, particularly for graphical symbol interpretation in paleographical studies.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Historical Document</kwd>
        <kwd>Image Enhancement</kwd>
        <kwd>paper formatting</kwd>
        <kwd>Digital Paleography</kwd>
        <kwd>Graphical Symbols</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The field of paleography has experienced significant advancement through digital technologies,
particularly in the analysis and preservation of historical documents [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The digitization of ancient
manuscripts has transformed how humanities scholars access and study historical materials, enabling
worldwide collaboration and more sophisticated analytical approaches [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. However, the quality of
digital images remains a critical factor in paleographical research, directly impacting scholars’ ability to
interpret and analyze historical content accurately [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]. Recent years have witnessed the emergence
of various computational methods to enhance the quality of historical document images [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. These
approaches, ranging from traditional image processing techniques to advanced deep learning models,
aim to address challenges such as degradation, poor preservation, and environmental damage that afect
historical documents [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The success of these enhancement methods depends heavily on understanding
specific requirements in humanities research, where scholars may need to focus on diferent aspects
of documents, from broad textual content to minute graphical details [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The collaboration between
computer scientists and humanities scholars has become increasingly important in developing efective
enhancement solutions [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. This interdisciplinary approach requires careful consideration of both
technical capabilities and scholarly needs, particularly in preserving and enhancing historical documents’
subtle features and symbols [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In our study, we focus on improving the quality of graphical symbols in
historical documents through two distinct approaches: image processing algorithms and deep learning
models. These methods aim to enhance symbol clarity while preserving their historical integrity and
significance.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Dataset</title>
      <p>
        Several datasets are available for evaluating enhancement and analysis techniques applied to historical
documents. These include graphic symbols from documentary records dating from Late Antiquity to
the early medieval period [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and the Digital Image Database of Ancient Handwritings (DIDA) [
        <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
        ],
which provides a collection of historical handwritten digit images. Another significant resource is the
IAM Historical Document Database (IAM-HistDB) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], which contains manuscripts from medieval
times, including the Saint Gall database of handwritten historical documents from the 9th century.
Additionally, the DIVA-HisDB [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] provides 150 annotated pages from three diferent medieval manuscripts
with challenging layouts, specifically designed for evaluating various Document Image Analysis (DIA)
tasks. All these datasets serve as valuable benchmarks for assessing document enhancement methods.
      </p>
      <p>
        In this research, we utilize the dataset of the project (NOTAE): NOT A writtEn word but graphic
symbols [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], which comprises graphic symbols extracted from documentary sources dating from late
antiquity to the early Middle Ages. These symbols, encompassing alphabetic and non-alphabetic signs,
were employed in various documents to convey meanings beyond the written text, serving functions
such as authentication, authorization, or annotation. The documents originate from diverse European
regions and contexts, reflecting their time’s administrative, legal, and cultural practices. The massive
number of documents and symbols provides a rich resource for analyzing the use and evolution of
graphic symbols in historical manuscripts and is curated to ensure high-quality representations of these
symbols, facilitating detailed analysis and interpretation.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Related Work</title>
      <p>
        In historical document processing, significant advancements have been made in enhancing and analyzing
ancient manuscripts through image-processing techniques and deep learning approaches. This section
reviews key studies relevant to document image enhancement, highlighting their strengths, limitations,
and how our approach builds upon these methods. Traditional image-processing techniques have
played a crucial role in document enhancement, particularly for restoring degraded historical texts.
Atanasiu and Marthot-Santaniello [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] employed histogram equalization, adaptive histogram equalization,
and local Laplacian filters to enhance papyri legibility. Their method preserved symbol clarity but
struggled with complex background noise, requiring manual refinement. Shi and Govindaraju [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]
introduced background estimation and pixel normalization to improve aged documents with uneven
illumination. While efective, it lacked adaptability for highly degraded manuscripts with missing
symbols or extensive fading. Similarly, Mittal et al. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] used stretch limits, histogram modifications,
and saturation adjustments for graphical symbol enhancement but required parameter tuning, limiting
scalability. Gupta et al. [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] applied error difusion and multiresolution binarization for restoration,
improving binarization quality but struggling with complex multi-layered text, which deep learning
models address more efectively. With deep learning advancements, newer methods have addressed
traditional limitations. Oliveira et al. [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] demonstrated the efectiveness of convolutional neural
networks (CNNs) in reconstructing degraded symbols and identifying repetitive patterns in ancient
manuscripts. While improving restoration, their model required extensive labeled training data, often
scarce for historical texts. Zhang et al. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] developed a deep learning model for graphical symbol
enhancement, recovering deteriorated details but lacking adaptability for multi-script textual restoration.
Our study integrates traditional image processing with deep learning to enhance symbol clarity and
reconstruct missing details, leveraging the strengths of both approaches for a more robust document
enhancement framework.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <p>This section explores two distinct image processing approaches alongside a deep learning method
to enhance the quality of graphical symbols. These techniques aim to improve readability, enhance
contrast, mitigate illumination variations, and uncover hidden details. By comparing the efectiveness
of diferent methods, we identify the most suitable approaches for the specific goal of graphical symbol
enhancement, enabling the extraction of valuable insights and fostering a deeper understanding of
these ancient symbols.</p>
      <sec id="sec-4-1">
        <title>4.1. Image Processing-Based Models</title>
        <p>Image processing techniques play a crucial role in enhancing image quality and addressing various
challenges across diferent datasets. For ancient graphical symbols from the medieval period, edge
detection and superpixel segmentation are employed to reveal hidden details and preserve intricate
patterns. Histogram equalization and adaptive histogram equalization enhance contrast and address
illumination variations, resulting in more visible and interpretable symbols. Model A and B, described
in the following, implement these ideas.</p>
        <sec id="sec-4-1-1">
          <title>4.1.1. Model A: Edge Detection and Segmentation Approach</title>
          <p>
            This model implements a comprehensive enhancement pipeline combining edge detection with advanced
segmentation techniques. The enhancement process consists of three main stages: edge detection,
superpixel segmentation, and histogram optimization. In the initial stage, we employ the Sobel operator
for edge detection [
            <xref ref-type="bibr" rid="ref20">20</xref>
            ]. For a given grayscale image (, ) defined over a two-dimensional spatial
domain Ω ⊂ R2, where (, ) represents the spatial coordinates, the horizontal and vertical gradients
are computed as:
          </p>
          <p>(, ) =  ≈ ( + 1, ) − ( − 1, )</p>
          <p>(, ) =  ≈ (,  + 1) − (,  − 1)</p>
          <p>√︁
(, ) =</p>
          <p>(, )2 + (, )2
where  and  denote the approximated gradients in the horizontal and vertical directions,
respectively. The gradient magnitude (, ) is then computed as:</p>
          <p>
            Following edge detection, we implement the Simple Linear Iterative Clustering (SLIC) algorithm [
            <xref ref-type="bibr" rid="ref21">21</xref>
            ]
for superpixel segmentation. For each pixel  = (, , , , ), where (, , ) represents
its color values in RGB space, the distance metric  to a cluster center  = (, , , , ) is
defined as:
          </p>
          <p>√︃
(, ) =
2 +
︂(  )︂ 2

2
where:
•  = √︀( − )2 + ( − )2 + ( − )2 is the color distance
•  = √︀( − )2 + ( − )2 is the spatial distance
•  represents the sampling interval in the pixel grid
•  is a compactness parameter controlling the relative importance of spatial proximity
The final enhancement stage employs adaptive histogram equalization. For each intensity level , we
compute the cumulative distribution function  ():</p>
          <p>() = ∑︁ (),  ∈ [0,  − 1]
=0
(1)
(2)
(3)
(4)
(5)
where () represents the probability of occurrence of intensity level , and  is the total number of
possible intensity levels. The new intensity value is then calculated as:</p>
          <p>(, ) = ( − 1) ·  ((, ))</p>
          <p>This transformation is applied locally within adaptive windows to account for spatial variations
in contrast and illumination. The efectiveness of this approach is demonstrated in Figure 1, where
significant improvements in both local contrast and edge definition are achieved while preserving the
structural integrity of the historical symbols.</p>
          <p>The enhancement process is applied iteratively until a convergence criterion  is met:
|+1 − |2 &lt; 
where  represents the enhanced image at iteration , and | · | 2 denotes the L2-norm. In our experiments,
we use a normalized convergence threshold  = 10− 4 · | 0|2, where 0 is the initial image. This relative
threshold ensures the stopping criterion scales appropriately with image size and intensity range,
providing consistent convergence behavior across diferent input images.
(6)
(7)</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>4.1.2. Model B: Noise Reduction and Enhancement Approach</title>
          <p>The second model focuses on noise reduction while preserving symbol structures through a two-phase
process. The first phase, apply_filter , combines edge detection and watershed segmentation with SLIC
superpixels to create an initial enhanced representation. The second phase, remove_noise, implements
morphological operations for noise reduction.</p>
          <p>The process begins with binary thresholding to separate foreground and background elements. This is
followed by morphological opening operations that efectively remove small artifacts while maintaining
the integrity of larger structures. The final step combines the cleaned binary image with the enhanced
version to produce the result shown in Figure 2.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Deep Learning-Based Enhancement</title>
        <p>Our deep learning approach incorporates synthetic data generation and transformer-based feature
extraction to enhance historical document images [22]. The system utilizes a Faster R-CNN model [23]
trained on both original and synthetically generated data, enabling the preservation of unique document
characteristics. The OPTICS algorithm [24] is employed for stroke segmentation, with adaptive circle
construction handling variable stroke thickness. Figure 3 demonstrates the efectiveness of this approach.
Following the ideas used by Cai et al. [25], our study explores how GAN-based synthetic data generation
can improve symbol recognition in ancient documents. By creating enriched training datasets that
reflect the diverse and complex nature of historical symbols, we aim to achieve higher accuracy and
reliability in symbol detection for document analysis. The enriched datasets generated facilitate a more
comprehensive training environment, significantly enhancing the models’ ability to generalize across
new and unseen data. This capability is vital for practical applications where models are expected
to perform accurately outside their training set. Moreover, integrating synthetic data that accurately
reflects the complexity of real-world scenarios reduces the models’ tendencies to overfit the limited
nuances of smaller datasets. Instead, they learn more robust features that represent the true underlying
patterns in the data, substantially boosting performance and generalizability.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion and Results</title>
      <p>This section presents a detailed evaluation and comparative analysis of our three proposed approaches:
Model A (edge detection and segmentation), Model B (noise reduction and enhancement), and the deep
learning-based method. Through quantitative metrics and visual examples, we assess each method’s
efectiveness in enhancing historical documents and preserving graphical symbols. The comparison
examines multiple performance aspects including edge preservation, contrast enhancement, noise
reduction, and symbol reconstruction capabilities. Special attention is given to challenging cases, such
as severely degraded documents and partially missing symbols, to thoroughly understand each method’s
strengths and limitations. The analysis includes comparisons with existing solutions, particularly the
Hierax method, to demonstrate the advancements achieved by our approaches.</p>
      <sec id="sec-5-1">
        <title>5.1. Comparative Analysis</title>
        <p>To evaluate the efectiveness of our approaches, we conducted the comparative analysis using
challenging low-quality images (one example is shown in Figure 4). As discussed in the following, Model
A demonstrated superior edge preservation and contrast enhancement, while Model B excelled in
noise reduction and structure preservation. The deep learning approach showed particular strength in
reconstructing degraded regions and pattern recognition.</p>
        <p>When compared to existing approaches in historical document enhancement, such as Hierax [26]1,
our method represents a better advancement in revealing details that were previously obscured, as
illustrated in Figure 5.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Performance Analysis</title>
        <p>To quantitatively evaluate and compare our approaches, we employed several standard image quality
metrics and developed specific measurements for historical document enhancement. The evaluation
was conducted on a test set of 100 historical document images containing various types of degradation
and symbols. The metrics taken into account are summarized below.</p>
        <sec id="sec-5-2-1">
          <title>Peak Signal-to-Noise Ratio (PSNR):</title>
          <p>PSNR = 10 · log10
︂(  2 )︂</p>
          <p>MSE
where   is the maximum possible pixel value, and MSE is the Mean Square Error between the
enhanced and reference images.</p>
        </sec>
        <sec id="sec-5-2-2">
          <title>Structural Similarity Index (SSIM):</title>
          <p>SSIM(, ) =</p>
          <p>(2   + 1)(2  + 2)
( 2 +  2 + 1)( 2 +  2 + 2)
where  ,   are local means,  2,  2 are variances, and   is the covariance.</p>
        </sec>
        <sec id="sec-5-2-3">
          <title>Symbol Preservation Rate (SPR):</title>
          <p>SPR = preserved
total
where preserved is the number of correctly preserved symbols and total is the total number of symbols
in the original image.</p>
          <p>Edge Preservation Assessment For edge preservation evaluation, we computed the Edge
Preservation Index (EPI):</p>
          <p>EPI =
∑︀, ||∇(, )| − |∇</p>
          <p>(, )||2
∑︀, |∇(, )|2
where ∇ and ∇ are the gradients of enhanced and original images, respectively.</p>
          <p>Our experimental results demonstrate the relative strengths of each approach in Table 1. Model A
achieved significant improvements in edge preservation and contrast enhancement, with quantitative
measurements showing an average improvement of 45% in contrast ratio and 38% in edge preservation
compared to baseline measurements. Model B demonstrated exceptional capability in noise reduction,
achieving a 52% reduction in background noise while maintaining 94% of essential edge information.
The deep learning approach showed a 73% accuracy in reconstructing damaged symbols, particularly
excelling in areas where traditional methods struggled. Model A achieved superior performance in edge
preservation and contrast enhancement achieving a PSNR of 32.4 dB and an SSIM of 0.89, indicating
better preservation of structural information and overall image quality. The high SPR of 94.2% confirms
its efectiveness in maintaining symbol integrity during the enhancement process.</p>
          <p>Model B excelled in noise reduction while maintaining edge preservation (EPI: 0.85), making it
particularly efective for badly degraded documents. The deep learning approach, despite lower traditional
metric scores (PSNR: 29.5 dB, SSIM: 0.84), demonstrated superior symbol reconstruction and pattern
recognition, especially in cases of severely degraded or missing symbols, as illustrated in Figure 3.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion and Future Work</title>
      <p>This study investigates image enhancement techniques for historical documents by integrating
traditional image processing with deep learning. Our proposed methods consistently outperform the Hierax
approach across various metrics. Model A excels in edge preservation and contrast enhancement, while
Model B is more efective for noise reduction and structural preservation. The deep learning model,
though slightly lower in traditional metrics, proves superior in reconstructing damaged symbols and
handling severely degraded cases. Each method has distinct strengths suited to diferent document
conditions. While tested on the NOTAE dataset, our framework’s adaptability suggests broader
applicability to collections like DIVA-HisDB and IAM-HistDB. Future work will focus on a hybrid system with
an adaptive selection mechanism and human-in-the-loop validation to optimize enhancement based on
document conditions.
(2012) 2274–2282.
[22] Z. Ziran, F. Leotta, M. Mecella, Enhancing object detection in ancient documents with
synthetic data generation and transformer-based models, 2023. URL: https://arxiv.org/abs/2307.16005.
arXiv:2307.16005.
[23] S. Ren, K. He, R. Girshick, J. Sun, Faster R-CNN: Towards real-time object detection with region
proposal networks, IEEE transactions on pattern analysis and machine intelligence 39 (2016)
1137–1149.
[24] M. Ankerst, M. M. Breunig, H.-P. Kriegel, J. Sander, Optics: Ordering points to identify the
clustering structure, ACM Sigmod record 28 (1999) 49–60.
[25] J. Cai, L. Peng, Y. Tang, C. Liu, P. Li, Th-gan: generative adversarial network based transfer learning
for historical chinese character recognition, in: 2019 International Conference on Document
Analysis and Recognition (ICDAR), IEEE, 2019, pp. 178–183.
[26] V. Atanasiu, I. Marthot-Santaniello, Hierax legibility enhancement software, 2020.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P. A.</given-names>
            <surname>Stokes</surname>
          </string-name>
          ,
          <article-title>Digital approaches to paleography and book history: some challenges, present and future</article-title>
          ,
          <source>Frontiers in Digital Humanities</source>
          <volume>2</volume>
          (
          <year>2015</year>
          )
          <article-title>5</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Hassner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rehbein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Stokes</surname>
          </string-name>
          , L. Wolf,
          <article-title>Computation and palaeography: Potentials and limits (dagstuhl perspectives workshop</article-title>
          12382),
          <source>Dagstuhl Manifestos</source>
          <volume>2</volume>
          (
          <year>2013</year>
          )
          <fpage>14</fpage>
          -
          <lpage>35</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Baechler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ingold</surname>
          </string-name>
          <article-title>, Multi resolution layout analysis of medieval manuscripts using dynamic mlp</article-title>
          ,
          <source>in: 2011 International Conference on Document Analysis and Recognition</source>
          , IEEE,
          <year>2011</year>
          , pp.
          <fpage>1185</fpage>
          -
          <lpage>1189</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>Pintus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Pal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Weyrich</surname>
          </string-name>
          , E. Gobbetti,
          <string-name>
            <given-names>H.</given-names>
            <surname>Rushmeier</surname>
          </string-name>
          ,
          <article-title>A survey of geometric analysis in cultural heritage</article-title>
          , in: Computer Graphics Forum, volume
          <volume>35</volume>
          , Wiley Online Library,
          <year>2016</year>
          , pp.
          <fpage>4</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>L.</given-names>
            <surname>Likforman-Sulem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zahour</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Taconet</surname>
          </string-name>
          ,
          <article-title>Text line segmentation of historical documents: a survey</article-title>
          ,
          <source>International Journal of Document Analysis and Recognition (IJDAR) 9</source>
          (
          <year>2007</year>
          )
          <fpage>123</fpage>
          -
          <lpage>138</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>W.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-H.</given-names>
            <surname>Xue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Liao</surname>
          </string-name>
          ,
          <article-title>Deep learning for single image superresolution: A brief review</article-title>
          ,
          <source>IEEE Transactions on Multimedia</source>
          <volume>21</volume>
          (
          <year>2019</year>
          )
          <fpage>3106</fpage>
          -
          <lpage>3121</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Stutzmann</surname>
          </string-name>
          ,
          <article-title>Clustering of medieval scripts through computer image analysis: towards an evaluation protocol</article-title>
          ,
          <source>Digital Medievalist</source>
          <volume>10</volume>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Simeone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Guiliano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kooper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bajcsy</surname>
          </string-name>
          ,
          <article-title>Digging into data using new collaborative infrastructures supporting humanities-based computer science research</article-title>
          , First
          <string-name>
            <surname>Monday</surname>
          </string-name>
          (
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>J.-B. Camps</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Clérice</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Pinche</surname>
          </string-name>
          ,
          <article-title>Noisy medieval data, from digitized manuscript to stylometric analysis: Evaluating paul meyer's hagiographic hypothesis</article-title>
          ,
          <source>Digital Scholarship in the Humanities</source>
          <volume>36</volume>
          (
          <year>2021</year>
          )
          <fpage>49</fpage>
          -
          <lpage>71</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>E.</given-names>
            <surname>Bernasconi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Boccuzzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Briasco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Catarci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ghignoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Leotta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mecella</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Monte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Sietis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Veneruso</surname>
          </string-name>
          , et al.,
          <article-title>Notae: Not a written word but graphic symbols</article-title>
          .,
          <source>in: CEUR WORKSHOP PROCEEDINGS</source>
          , volume
          <volume>3144</volume>
          ,
          <year>2022</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>H.</given-names>
            <surname>Kusetogullari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Yavariabdi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Lavesson</surname>
          </string-name>
          ,
          <article-title>Digitnet: A deep handwritten digit detection and recognition method using a new historical handwritten digit dataset</article-title>
          ,
          <source>Big Data Research</source>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>H.</given-names>
            <surname>Kusetogullari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Yavariabdi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Lavesson</surname>
          </string-name>
          ,
          <article-title>Dida: The largest historical handwritten digit dataset with 250k digits</article-title>
          , https://github.com/didadataset/DIDA/,
          <year>2020</year>
          . Accessed:
          <fpage>2024</fpage>
          -12-19.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>U.-V.</given-names>
            <surname>Marti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Bunke</surname>
          </string-name>
          ,
          <article-title>The IAM-database: an english sentence database for ofline handwriting recognition</article-title>
          ,
          <source>International journal on document analysis and recognition 5</source>
          (
          <year>2002</year>
          )
          <fpage>39</fpage>
          -
          <lpage>46</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>F.</given-names>
            <surname>Simistira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Seuret</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Eichenberger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Garz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Liwicki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ingold</surname>
          </string-name>
          ,
          <article-title>Diva-hisdb: A precisely annotated large dataset of challenging medieval manuscripts</article-title>
          ,
          <source>in: 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>471</fpage>
          -
          <lpage>476</lpage>
          . doi:
          <volume>10</volume>
          .1109/ICFHR.
          <year>2016</year>
          .
          <volume>0093</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Govindaraju</surname>
          </string-name>
          ,
          <article-title>Historical document image enhancement using background light intensity normalization</article-title>
          ,
          <source>in: Proceedings of the 17th International Conference on Pattern Recognition (ICPR)</source>
          , volume
          <volume>1</volume>
          , IEEE,
          <year>2004</year>
          , pp.
          <fpage>473</fpage>
          -
          <lpage>476</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>S.</given-names>
            <surname>Das</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Gulati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Mittal</surname>
          </string-name>
          ,
          <article-title>Histogram equalization techniques for contrast enhancement: A review</article-title>
          ,
          <source>International Journal of Computer Applications</source>
          <volume>114</volume>
          (
          <year>2015</year>
          )
          <fpage>32</fpage>
          -
          <lpage>36</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. P.</given-names>
            <surname>Jacobson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. K.</given-names>
            <surname>Garcia</surname>
          </string-name>
          ,
          <article-title>Ocr binarization and image pre-processing for searching historical documents</article-title>
          ,
          <source>Pattern Recognition</source>
          <volume>40</volume>
          (
          <year>2007</year>
          )
          <fpage>389</fpage>
          -
          <lpage>397</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>S. A.</given-names>
            <surname>Oliveira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Seguin</surname>
          </string-name>
          ,
          <string-name>
            <surname>F.</surname>
          </string-name>
          <article-title>Kaplan, dhsegment: A generic deep-learning approach for document segmentation</article-title>
          ,
          <source>in: Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)</source>
          , IEEE,
          <year>2018</year>
          , pp.
          <fpage>7</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>K.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , W. Zuo,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <article-title>Learning deep cnn denoiser prior for image restoration</article-title>
          ,
          <source>in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)</source>
          , IEEE,
          <year>2017</year>
          , pp.
          <fpage>3929</fpage>
          -
          <lpage>3938</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>N.</given-names>
            <surname>Kanopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Vasanthavada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Baker</surname>
          </string-name>
          ,
          <article-title>Design of an image edge detection filter using the sobel operator</article-title>
          ,
          <source>IEEE Journal of solid-state circuits 23</source>
          (
          <year>1988</year>
          )
          <fpage>358</fpage>
          -
          <lpage>367</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>R.</given-names>
            <surname>Achanta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Shaji</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lucchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Fua</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Süsstrunk</surname>
          </string-name>
          ,
          <article-title>Slic superpixels compared to stateof-the-art superpixel methods</article-title>
          ,
          <source>IEEE transactions on pattern analysis and machine intelligence 34</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>