=Paper=
{{Paper
|id=Vol-3838/paper4
|storemode=property
|title=A new methodology to automatically detect cracks in existing RC bridges
|pdfUrl=https://ceur-ws.org/Vol-3838/paper4.pdf
|volume=Vol-3838
|authors=Vincenzo Mario Di Mucci,Angelo Cardellicchio,Sergio Ruggieri,Andrea Nettis,Vito Renò,Giuseppina Uva
|dblpUrl=https://dblp.org/rec/conf/viperc/MucciCRNRU24
}}
==A new methodology to automatically detect cracks in existing RC bridges==
A new methodology to automatically detect cracks in
existing RC bridges
Vincenzo Mario Di Mucci1, *, †, Angelo Cardellicchio2, †, Sergio Ruggieri1, †, Andrea
Nettis1, †, Vito Renò2, † and Giuseppina Uva1, †
1 DICATECH Department, Polytechnic University of Bari, Via Orabona 4, Bari, Italy
2 STIIMA Institute, National Research Council of Italy, Via Amendola 122D/O, Bari, Italy
Abstract
The paper presents a novel approach to detect cracks in existing reinforced concrete (RC) bridges
using computer vision (CV) techniques as smart sensors and to identify existing damages from
photos. This method involves training specialized convolutional neural networks (CNNs) to
identify cracks in RC components, focusing on automated detection. The process begins with
defining a detailed dataset of labeled crack images by domain experts in the field. Subsequently,
CNNs designed for crack detection are trained and assessed. The effectiveness of the method is
initially evaluated through visual comparisons, with more specific evaluations planned to use
defined metrics upon completion of development. This innovative methodology aims to drive
digital progress and artificial intelligence applications in advanced visual inspections, ultimately
safeguarding the structures of existing bridge stock.
Keywords
Existing bridges, Conservation, Visual inspections, Crack detection, Structural Health
management, Computer vision, Artificial Intelligence1
2
1. Introduction
In recent years, bridge collapses [1] have highlighted the importance of the safety of existing
infrastructures, especially historic ones. This concerns not only ancient masonry bridges,
but also reinforced concrete (RC) bridges, which are crucial for their function and cultural
value. Events such as earthquakes have shown the vulnerability of these structures, making
careful monitoring necessary to avoid economic losses and protect the built heritage [2].
VIPERC2024: 3rd International Conference on Visual Pattern Extraction and Recognition for Cultural Heritage
Understanding, 1 September 2024
∗ Corresponding author.
† These authors contributed equally.
v.dimucci1@phd.poliba.it (V. M. Di Mucci); angelo.cardellicchio@stiima.cnr.it (A. Cardellicchio);
sergio.ruggieri@poliba.it (S. Ruggieri); a.nettis@poliba.it (A. Nettis); vito.reno@stiima.cnr.it (V. Renò);
giuseppina.uva@poliba.it (G. Uva)
0009-0002-6239-2743 (V. M. Di Mucci); 0000-0003-3313-4817 (A. Cardellicchio); 0000-0001-5119-
8967(S. Ruggieri); 0000-0001-8133-6830(A. Nettis); 0000-0003-1830-4961 (V. Renò); 0000-0001-6408-167X
(G. Uva)
© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
CEUR
ceur-ws.org
Workshop ISSN 1613-0073
Proceedings
The focus has been on developing systematic and non-invasive methods for monitoring and
maintaining these critical infrastructures [3], [4].
Existing RC bridges, which are often more than 50 years old, suffer from several
problems including concrete deterioration and steel corrosion [5]. These issues underline
the urgency of assessing the state of conservation of existing bridges as a fundamental step
for their efficient management. Two critical aspects emerge:
1. Limitations of economic and temporal resources.
2. Huge number of structures to be assessed.
To address this problem, the Ministry of Infrastructure and Transport (MIT) has released
the new Guidelines for the management of bridges safety [6]. The decree provides a multi-
level approach aimed at defining risk-based priority lists to direct accurate assessments and
interventions on the most critical bridges, and then to drive available resources on the
worst cases.
Level 1 of the Guidelines consists of visual inspection activities on bridges, necessary to
identify the current state of conservation and the presence of any degradation phenomena.
Traditional methods consist of inspecting bridges by trained inspectors, which identify
defects and define their intensity and extension using numerical coefficients.
It is worth observing that this operation requires significant human and economic
resources that infrastructure managers should face. Furthermore, traditional visual
inspection methods are time-consuming, laborious and highly dependent on the inspectors'
experience, which can lead to inconsistent assessments [7]. Visual inspection of a bridge
requires access to all parts of it, such as the piers and supports, which is not always possible,
as shown in [8]. In addition, these inspections often require the limitation of the bridge to
traffic, causing issues to the bridge serviceability. For this reason, research is underway to
find innovative solutions that automate inspections, reducing time and costs and improving
the safety of inspectors.
One of the most alarming defects is represented by cracks [9], which have specific
geometric characteristics such as width, length and orientation (e.g., longitudinal or
diagonal) [10]. With the aim of improving the current practice in cracks detection, this
paper explores the possibility to automatically detect cracks on bridge surface, through
advanced computer vision (CV) technologies, leveraging machine learning (ML) and deep
learning (DL) algorithms for defects detection.
The paper proposes a processing pipeline for automatic crack detection in existing RC
bridges. The system uses a pixel-based method to generate several patches from a limited
number of images showing cracks on RC bridge surfaces. These images are then used to
train a convolutional neural network (CNN) to identify the presence of cracks in the images.
The document has been organized as follows: Section 2 reports a review of the state-of-the-
art techniques of ML and DL for civil engineering; Section 3 presents the proposed
framework, detailing the steps of the process; Section 4 discusses the preliminary findings
and finally Section 5 provides the concluding remarks anticipating future developments.
2. State-of-the-art on crack detection
ML has been applied in various fields of civil and structural engineering [11], including
earthquake engineering [12], structural property identification and structural health
monitoring [13]. CV, which is the application of DL in the field of image analysis, has shown
promising results in assessing the state of conservation of structures.
One interesting application in this field is represented by VULMA [14], a tool able to
derive a simplified vulnerability index using images of existing buildings. This tool is based
on the use of Google Street View to automatically collect data, subjected to the labelling for
13 different geometrical parameters. Subsequently, by training a cascade of CNNs with
transfer learning and fine-tuning techniques, the tool extracts an accurate simplified
vulnerability index for each analyzed image.
Analogously, also for bridge analysis and the detection of structural defects such as
cracks, several studies have proposed the use of CV applications. In bridge damage
detection, CNNs have been mostly used to automatically identify defects through pixel-
based analysis, with a focus on crack detection and damage assessment. For example, Zhang
et al. [15] presented CrackNet, a CNN that achieved a remarkable accuracy score of 88.86%
on a 3D dataset containing 2,000 images of cracks present on asphalt surfaces. Similarly,
Yang et al. [16] developed a fully convolutional network for crack segmentation, achieving
an outstanding accuracy of 97.96% on a custom dataset.
Further progress was made in crack identification in concrete structures. Qiao et al. [17]
proposed an advanced method using the U-Net CNN, which outperformed standard U-Net
models by 11.7% in terms of average accuracy. Inam et al. [18] successfully used the U-Net
model for crack segmentation, accurately measuring attributes such as width, length, and
area.
Other innovative approaches include the YOLO algorithm, as proposed by Yu et al. [19],
to identify cracks in images. After training and testing on a large dataset of manually labeled
crack images, authors used the K-Means method to determine the optimal size of regions of
interest resulted in an average accuracy of 84.37%.
Finally, recent developments in crack detection adopted the integration of a Bottleneck
Transformer into an improved version of the YOLOv5 network, as proposed by Yu and Zou
[20]. This approach has been shown to accurately capture elongated features such as cracks,
achieving a higher accuracy score than the original version of YOLOv5. Similarly, the use of
semantic segmentation algorithms such as DeepLabv3+, as presented by Fu et al. [21], has
shown improved accuracy in crack segmentation, revealing finer details and improving the
overall effectiveness of the system.
A final contribution in the field of using CV for automatic defect identification in RC
bridges was presented by Cardellicchio et al. [22], which used CNNs and different DL
techniques to classify various common defects in bridges, and interpreting the results
through AI explainability techniques, such as Class Activation Maps (CAMs). Although the
initial performances were not promising, new evaluation metrics were proposed, which
proved to be effective in a real case study.
3. CNN-based crack detection framework
The objective of a crack detection problem is to determine if a specific pixel in an image of
an RC element is part of a crack. To solve this problem, a new framework is proposed to
detect cracks using CNNs. The method analyzes small portions of images to determine the
probability that the central pixel of each portion belongs to a crack.
This method represents a first step towards the automated generation of large amounts
of ground truth data that can be used to train pixel-based classifier models. The goal is to
simplify the training process and significantly increase the number of images available to
train such models. Figure 1 reports the flowchart illustrating the proposed framework.
Figure 1: Proposed framework.
3.1. Data preparation
The first step of the framework is to create the dataset with annotated cracks to train the
algorithm. This phase includes three main steps: proper image selection, manual
annotation, and extraction of the ground truth mask for each image (see Figure 2). The three
steps are following described:
1. Image selection: the first step consists of selecting high-resolution images where
cracks are clearly visible. However, including images with occlusions can also be
beneficial, as they represent real-world conditions and enhance the performance of
the ML model. Vegetation, shadow, reflection or elements that look like cracks (like
grout run-off) are some of the occlusions that make the dataset heterogeneous. This
variety improves the generalization and robustness of the proposed CNN.
2. Manual annotation: the second step consists of performing manual annotation of
cracks. To ensure accurate and high-quality labels, reducing biases, and improving
the generalization of the model, images need to be annotated by hand by domain
experts. Using the “Polyline” command of the Computer Vision Annotation Tool
(CVAT) [23], the annotations are then exported to the Dataset Management
Framework (Datumaro) format. In this way, the exported file includes the image
metadata (file name, dimensions) and the annotations that specify the object type
(class) and the array of point coordinates (x,y) of the polylines for a precise
segmentation of the cracks.
3. Ground truth mask extraction: the third step consists of generating the ground truth
image with the annotated cracks. In the pixels where the polyline (crack) is present,
the value 255 is assigned, corresponding to white, while all the other pixels are set
to 0, corresponding to black (absence of cracks). This allows to obtain a black image
with white cracks, providing a clear definition of the classes.
At this point, the dataset for training the CNN is complete and ready to be processed.
Figure 2: Data preparation workflow.
3.2. Dataset Preprocessing
To train the CNN, a preprocessing step is performed. During this phase small patches of
the original image are extracted. This is done by applying a sliding window that runs over
the image, capturing square parts of a fixed size (defined as “𝐹𝑤 × 𝐹ℎ”). Each patch is
automatically labeled as "positive" if the center is associated with a crack, "negative"
otherwise.
It is worth noting that splitting images into patches can lead to an unbalanced dataset
because most of the pixels do not contain cracks. In particular, the number of patches
containing cracks (positive patches) is much smaller than the number of patches without
cracks (negative patches), resulting in an unbalanced dataset. To address this imbalance,
the dataset is balanced by downsampling the negative patches. This involves randomly
selecting a number of negative patches equal to the number of positive patches, resulting in
a more balanced dataset.
Finally, the use of patches allows the application of data augmentation techniques, such
as rotations and translations. This process increases data diversity and makes the model
more robust to variations in the input data.
3.3. CNN model
This study proposes a CNN architecture similar to the one proposed by Cardellicchio et
al. in [24] for plant root segmentation.
The network model proposes a simple but efficient architecture with three stacked CNN
layers, each followed by a max-pooling and ReLU activation operation. In this architecture,
the RGB image is processed through three different convolutional layers, each applying
filters to explore and capture visual patterns in the image. In addition, there is a gradual
decrease in kernel density, which means that the filters used become smaller as one
proceeds through the convolutional layers. After the third convolutional layer, a max-
pooling layer is applied, the purpose of which is to reduce the spatial size of the data while
retaining the most significant features extracted from the previous layers. These features
are then passed to a fully connected layer, where each neuron is connected to all neurons in
the previous layer, facilitating the integration of the extracted information. Finally, the
results obtained are transferred to the decision layer, which is responsible for making the
final decisions, such as recognizing the class of the object in the image (presence of cracks,
in this case).
4. Preliminary Results
The proposed framework aims to predict the presence of cracks on concrete surfaces,
for which a software has been developed in Python [25] using OpenCV [26], NumPy [27],
Scikit-learn [28] and PyTorch [29] libraries. For this purpose, a dataset of photos related to
existing RC bridges was used, with 450 annotated images specifically used for the training
phase. Images of bridges are particularly well-suited for this procedure because, compared
to other RC structures, they have exposed structural surfaces where defects, such as cracks,
are directly visible.
The neural network was subjected only to preliminary tests, in order to qualitatively
evaluate its performance. In particular, the functionality of the method was verified by
visually comparing the original image, which contains the crack, with the automatic
segmentation generated by the model. As shown in Figure 3, the results clearly indicate that
the trained network can accurately follow the path of the crack during the segmentation
process.
Figure 3: Comparison between the original images (a) and the masks containing the
cracks segmented by the trained CNN.
This result is significant because it demonstrates the model ability to identify and
delineate cracks effectively, which is crucial for applications where accurate detection of
structural defects is required. The good visual match between the real crack and the
automatic segmentation suggests that the neural network training algorithm has been
properly configured and that the model has the potential to improve with additional data
and further optimizations. These preliminary tests provide a promising basis for future
development of the network, indicating that this could be the right track to achieve a robust
and reliable system for automatic crack segmentation.
5. Conclusions and further works
This paper proposes a CV-based methodology to automatically detect cracks in existing
RC bridges. Three main steps of the proposed framework were identified:
a) Definition of the dataset of RC bridge surface images with the annotated cracks.
b) Extraction of small patches from images in the training dataset.
c) Implementation of three stacked layers CNN model for automatic identification
of cracks.
Then, a CNN is trained to identify the presence of cracks in the images. Thus, from each
photo provided as input, the proposed framework is able to determine the presence or
absence of cracks. This approach is particularly practical in contexts with few labeled data,
as it allows the generation of numerous patches from a limited number of images, thus being
effective in reliably identifying complex cracks by reducing the computational effort.
The evaluation of the method has been based on preliminary visual comparisons. Once
the development is complete, a rigorous evaluation should be carried out using specific
evaluation metrics and quantitatively comparing this method with other existing
approaches. This should enable quantification of the model's effectiveness and verification
of its capability to accurately and reliably detect cracks.
In conclusion, this work proposes a preliminary promising framework for automatic
crack detection in reinforced concrete bridges, paving the way for automated and intelligent
inspection systems for health assessment of civil infrastructures. This innovative
methodology aims to enhance digital progress and utilize artificial intelligence for advanced
visual inspections, which are key to the development of automated inspection systems for
defect identification. This approach ultimately contributes to the preservation of the
existing bridge structure portfolio.
References
[1] Calvi, Gian Michele, et al. "Once upon a time in Italy: The tale of the Morandi Bridge."
Structural Engineering International 29.2 (2019): 198-217.
[2] Borzi, Barbara, et al. "Seismic vulnerability of the Italian roadway bridge stock."
Earthquake Spectra 31.4 (2015): 2137-2161.
[3] Assaad, Rayan, and Islam H. El-Adaway. "Bridge infrastructure asset management
system: Comparative computational machine learning approach for evaluating and
predicting deck deterioration conditions." Journal of Infrastructure Systems 26.3
(2020): 04020032.
[4] Bień, Jan, and Marek Salamak. "The management of bridge structures–challenges and
possibilities." Archives of Civil Engineering (2022): 5-35.
[5] Miluccio, Giacomo, et al. "Traffic-load fragility models for prestressed concrete girder
decks of existing Italian highway bridges." Engineering Structures 249 (2021): 113367.
[6] MIT, C. S. D. L. (2020). Linee Guida per la classificazione e gestione del rischio, la
valutazione della sicurezza ed il monitoraggio dei ponti esistenti. Technical report,
Consiglio Superiore dei Lavori Pubblici.
[7] Abdallah, Abdelrahman M., Rebecca A. Atadero, and Mehmet E. Ozbek. "A
comprehensive uncertainty-based framework for inspection planning of highway
bridges." Infrastructures 6.2 (2021): 27.
[8] Nettis, Andrea, Mirko Saponaro, and Massimo Nanna. "RPAS-based framework for
simplified seismic risk assessment of Italian RC-bridges." Buildings 10.9 (2020): 150.
[9] Soga, Kenʼichi, Ivan Vaníček, and A. Gens. Micro-Measurement and Monitoring System
for Ageing Underground Infrastructures. Czech Technical University in Prague, 2011.
[10] Golewski, Grzegorz Ludwik. "The phenomenon of cracking in cement concretes and
reinforced concrete structures: the mechanism of cracks formation, causes of their
initiation, types and places of occurrence, and methods of detection—a review."
Buildings 13.3 (2023): 765.
[11] Sun, H., Burton, H. V. and Huang, H. “Machine learning applications for building
structural design and performance assessment: State-of-the-art review.” Journal of
Building Engineering 33 (2021): 101816.
[12] Xie, Y, et al. “The promise of implementing machine learning in earthquake
engineering: A state-of-the-art review.” Earthquake Spectra, 36(4) (2020): 1769-1801.
[13] Flah, M, et al. “Machine learning algorithms in civil structural health monitoring: A
systematic review.” Archives of computational methods in engineering, 28(4) (2021):
2621-2643.
[14] Ruggieri, Sergio, et al. "Machine-learning based vulnerability analysis of existing
buildings." Automation in Construction 132 (2021): 103936.
[15] Zhang, Allen, et al. "Automated pixel-level pavement crack detection on 3D asphalt
surfaces using a deep-learning network." Computer-Aided Civil and Infrastructure
Engineering 32.10 (2017): 805-819.
[16] Yang, Xincong, et al. "Automatic pixel-level crack detection and measurement using
fully convolutional network." Computer-Aided Civil and Infrastructure Engineering
33.12 (2018): 1090-1109.
[17] Qiao, Wenting, et al. "A crack identification method for concrete structures using
improved U-Net convolutional neural networks." Mathematical Problems in
Engineering 2021 (2021): 1-16.
[18] Inam, Hina, et al. "Smart and automated infrastructure management: A deep learning
approach for crack detection in bridge images." Sustainability 15.3 (2023): 1866.
[19] Yu, Zhen. "YOLO V5s-based deep learning approach for concrete cracks detection." SHS
Web of Conferences. Vol. 144. EDP Sciences, 2022.
[20] Yu, Gui, and Xinglin Zhou. "An improved YOLOv5 crack detection method combined
with a bottleneck transformer." Mathematics 11.10 (2023): 2377.
[21] Fu, Huixuan, et al. "Bridge crack semantic segmentation based on improved
Deeplabv3+." Journal of Marine Science and Engineering 9.6 (2021): 671.
[22] Cardellicchio, Angelo, et al. "Physical interpretation of machine learning-based
recognition of defects for the risk management of existing bridge heritage."
Engineering Failure Analysis 149 (2023): 107237.
[23] CVAT. "CVAT: Computer Vision Annotation Tool." GitHub, 2023,
https://github.com/cvat-ai/cvat.
[24] Cardellicchio, A., et al. "Patch-based probabilistic identification of plant roots using
convolutional neural networks." Pattern Recognition Letters (2024).
[25] Python Software Foundation. Python Language Reference, version 3.8. Available at
http://www.python.org.
[26] Bradski, Gary. "The opencv library." Dr. Dobb's Journal: Software Tools for the
Professional Programmer 25.11 (2000): 120-123.
[27] Harris, Charles R., et al. "Array programming with NumPy." Nature 585.7825 (2020):
357-362.
[28] Pedregosa, Fabian, et al. "Scikit-learn: Machine learning in Python." the Journal of
machine Learning research 12 (2011): 2825-2830.
[29] Paszke, Adam, et al. "Pytorch: An imperative style, high-performance deep learning
library." Advances in neural information processing systems 32 (2019).