=Paper=
{{Paper
|id=Vol-2845/Paper_4
|storemode=property
|title=Identification of Rolling Stock of Railways Based on Multi-Projection Image Processing Methods
|pdfUrl=https://ceur-ws.org/Vol-2845/Paper_4.pdf
|volume=Vol-2845
|authors=Stepan Bilan
|dblpUrl=https://dblp.org/rec/conf/iti2/Bilan20
}}
==Identification of Rolling Stock of Railways Based on Multi-Projection Image Processing Methods==
<pdf width="1500px">https://ceur-ws.org/Vol-2845/Paper_4.pdf</pdf>
<pre>
Identification of Rolling Stock of Railways Based on Multi-
Projection Image Processing Methods
Stepan Bilana,b
a.   State University of Infrastructure and Technology, Kyrylivska, 9, Kyiv, 04071, Ukraine
b.   Taras Shevchenko National University of Kyiv, Volodymyrska Street, 60, Kyiv, 01033, Ukraine

                 Abstract
                 The paper considers the method for identifying railway rolling stock based on the parallel
                 shift technology and Radon transformation using cellular automata. The main stages of the
                 method are considered, which consist of: fixing a moving object, extracting an identifier
                 image, removing noise, extracting characteristic features and comparing with a standard. To
                 fix the rolling stock, the selection of pixels is used, which have changed their properties on
                 adjacent frames, taking into account the sensitivity and speed of movement. The selected
                 pixels are used to select a rectangular window containing a license plate. To remove noise in
                 the selected image, the parallel shift technology and the Radon transform, implemented on
                 cellular automata with a hexagonal covering, are used. The method is implemented for
                 images subject to various distortions that affect the identification result.

                 Keywords 1
                 Image, identification, moving object, cellular automata, parallel shift technology, Radon
                 transform

1. Introduction

   In modern intelligent systems, which are aimed at automating various processes in the transport
industry, there is a problem of identifying objects, both moving and stationary. Identification is a
partial case of object recognition. The identification process is based on the decision to classify the
image (image of the object) based on a comparison of its characteristic features (CF) with pre-known
CF reference images.
   Identification is based on obtaining a model of an object or system based on the results obtained in
their experimental study. In modern conditions, it is possible to build a number of models for any
physical object. In this regard, an important task is to choose the optimal model that gives the best
result in identifying the object. The model is also selected according to pre-established criteria.
   For effective identification of railway cars, a special numbering is used in the form of eight-digit
numbers, which are painted on the sides of the cars. Images of such license plates are read by special
optoelectronic sensors and then electronically are transmitted to a computer system for image
processing and recognition. In this case, the distances from the video sensors to the numbering license
plate can be located both at a fixed and at an arbitrary distance. The problem of automatic
identification is relevant, as it allows automating the structure of the tracking system for moving
objects of railway transport.
   This paper solves the problem of automatic identification of rolling stock in real time based on the
use of new technologies based on the parallel shift technology, cellular automata and Radon
transformation [1,2], which allow high-precision identification of rolling stock on railway transport.
Also, the tasks of effective image preparation are solved by removing noise and searching for
distortions of the resulting images.


IT&I-2020 Information Technology and Interactions, December 02–03, 2020, KNU Taras Shevchenko, Kyiv, Ukraine
EMAIL:bstepan@ukr.net
ORCID: 0000-0002-2978-5556
            ©️ 2020 Copyright for this paper by its authors.
            Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
            CEUR Workshop Proceedings (CEUR-WS.org)


                                                                                                               33
2. Existing methods and means of solving the identification problem
    To identify moving objects of railway transport, a large number of methods and means are used,
which are implemented on RFID technologies [3-5] and technologies based on transformation an
optical signal into an electrical [6-16].
    Among modern systems for identification of rolling stock, the following can be distinguished:
        «КАУ-В» (Ukraine);
        «ARSCIS» (Russia);
        «SL-Traffic» (Russia);
        «CarFlow»( Russia);
        SecurOS Transit (Ukraine)
    To implement RFID technologies, the following are used: radio frequency tags, special readers,
inductive sensors and software. RFID - technology allows realizing high physical reliability and
contactless reading of carriage numbers at a distance of up to several meters at high speed and
depends little on the state of the environment. However, RFID technology has a number of
disadvantages, which are the effect of metal and conductive surfaces, the possibility of simultaneous
superposition of several transponders, the influence of electromagnetic fields, high cost, and also the
effect on human health.
    The use of the optoelectronic method and means of control is a promising way to solve the
problem of informatization and control of transportation in railway transport. Optical - electronic
method has a number of advantage:
        relatively low cost;
        ease of operation, maintenance and upgrading;
        flexibility of the system due to the ability to adapt algorithms and software for various objects
    of implementation.
    The principle of optical - electronic transformation of optical pictures is used in almost all
countries with a developed railway infrastructure and on its basis a lot of software and hardware
systems have already been created that have high performance. At the same time, in all existing
commercial projects, algorithms and software for processing images of numbers are practically not
disclosed. They are used mainly for accounting of goods, but they are little used on railway sections
in real conditions.

3. Identification methodology

    As a rule, railway cars have identifiers in the form of a decimal number printed on the side surface.
In this case, the images of identifiers are located at certain levels and occupy a certain area in
advance. In the process of movement and the influence of weather conditions, the images of numbers
on the side surface of the carriage are distorted.
    The implementation of methods for identifying license plate images involves preliminary
preparation of initial images to form a vector of characteristic features. The general structure of image
identification is considered in [1, 17], in which the system consists of a memory block, a block of
standards and a block of preliminary preparation. The system can operate in the following modes:
        learning mode;
        Identification mode;
        Mode of combination of learning and identification.
    The third mode assumes that learning is carried out in the identification mode if the input image
cannot match one of the standards in the memory of the standards. In this case, the unidentified image
is assigned an identifier by the operator if the number plate image is previously known. In this mode,
the generated vector of characteristic features is entered into the memory of standards along with the
identifier.
    An algorithm consisting of four steps is used to identify moving objects of railway transport:
    1. Selection of moving objects on the video image;
    2. Selection of a field that outlines the number on a moving object in the input image;

                                                                                                       34
    3. Segmentation of numeric identifier characters;
    4. Recognition of selected characters of the image of the identifier.
    Selection of moving objects is carried out using several algorithms:
         Algorithm for overlaying several adjacent frames of the video stream;
         Algorithm for determining the difference between the states of pixels of two adjacent video
    frames.
    This work uses an algorithm for determining the difference between the states of two pixels [3].
The algorithm is a pixel-by-pixel comparison of two consecutive frames of the video stream. Pixels
that have changed their color code to a certain sensitivity threshold are considered to be those in
which motion is recorded. All others are recognized as belonging to the background area. This
method is the easiest to implement and is suitable for the case of a still video camera. A fixed video
camera is used to implement the method.
    Also, the video camera is positioned so that the moving object is fixed against a solid background
to facilitate image processing. To solve this problem, the video sensor is installed perpendicular to the
movement of railway cars. In addition, the camcorder is located at short distances from the subject,
which improves the quality of the resulting images. If it is not possible to realize a solid background,
then rather complex methods of selecting moving objects are used, which take into account the
presence of other small moving objects. The selection of the image of the identifier sign is
implemented by selecting a rectangular area covering the numeric identifier. Algorithms are applied
that implement the selection of a small area covering the image of the identifier.
    An example of selecting a moving object on Fig. 1 is shown.


Figure 1: An example of selecting a moving object (ball) based on the algorithm for determining the
difference between the states of two pixels

   This shows the entire moving object (ball), which is completely allocated, since the camera is
located far from it. This shooting video uses uniform light background, which gives a clear image,
and efficient allocation of the moving object. However, the background can be non-uniform, which
leads to false selection of objects, since there can also be movement in the background of a moving
object, which is fixed by the selection system. To eliminate such situations, a simple approach is used,
which consists in the near position of the video camera. If the camera is placed closer to the object,
then the background is actually the moving object itself, and the movement is visible only for the
applied images of numerical signs (Fig. 2).
   Here, a smaller area of the image is allocated and, accordingly, the algorithms for clearly
distinguishing the numbering plate image are simplified. However, different surfaces are used for
each moving object, which may not always give the desired result. At different sensitivity thresholds
and at different grayscale images used in calculations, different selection results are obtained (Fig. 3).
For the video sequence shown in Figure 2, 60% sensitivity and 40 grayscale were used.

                                                                                                       35
Figure 2: An example of a highlighted moving number plate against the background of a carriage


Figure 3: Examples of dedicated driving numbering plate for different sensitivities and grayscale

   In Fig. 3, the first number represents the sensitivity, and the second indicates the number of shades
(gradations) of gray used in the calculations. As can be seen from Fig. 3, the quality and size of the
selected image area is affected by the sensitivity. For a given image, the smallest area is defined as a
sensitivity of 70% - 80%.
   As mentioned earlier, the size of the selected window with the identifier image is affected by the
number of selected pixels that the rectangular selection area covers. For the example shown in Figure
3 shows the dependence of the selected pixels on the sensitivity.


                                                                                                     36
 Figure 4: Graphical dependence of the change in the number of selected pixels on the used
sensitivity

    The graph (Fig. 4) is presented for five video frames of the video sequence. Highest sensitivity
produces the least number of pixels.
    The next step is to generate a binary image. To implement this step, an approach is used, which is
as follows.
    The image field is divided into equal areas. This does not necessarily use the entire selected area of
the image. Pixel codes are analyzed in each selected field and the average value of each selected
image field is determined. By analyzing all the selected fields, the range of pixel codes that belong to
the background is determined. Pixels whose codes are included in the selected range take the same
code, which defines only one color. For example, white or black can be chosen, which is the opposite
of the pixel codes belonging to the numbering plate. If the background is black, then the image of the
numbering plate is formed in white and vice versa.
    An example of binarization of the image of the selected area based on the average values areas on
Figure 5 is shown.


Figure 5: An example of binarization of the selected area image based on the average values areas

    The area of the numbering plate is also determined by the frequency of the vertical lines of equal
height that describe each character. An area of the image with such frequent changes in color and
intensity is selected. If such a frequency of occurrence of vertical lines is high, then a rectangular area
is selected in which such vertical lines are present. Also, such an area can be determined by finding
the average value of the number codes while moving the scanning window of a given size. An area is
determined in which the average value of the color codes of pixels belonging to this window differs
by a large amount from the rest of the areas. The average value of such an area approaches the value
of the color code of the numbering plate characters.
    The resulting image has many distortions that are associated with the connection of individual
symbols of the numbering plate image. There are also gaps in the symbols themselves. This situation
can lead to false identification. However, the geometric shapes of the symbols have significant
differences that allow them to be distinguished with large distortions.
    For segmentation of numbering plate characters, a method is used, which consists in finding
vertical spaces in the selected area of the entire license plate. This is done using parallel shift
technology (PST) [1]. With the help of PST, the copy of the image of the selected area of the

                                                                                                        37
numbering plate is shifted to the right and the function of areas of intersection (FAI) is determined.
The FAI form determines the locations of the smallest number of pixels along the vertical and the
time steps of the shift, at which there are sharp jumps down, after the upper jumps. An example of an
FAI numbering plate on Fig. 6 is shown. Each FAI downward change indicates a gap between the
symbols. Moreover, if there is a merging of characters, then this will not affect the shape of the curve.
Horse racing will still be present on the FAI curve.
   The example (Fig. 6) shows that in images with distortions they give a sufficiently clear shape of
the curve, which can be used to determine the locations of the symbols.


Figure 6: Example of the numbering plate FAI

   Noise removal on the selected numbering plate symbols is carried out after applying the Radon
transform, which is implemented on cellular automata (CA) with a hexagonal coating [2]. With the
help of a CA with a hexagonal coating, six Radon projections are formed. The shape of these
projections carries out the search for extra pixels present in the image and located at the edges of the
images of the symbols or the entire numbering plate. An example of such projections is shown in Fig.
7.
   The CA is used to search for “salt and pepper” noises. According to the analysis of the
neighborhood, such pixels are removed. The CA selects the neighborhood for each cell and analyzes
each cell simultaneously at each time step.
   In Fig. 7 clearly displays noise on projections 00, 900 and 1200. They are present near the main
array of pixels and are removed.
   For highly distorted images, the Radon transform also gives projections that allow you to
determine the shape of the symbols. For the example shown in Fig. 5 the following symbols are
highlighted (Fig. 8). Each symbol is segmented according to FAI analysis by shifting a copy of this
image to the right.
   The image of each symbol is analyzed using the Radon transform. The obtained projections are
compared with the projections of images of ideal symbols recorded in the base of standards. These
symbols and their Radon projections are represented as a sequence of numbers in the base of

                                                                                                      38
standards. If the obtained projection codes correspond to the forms and projection codes of images of
ideal symbols and at the same time differ from other similar symbols in the form of symbols, then this
symbol is identified by the closest reference symbol and, at the user's discretion, can be written as an
additional standard of this symbol.


Figure 7: FAI example of one number plate character


Figure 8: Images of individual symbols of the numbering plate shown in Figure 5

   For example, the sixth symbol of the numbering plate image is considered, which is a distorted
image of the seven symbol. This image is close in shape to the image of the digit one symbol. The
Radon transform was applied to the image of the ideal seven and the ideal digit one, and also Radon
projections were generated for the image of the sixth symbol (Fig. 9). All Radon projections were
obtained with a binarization threshold of 50%. Other binarization thresholds can also be used. At the
same time, the projection forms practically do not change.
   Analysis of the obtained projections shows a significant similarity of the projections of the images
of the ideal and distorted sevens. A sharp difference in projection images for images of sevens and
digit one symbols is also clearly defined.

                                                                                                     39
    To obtain such projections, a cellular automaton with a hexagonal form of coverage was used,
which made it possible to obtain six Radon projections instead of the classical three Radon
projections. This circumstance makes it possible to significantly improve the accuracy of the analysis
of the symbol image.


Figure 9: Radon projections for the symbol of seven and for the symbol digit one images shown in
Fig. 8

   For all symbols of the identifier image (Fig. 8), the Radon projections on Fig. 10 are shown.
   Gaps in 00 and 1500 projections indicate horizontal gaps in the numbering plate image. These gaps
divide the projections into two large areas. This indicates that the gap is in the middle of the symbols.
Based on the obtained projections, discontinuities can be determined, if individual parts of the
projections are very small, then it can be argued that there is noise, as well as other symbol analyzes
can be carried out. At the same time, a small number of symbols are used and identification
algorithms are simplified thanks to a small database of standards.


                                                                                                      40
   To recognize the selected symbol, you can also apply the method of analyzing the extreme pixels
on the four sides of the symbol, as well as analyzing all six Radon projections and the FAI shape is
carried out. The result of this analysis is a decision on the shape of the symbol in the numbering plate.


Figure 10: Images of individual symbols of the numbering plate shown in Figure 8

   Both the Radon projections and the FAI set are described by the corresponding numeric arrays.
Each Radon projection can be represented as a numerical sequence, which is efficiently processed and
stored in the memory of standards. The area intersection function is also represented as an array of
numbers that describe a geometric figure in time, and each array of numbers has its own sequence,
which is also easily processed and stored in the electronic memory of standards.

4. Conclusion
    The paper presents a system for the identification of moving objects of railway transport, which
allows high-precision identification of moving objects. The use of parallel shift and Radon
transformation technology allows automatic identification of the driving train in real time. The
proposed method for identifying the image of an identifier and the experimental studies carried out
made it possible to formulate the main requirements and stages of effective preprocessing of
numbering plate images. Using the parallel shift technology, it is possible to select each symbol in the
numerical sequence of the identifier using simple methods. Radon transforms and PST allow you to
remove or account for the presence of various interferences, such as salt and pepper, breaks, and
intersymbol connections.
    As a result of experimental studies, it was found that the identification accuracy on average
corresponds to 99.3% for a rigidly fixed camera at a carriage speed of 70 km / h, as well as when the
angle of deflection of the video camera is 150. For the case of using a conventional video camera from
a mobile phone, the accuracy reaches 96.7 % at a deflection angle of 15 0. In addition, the system does


                                                                                                      41
not use a large number of standards, and also does not require large expenditures of time spent on
training and setting up the system (10 or more reference values are enough to display each symbol).
The software is implemented in such a way that the system can process 17 images per second. The
system is cost effective as it does not require the use of high definition video cameras.
    In further works, the author plans to conduct research work in the direction of identifying moving
objects belonging to other areas of human activity with complex images of identifiers.

5. References

[1] S. Yuzhakov, S. Bilan. Identification System for Moving Objects Based on Parallel Shift
     Technology. Handbook of Research on Intelligent Data Processing and Information Security
     Systems. Edited by Bilan, S. M., & Al-Zoubi, S. I. Hershey, USA: IGI Global(2019): 374 – 387
[2] R. L. Motornyuk, S. Bilan. The Moving Object Detection and Research Effects of Noise on
     Images Based on Cellular Automata With a Hexagonal Coating Form and Radon Transform.
     Handbook of Research on Intelligent Data Processing and Information Security Systems.
     Handbook of Research on Intelligent Data Processing and Information Security Systems. Edited
     by Bilan, S. M., & Al-Zoubi, S. I. Hershey, USA: IGI Global (2019): 330 – 359.
[3] Manish Bhuptani, Shahram Moradpur. RFID for your business = RFID Field Guide: Deploying
     Radio Frequency Identification Systems, Troitsky N . Moscow: "Alpina Publisher", (2007): 70-
     290
[4] M. Klems. RFID: Transport und Logistik an der Schwelle eines neuen Zeitalters, (German
     Edition), GBI-Genios Verlag (2005)
[5] Judith Symond, John Ayoade and David Parry. Auto-Identification and Ubiquitous Computing
     Applications. IGI-global (2009): 350.
[6] Vasin N.N., Baranov A.M. Video signal processing for identifying objects at a railway crossing.
     Computer Optics. (2005), Issue 28: 152-155
[7] Hiroaki Niitsiima, Tsutomu Maruyama Real-Time Detection of Moving Objects. FPL (2004),
     LNCS 3203: 1155-1157
[8] Hiroaki Niitsuma, Tsutomu Maruyama Real-Time Generation Of Three-Dimensional Motion
     Fields. FPL (2005): 179-184
[9] Ashit Talukder, Larry Maithies Real-time Detection of Moving Objects from Moving Vehicles
     using Dense Stereo and Optical Flow, Intelligent Robots and Systems, (2004). vol.4, (IROS
     2004): 3718 - 3725
[10] Sedat Doğan, Mahir Serhan, Temiz Sıtkı Külür Real Time Speed Estimation of Moving Vehicles
     from Side View Images from an Uncalibrated Video Camera. Sensors 2010, 10: 4805-4824
[11] Chunrong Yuan, Hanspeter A. Mallot Real-Time Detection of Moving Obstacles from Mobile
     Platforms, ICRA10 Workshop on Robotics and Intelligent Transportation System, (2010): 109-
     113
[12] D. Liya and L. Jilin. Intelligent freight train ID recognition system. In IEEE International
     Conference on Intelligent Transportation Systems, (2002): 417–422.
[13] W. Zhang, G. Zhou, and M. Jiang. “Convolutional neural network for freight train information
     recognition”. In International      Conference       on      Machine        Learning      and
     Computing (ICMLC), (2017-02): 167–171
[14] Z. Liu, Z. Wang, and Y. Xing. “Wagon number recognition based on the YOLOv3
     detector”. In IEEE International Conference on Computer and Communication Engineering
     Technology, (2019): 159–163
[15] Rayson Laroca, Alessander Cidra Boslooper, and David Menotti. “Automatic Counting and
     Identification of Train Wagons Based on Computer Vision and Deep Learning”
     arXiv:2010.16307v1 [cs.CV] 30 Oct 2020. URL: https://arxiv.org/pdf/2010.16307.pdf
[16] X. Zou, Y. Fu, and X. Li, “Image feature recognition of railway truck based on machine
     learning,” in IEEE Information Technology, Networking, Electronic and Automation Control
     Conference, Mar 2019: 1549–1555.
[17] Stepan Bilan. Models and hardware implementation of methods of Pre-processing Images based
     on the Cellular Automata, Advances in Image and Video Processing, Vol 2, No 5 (2014): 76-90

                                                                                                   42

</pre>