<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Weighted Rule-Based Model for File Forgery Detection: UA.PT Bioinformatics at ImageCLEF 2019</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>DETI / IEETA</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>University of Aveiro</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aveiro</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Portugal</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>joao.rafael.almeida</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>pedrofreire</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>olga.oliveira</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>jlog@ua.pt</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information and Communications Technologies, University of A Corun~a</institution>
          ,
          <addr-line>A Corun~a</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>1957</year>
      </pub-date>
      <abstract>
        <p>With today's digital technology, disparate kinds of data can be easily manipulated. The forgery commonly hides information, by altering les' extensions, les' signatures, or by using steganography. Consequently, digital forensic examiners are faced with new problems in the detection of these forged les. The lack of automatised approaches to discover these infractions encourages researchers to explore new computational solutions that can help its identi cation. This paper describes the methodologies used in the ImageCLEFsecurity 2019 challenge, which were mainly rule-based models. The rules and all of their underlying mechanisms created for each task are described. For the third task, was used a random forest algorithm due to the poor performance of these rules.</p>
      </abstract>
      <kwd-group>
        <kwd>ImageCLEF</kwd>
        <kwd>File Forgery Detection</kwd>
        <kwd>Rule-based models</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The ImageCLEF [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] initiative launched a new security challenge, called
ImageCLEFsecurity, addressing the problem of automatically identifying forged les
and stego images [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. This challenge is divided into three sub-tasks: 1) forged le
discovery, 2) stego image discovery and 3) secret message discovery. The forged
les discovery sub-task is the rst task of the challenge and it is independent
from the remaining two tasks. The goal of this task is to automatically detect
les whose extension and signature has been altered; more speci cally, to
identify the les with extension PDF that are, actually, image les (with extension
JPG, PNG, and GIF). The objective of the second sub-task is to identify the
images that hide steganographic content and the goal of the third task is to
retrieve these hidden messages.
      </p>
      <p>In this paper, we present the several approaches that we used to address this
challenge. The main solution is based on an orchestration of specialised
rulebased models. For each model, a set of rules was de ned with the purpose of
identifying a speci c le or message. Additionally, when there are insu cient
rules to provide a good result, other complementary strategies have been
combined, namely a random forest classi er.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Materials and Methods</title>
      <p>For each task, a training set and a test set were provided. The training set of the
rst task is composed of 2400 les, 1200 of which are PDF les. The remaining
les, despite having PDF extension, belong to one of three classes: JPEG, PNG,
GIF, each with 400 les. In the second and third tasks, the training sets include
1000 JPEG images, 500 of which are stego images and the others are clean
images. In the case of the third task, the stego images contain ve di erent text
messages. Regarding the test sets, the rst task is comprised of 1000 les and
the second and third tasks are composed of 500 images.</p>
      <p>In this section, we present the ve methods that were used to solve each task
of the challenge.
2.1</p>
      <sec id="sec-2-1">
        <title>Rule-Based Approach</title>
        <p>
          A typical rule-based system is constructed through a set of if-then rules [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]
which help identify conditions and patterns in the problem domain. However, the
use of simple conditions may not be enough to obtain the best results. Sometimes,
to accomplish a more accurate outcome, those rules need to be balanced, with
weights. The subject of rule weights in fuzzy rule-based models for classi cations
is not new, and its positive e ect has already been proven [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ].
        </p>
        <p>We propose a rule-based weighted system with a set of models ( gure 1),
which are specialised in classifying a speci c entry. Each model generates a
con dence score regarding the match of the received input with its conditions.
The orchestrator collects all the results and chooses the model that gives the
best score. When more than one models give similar good con dence scores for
di erent classes, the weights of the rules are readjusted and a new classi cation
cycle is performed to help separating the classes' scores. These readjustments
will, hopefully, allow the right output to stand out.</p>
        <p>The rules and the weight of the rules are speci c to each problem and
scenario. Therefore, we used this approach as our base method for all the tasks.
The rules and the methods to classify the rules are speci ed in section 3.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Image Distortion Pattern Matching</title>
        <p>
          Steganographic techniques permit hiding, within an image, information that
should be perceptually and statistically undetectable [
          <xref ref-type="bibr" rid="ref11 ref2">2,11</xref>
          ]. However, some of
these techniques, may not respect these two principles entirely, namely tools like
        </p>
        <p>
          Jsteg, Outguess, F5, JPHide, and Steghide. These tools use the least signi cant
bit (LSB) insertion technique and distort the delity of the cover image by
choosing the quantized DCT coe cients as their concealment locations [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
        <p>Our approach aims to identify aws of the used method by searching for a
common pattern among all the stego images.</p>
        <p>While scanning the training set of the second task for common patterns, it
was possible to identify that several stego images had a distortion pattern of 8x8
pixels size, that could easily be identi ed with the naked eye, as described in
gure 2 and gure 3.</p>
        <p>Taking gure 3 as reference, the identi ed pattern could be described by the
following relation between each pixel, where P(x, y) represents the mean value
of R, G, and B at position (x,y):</p>
        <p>P (x3; y3) = P (x3; y4) = P (x4; y3) = P (x4; y4)
P (x2; y3) = P (x2; y4) = P (x5; y3) = P (x5; y4)
P (x2; y2) = P (x2; y5) = P (x5; y2) = P (x5; y5)
P (x3; y3) &gt; P (x3; y2)
P (x3; y2) &gt; P (x2; y2)</p>
        <p>= P (x3; y2) = P (x4; y2) = P (x3; y5) = P (x4; y5)</p>
        <p>We created a function to scan an image for this pattern and to count the
number of occurrences. The function determines that a certain image had a
message if the number of patterns found is greater than the speci ed threshold.
Its output was used as a parameter into our weighted rule-base model.
2.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>Image Metadata Pattern Searcher</title>
        <p>Another approach used to assist the creation of rules for our rule-based model
was a pre-analysis of the le's metadata. This analysis aimed to discover patterns
that could be used as rules in the model. For instance, in the JPEG images of
the training set of the second task, a set of bits were detected which identi ed
the images with a hidden message.</p>
        <p>The pattern search was mainly done with the metadata, ignoring the image
bitstream. The rational was that the altered les could be signed in the metadata
to quickly identify which les are of interest. This simple signature would go
unnoticed and it would increase the decoding procedure.
2.4</p>
      </sec>
      <sec id="sec-2-4">
        <title>Random Forests for Rule De nition</title>
        <p>
          Random forest is a supervised learning algorithm developed by Breiman [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], who
was in uenced by the work of Amit and Geman [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. This algorithm constructs
a large number of decision trees, considering a random subset of features for
each split, and makes a prediction by combining the predictions of the di erent
decision trees. Caruana and Niculescu-Mizil [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] performed an empirical
comparison of supervised learning and concluded that random forest was one of the
algorithms that gave the best average performance.
        </p>
        <p>
          The random forest algorithm has several positive characteristics [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] for this
challenge, namely it can be used for high-dimensional problems and it gives an
estimation of the importance of variables. Moreover, it just needs two
parameters: the number of trees in the forest (ntree) and the number of features in the
random subset used in each split (mtry ).
        </p>
        <p>We used the random forest algorithm in order to help solve the third task
and our implementation is described in section 3.
2.5
In large data sets, manual classi cation is unrealistic. However, since the training
and test set are small, we decide to try a manual validation. This approach
consists mainly in a veri cation of the rule-based output, followed by manual
adjustments considered relevant. This method was, essentially, used in the second
task.</p>
        <p>When analysing the training set, the 8x8 pixel distortion pattern described
in 2.2 was identi ed. Using the rule-based model in the early stages, made it
possible to de ne rules to reach a precision of 1. However, the recall was low.
Therefore, we isolated the images not detected as forged and tried to identify
these distortions in the image manually. This procedure increased our recall
signi cantly.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>The described methods were combined and led to several submissions in the
di erent tasks. The performance of the submissions was evaluated using the
traditional metrics: precision, recall, and F1, in the rst two tasks and edit
distance in the third task.</p>
      <p>In the rst task, the precision was de ned as the quotient between the number
of altered images correctly detected and the total number of les identi ed as
changed. In its turn, the recall was the quotient between the number of altered
images correctly detected and the total number of les modi ed.</p>
      <p>For the second task, the de nition of precision and recall was similar. The
precision was the quotient between the number of images with hidden messages
correctly detected and the total number of images with hidden messages
identi ed. The recall was the quotient between the number of images with hidden
messages correctly detected and the total number of images with hidden
messages.</p>
      <p>Finally, the third task used the edit distance to measure the e ciency into
recovering the message. This distance is the minimum-weight count of edit
operations that transforms a string in another one.
3.1</p>
      <sec id="sec-3-1">
        <title>Task 1: Identify Forged Images</title>
        <p>
          Detecting the type of le is a process that can be done using three di erent le
characteristics: the le's extension, the magic number, and the le's content [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
The most straightforward technique to hide a le is changing the le's extension
and the magic number, which is a set of standard bytes that signs the le. With
this technique, the operation system is unable to open the le. Therefore, four
models are built, each one specialised in identifying a le type (PDF, PNG, JPG
or GIF). Each model produces a score re ecting the con dence that the analysed
le is of the given type. These scores are sent to the orchestrator, who classi es
the type of le based on the scores received.
        </p>
        <p>
          The initial approach considered standard ags in the le structure, such as
the last bytes or the number of occurrences of a set of bytes. For instance, a
JPEG le has the hexadecimal 0xFFDA at least once in its structure because
this is the ag that indicates where the image binary starts. Table 1 presents
the ags for the end of le for each le type. For this rst task, we used the rule
of identifying the end of le ag and obtained an F1 measure of 1.0.
In JPEG images there are two di erent stages of compression: lossy and lossless.
Embedding steganographic content inside images that uses lossy compression
increases the possibility of that content to be partly lost, which means that, it is
not feasible to hide a message on a lossy stage. Therefore, in the case of JPEG
les, the steganography should take place on the lossless stage. The DCT and
the quantization phase form part of the lossy stage, while the Hu man encoding
used to further compress the data is lossless. [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
        </p>
        <p>After scanning the training set of the second task for common parts that
distinguishes stego images binary data from non-stego images, it was possible
to observe some patterns and specify the weights of the rules that identi es
stego images. The patterns and their weights are described on table 2. We could
observe that all the patterns appeared in the Hu man table sequences, i.e. after
the marker DHT. The sequences in a JPEG le are identi ed by a two-byte code
described in table 3.</p>
        <p>DHT
DHT
DHT</p>
        <p>This task was also solved using the rule based-model where the images with
a score equal to or greater than 0.70 were considered as altered. However, in
this case, strategies and di erent ways to extract information from the images
were combined. Initially, a metadata pattern searcher to compare the metadata
elds' content was developed. From this analysis, in all the images with a
hidden message, the set of bits represented in the rules displayed in table 2 were
found. However, these rules produced several false positives, achieving, using the
training set, a recall of 1.0 and a precision of 0.75.</p>
        <p>Due to the lack of precision, we attempt to identify the distortion pattern
in the images, the method described in section 2.2. Without the rules used in
the rst approach and using a threshold of at least one pixel-block with the
distortion, this method produced a precision of 0.53 and a recall of 0.60 in the
training set. It was also the best score obtained from all runs using this approach
isolated.</p>
        <p>Then, to increase the precision, the decision was made to combine the rules
of the rst approach with this analyser. The image analyser method was only
used when an image was classi ed through the rst approach as having a hidden
message. This decreased the recall to 0.604 and the precision remained in the
0.75, the best precision result so far. The decrease of the recall and the bad
results in isolated scenarios led to the abandonment of this approach and to
focusing only on ways to increase the quality of the rules.</p>
        <p>At this stage, a submission was made, obtaining an F1 measure of 0.933 and
a precision of 0.874.</p>
        <p>In the next attempts some manual recti cations were made in the output
retrieved from the rule-based system, by observing the images classi ed as having
a message. Some submission were made following only the rule-based approach
mixed with the manual validation, and the best result was 0.98, both for F1 and
precision.</p>
        <p>
          As a last attempt, the decision was made to re-run the metadata pattern
searcher to be more precise. Now, it analysed all the metadata as a binary,
ignoring which were the elds or its content. With these changes, the method
found a new pattern, which produced a new rule, represented in table 4, made
it possible to achieve F1 measure of 1 in the training and test sets.
The goal of the third task was to retrieve the hidden text messages from the
stego images. To address this task, initially freely available steganographic tools
were used, speci cally, Hide'N'Send, Image Steganography 3, QuickStego, SSuite
Picsel 4, Steghide 5, and SteganPEG [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. However, none of the steganographic
tools were able to retrieve the hidden text messages.
        </p>
        <p>In our second approach, the RGB matrix was analyzed. First, each colour
component was individually inspected, examining the least and the two least
signi cant bits, in order to detect if they could compose ASCII codes of letters
in the alphabet, more precisely, the ASCII character in the range from 65 to 90,
and from 97 to 122. As these procedures did not provide a pattern for the stored
messages, the decision was made to look to the pixel as a whole, inspecting the
three colour component combined. We, also, tried to use the two least signi cant
bits from the four pixels that are in the centre of each 8x8 pixel block.</p>
        <p>The second approach could not retrieve the hidden messages from the image
les and therefore an attempt to nd a pattern using the DCT matrix was made,
by inspecting the least and the two least signi cant bits from the value in the
rst cell of an 8x8 block. The change of the least or the two least signi cant bit
of these values would create a small change in the block brightness, which would
explain the distortion identi ed in the 8x8 pixel block.
3 https://incoherency.co.uk/image-steganography/
4 https://www.ssuiteo ce.com/software/ssuitepicselsecurity.htm
5 http://steghide.sourceforge.net/</p>
        <p>
          None of the procedures described so far could nd a pattern in the images
of the training set with the same hidden text message. Therefore, the random
forest algorithm in an attempt to nd a pattern in the binary of the image les
was used. The 500 stego images of the training set have, as hidden message,
one of ve messages. Consequently, the next step was to consider a multiclass
classi cation problem which consists in classifying each stego image into one of
the ve messages. Initially, for our rst model, we used as features the frequency,
in percentage, with which each ASCII character appears in the binary of the
image les. For the second model, in addition to the features used in the rst
model, the percentage of 0s and 1s in the binary of the image les were used.
To train the models we used the R package caret [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] and used cross-validation
the chose the optimal value of the parameters ntree and mtry. In what concerns
the performance of the models, this was evaluated using 10-fold cross-validation.
Table 5 presents the parameters used and the accuracy of each model.
        </p>
        <p>From the results, the random forest models were unable to nd a pattern
in the binary of the image les. In the absence of alternatives, the two random
forest models to classify the image les of the test set were used, despite the
fact that this task was not a classi cation problem. The rst step was to use
the rule based-model de ned in the second task to identify the stego images of
the test set and, subsequently, the random forest models to classify the images
identi ed as containing a hidden message. For the submissions, all the images
in the test set should have a string appointed and therefore an empty string to
the images identi ed as having no hidden message was assigned. Using the rst
model, an edit distance of 0.588 was obtained and using the second model an
edit distance of 0.587. Our best edit distance (0.598) was achieved by assigning
the string \name John Fraud " to all images we identi ed as stego images and
an empty string to the other images of the test set. These results re ect the fact
that we could correctly identify the images with no hidden messages.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>This paper presents the methodologies to identify forged les and stenographic
images used in the ImageCLEFsecurity challenge. These methods were developed
speci cally for the tasks of this challenge, which does not invalidate them being
used for other data sets. In the rst task, an F1 measure of 1 was obtained.
This excellent result was accomplished mainly because the changes done to the
les were only the traditional ones, and with simple rules, it was possible to
identify each type. The second task also had a submission with F1 measure of
1. In this case, we could identify a signature in the altered images. On the other
hand, in the third task, the best submission had an edit distance of 0.598, mainly
due to the success of identifying empty strings, i.e., images without a message.
The purposed methodology works if it is possible to de ne the right rules. The
problem in this task was the di culty to nd the stenographic algorithm used.</p>
      <p>This challenge allowed for the identi cation of problems in the developed
approach, and most importantly, ways to improve some of these issues. A future
work originated from this year's participation could be the creation of a rule
generator to fed the rule-based models. The message identi cation task may
be improved by creating a database of strategies used by stenographic
attackers, mixed with machine learning approach that look into neighbourhood pixel
colour.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>This work was supported by the projects NETDIAMOND
(POCI-01-0145-FEDER016385) and SOCA (CENTRO-01-0145-FEDER-000010), co-funded by Centro
2020 program, Portugal 2020, European Union.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Amit</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Geman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Shape quantization and recognition with randomized trees</article-title>
          .
          <source>Neural computation 9(7)</source>
          ,
          <volume>1545</volume>
          {
          <fpage>1588</fpage>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Attaby</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ahmed</surname>
            ,
            <given-names>M.F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alsammak</surname>
            ,
            <given-names>A.K.</given-names>
          </string-name>
          :
          <article-title>Data hiding inside jpeg images with high resistance to steganalysis using a novel technique: Dct-m3</article-title>
          .
          <source>Ain Shams Engineering Journal</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Breiman</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Random forests</article-title>
          .
          <source>Machine learning 45(1)</source>
          ,
          <volume>5</volume>
          {
          <fpage>32</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Caruana</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Niculescu-Mizil</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>An empirical comparison of supervised learning algorithms</article-title>
          .
          <source>In: Proceedings of the 23rd international conference on Machine learning</source>
          . pp.
          <volume>161</volume>
          {
          <fpage>168</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Cutler</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cutler</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stevens</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          :
          <article-title>Random forests</article-title>
          .
          <source>In: Ensemble machine learning</source>
          , pp.
          <volume>157</volume>
          {
          <fpage>175</fpage>
          . Springer (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Gloe</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Forensic analysis of ordered data structures on the example of jpeg les</article-title>
          .
          <source>In: 2012 IEEE International Workshop on Information Forensics and Security (WIFS)</source>
          . pp.
          <volume>139</volume>
          {
          <fpage>144</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ionescu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , Muller, H.,
          <string-name>
            <surname>Peteri</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cid</surname>
            ,
            <given-names>Y.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liauchuk</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kovalev</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klimuk</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tarasau</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abacha</surname>
            ,
            <given-names>A.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hasan</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Datla</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Demner-Fushman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dang-Nguyen</surname>
            ,
            <given-names>D.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piras</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riegler</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tran</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lux</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gurrin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pelka</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Friedrich</surname>
            ,
            <given-names>C.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de Herrera</surname>
            ,
            <given-names>A.G.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kavallieratou</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>del Blanco</surname>
            ,
            <given-names>C.R.</given-names>
          </string-name>
          , Rodr guez, C.C.,
          <string-name>
            <surname>Vasillopoulos</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karampidis</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chamberlain</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Campello</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>ImageCLEF 2019: Multimedia retrieval in medicine, lifelogging, security and nature</article-title>
          . In:
          <article-title>Experimental IR Meets Multilinguality, Multimodality, and Interaction</article-title>
          .
          <source>Proceedings of the 10th International Conference of the CLEF Association (CLEF</source>
          <year>2019</year>
          ),
          <source>LNCS Lecture Notes in Computer Science</source>
          , Springer, Lugano,
          <source>Switzerland (September 9-12</source>
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Ishibuchi</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nakashima</surname>
          </string-name>
          , T.:
          <article-title>E ect of rule weights in fuzzy rule-based classi cation systems</article-title>
          .
          <source>IEEE Transactions on Fuzzy Systems</source>
          <volume>9</volume>
          (
          <issue>4</issue>
          ),
          <volume>506</volume>
          {
          <fpage>515</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Karampidis</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Papadourakis</surname>
          </string-name>
          , G.:
          <article-title>File type identi cation for digital forensics</article-title>
          .
          <source>In: International Conference on Advanced Information Systems Engineering</source>
          . pp.
          <volume>266</volume>
          {
          <fpage>274</fpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Karampidis</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vasillopoulos</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuevas Rodrguez</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>del Blanco</surname>
            ,
            <given-names>C.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kavallieratou</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia</surname>
          </string-name>
          , N.:
          <article-title>Overview of the ImageCLEFsecurity 2019 task</article-title>
          .
          <source>In: CLEF2019 Working Notes. CEUR Workshop Proceedings (CEUR-WS.org)</source>
          ,
          <source>ISSN 1613-0073</source>
          , http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2380</volume>
          / (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Khalid</surname>
            ,
            <given-names>S.K.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deris</surname>
            ,
            <given-names>M.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mohamad</surname>
            ,
            <given-names>K.M.:</given-names>
          </string-name>
          <article-title>A steganographic technique for highly compressed jpeg images</article-title>
          .
          <source>In: The Second International Conference on Informatics Engineering &amp; Information Science (ICIEIS2013)</source>
          . pp.
          <volume>107</volume>
          {
          <issue>118</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Kuhn</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.:
          <article-title>Building predictive models in r using the caret package</article-title>
          .
          <source>Journal of statistical software 28(5)</source>
          ,
          <volume>1</volume>
          {
          <fpage>26</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Kumari</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khare</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khare</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Jpeg compression steganography &amp; crypography using image-adaptation technique</article-title>
          .
          <source>journal of advances in information technology 1</source>
          (
          <issue>3</issue>
          ),
          <volume>141</volume>
          {
          <fpage>145</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gegov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cocea</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Rule-based systems: a granular computing perspective</article-title>
          .
          <source>Granular Computing</source>
          <volume>1</volume>
          (
          <issue>4</issue>
          ),
          <volume>259</volume>
          {
          <fpage>274</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Reddy</surname>
            ,
            <given-names>V.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Subramanyam</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reddy</surname>
            ,
            <given-names>P.C.</given-names>
          </string-name>
          :
          <article-title>Steganpeg steganography+ jpeg</article-title>
          .
          <source>In: 2011 International Conference on Ubiquitous Computing and Multimedia Applications</source>
          . pp.
          <volume>42</volume>
          {
          <fpage>48</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>