Experiments in Example-based
                                                    Image Filter Retrieval

                                                 Pedro Torres∗, Simon Colton∗ , and Stefan Rüger+
                                ∗
                                    Department of Computing, Imperial College, London, UK. ptorres,sgc@doc.ic.ac.uk
                                      +
                                        Knowledge Media Institute, The Open University, UK. s.rueger@open.ac.uk

                                         Abstract. We present a novel image ﬁlter generation method and an
                                         image ﬁlter retrieval algorithm and analyse their properties. Based on
                                         an original image and a ﬁltered version of the original, the retrieval al-
                                         gorithm can ﬁnd, to a high probability, which ﬁlter was applied to the
                                         original from a large pre-deﬁned list of ﬁlters, without having to apply
                                         all ﬁlters to the original image, which is usually a time consuming task
                                         when the number of ﬁlters is large. This is achieved by pre-computing
                                         image annotations for a set of ﬁltered images obtained by applying the
                                         pre-deﬁned ﬁlters to a database of 50 images. Using standard image-
                                         based annotation techniques, we show that the ﬁlter retrieval can be
                                         achieved by taking the closest images to the original from the database
                                         and analysing those known images instead. The retrieval algorithm has a
                                         set of parameters and we present results of experiments with these values
                                         to maximise the probability of retrieving the correct ﬁlter.


                                1      Introduction

                                Image ﬁltering [4] is the process whereby the pixel information of a digital im-
                                age is mathematically manipulated to produce an altered version of the image.
                                Image ﬁltering is a very commonplace process in graphic design, and modern
                                software packages such as Adobe Photoshop include hundreds of image ﬁlter-
                                ing techniques ranging from simple edge detection to the production of artistic
                                images simulating watercolour eﬀects, etc. Along with other graphics packages,
                                Photoshop enables browsing through the range of ﬁlters and their parameteri-
                                sations. We address here the question of searching through a database of image
                                ﬁlters to ﬁnd a match given an image and a ﬁltered version of the image, or
                                given only the ﬁltered version of the image. Potentially, this may enable a novice
                                user to more easily produce an image eﬀect, because they can supply an exem-
                                plar of the kind of eﬀect they wish the image ﬁlter to produce, and experiment
                                with the image ﬁlters which are returned from the search, rather than having
                                to browse through sequences of ﬁlter applications. Another potential application
                                of the retrieval algorithm described here may be medical imaging [1]. Here, the
                                system could be used to retrieve ﬁlters which enhance certain characteristics of
                                an image under analysis, so that a particular aspect of the image becomes more
                                prominent.
                                    To determine good methods for searching for ﬁlters, we work with 1000 image
                                ﬁlters of our own devising, as described in §2. In §3, we describe the retrieval


                                                                            9


CEUR
                  ceur-ws.org
Workshop      ISSN 1613-0073
Proceedings
Fig. 1. Example image ﬁlter tree consisting of transforms in blue circles: A=Add
Colour, C=Convolution, I=Inverse, M=Median and T=Threshold; and compositors
in red squares: A=And, F=Fade, M=Min and O=Or. Image inputs are in green dia-
monds. An example original image and the ﬁltered version are shown.


methods we have experimented with, and in §4, we describe our experiments and
the results. We conclude that – at least with the 1000 image ﬁlters we have been
working with – it is possible to retrieve a ﬁlter given a single image produced
by it, to a high degree of probability. We further discuss some future directions,
including the application of machine learning techniques to this problem.


2   Image Filter Database
Our database consists of 1000 image ﬁlters, each of which can be represented as a
tree of fundamental (unary) image transform such as inverse, lookup, threshold,
colour addition, median, etc., and also (binary) image compositors such as add,
and, divide, fade, max, min, multiply, or, subtract, xor, etc., as described in [5]
and [7]. An example tree is provided in ﬁgure 1. We see that the overall ﬁlter
uses seven transform steps and six compositor steps, and that the original image
is input to the tree seven times. The 1000 ﬁlters were produced by randomly
generating a tree structure subject to a limit on the number of nodes (usually 30),
with each transform and compositor added to the tree with random parameter
settings, e.g., the median transform takes two parameters, the ﬁrst of which
determines the way in which the median RGB values of each pixel is calculated,
and the second of which determines the extent of the neighbourhood that is
taken into account when the median is calculated. Each time an image ﬁlter
generated in this random way produced an interesting visual eﬀect on one or
more original images, it was added to the database. The time taken to apply a
ﬁlter is roughly proportional to the size of the input image and the size of the
tree. Over the entire database of ﬁlters, the average number of nodes in a tree is


                                       10
13.62, and the average time1 to apply a ﬁlter to an image of dimension 256 by
384 pixels (the size used for the experiments reported here) is 410 milliseconds.
Hence it would take a little under 7 minutes to apply the entire set of ﬁlters to
an image of this size. In practice, images tend to be larger than this, and hence
the application of all the ﬁlters to an image for search purposes is infeasible.


3     Retrieval Techniques

3.1    Deﬁnitions

An image is a function I : {0, . . . , N } × {0, . . . , M } → {0, . . . , 255}3, for some
N and M . The set of all images will be denoted by I. A ﬁlter is a function
F : I → I. The set of all ﬁlters will be denoted by F. The symbol F (I) will
then stand for the application of ﬁlter F to image I.
    An annotation2 of dimension d is a function A : I → Rd . The set of all
annotations of dimension d is denoted by Ad and the set of all annotations
is deﬁned
    z     as A = ∪∞  d=1 Ad . Writing Pz = {(w1 , . . . , wz ) ∈ R |∀i.0 ≤ wi ≤
                                                                           z

1 ∧ i=1 wi = 1}, we deﬁne a set of weighted annotations to be a pair (A0 , w)
such that A0 ∈ A and w ∈ P|A0 | . The set of all weighted annotations is denoted
by W.
    Let I be an image, (A0 , w) a set of weighted annotations and A ∈ A0 an an-
notation of dimension d. The Rd vector A(I) is called the annotation of I via A or
an A-annotated image. We can introduce metrics in the space of A-annotations
using any existing metric for Rd . For example, we may deﬁne the distance be-
tween two images, I and J, as the weighted Euclidean distance between pairs of
real vectors Ak (I) and Ak (J) for each Ak ∈ A0 :

                                          |A0 |
                                          
                      D(A0 ,w) (I, J) =           wk · d(Ak (I), Ak (J))              (1)
                                          k=1

where d is the usual Euclidean distance. We will use the symbol 2X , where X is
a set, to denote the power set of X.


3.2    Problem Formulation

The problem we address here is as follows: given (i) an image I ∈ I, (ii) a set of
ﬁlters F0 and (iii) a ﬁltered version of I, I  = F (I), for some F ∈ F0 , determine
the ﬁlter F . A straightforward naı̈ve algorithm to solve this problem is shown
below. The Naı̈ve algorithm simply applies all ﬁlters to the user image and
checks which ﬁlter achieves the same end image. The problem with this naı̈ve
algorithm is that we are generally interested in the case where F0 is a large set
1
    On a MacOs X machine running at 2.6Ghz.
2
    An annotation is a vector of what are usually called features of an image, which in
    turn are real values computed from that image.


                                             11
      Filter(ID : 2I , FD : 2F ) : 2I            Naı̈ve(FD : 2F , IU : I, IU : I) : FD

 1. IF ← ∅                                     1. IF ← Filter({IU }, FD )
 2. for each I ∈ ID                            2. for each I  ∈ vF
     1. for each F ∈ FD                            1. if I = IU
         1. IF ← IF ∪ {F (I)}                           1. F ← ﬁlter which produced I 
 3. return IF                                           2. return F


and applying thousands of ﬁlters to an image is very time consuming. Moreover,
we are interested in ﬁnding an eﬃcient algorithm for the above problem for the
case where the ﬁlter retrieval operation is to be performed repeatedly. So, we
can concede some initial oﬄine computation time if we can have fast retrieval
thereafter.


3.3   Filter Retrieval Algorithm


We will now present a general algorithm to perform such ﬁlter retrieval, which
is described below in pseudo-code. We start with a database of images, ID ,
and ﬁlters, FD , a set of weighted annotations (A0 , w) and three integers which
parameterise the algorithm. We start by choosing a random sample of size O
of the image database ID and choose the N images of that sample which are
closest to the user image, IU . We denote this set of closest images by C. Note
that the distance between two images I and J is measured here as the weighted
Euclidean distance between their annotations obtained via A0 , as described by
Equation (1). We then retrieve the ﬁlters which, applied to the images in C,
yield images closest to the user-given ﬁltered image, IU . This set of ﬁlters FR is
then returned. Given an integer K, a set X and a function f : X → R. The set
argminK x∈X {f (x)} represents the K values of x which yield minimum values for
f (x). If multiple values of x yield the same minimum value for f , we take the
resulting set to be the union of all those values of x.

             Retrieve(ID : I, FD : F, (A0 , w) : W, (N, O, T ) ∈ N3 ) : FD

 1. S ← randomSample(O, ID )
                   P|A0 |
 2. C ← argminN
              I∈S {  k=1 wk · d(Ak (I), Ak (IU ))}
               T      P     P|A0 |
 3. FR ← argminF ∈FD { I∈C k=1      wk · d(Ak (F (I)), Ak (IU ))}
 4. return FR

    In the algorithm, given an image I, we write Ak (I) to be the kth-annotation
of I. Note that, in this algorithm, all images in the database have been previously
ﬁltered and both original and ﬁltered images have been annotated beforehand.
This means that at retrieval time both Ak (I) and Ak (F (I)) are known and no
calculations are needed other than Euclidean distances between vectors. The
only annotations to compute at retrieval time are Ak (IU ) and Ak (IU ).


                                          12
4   Experimental Results
The Retrieve algorithm can be assessed by choosing particular parameters and
comparing the retrieval accuracy and retrieval time. For ID , we hand-picked 50
images from the Corel Stock Photo Library illustrating a wide range of everyday
images, including portraits and full-body pictures of men and women, cityscapes
and countryside scenes, among others. For FD we took the set of 1000 ﬁlters
described in §2. These ﬁlters represent a broad sample of the ﬁlter space F.
    The set of annotations A0 chosen consists of four main modes of opera-
tion of the Annotate software, which has been used as an automated image
annotation system using global features in [6]. These were: (i) gabor with s=4
and o=6, (ii) linear HSV histograms with 4 × 4 × 4 sampling, (iii) RGB his-
tograms with 4 × 4 × 4 sampling and (iv) tamura with 3 × 3 tiling. The ex-
act Annotate command-line options for these modes were: -a gabor-4-6, -a
HSV lin-4x4x4-G, -a RGB-4x4x4-G, -a tamuraS-2-A.T3x3.
    Having made these choices, the only remaining parameters of the Retrieve
algorithm were N , O, T and the set of four weights, w. To study this space of
parameters, we implemented the Retrieve algorithm and, for diﬀerent choices
of the parameters, we measured the proportion of times that the correct ﬁlter
was found in the set of ﬁlters returned by the algorithm.
    We started by studying the space of weights, [0, 1]4 , for the particular setting
(N, O, T ) = (3, 50, 10), in which we considered the whole database of images ID
(O = 50) chose the closest 3 images (N = 3) and retrieved 10 candidate ﬁlters
(T = 10). Each test for a particular set of parameters (N, O, T, w) takes between
several minutes to more than an hour. Hence, we decided to explore the space
of weights non-exhaustively by hand. From this exploration, we concluded that
a good set of weights was w = (0.005, 0.465, 0.530, 0). It is not guaranteed to be
the best choice of weights but – as we see below – with this choice, the retrieval
method will ﬁnd the correct ﬁlter in a set of 10 retrieved ﬁlters nearly 93% of the
time. Hence, these weights gave us a good starting point to explore the remaining
three parameters. The weights chosen reﬂect an almost identical weight of RGB
and HSV annotations and a small contribution of the gabor annotation; the
inclusion of the tamura annotation did not seem to increase the probability of
correct retrieval, but this can only be conﬁrmed by a more exhaustive search of
the weight space.
    For this particular set of weights, w = (0.005, 0.465, 0.530, 0), we assessed
how the remaining parameters aﬀected the probability of correct retrieval, with
the results given in ﬁgure 2. We carried out three diﬀerent sets of experiments:
(i) we ﬁx O = 10, 50 and computed the probability of correct retrieval for vary-
ing N = 1, 2, 3, 5, 10, 25, 50 and T = 1, 5, 10, 25, 50 (each diﬀerent value of T
produces one curve and each choice of O is shown in a diﬀerent graph); (ii) we
ﬁxed T = 10, 50 and computed the probability of correct retrieval for varying
N = 1, 2, 3, 5, 10, 25, 50 and O = 1, 2, 3, 5, 10, 25, 50 (each diﬀerent value of O
produces one curve and each choice of T is shown in a diﬀerent graph); (iii)
we ﬁxed N = 2 and computed the probability of correct retrieval for varying
O = 2, 3, 5, 10, 25, 50 and T = 1, 5, 10, 25, 50 (each diﬀerent value of T produces


                                        13
one curve). In the sixth graph of ﬁgure 2, we show the retrieval times which
correspond to the setting of experiment (i) with O = 10.
    From the ﬁrst set of experiments (ﬁxed O) we see that there is a maximum
for the parameter N and that it is generally attained for either N = 2 or N = 3.
This is particularly clear in the O = 50 graph. So, we conclude that it is good
to have more than one image to compare the user’s image to but that using
more than 3 images degrades the subsequent comparison of annotations. We
can also conﬁrm that increasing T always improves retrieval success. This was
expected since returning more ﬁlters increases the probability that they contain
the correct ﬁlter but it shows us that the diﬀerence in results from taking T = 25
or T = 50 is small. Even for T = 10, the diﬀerence is still acceptable and this
choice seems to be a good compromise between probability of correct retrieval
and the amount of ﬁlters that the user still has to choose from after the retrieval.
For (N, O, T ) = (3, 50, 10), the probability of the required ﬁlter being present in
the 10 retrieved ﬁlters is 0.928.
    From the second set of experiments (ﬁxed T ), we see that increasing O always
achieves better results, which is expected since greater O means a larger image
database to work with and therefore more likelihood of ﬁnding an image which
is closer to the image provided by the user. Once again, there is a maximum for
N around 2 or 3 except for very small O where the initial database is so small
that in fact, it is better just to keep the closest image in the database. From the
third experiment (ﬁxed N ) we see that for N = 2, both increasing either T or
O always increases retrieval success as before. In the sixth graph in ﬁgure 2 we
present retrieval times corresponding to the ﬁrst experiment with O = 10. We
see that retrieval times lie in the range 10 − 40 ms, are hardly aﬀected by T , and
increase roughly linearly with N . For larger values of O (not shown) retrieval
times increase considerably but never raise above 300 ms. We will not therefore
emphasise comparison between retrieval times for the diﬀerent parameters as
for practical purposes of user retrieval, the times are small (between 10 − 300
ms). We did not compute retrieval time for the Naı̈ve algorithm as it is clear
from the algorithm’s deﬁnition that retrieval times will be dominated by the
ﬁlter application times which, when applied 1000 times, will add up to several
minutes. In future, we will apply the Naı̈ve algorithm to the ﬁlters returned by
Retrieve, which will improve accuracy, but only degrade eﬃciency slightly.
    Finally, we considered the scenario N = O separately. This can be seen as
equivalent to the case where the user only provides the Retrieve algorithm
with the ﬁltered image IU but not the original image IU . If we do not know
the original image, we can simply pick one or more random images from the
database and compare their ﬁltered versions to the ﬁltered user image. This
amounts to changing the second line of the Retrieve algorithm to C ← S.
The results of this case are shown in the ﬁnal graph of ﬁgure 2 for N = O =
1, 2, 3, 4, 5, 10, 20, 30, 40, 50 and T = 1, 5, 10, 25. We see that considering more
than 5 images in the database most of the times does not add to the retrieval
probability and that for T = 10, the probability now drops to around 0.727, but
if the user chooses from 50 retrieved ﬁlters (T = 50), the probability is 0.914.


                                        14
5    Conclusions and Future Work
We have presented an image-ﬁlter retrieval algorithm which takes a user-given
image and ﬁltered version of the image and retrieves a set of 10 ﬁlters (from
1000) which contains the applied ﬁlter with a probability of 0.928 in a fraction
of a second. The retrieval algorithm has 7 parameters and we have showed how
its behaviour is aﬀected by changing these parameters. We ﬁnd the best choice
for 3 parameters and near best choices for the remaining 4. We also show that
the method can retrieve the correct ﬁlter even if the original (unﬁltered) image
is not given, albeit with a smaller probability of success.
    In future, we plan to study how diﬀerent distance metrics in step 2 of Re-
trieve aﬀects its performance. Moreover, we will study how the choice of im-
age database may inﬂuence the probability of correct retrieval by repeating the
same experiments with diﬀerent sets of images. We also plan to address the more
general problem of learning a set of properties for a required ﬁlter (or ﬁlters)
given a set of images and/or ﬁlters that the user has chosen through a browsing
mechanism. In particular, we will use mathematical theory formation [3] and
closed-loop learning [2] to approximate a user’s aesthetic preferences for image
ﬁltering during a session. These preferences will then be used as a ﬁtness func-
tion to search for and/or evolve image ﬁlters that may be of interest to the user.
In addition, we will investigate any beneﬁt of our approach to medical image ﬁl-
tering, and by replacing the compositors and transforms in our tree structure by
Photoshop actions, we plan to show that AI techniques can be used to increase
the beneﬁt of graphic design software to designers and artists.

Acknowledgements
We wish to acknowledge the excellent work of Imperial College masters students
George Lin and Chi-Tou Yip for their investigations into evolving image ﬁlters.
We would also like to thank João Magalhães for his extensive input on the usage
of the Annotate software. This research is funded by EPSRC grant EP/F067127.

References
1. C Behrenbruch, S Petroudi, S Bond, J Declerck, F Leong, and J Brady. Image
   ﬁltering techniques for medical image post-processing: an overview. British Journal
   of Radiology, 77(2).
2. C Bryant, S Muggleton, C Page, and M Sternberg. Combining active learning with
   inductive logic programming to close the loop in machine learning. In Proceedings
   of the AISB’99 Symposium on AI and Scientiﬁc Creativity, 1999.
3. S Colton. Automated Theory Formation in Pure Mathematics. Springer, 2002.
4. R. Gonzalez and R. Woods. Digital Image Processing. Prentice Hall, 2008.
5. Z. Lin. Evolving image ﬁlters. Master’s thesis, Imperial College London, 2005.
6. A Yavlinsky, E Schoﬁeld, and S Rüger. Automated image annotation using global
   features and robust nonparametric density estimation. In Proceedings of the Inter-
   national Conference on Image and Video Retrieval, pages 507–517. Springer, 2005.
7. C. Yip. Evolving image ﬁlters. Master’s thesis, Imperial College London, 2004.


                                         15
                1
                                               o=10                                                                               1
                                                                                                                                                                         o=50

               0.9                                                                                                           0.9
 Probability


                                                                                                               Probability
               0.8                                                                                                           0.8


               0.7                                                                                                           0.7

                                                                                       t=1
                                                                                       t=5
                                                                                      t=10
               0.6                                                                    t=25                                   0.6               t=1
                                                                                      t=50                                                     t=5
                                                                                                                                              t=10
                                                                                                                                              t=25
                                                                                                                                              t=50
               0.5                                                                                                           0.5


                     0       2        4                       6                   8                 10                                0                  10        20           30                 40         50
                                          Parameter n                                                                                                              Parameter n

                     1
                                               t=10                                                                                   1
                                                                                                                                                                         t=50
                                                                                             o=50
                                                                                             o=25
                                                                                             o=10
               0.95                                                                           o=5                            0.95
                                                                                              o=3
                                                                                              o=2
                                                                                              o=1
                0.9                                                                                                           0.9
 Probability


                                                                                                               Probability
               0.85                                                                                                          0.85


                0.8                                                                                                           0.8                                           o=50
                                                                                                                                                                            o=25
                                                                                                                                                                            o=10
                                                                                                                                                                             o=5
               0.75                                                                                                          0.75                                            o=3
                                                                                                                                                                             o=2
                                                                                                                                                                             o=1
                0.7                                                                                                           0.7


               0.65                                                                                                          0.65


                0.6                                                                                                           0.6
                         0       10       20                          30               40                50                               0               10        20             30               40         50
                                          Parameter n                                                                                                               Parameter n

                1
                                               n=2                                                                           50
                                                                                                                                                                         o=10

               0.9
                                                                                                                             40


               0.8
 Probability


                                                                                                               Time (ms)


                                                                                                                             30


               0.7

                                                                                                                             20

               0.6
                                                                                                                                                                                         t=1
                                                                                                                                                                                         t=5
                                                                            t=1
                                                                                                                                                                                        t=10
                                                                            t=5
                                                                           t=10                                              10                                                         t=25
               0.5                                                                                                                                                                      t=50
                                                                           t=25
                                                                           t=50


               0.4                                                                                                            0
                     0       10           20                          30               40                50                       0                  2         4            6                  8         10
                                          Parameter o                                                                                                              Parameter n

                                                                  1
                                                                                                              n=o

                                                              0.9


                                                              0.8
                                                Probability


                                                              0.7

                                                                                                                                               t=1
                                                                                                                                              t=10
                                                                                                                                              t=25
                                                              0.6
                                                                                                                                              t=50


                                                              0.5


                                                              0.4
                                                                      0                10                20                           30                 40        50
                                                                                                         Parameter n,o

Fig. 2. Probability of correct retrieval for experiments (i), (ii) and (iii) (ﬁrst ﬁve
graphs) and retrieval times of O = 10 (sixth graph). The weights were set to
w = (0.005, 0.465, 0.530, 0). Final graph: the probability of correct retrieval for N = O.


                                                                                                              16