<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Concept-based Recommendations for Internet Advertisement</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dmitry I. Ignatov</string-name>
          <email>dignatov@hse.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sergei O. Kuznetsov</string-name>
          <email>skuznetsov@hse.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Higher School of Economics, Department of Applied Mathematics Kirpichnaya 33/5</institution>
          ,
          <addr-line>Moscow 105679</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2008</year>
      </pub-date>
      <fpage>157</fpage>
      <lpage>166</lpage>
      <abstract>
        <p>The problem of detecting terms that can be interesting to the advertiser is considered. If a company has already bought some advertising terms which describe certain services, it is reasonable to find out the terms bought by competing companies. A part of them can be recommended as future advertising terms to the company. The goal of this work is to propose better interpretable recommendations based on FCA and association rules.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Introduction
First, we recall some basic notions from Formal Concept Analysis (FCA) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
Let G and M be sets, called the set of objects and attributes, respectively, and
let I be a relation I ⊆ G × M : for g ∈ G, m ∈ M , gIm holds iff the object
g has the attribute m. The triple K = (G, M, I) is called a (formal) context. If
A ⊆ G, B ⊆ M are arbitrary subsets, then the Galois connection is given by the
following derivation operators:
      </p>
      <p>A0 d=ef {m ∈ M | gIm for all g ∈ A},</p>
      <p>B0 d=ef {g ∈ G | gIm for all m ∈ B}.</p>
      <p>If we have several contexts derivative operator of a context (G, M, I) denoted
by (.)I .</p>
      <p>The pair (A, B), where A ⊆ G, B ⊆ M , A0 = B, and B0 = A is called a
(formal) concept (of the context K) with extent A and intent B (in this case we
have also A00 = A and B00 = B). For B, D ⊆ M the implication B → D holds if
B0 ⊆ D0.</p>
      <p>In data mining applications, an element of M is called an item and a subset
of M is called an itemset.</p>
      <p>
        The support of a subset of attributes (an itemset) P ⊆ M is defined as
supp(P ) = |P 0|. An itemset is frequent if its support is not less than a given
minimum support (denoted by min supp). An itemset P is closed if there exists
no proper superset with the same support. The closure of an itemset P (denoted
by P 00) is the largest superset of P with the same support. The task of frequent
itemset mining consists of generating all (closed) itemsets (with their supports)
with supports greater than or equal to a specified min supp. An association rule
is an expression of the form I1 → I2, where I1 and I2 are arbitrary itemsets
(I1, I2 ⊆ A), I1 ∩ I2 = ∅ and I2 6= ∅. The left side, I1 is called antecedent, the
right side, I2 is called consequent. The support of an association rule r : I1 → I2
1 is defined as: supp(r) = supp(I1 ∪ I2). The confidence of an association rule r:
I1 → I2 is defined as the conditional probability that an object has itemset I2,
given that it has itemset I1: conf (r) = supp(I1 ∪ I2)/supp(I1). An association
rule r with conf (r) = 100% is an exact association rule (or implication [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]),
otherwise it is an approximate association rule. An association rule is valid if
supp(r) ≥ min supp and conf (r) ≥ min conf . An itemset P is a generator if
it has no proper subset Q(Q ⊂ P ) with the same support. Let F CI be the set
of frequent closed itemsets and let F G be the set of frequent generators. The
informative basis for approximate association rules: IB = {r : g → (f \ g)|f ∈
F CI ∧ g ∈ F G ∧ g00 ⊂ f }.
3
      </p>
      <p>
        Initial Data and Problem Statement
For experimentation we used data of US Overture [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], which were first
transformed in the standard context form. In the resulting context K = (G, M, I)
objects from G stay for advertising companies (advertisers) and attributes from
M stay for advertising terms (bids), gIm means that advertiser g bought term
m. In the context |G| = 2000, |M | = 3000, |I| = 92345.
      </p>
      <p>
        In our context, the number of attributes per object is bounded as follows:
13 ≤ |g0| ≤ 947. For objects per attribute we have 18 ≤ |m0| ≤ 159. From
1 In this paper we use absolute values, but the support of an association rule r is also
often defined as supp(r) = supp(I1 ∪ I2)/|O|.
this context one had to compute formal concepts of the form (advertisers, bids)
that represent market sectors. Formal concepts of this form can be further used
for recommendation to the companies on the market, which did not buy bids
contained in the intent of the concept. In other words, empty cell (g, m) of the
context can be considered as a recommendation to advertiser g to buy bid m,
if this advertiser bought other bids contained in the intent of any concept. This
can also be represented as association rules of the form “If an advertiser bought
bid a, then one can recommend this advertiser to buy term b” See [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] for the use
of association rules in recommendation systems.
      </p>
      <p>We consider the following context: KF T = (F, T, IF T ), where F is the set
of advertising firms (companies), T is the set of advertising terms, or phrases,
f IF T t means that firm f ∈ F bought advertising term t ∈ T .</p>
      <p>For constructing recommendations we used the following approaches and
tools:
1. D-miner algorithm for detecting large market sectors as concepts;
2. Coron system for constructing association rules;
3. Construction of association metarules using morphological analysis;
4. Construction of association metarules using ontologies (thematic catalogs).
4
4.1</p>
      <p>Standard approach to rule mining</p>
    </sec>
    <sec id="sec-2">
      <title>Detecting large market sectors with D-miner.</title>
      <p>
        D-miner is a freely available tool [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] which constructs the set of concepts
satisfying given constraints on sizes of extents and intents (icebergs and dual
icebergs). D-miner takes as input a context and two parameters: minimal
admissible extent and intent sizes and outputs a “band” of the concept lattice:
all concepts satisfying constraints given by parameter values (|intent| ≥ m and
|extent| ≥ n, where m, n ∈ N, see table 1).
(A,B)
(C,D)
В полученном слое решетки мы можем выбрать понятия верхней границы слоя и понятия
из нижней, «склеить» их определенным образом и получить кластеры с некоторым числом
нулей внутри. При этом понятия нижней границы должны быть потомками выбранного
понятия верхнего слоя. Такие кластеры вполне пригодны для формирования
рекомендаDцmийit.rДyаIн.нIgуnюatиoдvе,юSeпrоgeяiснOя.еKтuрzиnсeуtнsоoкv 12.
      </p>
      <p>We give examples of intents of formal concepts for the case |L| = 53, where
L is a concept lattice.</p>
    </sec>
    <sec id="sec-3">
      <title>Hosting market.</title>
      <p>{ affordable hosting web, business hosting web, cheap hosting, cheap hosting site
web, cheap hosting web, company hosting web, cost hosting low web, discount
hosting web, domain hosting, hosting internet, hosting page web, hosting service, hosting
services web, hosting site web, hosting web }.</p>
    </sec>
    <sec id="sec-4">
      <title>Hotel market.</title>
      <p>{ angeles hotel los, atlanta hotel, baltimore hotel, dallas hotel, denver hotel, hotel
chicago, diego hotel san, francisco hotel san, hotel houston, hotel miami, hotel new
orleans, hotel new york, hotel orlando, hotel philadelphia, hotel seattle, hotel vancouver}</p>
    </sec>
    <sec id="sec-5">
      <title>Distance communication market.</title>
      <p>{ call distance long, calling distance long, calling distance long plan, carrier
distance long, cheap distance long, company distance long, company distance
long phone, discount distance long, distance long, cheap calling distance long,
distance long phone, distance long phone rate, distance long plan, distance long
provider, distance long rate, distance long service }</p>
    </sec>
    <sec id="sec-6">
      <title>Weight loss drug market.</title>
      <p>{ adipex buy, adipex online, adipex order, adipex prescription, buy didrex,
buy ionamin, ionamin purchase, buy phentermine, didrex online, ionamin
online, ionamin order, online order phentermine, online phentermine, order
phentermine, phentermine prescription, phentermine purchase }</p>
      <p>
        Using the Coron system (see [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]) we construct the informative basis of association
rules [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. We have chosen the informative basis, since it proposes a compact and
effective way of representing the whole set of association rules. The results are
given in table 2.
min supp max supp min conf max conf number of rules
30 86 0,9 1 101 391
30 109 0,8 1 144 043
      </p>
      <p>Here are some examples of association rules.
– {evitamin} → {cvitamin} supp=31 [1.55%]; conf=0.861 [86.11%]
– {gif t graduation} → {anniversary gif t}, supp=41 [2.05%]; conf=0.820
[82.00%];</p>
      <p>The value supp = 31 of the first rule means that 31 companies bought phrases
“e vitamin” and “c vitamin”. The value conf = 0.861 means that 86,1%
companies that bought the phrase “e vitamin” also bought the phrase “c vitamin”.</p>
      <p>
        To make recommendations for each particular company one may use an
approach proposed in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. For company f we find all association rules, the
antecedent of which contain all the phrases bought by the company, then we
construct the set Tu of unique advertising phrases not bought by the company f
before. Then we order these phrases by decreasing of confidence of the rules
where the phrases occur in the consequences. If buying a phrase is predicted by
several rules (i.e., the phrase is in the consequences of several rules), we take the
largest confidence.
5
5.1
      </p>
      <p>Mining metarules</p>
    </sec>
    <sec id="sec-7">
      <title>Morphology-based Metarules</title>
      <p>Each attribute of our context is either a word or a phrase. Obviously, synonymous
phrases are related to same market sectors. The advertisers companies have
usually thematic catalogs composed by experts, however due to the huge number
of advertising terms manual composition of catalogs is a difficult task. Here we
propose a morphological approach for detecting similar bids.</p>
      <p>
        Let t be an advertising phrase consisting of several words (here we disregard
the word sequence): t = {w1, w2, . . . , wn}. A stem is the root or roots of a word,
together with any derivational affixes, to which inflectional affixes are added [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
The stem of word wi is denoted by si = stem(wi) and the set of stems of words
of the phrase t is denoted by stem(t) = S stem(wi), where wi ∈ t. Consider the
i
formal context KT S = (T, S, IT S ), where T is the set of all phrases and S is the
set of all stems of phrases from T , i.e. S = S stem(ti). Then tIs denotes that
i
the set of stems of phrase t contains s.
      </p>
      <p>In this context we construct rules of the form t → sIT S for all t ∈ T , where
i
(.)Its denotes the prime operator in the context KT S . Then the a of the context
KT S (we call it a metarule, because it is not based on experimental data, but
on implicit knowledge resided in natural language constructions) corresponds to
t −F−→T sIT S , an association rule of the context KF T = (F, T, IF T ). If the values
i
of support and confidence of this rule in context KF T do not exceed certain
thresholds, then the association rules constructed from the context KF T are
considered not very interesting.
the consequent with the set of stems being the same as the set of stems of the
antecedent. Third, we also propose to consider metarules of the form t1 −F−→T t2,
where tI2TS ⊆ tITS . These are rules with the consequent being sets of stems that
1
contain the set of stems of the antecedent.</p>
    </sec>
    <sec id="sec-8">
      <title>Example of metarules.</title>
      <p>– t −F−→T sITS</p>
      <p>i
{last minute vacation} → {last minute travel}</p>
      <p>Supp= 19 Conf= 0,90
– t −F−→T S sITS</p>
      <p>i
i
{mail order phentermine} → {adipex online order, adipex order, . . . ,
phentermine prescription, phentermine purchase, phentermine sale}
Supp= 19 Conf= 0,95
– t −F−→T (S si)ITS</p>
      <p>i
{distance long phone} → {call distance long phone, . . . ,
carrier distance long phone, distance long phone rate, distance long phone
service}
Supp= 37 Conf= 0,88</p>
      <p>F T
– t1 −−→ t2, tI2TS ⊆ tI1TS</p>
      <p>{ink jet} → {ink}, Supp= 14 Conf= 0,7
5.2</p>
    </sec>
    <sec id="sec-9">
      <title>Constructing ontologies and ontology-based metarules.</title>
      <p>Here we use simple tree-like ontologies, where the closeness to the root of a tree
defines generality of ontology concepts, which are advertisement phrases. For
example, we use a manually constructed WordNet-like ontologies of market
sectors. In our ontology of the pharmaceutical market the concept “pharmaceutical
product” is more general than that of “vitamin.” We introduce two operators
acting on the set of advertising words T . Generalization operator gi(.) : T → T
takes a concept to a more general concept i levels higher in the generality
order. Neighborhood operator n(.) : T → T takes a concept to the set of sibling
concepts.</p>
      <p>Now we define two types of metarules for ontology: a generalization rule
t → gi(t) and a neighborhood rule t → n(t). These rules can also be
considered as association rules of the context KF T = (F, T, IF T ), which allows one to
understand which of them are good supported by data.</p>
      <p>Examples of metarules for pharmaceutical market.</p>
      <p>Rule of the form t → n(t), where t = “B V IT AM IN 00.</p>
      <p>Experimental Validation
For validation of association rules we used an adapted version of cross-validation.
The training set was randomly divided into 10 parts, 9 of which were taken as
the training set and the remaining part was used as a test set. By A −t→r B we
denote an association rule generated on a training context. The confidence of
this association rule measured on the test set, i.e.,
conf (A −t−es→t B) = |AItest ∩ BItest |
|AItest |
shows the relative amount of companies that bought phrase B having bought
phrase A.</p>
      <p>We constructed 10 sets of association rules for 10 different training sets 1800
companies each (with min supp = 1, 5% and min conf = 90%. The aggregated
quality measure of the obtained rules is the average confidence:
average conf (Rulesi) = A→−B∈Rules</p>
      <p>|Rulesi|
P
conf (A −t−es→t B)
where Rulesi is the set of association rules obtained on the i-th training set. We
also considered rules with min conf ≥ 0.5 and computed averaged confidence,
n
P average conf(Rulesi)
which was again averaged over 10 cases, average conf = i=1 .
n
,</p>
      <p>The confidence of rules averaged over the test set is almost the same as the
min conf for the training set, i.e., (0, 9 − 0, 87)/0, 9 ≈ 0, 03.</p>
      <p>Rule type
t −F−→T sIT S</p>
      <p>i
t −F−→T Si siIT S
t −F−→T (S si)IT S</p>
      <p>i
t −F−→T ti, such that tiIT S ⊆ tIT S
t −F−→T S ti, such that tiIT S ⊆ tIT S
i</p>
      <p>Rule types
t −F−→T sIT S</p>
      <p>i
t −F−→T Si siIT S
t −F−→T (S si)IT S</p>
      <p>i
t −F−→T ti such that tiIT S ⊆ tIT S
t −F−→T S ti such that tiIT S ⊆ tIT S
i</p>
      <p>We used confidence measure also for validation of metarules. Support does
not have much importance here, since we do not look for large markets or
mostly sellable phrases, but stable dependencies of purchases. So, we
considered only rules with confidence larger than 0.8 (or 0.9). Confidence and support
for metarules are computed for the context KF T = (F, T, IF T ). We present the
values of confidence and support in the tables for morphology-based metarules.</p>
      <p>We set the minimal support 0,5 and compute the number of rules of each
group for which this threshold is exceeded. Table 5 shows that average conf of
these metarules is actually much higher (about 0,9).
purchases. The rules with low support and confidence may be tested against
recommendation systems such as Google AdWords, which uses the frequency
of queries for synonyms. For validation of ontological rules we used Google
service AdWords. 90% of recommendations (words) were contained in the list of
synonyms output by AdWords.
7</p>
      <p>Conclusion and further work
The obtained results show that a part of dependencies in databases for purchases
of advertisement phrases may be detected automatically, with the use of
standard means of computer linguistics. Along with methods of data mining, these
approaches allows one to improve recommendations and propose good means
of ranking, which is very important for making Top-N recommendations.
Another advantage of the approach consists in the possibility of detecting related
advertisement phrases not given directly in data. Results of FCA-based
biclusterization show the possibility of detecting relatively large advertisement markets
(with more than 20 participants) given by companies and advertising phrases.
To improve the proposed approach we plan to use well-developed ontologies like
WordNet for constructing ontology-based metarules.
8</p>
      <p>Acknowledgements
This work was supported by the Scientific Foundation of Russian State
University Higher School of Economics as a part of project 08-04-0022.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ganter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wille</surname>
          </string-name>
          , R.:
          <source>Formal concept analysis: mathematical foundations</source>
          . Springer, Berlin/Heidelberg (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Zhukov</surname>
            ,
            <given-names>L.E.</given-names>
          </string-name>
          :
          <article-title>Spectral clustering of large advertiser datasets</article-title>
          .
          <source>Technical report</source>
          , Overture R&amp;D (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Sarwar</surname>
            ,
            <given-names>B.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karypis</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riedl</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>Analysis of recommendation algorithms for e-commerce</article-title>
          .
          <source>In: ACM Conference on Electronic Commerce</source>
          . (
          <year>2000</year>
          )
          <fpage>158</fpage>
          -
          <lpage>167</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Besson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Robardet</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boulicaut</surname>
            ,
            <given-names>J.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rome</surname>
          </string-name>
          , S.:
          <article-title>Constraint-based bi-set mining for biologically relevant pattern discovery in microarray data</article-title>
          .
          <source>Intelligent Data Analysis journal 9(1)</source>
          (
          <year>2005</year>
          )
          <fpage>59</fpage>
          -
          <lpage>82</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Besson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Robardet</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boulicaut</surname>
            ,
            <given-names>J.F.</given-names>
          </string-name>
          :
          <article-title>Constraint-based mining of formal concepts in transactional data</article-title>
          . In Dai, H.,
          <string-name>
            <surname>Srikant</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , Zhang, C., eds.
          <source>: PAKDD</source>
          . Volume
          <volume>3056</volume>
          of Lecture Notes in Computer Science., Springer (
          <year>2004</year>
          )
          <fpage>615</fpage>
          -
          <lpage>624</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Szathmary</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Napoli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>CORON: A Framework for Levelwise Itemset Mining Algorithms</article-title>
          .
          <source>In: Suppl. Proc. of ICFCA '05</source>
          ,
          <string-name>
            <surname>Lens</surname>
          </string-name>
          , France. (
          <year>2005</year>
          )
          <fpage>110</fpage>
          -
          <lpage>113</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Szathmary</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Napoli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuznetsov</surname>
            ,
            <given-names>S.O.</given-names>
          </string-name>
          :
          <article-title>ZART : A Multifunctional Itemset Mining Algorithm</article-title>
          .
          <source>Research Report 00001271</source>
          ,
          <string-name>
            <surname>LORIA</surname>
          </string-name>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>David</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>A dictionary of linguistics and phonetics</article-title>
          .
          <source>third edn</source>
          . Oxford: Blackwell Publishers (
          <year>1991</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>