<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Concept-based Recommendations for Internet Advertisement</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Dmitry</forename><forename type="middle">I</forename><surname>Ignatov</surname></persName>
							<email>dignatov@hse.ru</email>
						</author>
						<author>
							<persName><forename type="first">Sergei</forename><forename type="middle">O</forename><surname>Kuznetsov</surname></persName>
							<email>skuznetsov@hse.ru</email>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Higher School of Economics</orgName>
								<orgName type="department" key="dep2">Department of Applied Mathematics</orgName>
								<address>
									<addrLine>Kirpichnaya 33/5</addrLine>
									<postCode>105679</postCode>
									<settlement>Moscow</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="department">Dmitry I. Ignatov</orgName>
								<orgName type="institution">Sergei O. Kuznetsov</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Concept-based Recommendations for Internet Advertisement</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">57C6E459A77EB6338BA1378BB4D00D8C</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T10:36+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The problem of detecting terms that can be interesting to the advertiser is considered. If a company has already bought some advertising terms which describe certain services, it is reasonable to find out the terms bought by competing companies. A part of them can be recommended as future advertising terms to the company. The goal of this work is to propose better interpretable recommendations based on FCA and association rules.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Contextual Internet advertising is a form of e-commerce. The largest revenues of the major players at this market, like search systems, are obtained from the so-called search sensitive advertisement, i.e, advertisement in a sense close to user queries. Here we consider the problem of detecting terms that can be interesting to an advertiser. Assume that a company F has already bought some advertising terms which describe certain services. As a rule, there are already competing companies at the market, therefore it is reasonable to find terms bought by them. These terms can be compared to those bought by F and part of them can be recommended as future advertising terms to F . The goal of this work is to propose well-interpretable recommendations based on FCA. The rest of the paper is organized as follows: First we recall main definitions from FCA and rule mining. Then we consider experimental data and the problem statement. Afterwards, we propose morphology-based and ontology-based metarules that can be derived without experimental data. We conclude the paper with experiments and their discussion.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Main definitions</head><p>First, we recall some basic notions from Formal Concept Analysis (FCA) <ref type="bibr" target="#b0">[1]</ref>. Let G and M be sets, called the set of objects and attributes, respectively, and let I be a relation I ⊆ G × M : for g ∈ G, m ∈ M , gIm holds iff the object g has the attribute m. The triple K = (G, M, I) is called a (formal) context. If A ⊆ G, B ⊆ M are arbitrary subsets, then the Galois connection is given by the following derivation operators: In data mining applications, an element of M is called an item and a subset of M is called an itemset.</p><formula xml:id="formula_0">A def = {m ∈ M | gIm for all g ∈ A}, B def = {g ∈ G | gIm for all m ∈ B}.</formula><p>The support of a subset of attributes (an itemset) P ⊆ M is defined as supp(P ) = |P |. An itemset is frequent if its support is not less than a given minimum support (denoted by min supp). An itemset P is closed if there exists no proper superset with the same support. The closure of an itemset P (denoted by P ) is the largest superset of P with the same support. The task of frequent itemset mining consists of generating all (closed) itemsets (with their supports) with supports greater than or equal to a specified min supp. An association rule is an expression of the form I 1 → I 2 , where I 1 and I 2 are arbitrary itemsets (I 1 , I 2 ⊆ A), I 1 ∩ I 2 = ∅ and I 2 = ∅. The left side, I 1 is called antecedent, the right side, I 2 is called consequent. The support of an association rule r : I 1 → I 2<ref type="foot" target="#foot_0">1</ref> is defined as: supp(r) = supp(I 1 ∪ I 2 ). The confidence of an association rule r: I 1 → I 2 is defined as the conditional probability that an object has itemset I 2 , given that it has itemset I 1 : conf (r) = supp(I 1 ∪ I 2 )/supp(I 1 ). An association rule r with conf (r) = 100% is an exact association rule (or implication <ref type="bibr" target="#b0">[1]</ref>), otherwise it is an approximate association rule. An association rule is valid if supp(r) ≥ min supp and conf (r) ≥ min conf . An itemset P is a generator if it has no proper subset Q(Q ⊂ P ) with the same support. Let F CI be the set of frequent closed itemsets and let F G be the set of frequent generators. The informative basis for approximate association rules:</p><formula xml:id="formula_1">IB = {r : g → (f \ g)|f ∈ F CI ∧ g ∈ F G ∧ g ⊂ f }.</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Initial Data and Problem Statement</head><p>For experimentation we used data of US Overture <ref type="bibr" target="#b1">[2]</ref>, which were first transformed in the standard context form. In the resulting context K = (G, M, I) objects from G stay for advertising companies (advertisers) and attributes from M stay for advertising terms (bids), gIm means that advertiser g bought term m. In the context |G| = 2000, |M | = 3000, |I| = 92345.</p><p>In our context, the number of attributes per object is bounded as follows: 13 ≤ |g | ≤ 947. For objects per attribute we have 18 ≤ |m | ≤ 159. From this context one had to compute formal concepts of the form (advertisers, bids) that represent market sectors. Formal concepts of this form can be further used for recommendation to the companies on the market, which did not buy bids contained in the intent of the concept. In other words, empty cell (g, m) of the context can be considered as a recommendation to advertiser g to buy bid m, if this advertiser bought other bids contained in the intent of any concept. This can also be represented as association rules of the form "If an advertiser bought bid a, then one can recommend this advertiser to buy term b" See <ref type="bibr" target="#b2">[3]</ref> for the use of association rules in recommendation systems.</p><p>We consider the following context:</p><formula xml:id="formula_2">K F T = (F, T, I F T )</formula><p>, where F is the set of advertising firms (companies), T is the set of advertising terms, or phrases, f I F T t means that firm f ∈ F bought advertising term t ∈ T .</p><p>For constructing recommendations we used the following approaches and tools:</p><p>1. D-miner algorithm for detecting large market sectors as concepts; 2. Coron system for constructing association rules; 3. Construction of association metarules using morphological analysis; 4. Construction of association metarules using ontologies (thematic catalogs).</p><p>4 Standard approach to rule mining 4.1 Detecting large market sectors with D-miner. D-miner is a freely available tool <ref type="bibr" target="#b3">[4]</ref>, <ref type="bibr" target="#b4">[5]</ref> which constructs the set of concepts satisfying given constraints on sizes of extents and intents (icebergs and dual icebergs). D-miner takes as input a context and two parameters: minimal admissible extent and intent sizes and outputs a "band" of the concept lattice: all concepts satisfying constraints given by parameter values (|intent| ≥ m and |extent| ≥ n, where m, n ∈ N, see table <ref type="table" target="#tab_0">1</ref>).  We give examples of intents of formal concepts for the case |L| = 53, where L is a concept lattice.</p><p>Hosting market.</p><p>effective way of representing the whole set of association rules. The results are given in table <ref type="table" target="#tab_1">2</ref>. The value supp = 31 of the first rule means that 31 companies bought phrases "e vitamin" and "c vitamin". The value conf = 0.861 means that 86,1% companies that bought the phrase "e vitamin" also bought the phrase "c vitamin".</p><p>To make recommendations for each particular company one may use an approach proposed in <ref type="bibr" target="#b2">[3]</ref>. For company f we find all association rules, the antecedent of which contain all the phrases bought by the company, then we construct the set T u of unique advertising phrases not bought by the company f before. Then we order these phrases by decreasing of confidence of the rules where the phrases occur in the consequences. If buying a phrase is predicted by several rules (i.e., the phrase is in the consequences of several rules), we take the largest confidence.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Mining metarules</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">Morphology-based Metarules</head><p>Each attribute of our context is either a word or a phrase. Obviously, synonymous phrases are related to same market sectors. The advertisers companies have usually thematic catalogs composed by experts, however due to the huge number of advertising terms manual composition of catalogs is a difficult task. Here we propose a morphological approach for detecting similar bids.</p><p>Let t be an advertising phrase consisting of several words (here we disregard the word sequence): t = {w 1 , w 2 , . . . , w n }. A stem is the root or roots of a word, together with any derivational affixes, to which inflectional affixes are added <ref type="bibr" target="#b7">[8]</ref>. The stem of word w i is denoted by s i = stem(w i ) and the set of stems of words of the phrase t is denoted by stem(t) = i stem(w i ), where w i ∈ t. Consider the formal context K T S = (T, S, I T S ), where T is the set of all phrases and S is the set of all stems of phrases from T , i.e. S = i stem(t i ). Then tIs denotes that the set of stems of phrase t contains s.</p><p>In this context we construct rules of the form t → s I T S i for all t ∈ T , where (.) Its denotes the prime operator in the context K T S . Then the a of the context K T S (we call it a metarule, because it is not based on experimental data, but on implicit knowledge resided in natural language constructions) corresponds to t</p><formula xml:id="formula_3">F T − − → s I T S i</formula><p>, an association rule of the context K F T = (F, T, I F T ). If the values of support and confidence of this rule in context K F T do not exceed certain thresholds, then the association rules constructed from the context K F T are considered not very interesting. Metarules of the following forms seem also to be reasonable. First, one can look for rules of the form t</p><formula xml:id="formula_4">F T − − → i s I T S i</formula><p>, i.e., rules, the consequent of which contain all terms containing at least one word with the stem common to a word in the antecedent term. Obviously, constructing rules of this type may result in the fusion of phrases related to different market sectors, e.g. "black jack" and "black coat". Second, we considered rules of the form t Here we use simple tree-like ontologies, where the closeness to the root of a tree defines generality of ontology concepts, which are advertisement phrases. For example, we use a manually constructed WordNet-like ontologies of market sectors. In our ontology of the pharmaceutical market the concept "pharmaceutical product" is more general than that of "vitamin." We introduce two operators acting on the set of advertising words T . Generalization operator g i (.) : T → T takes a concept to a more general concept i levels higher in the generality order. Neighborhood operator n(.) : T → T takes a concept to the set of sibling concepts. Now we define two types of metarules for ontology: a generalization rule t → g i (t) and a neighborhood rule t → n(t). These rules can also be considered as association rules of the context K F T = (F, T, I F T ), which allows one to understand which of them are good supported by data. Examples of metarules for pharmaceutical market. Rule of the form t → n(t), where t = "B V IT AM IN .</p><formula xml:id="formula_5">{B V IT AM IN } → {B COM P LEX V IT AM IN, B12 V IT AM IN, C V IT AM IN, . . . D V IT AM IN, DISCOU N T V IT AM IN, E V IT AM IN, M IN ERAL V IT AM IN, . . . M U LT I V IT AM IN, SU P P LEM EN T V IT AM IN, V IT AM IN } Rules of the form t → g 1 (t), where t = "B V IT AM IN , g 1 (t) = "V IT AM IN S . {B V IT AM IN } → {V IT AM IN S}.</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Experimental Validation</head><p>For validation of association rules we used an adapted version of cross-validation. The training set was randomly divided into 10 parts, 9 of which were taken as the training set and the remaining part was used as a test set. By A tr − → B we denote an association rule generated on a training context. The confidence of this association rule measured on the test set, i.e., conf (A</p><formula xml:id="formula_6">test −−→ B) = |A Itest ∩ B Itest | |A Itest |</formula><p>shows the relative amount of companies that bought phrase B having bought phrase A.</p><p>We constructed 10 sets of association rules for 10 different training sets 1800 companies each (with min supp = 1, 5% and min conf = 90%. The aggregated quality measure of the obtained rules is the average confidence:</p><formula xml:id="formula_7">average conf (Rules i ) = A− →B∈Rules conf (A test −−→ B) |Rules i | ,</formula><p>where Rules i is the set of association rules obtained on the i-th training set. We also considered rules with min conf ≥ 0.5 and computed averaged confidence, which was again averaged over 10 cases, average conf The confidence of rules averaged over the test set is almost the same as the min conf for the training set, i.e., (0, 9 − 0, 87)/0, 9 ≈ 0, 03.</p><formula xml:id="formula_8">= n P i=1 average conf (Rulesi) n .</formula><p>We used confidence measure also for validation of metarules. Support does not have much importance here, since we do not look for large markets or mostly sellable phrases, but stable dependencies of purchases. So, we considered only rules with confidence larger than 0.8 (or 0.9). Confidence and support for metarules are computed for the context K F T = (F, T, I F T ). We present the values of confidence and support in the tables for morphology-based metarules. We set the minimal support 0,5 and compute the number of rules of each group for which this threshold is exceeded. Table <ref type="table" target="#tab_4">5</ref> shows that average conf of these metarules is actually much higher (about 0,9). From tables 6 and 7one can easily see that most confident and supported rules are of the form t purchases. The rules with low support and confidence may be tested against recommendation systems such as Google AdWords, which uses the frequency of queries for synonyms. For validation of ontological rules we used Google service AdWords. 90% of recommendations (words) were contained in the list of synonyms output by AdWords.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7">Conclusion and further work</head><p>The obtained results show that a part of dependencies in databases for purchases of advertisement phrases may be detected automatically, with the use of standard means of computer linguistics. Along with methods of data mining, these approaches allows one to improve recommendations and propose good means of ranking, which is very important for making Top-N recommendations. Another advantage of the approach consists in the possibility of detecting related advertisement phrases not given directly in data. Results of FCA-based biclusterization show the possibility of detecting relatively large advertisement markets (with more than 20 participants) given by companies and advertising phrases. To improve the proposed approach we plan to use well-developed ontologies like WordNet for constructing ontology-based metarules.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>If we have several contexts derivative operator of a context (G, M, I) denoted by (.) I . The pair (A, B), where A ⊆ G, B ⊆ M , A = B, and B = A is called a (formal) concept (of the context K) with extent A and intent B (in this case we have also A = A and B = B). For B, D ⊆ M the implication B → D holds if B ⊆ D .</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. A concept lattice and its band output by D-miner.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>2 ⊆ t I T S 1 .i 2 ⊆ t I T S 1 {ink 7 5. 2</head><label>212172</label><figDesc>I T S , i.e., rules with 162 Dmitry I. Ignatov, Sergei O. Kuznetsov the consequent with the set of stems being the same as the set of stems of the antecedent. Third, we also propose to consider metarules of the form t 1 F T − − → t 2 , where t I T S These are rules with the consequent being sets of stems that contain the set of stems of the antecedent. {mail order phentermine} → {adipex online order, adipex order, . . . , phentermine prescription, phentermine purchase, phentermine sale} Supp= 19 Conf= 0,95 t F T − − → ( i s i ) I T S {distance long phone} → {call distance long phone, . . . , carrier distance long phone, distance long phone rate, distance long phone service} Supp= 37 Conf= 0,88 t 1 F T − − → t 2 , t I T S jet} → {ink}, Supp= 14 Conf= 0,Constructing ontologies and ontology-based metarules.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head></head><label></label><figDesc>Note that the use of morphology is completely automated and allows one to find highly plausible metarules without data onConcept-based Recommendations for Internet Advertisement</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>D-miner results.</figDesc><table><row><cell cols="3">Minimal extent Minimal intent Number of</cell></row><row><cell>size</cell><cell>size</cell><cell>concepts</cell></row><row><cell>0</cell><cell>0</cell><cell>8 950 740</cell></row><row><cell>10</cell><cell>10</cell><cell>3 030 335</cell></row><row><cell>15</cell><cell>10</cell><cell>759 963</cell></row><row><cell>15</cell><cell>15</cell><cell>150 983</cell></row><row><cell>15</cell><cell>20</cell><cell>14 226</cell></row><row><cell>20</cell><cell>15</cell><cell>661</cell></row><row><cell>20</cell><cell>16</cell><cell>53</cell></row><row><cell>20</cell><cell>20</cell><cell>0</cell></row></table><note>В полученном слое решетки мы можем выбрать понятия верхней границы слоя и понятия из нижней, «склеить» их определенным образом и получить кластеры с некоторым числом нулей внутри. При этом понятия нижней границы должны быть потомками выбранного понятия верхнего слоя. Такие кластеры вполне пригодны для формирования рекомендаций. Данную идею поясняет рисунок 12.</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Properties of informative basis.</figDesc><table><row><cell cols="5">min supp max supp min conf max conf number of rules</cell></row><row><cell>30</cell><cell>86</cell><cell>0,9</cell><cell>1</cell><cell>101 391</cell></row><row><cell>30</cell><cell>109</cell><cell>0,8</cell><cell>1</cell><cell>144 043</cell></row><row><cell cols="4">Here are some examples of association rules.</cell><cell></cell></row><row><cell cols="5">-{evitamin} → {cvitamin} supp=31 [1.55%]; conf=0.861 [86.11%]</cell></row><row><cell cols="5">-{gif t graduation} → {anniversary gif t}, supp=41 [2.05%]; conf=0.820</cell></row><row><cell>[82.00%];</cell><cell></cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 .</head><label>3</label><figDesc>A toy example of context KF T for "long distance calling" market.</figDesc><table><row><cell>firm \ phrase</cell><cell>call</cell><cell>calling</cell><cell>calling</cell><cell>carrier</cell><cell>cheap</cell></row><row><cell></cell><cell cols="5">distance distance distance distance distance</cell></row><row><cell></cell><cell>long</cell><cell>long</cell><cell>long plan</cell><cell>long</cell><cell>long</cell></row><row><cell>f1</cell><cell>x</cell><cell></cell><cell>x</cell><cell></cell><cell>x</cell></row><row><cell>f2</cell><cell></cell><cell>x</cell><cell>x</cell><cell>x</cell><cell></cell></row><row><cell>f3</cell><cell></cell><cell></cell><cell></cell><cell>x</cell><cell>x</cell></row><row><cell>f4</cell><cell></cell><cell>x</cell><cell>x</cell><cell></cell><cell>x</cell></row><row><cell>f5</cell><cell>x</cell><cell>x</cell><cell></cell><cell>x</cell><cell>x</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4 .</head><label>4</label><figDesc>A toy example of context KT S for "long distance calling" market.</figDesc><table><row><cell>phrase \ stem</cell><cell cols="4">call carrier cheap distanc long plan</cell></row><row><cell>call distance long</cell><cell>x</cell><cell>x</cell><cell>x</cell></row><row><cell>calling distance long</cell><cell>x</cell><cell>x</cell><cell>x</cell></row><row><cell cols="2">calling distance long plan x</cell><cell>x</cell><cell>x</cell><cell>x</cell></row><row><cell>carrier distance long</cell><cell>x</cell><cell>x</cell><cell>x</cell></row><row><cell>cheap distance long</cell><cell>x</cell><cell>x</cell><cell>x</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 5 .</head><label>5</label><figDesc>Results of cross-validation for association rules.</figDesc><table><row><cell></cell><cell cols="4">Number of Number of average conf Number of</cell><cell>average conf</cell></row><row><cell></cell><cell>rules</cell><cell>rules with</cell><cell></cell><cell cols="2">rules with (min conf=0.5)</cell></row><row><cell></cell><cell></cell><cell>sup &gt; 0</cell><cell></cell><cell>min conf=0.5</cell><cell></cell></row><row><cell>1</cell><cell>147170</cell><cell>73025</cell><cell>0,77</cell><cell>65556</cell><cell>0,84</cell></row><row><cell>2</cell><cell>69028</cell><cell>68709</cell><cell>0,93</cell><cell>68495</cell><cell>0,93</cell></row><row><cell>3</cell><cell>89332</cell><cell>89245</cell><cell>0,95</cell><cell>88952</cell><cell>0,95</cell></row><row><cell>4</cell><cell>107036</cell><cell>93078</cell><cell>0,84</cell><cell>86144</cell><cell>0,90</cell></row><row><cell>5</cell><cell>152455</cell><cell>126275</cell><cell>0,82</cell><cell>113008</cell><cell>0,90</cell></row><row><cell>6</cell><cell>117174</cell><cell>114314</cell><cell>0,89</cell><cell>111739</cell><cell>0,91</cell></row><row><cell>7</cell><cell>131590</cell><cell>129826</cell><cell>0,95</cell><cell>128951</cell><cell>0,96</cell></row><row><cell>8</cell><cell>134728</cell><cell>120987</cell><cell>0,96</cell><cell>106155</cell><cell>0,97</cell></row><row><cell>9</cell><cell>101346</cell><cell>67873</cell><cell>0,72</cell><cell>52715</cell><cell>0,92</cell></row><row><cell>10</cell><cell>108994</cell><cell>107790</cell><cell>0,93</cell><cell>106155</cell><cell>0,94</cell></row><row><cell>means</cell><cell>115885</cell><cell>99112</cell><cell>0,87</cell><cell>92787</cell><cell>0,92</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 6 .</head><label>6</label><figDesc>Average support and average confidence for morphology-based metarules.</figDesc><table><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="4">Rule type</cell><cell>Average supp Average conf Number of</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>rules</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>t</cell><cell cols="3">F T − − → s I T S i</cell><cell>6</cell><cell>0,26</cell><cell>2389</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>t</cell><cell cols="2">F T − − →</cell><cell cols="2">S</cell><cell>s I T S i</cell><cell>6</cell><cell>0,24</cell><cell>456</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>i</cell></row><row><cell></cell><cell></cell><cell></cell><cell>t</cell><cell cols="3">F T − − → (</cell><cell>S</cell><cell>si) I T S</cell><cell>12</cell><cell>0,40</cell><cell>1095</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>i</cell></row><row><cell></cell><cell>t</cell><cell cols="7">F T − − → ti, such that t I T S i</cell><cell>⊆ t I T S</cell><cell>15</cell><cell>0,49</cell><cell>7409</cell></row><row><cell>t</cell><cell cols="2">F T − − →</cell><cell>S</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>i</cell><cell>⊆ t I T S</cell><cell>11</cell><cell>0,36</cell><cell>2006</cell></row></table><note>i ti, such that t I T S</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_6"><head>Table 7 .</head><label>7</label><figDesc>Average supp and conf for morphological metarules for min conf = 0, 5.</figDesc><table><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="3">Rule types</cell><cell>Average supp Average value of Number of</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>conf</cell><cell>rules</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>t</cell><cell cols="2">F T − − → s I T S i</cell><cell>15</cell><cell>0,64</cell><cell>454</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>t</cell><cell cols="2">F T − − →</cell><cell>S</cell><cell>s I T S i</cell><cell>15</cell><cell>0,63</cell><cell>75</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>i</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>t</cell><cell cols="3">F T − − → (</cell><cell>S</cell><cell>si) I T S</cell><cell>18</cell><cell>0,67</cell><cell>393</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>i</cell></row><row><cell></cell><cell>t</cell><cell cols="7">F T − − → ti such that t I T S i</cell><cell>⊆ t I T S</cell><cell>21</cell><cell>0,70</cell><cell>3922</cell></row><row><cell>t</cell><cell cols="2">F T − − →</cell><cell>S</cell><cell cols="5">ti such that t I T S i</cell><cell>⊆ t I T S</cell><cell>20</cell><cell>0,69</cell><cell>673</cell></row><row><cell></cell><cell></cell><cell></cell><cell>i</cell><cell></cell><cell></cell><cell></cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">In this paper we use absolute values, but the support of an association rule r is also often defined as supp(r) = supp(I1 ∪ I2)/|O|.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_1">Dmitry I. Ignatov, Sergei O. Kuznetsov</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_2">Concept-based Recommendations for Internet Advertisement</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8">Acknowledgements</head><p>This work was supported by the Scientific Foundation of Russian State University Higher School of Economics as a part of project 08-04-0022.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Formal concept analysis: mathematical foundations</title>
		<author>
			<persName><forename type="first">B</forename><surname>Ganter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Wille</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1999">1999</date>
			<publisher>Springer</publisher>
			<pubPlace>Berlin/Heidelberg</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Spectral clustering of large advertiser datasets</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">E</forename><surname>Zhukov</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
		<respStmt>
			<orgName>Overture R&amp;D</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Technical report</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Analysis of recommendation algorithms for e-commerce</title>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">M</forename><surname>Sarwar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Karypis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Konstan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Riedl</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ACM Conference on Electronic Commerce</title>
				<imprint>
			<date type="published" when="2000">2000</date>
			<biblScope unit="page" from="158" to="167" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Constraint-based bi-set mining for biologically relevant pattern discovery in microarray data</title>
		<author>
			<persName><forename type="first">J</forename><surname>Besson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Robardet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">F</forename><surname>Boulicaut</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Rome</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Intelligent Data Analysis journal</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="59" to="82" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Constraint-based mining of formal concepts in transactional data</title>
		<author>
			<persName><forename type="first">J</forename><surname>Besson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Robardet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">F</forename><surname>Boulicaut</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">PAKDD</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<editor>
			<persName><forename type="first">H</forename><surname>Dai</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Srikant</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Zhang</surname></persName>
		</editor>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2004">2004</date>
			<biblScope unit="volume">3056</biblScope>
			<biblScope unit="page" from="615" to="624" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">CORON: A Framework for Levelwise Itemset Mining Algorithms</title>
		<author>
			<persName><forename type="first">L</forename><surname>Szathmary</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Napoli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Suppl. Proc. of ICFCA &apos;05</title>
				<meeting><address><addrLine>Lens, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
			<biblScope unit="page" from="110" to="113" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">ZART : A Multifunctional Itemset Mining Algorithm</title>
		<author>
			<persName><forename type="first">L</forename><surname>Szathmary</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Napoli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">O</forename><surname>Kuznetsov</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
		<respStmt>
			<orgName>LORIA</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Research Report 00001271</note>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">A dictionary of linguistics and phonetics</title>
		<author>
			<persName><forename type="first">C</forename><surname>David</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1991">1991</date>
			<publisher>Blackwell Publishers</publisher>
			<pubPlace>Oxford</pubPlace>
		</imprint>
	</monogr>
	<note>third edn</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
