<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Formal Concept Analysis and Pattern Structures for Recommendation Systems</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Nyoman</forename><surname>Juniarta</surname></persName>
							<email>nyoman.juniarta@loria.fr</email>
							<affiliation key="aff0">
								<orgName type="institution" key="instit1">Université de Lorraine</orgName>
								<orgName type="institution" key="instit2">CNRS</orgName>
								<orgName type="institution" key="instit3">Inria</orgName>
								<orgName type="institution" key="instit4">LORIA</orgName>
								<address>
									<postCode>F-54000</postCode>
									<settlement>Nancy</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Formal Concept Analysis and Pattern Structures for Recommendation Systems</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">B70D82659940DF548F579C764891281B</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T08:42+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>biclustering</term>
					<term>FCA</term>
					<term>pattern structures</term>
					<term>recommendation</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This work focuses on the application of Formal Concept Analysis and pattern structures in the problem of recommendation. We focus on the collaborative recommendation, where we give a suggestion to a new user based on a database of previous users. Here we can look the pattern in the database using two approaches: interval pattern structures or gradual pattern mining.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>CrossCult (http://www.crosscult.eu) is a European project whose idea is to support the emergence of a European cultural heritage by allowing visitors in different cultural sites (e.g. museum, historic city, archaeological site) to improve the quality of their visit by using adapted computer-based devices and to consider the visit at a European level. Such improvement can be accomplished by studying, among others, the possibility to build a dynamic recommendation system. This system should be able to produce a relevant suggestion on which part of a cultural site may be interesting for a specific visitor. Here, our objective is to study a dynamic recommendation system for visitors in a museum. Given a new visitor V n , the task is to suggest a museum item that may be interesting for him/her. Based on how a suggestion is made to a new visitor V n , a recommendation system can be classified into one of the three following categories <ref type="bibr" target="#b0">[1]</ref>:</p><p>-Content-based recommendations: The system makes a suggestion based only on the previous visited items of V n . For example, if V n visited mostly the items from prehistoric era, then the system recommends another item from that era. -Collaborative recommendations: The system looks for previous users who have similar interest to V n , and makes a suggestion based on their visited items. For example, if many of V n 's similar users have visited item I, then the system recommends this item. -Hybrid approaches: The combination of content-based and collaborative approaches.</p><p>In this work, we explore the second category (collaborative recommendation) for museum visitors.</p><p>2 State-of-the-Art</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Formal Concept Analysis and Pattern Structures</head><p>Formal Concept Analysis (FCA) is a mathematical framework based on lattice theory and used for classification, data analysis, and knowledge discovery, introduced in <ref type="bibr" target="#b3">[4]</ref>. From a formal context, FCA detects all formal concepts, who can be arranged in a concept lattice. If an object g has an attribute m, then (g, m) ∈ I. An example of a formal context is shown in Figure <ref type="figure" target="#fig_0">1</ref>. From Figure <ref type="figure" target="#fig_0">1</ref>, we see that {g 3 , g 4 } = {m 2 , m 3 } and {m 2 , m 3 } = {g 3 , g 4 }. Therefore, ({g 3 , g 4 }, {m 2 , m 3 }) is a formal concept. It should be noticed that the extent and the intent of a concept (A, B) are closed sets, i.e. A = A and</p><formula xml:id="formula_0">m1 m2 m3 m4 g 1 × g 2 × × g 3 × × × g 4 × × ×</formula><formula xml:id="formula_1">B = B . A formal concept (A, B) is a subconcept of a formal concept (C, D) - denoted by (A, B) ≤ (C, D) -if A ⊆ C (or equivalently D ⊆ B).</formula><p>If (A, B) &lt; (C, D) and there is no (E, F ) such that (A, B) &lt; (E, F ) &lt; (C, D), then (A, B) is a cover of (C, D). A concept lattice can be formed using the ≤ relation which defines the partial order among concepts. For the context in Figure <ref type="figure" target="#fig_0">1</ref>, the formal concepts and their corresponding lattice are shown in Figure <ref type="figure">2</ref> (extent and intent is below and above each concept respectively).</p><p>FCA is restricted to specific datasets where each attribute is binary (e.g. has only yes/no value). For more complex values (e.g. numbers, strings, trees, graphs. . . ), FCA is then generalized into pattern structures <ref type="bibr" target="#b2">[3]</ref>. Definition 3. A pattern structure is a triple (G, (D, ), δ), where G is a set of objects, (D, ) is a complete meet-semilattice of descriptions, and δ : G → D maps an object to a description.</p><p>The operator is a similarity operator that returns the common elements to two descriptions. A description can be a set, a sequence, or other complex structure. In the binary case where descriptions are sets, corresponds to set intersection (∩), i.e. {a, b, c} {a, b, d} = {a, b}, and corresponds to subset inclusion (⊆). The order between any two descriptions is given by the subsumption relation:</p><formula xml:id="formula_2">d 1 d 2 ⇐⇒ d 1 d 2 = d 1</formula><p>Definition 4. The Galois connection for a pattern structure (G, (D, ), δ) is defined by: Interval pattern structures (ip-structures) is one of possible extensions of FCA to study more complex descriptions. In ip-structures, the description of an object is an interval for each attribute. Consider the dataset given in Table <ref type="table" target="#tab_0">1</ref> where G = {g 1 , g 2 , g 3 } is the set of objects and M = {m 1 , m 2 , m 3 , m 4 } is the set of attributes. Here the description of g 1 is δ(g 1 ) = <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b4">5]</ref>, <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b6">7]</ref>, <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b5">6]</ref> , while the description of g 4 is δ(g 4 ) = <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b3">4]</ref>, [9, 9], <ref type="bibr">[8,</ref><ref type="bibr">8]</ref> .</p><formula xml:id="formula_3">A = g∈A δ(g), A ⊆ G, d = {g ∈ G|d δ(g)}, d ∈ D. A pattern</formula><p>The similarity operator is the smallest interval containing both descriptions in each attribute. Therefore, δ(g 1 ) δ(g 4 ) = <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>, <ref type="bibr" target="#b6">[7,</ref><ref type="bibr">9]</ref>, <ref type="bibr" target="#b5">[6,</ref><ref type="bibr">8]</ref> . A description with larger interval is "subsumed by" ( ) descriptions with smaller ones. The corresponding Galois connection is illustrated as follows: Similar to formal concept, interval pattern concept (ip-concept) can also be arranged in a lattice. The ip-concept lattice for Table <ref type="table" target="#tab_0">1</ref> is illustrated in Figure <ref type="figure">3</ref>. For further background concerning ip-structures, see <ref type="bibr" target="#b4">[5]</ref>. Fig. <ref type="figure">3</ref>. Lattice of interval pattern concept for g1, g2, and g3 in Table <ref type="table" target="#tab_0">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Gradual Pattern Mining</head><p>In a numerical dataset, a gradual pattern is a pattern that is observed from at least two objects across at least two attributes. Usually this pattern is described as a correlation between two (or more) attributes: "the more/less A corresponds to the more/less B". This type of pattern was originally proposed in <ref type="bibr" target="#b1">[2]</ref> and studied by <ref type="bibr" target="#b6">[7]</ref> as an instrument to aid fuzzy controllers.</p><p>Consider again the matrix in Table <ref type="table" target="#tab_0">1</ref>. This numerical matrix can be transformed into a sign matrix as shown in Table <ref type="table">2</ref>. It contains information about the attribute comparison between any pair of objects. For example, pair (g 1 , g 2 ) has ' ' in m 1 because according to this attribute, g 1 &lt; g 2 . Then, from this sign matrix, we can find a coherent-sign-changes bicluster (for detailed explanation about biclustering, please refer to <ref type="bibr" target="#b5">[6]</ref>). One of such bicluster is ({p 3 , p 5 , p 6 , p 7 , p 8 , p 9 , p 10 }, {m 1 , m 2 , m 3 }). This bicluster represent the pattern "the more/less m 1 , the less/more m 2 , the less/more m 3 (the '=' can be regarded as either ' ' or ' ').</p><formula xml:id="formula_4">Pair m1 m2 m3 p1 = (g1, g2) p2 = (g1, g3) p3 = (g1, g4) p4 = (g1, g5) = p5 = (g2, g3) = p6 = (g2, g4) p7 = (g2, g5) = p8 = (g3, g4) = p9 = (g3, g5) = = p10 = (g4, g5)</formula><p>Table <ref type="table">2</ref>. Sign matrix for Table <ref type="table" target="#tab_0">1</ref>.</p><p>3 Proposed Approach</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Recommendation Based on Interval Pattern Structures</head><p>Table <ref type="table" target="#tab_0">1</ref> can be regarded as a rating matrix, where G is the set of users and M is the set of movies. Here we have a target user g t for which we will estimate his/her rating for movie m 3 . Based on this estimation, we can decide whether we recommend m 3 to g t .</p><p>To do that we can traverse the lattice in Figure <ref type="figure">3</ref> from the top node. Here we search the concept(s) with the most specific interval that contains m 1 : 4 and m 2 : 7. We will arrive at concept ({g 1 , g 3 , g 5 }, <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>, <ref type="bibr" target="#b6">[7,</ref><ref type="bibr">8]</ref>, <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b5">6]</ref> ). Therefore, we can estimate that g t will likely give a rating between 5 and 6 for m 3 .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Recommendation Based on Gradual Pattern Mining</head><p>For explaining this approach, we will also use the example in Table <ref type="table" target="#tab_0">1</ref> as usermovie rating matrix. The user g t has rated the movie m 1 and m 2 , and his/her rating for m 3 is to be estimated. Therefore, from the sign matrix in Table <ref type="table">2</ref>, we have to find the largest bicluster that contains m 1 and m 2 .</p><p>Suppose that the largest bicluster is ({p 3 , p 5 , p 6 , p 7 , p 8 , p 9 , p 10 }, {m 1 , m 2 , m 3 }) that represents the pattern R 1 "the more/less m 1 , the less/more m 2 , the less/more m 3 . Then, we have to compare the ratings from g t to the ratings from every other users, as shown in Table <ref type="table">3</ref>. Here the pairs that follow R 1 are p a , p c , and p d , and the estimation of m 3 's rating from g t is in the range <ref type="bibr" target="#b5">[6,</ref><ref type="bibr">8]</ref>.</p><formula xml:id="formula_5">Pairs m1 m2 m3 estimation pa = (g1, gt) = ≥ 6 p b = (g2, gt) - pc = (g3, gt) = ≤ 5 p d = (g4, gt) = ≤ 8 pe = (g5, gt) - Table 3.</formula><p>Comparison of gt and every other users in Table <ref type="table" target="#tab_0">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Methodology</head><p>Our research problem is to build a more sophisticated collaborative recommendation system for visitors in a museum. The problem is then formulated as rating prediction, i.e. predicting the given rating from a new user based on patterns studied from previous users. We focus on whether FCA and pattern structures are applicable to our problem. In literature, certain types of pattern structures have been studied to solve the task of recommendation. On the other hand, the applicability of interval pattern structures for this task is not yet explored, and this becomes our objective. Finally, the proposed recommendation system is evaluated based on the accuracy of recommended items. Given that the CrossCult project doesn't have any substantial visitor-item rating dataset, the approaches will be tested on a movie dataset.</p><p>Currently, the authors work on gradual pattern mining using coherent-signchanges biclustering. In order to take into account the '=' sign, we will apply the interval pattern structures.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Definition 1 .</head><label>1</label><figDesc>A formal context is a triple (G, M, I), where G is a set of objects, M is a set of attributes, and I is a binary relation between G and M, i.e. I ⊆ G × M .</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 1 .Fig. 2 .Definition 2 .</head><label>122</label><figDesc>Fig. 1. A formal context. Fig. 2. Concept lattice for the formal context in Figure 1.</figDesc><graphic coords="2,318.82,305.72,155.62,136.27" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>{g 1 , g 4 }</head><label>14</label><figDesc>= δ(g 1 ) δ(g 4 ) =<ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>,<ref type="bibr" target="#b6">[7,</ref> 9],<ref type="bibr" target="#b5">[6,</ref> 8]    [4, 5],<ref type="bibr" target="#b6">[7,</ref> 9],<ref type="bibr" target="#b5">[6,</ref> 8]  = {g ∈ G|<ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>,<ref type="bibr" target="#b6">[7,</ref> 9],<ref type="bibr" target="#b5">[6,</ref> 8]   δ(g)} = {g 1 , g 4 }</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>A numerical dataset with 6 objects (one as a target object).</figDesc><table><row><cell>m1 m2 m3</cell></row><row><cell>g1 5 7 6</cell></row><row><cell>g2 6 8 4</cell></row><row><cell>g3 4 8 5</cell></row><row><cell>g4 4 9 8</cell></row><row><cell>g5 5 8 5</cell></row><row><cell>gt 4 7 ?</cell></row></table><note>concept is a pair (A, d), A ⊆ G and d ∈ D, where A = d and d = A.</note></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>The thesis of the author is financed by the Région Grand-Est and the European project CrossCult (http://www.crosscult.eu/), under the supervision of Amedeo Napoli, Chedy Raïssi, and Miguel Couceiro.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions</title>
		<author>
			<persName><forename type="first">G</forename><surname>Adomavicius</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tuzhilin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE transactions on knowledge and data engineering</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="734" to="749" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Gradual inference rules in approximate reasoning</title>
		<author>
			<persName><forename type="first">D</forename><surname>Dubois</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Prade</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information Sciences</title>
		<imprint>
			<biblScope unit="volume">61</biblScope>
			<biblScope unit="issue">1-2</biblScope>
			<biblScope unit="page" from="103" to="122" />
			<date type="published" when="1992">1992</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Pattern structures and their projections</title>
		<author>
			<persName><forename type="first">B</forename><surname>Ganter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">O</forename><surname>Kuznetsov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Conceptual Structures</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2001">2001</date>
			<biblScope unit="page" from="129" to="142" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Formal Concept Analysis: Mathematical Foundations</title>
		<author>
			<persName><forename type="first">B</forename><surname>Ganter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Wille</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1999">1999</date>
			<publisher>Springer-Verlag</publisher>
		</imprint>
	</monogr>
	<note>2nd edn.</note>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Traitement de données numériques par analyse formelle de concepts et structures de patrons</title>
		<author>
			<persName><forename type="first">M</forename><surname>Kaytoue</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
		<respStmt>
			<orgName>Université de Lorraine</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Ph.D. thesis</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Biclustering algorithms for biological data analysis: a survey</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">C</forename><surname>Madeira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">L</forename><surname>Oliveira</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="24" to="45" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Similarity rules and gradual rules for analogical and interpolative reasoning with imprecise data</title>
		<author>
			<persName><forename type="first">P</forename><surname>Subašić</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hirota</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Fuzzy Sets and Systems</title>
		<imprint>
			<biblScope unit="volume">96</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="53" to="75" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
