<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">An Adaptive Faceted Search Interface for Structured Product Offers on the Web</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Alex</forename><surname>Stolz</surname></persName>
							<email>alex.stolz@unibw.de</email>
							<affiliation key="aff0">
								<orgName type="institution">Universitaet der Bundeswehr Munich</orgName>
								<address>
									<postCode>D-85579</postCode>
									<settlement>Neubiberg</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Martin</forename><surname>Hepp</surname></persName>
							<email>martin.hepp@unibw.de</email>
							<affiliation key="aff0">
								<orgName type="institution">Universitaet der Bundeswehr Munich</orgName>
								<address>
									<postCode>D-85579</postCode>
									<settlement>Neubiberg</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">An Adaptive Faceted Search Interface for Structured Product Offers on the Web</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">4950004A7DA4E08B4273786F1AC64775</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-19T15:29+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In the past few years, a growing amount of e-commerce information has been published online either as Linked Open Data or embedded as Microdata or RDFa markup inside HTML pages. Unfortunately, the usage of such data for product search and comparison is hampered by the products and services being themselves specific and heterogenous with regard to their relevant characteristics, and by the search process that involves learning about the option space. In this paper, we present an adaptive faceted search interface over product offers in RDF. Our search interface is directly based on the popularity of schema elements in the data and does not rely on a rigid conceptual schema with hardwired product features, thereby being suitable for arbitrary product domains and product evolution. Further it supports learning during the search process. As a proof of concept of our work, we provide two use cases, namely one with product offers from an automobile database, and a second one with real product data collected from the Web.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Linked Open Data (LOD) has become a popular paradigm for publishing and consuming data on the Web. In the last few years, there has been a complementing trend towards publishing structured e-commerce data as RDFa and Microdata markup embedded in HTML Web pages. Primarily based on the GoodRelations <ref type="bibr" target="#b0">[1]</ref> and schema.org<ref type="foot" target="#foot_0">1</ref> vocabularies, this markup can be easily converted into RDF and combined with LOD datasets, which forms a promising data source for novel Web applications and services.</p><p>Unfortunately, the usage of such data for product search and comparison remains an open challenge. Because products and services are specific and heterogeneous with regard to their relevant characteristics, the means for exploring the giant RDF graph of online product data are limited. On one hand, the search space is huge because of the multi-dimensionality of products, and on the other hand, the graph of product information is in many branches sparsely populated due to the limited adoption of such data publication practices. For users it is thus very difficult to formulate queries without knowing how well conceptual elements from the schemas are populated and how much they influence the size and characteristics of the result set.</p><p>Each search process involves a learning effect about the option space [2, p. 9], which should be accounted for by search interfaces. Traditional search models, however, fail to support exploratory searches within the product space: Keyword searches flatten multi-dimensional product descriptions to simple, onedimensional term matches; query formulation with SPARQL is too complex for the average user and lacks mediation between the conceptual models of the data and the mental models of human users; and other approaches suggested for browsing RDF data (e.g. Tabulator <ref type="bibr" target="#b2">[3]</ref>) are very low-level and hence hardly suitable for serious product search.</p><p>In this paper, we propose an instance-driven, adaptive faceted search interface to support the incremental nature and the learning aspect of product search over structured e-commerce data on the Web. In a prior publication, we have already evaluated the usability of this kind of search interface with regard to e-commerce <ref type="bibr" target="#b3">[4]</ref>. The present work describes the single components of the faceted search interface (i.e. the architecture and implementation of the user interface), and details about the iterative, incremental search strategy applied to the product domain. In this respect, we also introduce a novel, instance-based search filtering approach and highlight the role of user feedback in RDF environments. We then showcase our approach using (a) a homogeneous dataset derived from the automotive domain, and (b) some real e-commerce data that we have collected from the Web of Data.</p><p>The rest of this paper is structured as follows: Section 2 describes our faceted search interface over structured product offers; in Sect. 3, we present two use cases; and finally, Sect. 4 concludes our work and discusses future directions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Adaptive Faceted Search Interface for Product Offers</head><p>In the following, we describe an adaptive faceted search interface for product offers over RDF data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Overview of Faceted Search</head><p>Faceted search <ref type="bibr" target="#b4">[5]</ref>, a special form of exploratory search <ref type="bibr" target="#b5">[6]</ref>, is a well-established interaction paradigm both recognized in industry (e.g. eBay<ref type="foot" target="#foot_1">2</ref> and Amazon<ref type="foot" target="#foot_2">3</ref> ) and academia for guiding users through option spaces <ref type="bibr" target="#b6">[7]</ref>. While the term faceted search is sometimes equated with faceted browsing or faceted navigation (e.g. <ref type="bibr" target="#b6">[7]</ref> or [2, p. 95]), it is often understood as a combination of faceted navigation and keyword search functionality <ref type="bibr">[5, p. 24</ref>]. Substantial research related to faceted search has also been done in the field of dynamic taxonomies <ref type="bibr" target="#b7">[8,</ref><ref type="bibr" target="#b8">9]</ref>.</p><p>A faceted search interface is based on facet-value pairs. Facets can be compared to mutually orthogonal categories, whereas facet values (or terms) are instances of these categories. In the context of product search, facets represent product dimensions (e.g. the features "color" and "material"), whereas facet  values correspond to instances of these dimensions (e.g. "brown" and "wood"). Products are usually represented by multiple facets and facet values. The selection within facets of faceted search interfaces maps to boolean expressions for the filtering of the option space. While a choice of multiple facets generally leads to their conjunction (e.g. "color" is "brown" and "material" is "wood"), multiple facet values are mostly combined using disjunction (e.g. "color" is "brown" or "red") (cf. <ref type="bibr" target="#b9">[10]</ref>). In set-theoretic terms, conjunction corresponds to the intersection of items, and disjunction to their union.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Product Image</head><note type="other">Product Image</note><p>Faceted search interfaces dynamically adapt to user interaction. In other words, the facet views update on reducing or expanding the option space. Furthermore, the faceted navigation paradigm does normally not lead to dead ends <ref type="bibr" target="#b10">[11]</ref>, i.e. an empty result set, because a user is only shown facets for which instance data exists. Unlike parametric searches, where a user is forced into a sequential search order (e.g. first choose the camera type, then the focal length, after that the picture resolution, and finally the color), faceted search allows users to drill down the search space in any preferred order that best suits their individual information needs. Adaptive faceted search interfaces further adjust themselves (i.e. their structure) to the underlying data, sometimes leading to a repeated, dynamic reorganization of the filtering options.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Faceted Search User Interface</head><p>In Fig. <ref type="figure" target="#fig_1">1</ref>, we show a sketch of a general faceted search interface over e-commerce data. The faceted search interface combines the two interaction paradigms keyword search and faceted navigation. As illustrated in the graphic, the keyword The user interface layout in Fig. <ref type="figure" target="#fig_1">1</ref> served as the baseline for developing the faceted search interface depicted in Fig. <ref type="figure" target="#fig_2">2</ref>. It effectively integrates product details, product category information, and commercial properties related to product offers. The results displayed in the provided screenshot are based on toy data modeled using the Vehicle Sales Ontology (VSO) <ref type="foot" target="#foot_3">4</ref> . The most interesting facts such as product image, name, description, and price are summarized in the result list. Nonetheless, for every item in the result list, a link is provided that, when clicked, opens a window where the full product details are shown. If the size of the result set exceeds ten items, then the remaining results are outsourced to other result pages that can be accessed via pagination controls <ref type="bibr">[2, pp. 110-116]</ref>. This shall prevent the user from possible information overload. The part left to the result list is dedicated to product-related details and commercial properties. It mainly includes product features and prices, but also manufacturers, vendors, payment options, or business functions. The right part features a category fil-ter, along with additional filter configurations such as to exclude product offers without images or to revert the result order. The search interface further makes heavy use of tooltips that help users at better understanding the option space.</p><p>Our search interface is instance-driven and schema-agnostic with respect to the generation of the facet views. Instead of relying on a fixed schema, the data is the first-class citizen that specifies the appearance of the user interface. In other words, the search interface is directly based on the constraints imposed on the underlying RDF data and dynamically adapts with the availability of the data, such as the presence of product features or categories. Accordingly, our approach is able to flexibly cope with structured product data from diverse data sources, as long as it complies with the e-commerce meta-model implied by the GoodRelations vocabulary <ref type="bibr" target="#b0">[1]</ref>. This allows for useful guided navigation paths even in the absence of richly axiomatized products. Keyword Search. We incorporated two forms of keyword searches into our faceted search prototype: A first one for product offers, and a second one for searching within product categories. The keyword searches match terms within textual properties attached to objects, which includes the names and descriptions found in product offers, instances, and models, and the labels and comments for product categories. An autocomplete feature helps the user with term suggestions. It is based on a light-weight SPARQL query executed over the names and labels of products and product categories, respectively.</p><p>Simple keyword search functionality can be obtained using the SPARQL CONTAINS function <ref type="bibr" target="#b11">[12]</ref>, even though such queries are typically very costly for large datasets, as most implementations iterate over all relevant objects. Some SPARQL endpoints thus permit operations over optimized full-text indexes built from textual properties in the available data. Such complementary full-text search engines like Lucene <ref type="bibr" target="#b12">[13]</ref> support, in addition to exact term matches, wildcard queries or fuzzy string matches based on given threshold limits [13, pp. 99-101]. Our prototype relies on Lucene, as long as it is supported by the underlying SPARQL endpoint. Otherwise, we fall back to the much slower SPARQL CONTAINS function.</p><p>Faceted Navigation. The exploratory search capability is provided through the faceted navigation controls that complement the keyword searches. Faceted navigation uses boolean constraint filtering based on product dimensions, commercial properties of product offers, and product type information.</p><p>As the product features facet view in Fig. <ref type="figure" target="#fig_2">2</ref> suggests, product features are initially displayed in a compact form and expanded to their corresponding values as the user clicks on them. This mechanism is illustrated in Fig. <ref type="figure" target="#fig_3">3</ref>. A selection of multiple product features leads to their conjunction (logical and ), i.e. items to appear in the result list need to match every selected feature. By contrast, a disjunctive approach is used (logical or ) among several facet values. For example, imagine that a user adds a second payment option to a payment method that The possible facet values correspond to qualitative, quantitative, and datatype properties in OWL. Unlike qualitative or datatype values that may be implemented with checkboxes that a user can click on, quantitative values can cover an indefinite range for which checkboxes would be suboptimal. A popular technique is to group quantitative values into discrete classes given as range intervals (e.g. "$ 0-20", "$ 20-50", etc.). Our approach even uses a range slider as demonstrated on the price view in Fig. <ref type="figure" target="#fig_2">2</ref> and the total of gears facet in Fig. <ref type="figure" target="#fig_3">3</ref>, respectively. To build up the range slider, we need to generate a useful number of classes with each having the same width. In order to obtain a proper class width, the interval between the minimum and maximum value is divided by the number of classes, which is essentially the instance count with a specified upper limit (e.g. a maximum of 30 classes). The height of the single bars is calculated relative to the class with the highest frequency of instances. The scale of the bars' height is logarithmic with a fixed maximum height. Thanks to the frequency of values indicated for every class, a user is able to quickly gauge the possible outcomes of his filtering decisions on the result set. While this approach works well for point values, we need to rely on a heuristic for range intervals: For closed intervals, we use the lower boundary for the classifying of numeric values into classes, whereas for open intervals, we take the actual value that is available.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Implementation</head><p>From a conceptual point of view, the front-end of the application is backed by data from an RDF store accessible through a SPARQL endpoint, which can be configured as per the endpoint selection dropdown menu in Fig. <ref type="figure" target="#fig_2">2</ref>.</p><p>The front-end of the application was realized using HTML 5 for static content, JavaScript and JQuery for user interaction, and Twitter Bootstrap for page responsiveness. Wherever possible, human-readable labels are shown in the user interface instead of long and cryptic URIs for classes, instances, and properties. Provided that they have been loaded into the RDF store before, labels can be extracted from product vocabularies and instance data. The back-end is based on the Python programming language and the Jinja 2 templating engine for generating custom-tailored SPARQL queries, that are then submitted to the SPARQL endpoint. The application can be run on any general-purpose Apache Web server instance where the mod_python and mod_wsgi modules are active. If supported by the SPARQL endpoint, keyword searches execute over a full-text search engine such as Lucene <ref type="bibr" target="#b12">[13]</ref>.</p><p>Every facet view of the user interface is generated by executing its own, unique SPARQL SELECT query, which has to take into account the current set of constraints. It was not feasible to formulate one single, comprehensive query that would encompass the data to create all facet views. First, due to the fact that each facet view uses different aggregate functions over the data (e.g. group results by manufacturers vs. vendors). And second, because large and complex queries are very costly. Facet-value pairs of faceted search interfaces are represented in RDF by properties and instances (or values). Similarly, conjunction of facets and disjunction of facet values within a facet are realized in SPARQL using a sequence of triple patterns and UNION statements, respectively. In Listing 1.1, we outline a simplified SPARQL SELECT query to generate the facet view for product prices. Let us suppose that a user has chosen to select products that are manufactured either by Apple Inc. (e.g. URI http://www.apple.com/#company) or by Microsoft Corporation (e.g. URI http://www.microsoft.com/#company), i.e. he has checked these both organizations in the facet view for manufacturers. The resulting filter constraint is a UNION clause in SPARQL. Please note also that the number of affected instances is counted in the projection part of the query, which is further displayed to the user through the search interface.</p><p>As of writing this paper, we had successfully tested our prototype with three different SPARQL endpoints compliant with the W3C SPARQL 1.1 standard, namely Virtuoso Open Source Edition<ref type="foot" target="#foot_4">5</ref> , the Jena-TDB-based Fuseki SPARQL server <ref type="foot" target="#foot_5">6</ref> , and Stardog<ref type="foot" target="#foot_6">7</ref> .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4">Incremental Search Strategy</head><p>We understand product search as an iterative process (cf. <ref type="bibr" target="#b3">[4]</ref>). On every search step, the user is presented with dynamic facet and result views that represent the search space pruned according to the current selection. Product search supposedly ends as the user is satisfied with the results or not able to find a path leading him to better search results. Until then, the search process may move across multiple search paradigms. Based on the quality of the results, several iterations with possibly different search paradigms might be necessary. As illustrated in Fig. <ref type="figure">4</ref>, the user might start with keyword search, faceted navigation, or directly head to the product details page of an item from the initial result set. The search interface may also accept user feedback that is incorporated into the graph with product data, as we will explain in Sect. 2.6.</p><p>The task of presenting optimal facets to users is a key challenge towards increasing the search efficiency in an incremental search. A user would ideally be presented, in every search step, the options (facets and facet values) that yield the best possible partition of the search space (and create the highest utility to the user). At the time of writing this paper, the splitting strategy employed for our prototype was relying on presenting to the user the five most frequent facets and facet values in the data (cf. <ref type="bibr" target="#b13">[14]</ref>). Nevertheless, the authors are aware of several weaknesses that this methodology is suffering from. E.g., if all or only very few items exhibit a certain feature or feature-value pair, then the current algorithm tends to over-or underrate their value for the search progress. Similarly, the algorithm is misguided as multiple instances of the same feature belong to a single item. Superior approaches have been suggested in literature, e.g. <ref type="bibr" target="#b13">[14]</ref> mention some popular facet-pair suggestions strategies, namely relying on frequency, probability, and the information gain. The authors in <ref type="bibr" target="#b14">[15]</ref> further give an overview over different metrics appropriate for product search to help decide which facets shall be presented to the user. Investigating these alternative presentation strategies is planned as future work for our prototype.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Search Start</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Search Stop</head><p>Instance-based Search Filtering Keyword Search Faceted Navigation User Feedback Fig. <ref type="figure">4</ref>. Iterative product search across multiple search paradigms</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.5">Instance-based Search Filtering</head><p>As a complement to faceted search filters over aggregated data, we herein present a novel idea for supporting the incremental search process, namely instance-based search filtering (see Fig. <ref type="figure" target="#fig_4">5</ref>). Occasionally, a user might be looking at the details of a product offer and discover features that he intends to consider for the next filtering steps in the search process. Now, by going back to the result list, the user would lose track of the respective features and only find them by chance in the list of displayed product features, namely if they are ranked high with respect to the current items in the option space. Of course, the same holds true for feature values and individuals. From a user interaction point of view, we can prevent this level of indirection by letting the user apply filters directly from the product details page. As a nice side effect, this solution facilitates the comparison among similar products via navigating across a collection of items that have some product features in common. Even though not entirely the same, this approach is in a way similar to the pivoting operation in user interfaces (e.g. <ref type="bibr">[16, pp. 83-84]</ref>, <ref type="bibr" target="#b16">[17]</ref>), where a concept can be found through shared features with another concept. In order for this to work well, the properties in the RDF graph need to be consolidated first, which is a research problem on its own that we are not going to further discuss in this paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.6">User Feedback with Conceptual Consolidation</head><p>For product search with incremental learning, it is not only vital to assist in navigating and pruning the option space, but also to actively engage the user in the search process, to incorporate user feedback, and to exploit user intelligence. As already depicted in the mock-up in Fig. <ref type="figure" target="#fig_1">1</ref>, a viable approach is to integrate a user dialog in the search interface. This approach goes one step beyond the traditional relevance feedback method (e.g. <ref type="bibr">[18, pp. 178-179]</ref>, <ref type="bibr" target="#b9">[10]</ref>), where systems It is outside the scope of this paper to mention here all possibilities of providing explicit user feedback in search systems. Yet, a user interface could ask the user for help in consolidating conceptual correspondences, e.g. to approve a possible match of the two features gr:hasGTIN-14 and ex:productId (possibly by way of presenting an instance for each of the two properties), leading to the following equivalence axiom: gr:hasGTIN-14 owl:equivalentProperty ex:productId .</p><p>In the current state of our implementation, we accept user feedback in the form of a dialog window that pops up as there exist possibly interesting super-concepts with respect to the concept in regard (see Fig. <ref type="figure" target="#fig_5">6</ref>). On this account, a user is able to expand the search scope, e.g. going from an automobile (vso:Automobile) to its super-concept motorized road vehicle (vso:MotorizedRoadVehicle).</p><p>:automobile_instance1 a gr:MotorizedRoadVehicle . :automobile_instance2 a gr:MotorizedRoadVehicle .</p><p>Technically, this method offers a guided way to materialize inference rules. In an RDF environment, corresponding axioms can be easily added to the existing data as named RDF graphs -potentially managed on a per-user basis. The named graphs can then persist in the RDF store to reflect past user experiences, or otherwise be cleared before a user starts a new search.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Use Cases</head><p>During the development of our adaptive faceted search interface, we set up two slightly different use cases. The first one uses product offers and properties derived from a car listing Web site, and the second one is based on real e-commerce data from the Web. Both use cases have been published on the Web in a combined demo application <ref type="foot" target="#foot_7">8</ref> . We further made the respective (raw and random sampled) datasets available online <ref type="foot" target="#foot_8">9</ref> . A thorough user study of our application has already been conducted in a prior publication yielding promising results <ref type="bibr" target="#b3">[4]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Use Case with Product Data from a Car Listing Web Site</head><p>Our first use case serves to illustrate how the search interface works with densely populated product data and a homogeneous dataset, as it is mostly the case for vertical search engines. More precisely, we have crawled a random sample of automobile offers from the car listing Web site mobile.de (i.e. as much as we could collect from the visible part of the Web pages). Our random sample originally consisted of 875 automobile offers, but due to overly complex SPARQL queries for the restricted hardware at hand (dual core processor clocked at 2.6 GHz and with 16 MB cache size, 4 GB RAM), we had to reduce it to 25 instances. In particular, we selected 50 result pages at random (based on a URL template with a random page number, a lower bound, and an upper bound for the price) with 20 results each, summing up to 1000 automobile offers. Effectively, we could obtain 875 car offers. Of these 875 car offers we again picked a random sample of 25 car offers that we loaded into our demonstrator. Quite clearly, there exist more appropriate technologies (e.g. relational databases) than the Semantic Web to implement this specific use case in terms of efficiency and ease of use, but they complicate data integration (e.g. include information from DBPedia) and oppose schema flexibility during run-time. The most obvious finding in this use case is that due to the high data quality of such a Web site (i.e. almost every car offer is populated exhaustively), drilling down the option space based on feature values works quite well. However, as almost every automobile offer uses the same features, the discriminatory value of product features is low. The present use case can be accessed from the online demo by selecting the SPARQL endpoint with the "automobile dataset".</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Use Case with Real Product Data From the Web</head><p>We set up another use case that operates over real e-commerce data from the Web. We collected product offer data by crawling Web shops with household appliances. More precisely, we selected shops from the Rakuten Deutschland platform<ref type="foot" target="#foot_9">10</ref> that were classified into categories related to the general topic household. By that we could obtain 23 Web shops, from which 17 shops contained GoodRelations <ref type="bibr" target="#b0">[1]</ref> markup. Thanks to earlier research <ref type="bibr" target="#b18">[19]</ref>, we have already been in the possession of high-quality, structured product model data by BSH <ref type="foot" target="#foot_10">11</ref> in the form of a BMEcat catalog. We used that one to augment the product offer data from the Web crawl with product features to enable deep product comparison. Furthermore, we executed cleansing and consolidation rules on the data such as to convert all price specifications to a common currency "Euros". For the purposes of this paper, we abstract from possible challenges related to this task. The interesting aspect of this use case is the integration of various, heterogeneous data sources in a single faceted search interface for products. Because the product features are mainly coming from a single BMEcat product catalog, the product features are homogeneous as well. However, because the product types are rather diversified (e.g. coffee machine vs. vacuum cleaner), the variety in product features is higher than in the previous use case. A live demonstration of this use case with real product data is available via the provided online tool by selecting the SPARQL endpoint that contains the "household crawl data".</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusions and Future Work</head><p>In this paper, we have proposed a data-driven, adaptive faceted search interface as a means to navigate the giant but often sparse graph of e-commerce data on the Web. Our prototype enables incremental, multi-parametric searches over RDF data based on distinguishing properties and attributes of product items, provides explicit support for user learning about the option space, and allows to enhance the conceptual consolidation during the search process. As the search interface does not rely on a rigid conceptual schema with hard-wired product features, it is suitable for arbitrary product domains and product evolution. We have demonstrated the viability of our approach by setting up two use cases with (a) data derived from a car listing Web site, and (b) real product data collected from the Web.</p><p>As future work, we envision to enhance our work by more accurate and context-sensitive user dialogs, personalization and diversification of facet and result views, and in particular to utilize metrics for improving the efficiency of the search process, i.e. to propose features and values that best possible partition the search space and promise the highest utility to a given information need. We will also try to incorporate machine-learning components.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Mock-up of a faceted search interface for e-commerce</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Screenshot of our faceted product search interface</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. Iteratively expanded navigation controls</figDesc><graphic coords="6,370.38,119.37,106.68,169.20" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Fig. 5 .</head><label>5</label><figDesc>Fig. 5. Product details window with instance-based search filtering</figDesc><graphic coords="10,137.19,118.26,340.97,238.09" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Fig. 6 .</head><label>6</label><figDesc>Fig. 6. User feedback dialog window</figDesc><graphic coords="11,169.35,115.83,276.68,162.32" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>Listing 1.1. Simplified SPARQL SELECT query to generate the facet view for prices with a constraint on the manufacturer</figDesc><table><row><cell>PREFIX gr: &lt;http://purl.org/goodrelations/v1#&gt;</cell></row><row><cell>SELECT (COUNT(DISTINCT ?offer) AS ?cnt) ?pspec ?price ?code</cell></row><row><cell>WHERE {</cell></row><row><cell># price-specific graph pattern</cell></row><row><cell>?offer a gr:Offering ;</cell></row><row><cell>gr:includes ?product ;</cell></row><row><cell>gr:hasPriceSpecification ?pspec .</cell></row><row><cell>?pspec a gr:UnitPriceSpecification ;</cell></row><row><cell>gr:hasCurrencyValue ?price ;</cell></row><row><cell>gr:hasCurrency ?code .</cell></row><row><cell># the next part of the graph pattern is common to all facet views</cell></row><row><cell>## filter constraint on manufacturer</cell></row><row><cell>{ ?product gr:hasManufacturer &lt;http://www.apple.com/#company&gt; } UNION</cell></row><row><cell>{ ?product gr:hasManufacturer &lt;http://www.microsoft.com/#company&gt; }</cell></row><row><cell>## additional filter constraints ...</cell></row><row><cell>}</cell></row><row><cell>GROUP BY ?pspec ?price ?code</cell></row><row><cell>ORDER BY ?price</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">http://schema.org/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">http://www.ebay.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">http://www.amazon.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">http://purl.org/vso/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">http://virtuoso.openlinksw.com/dataspace/doc/dav/wiki/Main/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">http://jena.apache.org/documentation/serving_data/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_6">http://stardog.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_7">http://www.ebusiness-unibw.org/tools/product-search/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_8">http://www.ebusiness-unibw.org/tools/product-search/datasets/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_9">http://www.rakuten.de/, a hosting platform for online shops.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_10">Bosch und Siemens Haushaltsgeräte GmbH, a manufacturer specializing on household appliances.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Acknowledgments. The authors would like to thank Florian Ott for the kind support in setting up and hosting the infrastructure of our search prototype.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">GoodRelations: An Ontology for Describing Products and Services Offers on the Web</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hepp</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 16th International Conference on Knowledge Engineering and Knowledge Management (EKAW 2008)</title>
				<meeting>the 16th International Conference on Knowledge Engineering and Knowledge Management (EKAW 2008)<address><addrLine>Acritezza, Italy; Berlin Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="329" to="346" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Search Patterns</title>
		<author>
			<persName><forename type="first">P</forename><surname>Morville</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Callender</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2010">2010</date>
			<publisher>O&apos;Reilly Media</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Tabulator: Exploring and Analyzing Linked Data on the Semantic Web</title>
		<author>
			<persName><forename type="first">T</forename><surname>Berners-Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Chilton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Connolly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Dhanaraj</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hollenbach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lerer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Sheets</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 3rd International Semantic Web User Interaction Workshop (SWUI 2006)</title>
				<meeting>the 3rd International Semantic Web User Interaction Workshop (SWUI 2006)<address><addrLine>Athens, GA, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Adaptive Faceted Search for Product Comparison on the Web of Data</title>
		<author>
			<persName><forename type="first">A</forename><surname>Stolz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hepp</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 15th International Conference on Web Engineering (ICWE 2015)</title>
				<meeting>the 15th International Conference on Web Engineering (ICWE 2015)<address><addrLine>Rotterdam, The Netherlands; Berlin Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="420" to="429" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Faceted Search. Synthesis Lectures on Information Concepts</title>
		<author>
			<persName><forename type="first">D</forename><surname>Tunkelang</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2009">2009</date>
			<publisher>Morgan &amp; Claypool</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Exploratory Search: From Finding to Understanding</title>
		<author>
			<persName><forename type="first">G</forename><surname>Marchionini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Communications of the ACM</title>
		<imprint>
			<biblScope unit="volume">49</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="41" to="46" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">A Survey of Faceted Search</title>
		<author>
			<persName><forename type="first">B</forename><surname>Wei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Zheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Fu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Feng</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Web Engineering</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="41" to="64" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The Intelligent E-Store: Easy Interactive Product Selection and Comparison</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">M</forename><surname>Sacco</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Seventh IEEE International Conference on E-Commerce Technology (CEC 2005)</title>
				<meeting>the Seventh IEEE International Conference on E-Commerce Technology (CEC 2005)<address><addrLine>Munich, Germany</addrLine></address></meeting>
		<imprint>
			<publisher>IEEE Computer Society</publisher>
			<date type="published" when="2005">2005</date>
			<biblScope unit="page" from="240" to="248" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">Dynamic Taxonomies and Faceted Search: Theory, Practice, and Experience</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">M</forename><surname>Sacco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Tzitzikas</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2009">2009</date>
			<publisher>Springer</publisher>
			<pubPlace>Berlin Heidelberg</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Finding the Flow in Web Site Search</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Hearst</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Elliott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>English</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Sinha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Searingen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">P</forename><surname>Yee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Communications of the ACM</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="42" to="49" />
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Semantic Search: Reconciling Expressive Querying and Exploratory Search</title>
		<author>
			<persName><forename type="first">S</forename><surname>Ferré</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Hermann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 10th International Semantic Web Conference (ISWC 2011)</title>
				<meeting>the 10th International Semantic Web Conference (ISWC 2011)<address><addrLine>Bonn, Germany; Berlin Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="177" to="192" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">SPARQL 1.1 Query Language</title>
		<author>
			<persName><forename type="first">S</forename><surname>Harris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Seaborne</surname></persName>
		</author>
		<ptr target="http://www.w3.org/TR/2013/REC-sparql11-query-20130321/(2013" />
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Mccandless</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Hatcher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Gospodnetic</surname></persName>
		</author>
		<title level="m">Lucene in Action</title>
				<imprint>
			<publisher>Manning Publications Co</publisher>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
	<note>2 edn</note>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Personalized Interactive Faceted Search</title>
		<author>
			<persName><forename type="first">J</forename><surname>Koren</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Liu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 17th International World Wide Web Conference (WWW 2008)</title>
				<meeting>the 17th International World Wide Web Conference (WWW 2008)<address><addrLine>Beijing, China</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="477" to="485" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Facet Selection Algorithms for Web Product Search</title>
		<author>
			<persName><forename type="first">D</forename><surname>Vandic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Frasincar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">U</forename><surname>Kaymak</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013)</title>
				<meeting>the 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013)<address><addrLine>San Francisco, CA, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="2327" to="2332" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">User Interface Design</title>
		<author>
			<persName><forename type="first">M</forename><surname>Stefaner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ferré</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Perugini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Koren</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Dynamic Taxonomies and Faceted Search</title>
				<editor>
			<persName><forename type="first">G</forename><forename type="middle">M</forename><surname>Sacco</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>Tzitzikas</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="75" to="112" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">From Overview to Facets and Pivoting for Interactive Exploration of Semantic Web Data</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Brunetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>García</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal on Semantic Web and Information Systems (IJSWIS)</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="20" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A</forename><surname>Baeza-Yates</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">A</forename><surname>Ribeiro-Neto</surname></persName>
		</author>
		<title level="m">Modern Information Retrieval: The Concepts and Technology behind Search</title>
				<imprint>
			<publisher>Addison-Wesley</publisher>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
	<note>2 edn</note>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Using BMEcat Catalogs as a Lever for Product Master Data on the Semantic Web</title>
		<author>
			<persName><forename type="first">A</forename><surname>Stolz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Rodriguez-Castro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hepp</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 10th Extended Semantic Web Conference (ESWC 2013)</title>
				<meeting>the 10th Extended Semantic Web Conference (ESWC 2013)<address><addrLine>Montpellier, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="623" to="638" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
