=Paper=
{{Paper
|id=Vol-2467/paper-13
|storemode=property
|title=Portfolio Management: How to Find Your Standard Variants
|pdfUrl=https://ceur-ws.org/Vol-2467/paper-13.pdf
|volume=Vol-2467
|authors=Daniel Jeuken,Frank Dylla,Thorsten Krebs
|dblpUrl=https://dblp.org/rec/conf/confws/JeukenDK19
}}
==Portfolio Management: How to Find Your Standard Variants==
Portfolio Management: How to Find Your Standard Variants Frank Dylla and Daniel Jeuken and Thorsten Krebs 1 Abstract. necessarily), two trucks, i.e. spring mounted axles, and four wheels Product portfolio management is one of the most important tasks with bearings. Optional components may be sliptape, paintings, ris- for companies to secure their future competitiveness. A crucial as- ers, shock pads, nose/tail guards, etc. Consider that not all truck types pect for portfolio management decisions is the volume of products fit to each deck and that not all wheel/truck combinations fit. An in- sold and the sales numbers development over time – one could say: dividual composition of these components is sought by the customer What are your current or upcoming best selling products, often used – leading to very few skateboards that are sold with the exact same as “standard products” in sales? Especially for these products it is composition of deck, axes, wheels, and so on. worthwhile to take actions in reducing costs and improving revenue. From our experience, for new multi-variant products it is common Regarding discrete products the task is, simply said, looking for prod- that product managers guess which variants will be the top selling ucts with the highest quantities or profit sold or significant changes in ones in future, i.e. the decision is based on their gut feeling. Evalu- these quantities over a certain period of time (business intelligence). ation of the quality of their initial decision is barely feasible as only In contrast, this approach does not work satisfactorily with complex standard BI techniques are available. These techniques are not suffi- multi-variant products. An aggregated view on products, i.e. ignoring cient for portfolio planning of multi-variant products as they ignore the sales numbers of the variants with their individual features, does the structural information of the variants themselves. Standard tech- not give sufficient insights or may even lead to wrong decisions in niques typically analyze the list of sales over a certain period of time portfolio management. The recurring combination of features across and use product identifiers as the key to identify which one is sold the multiple types of products might be more important than the type of most and predict how this will change in future. But for variant-rich the product itself. In this paper we investigate differences in identify- products that are sold in lot size 1 the product identifier cannot be ing potential standard products in comparison to identifying potential used as a key criterion. It is rather important to compare characteris- standard variants of products. Thereon we derive a high-level frame- tics and their values. For example, comparing the product ID, which work how standard variants may be deduced from a given set of vari- identifies an individual composition, does not identify that a lot of ants described by characteristics and provide an algorithmic sketch skateboards use the same wheels. Thus we consider it is important to and discuss resulting challenges from a pragmatic perspective. use the configuration model - containing product data and rule sets - as an input for a new kind of algorithm that does not compare on the level of product identifiers but on the level of a set of product char- 1 Introduction acteristics, which supports better predictions of top-selling variants, Portfolio management is a dynamic decision process evaluating, i.e. what the market really is willing to pay for. prioritizing, reorganizing, cancelling, etc. products throughout their In order to support the step of evaluating past sales in comparison lifecycle [6]. As managers have to deal with uncertain and changing to original plannings on the level of characteristics and their values, information portfolio management is a complex task. One of the ma- we introduce the notion of central representative and propose a po- jor difficulties in product portfolio management is predicting what tential calculation thereof. We discriminate against the term ”stan- the customers are willing to pay for. This includes knowing the mar- dard product”, standard variants respectively, as this term describes ket, i.e. knowing the current customer demand and knowing how it products which were actually built many times. As you will see later will most likely change in future. Thus, product portfolio manage- a central representative does not need to have been built once. We are ment is complex already when considering simple products, but gets convinced that central representatives will help to recognize changes more complex when considering configurable and thus multi-variant in client behavior – or the market in general – over time and whether products. adaptions are reasonable in order to meet the goals of portfolio man- But what exactly is the challenging part of this task? Forecasts agement. are created in order to plan the supply chain and production capac- We start with introducing our understanding of product configura- ities. For simple products this is a rather straightforward task: one tion, which is constraint-based, and introduce diverse variant spaces can assign a sales forecast to the product identifiers, e.g. material for later use (Sec. 2.1). We consider definitions of discrete standard numbers, and use the bill of materials in order to get a list of compo- and basic products (Sec. 2.2) and elaborate how this relates to stan- nents that are required. For variant-rich products such as skateboards, dards for multi-variant products (Sec. 2.3). We sketch our approach however this is not that easy. In general necessary components of a in Section 3. To avoid misunderstandings with varying definitions of skateboard2 are the deck, i.e. a plank (in general wooden, but not ‘standard’ we introduce the term central representative of a given variant space described by characteristics (Sec. 3.1). In order to find 1 encoway GmbH, Germany, email: {dylla,jeuken,krebs}@encoway.de such a central representative a measure of dissimilarity needs to be 2 see en.wikipedia.org/wiki/Skateboard (retrieved 8.5.2019) Copyright © 2019 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). defined (Sec. 3.2). In Section 3.3 we exemplify how a representative In more detail, if an assignment contains multiple values for one can be computed and how a deviation can be derived thereon. We or more characteristics ki it contains at least two different atomic summarize our considerations in an algorithm sketch (Sec. 3.4). We solutions. For example, given an assignment where a characteristic discuss our approach from various pragmatic perspectives (Section contains a value, e.g. for deck A and deck B, this means that the 4). First, we revisit the choice of the set of product vectors P for customer at some configuration front-end can still decide for either which the central representative should be computed (Sec. 4.1). Fur- deck A or B resulting in a valid assignment definitely. In the re- thermore, in general data is not available in a well defined form in re- mainder of this paper we will use ki synonymously for referring to ality, i.e. not all characteristics and values are defined in a consistent the characteristic itself as well as for its evaluation, i.e. value assign- manner (Sec. 4.2). Additionally, multi-variant products are subject to ment, as the meaning becomes clear from the context in most cases. change such that older products may falsify the results that should In case of ambiguities we clarify the meaning. reflect the current state (Sec. 4.3). Finally, we consider derivation of In order to discuss notions of standard variant, we need to define parameters and further prerequisites necessary in order to apply the several solution spaces based on the CSP definition. algorithm presented to real data. (sec. 4.4). Definition 3. Theoretical Configuration Space (S ∅ ): This space contains all combinations of characteristics which are possible from 2 Theoretical background a mereological perspective, i.e. from all minimalist configurations to all maximum configurations containing all optional components, 2.1 What is product configuration? but ignoring further constraints. In terms of CSP this is reflected by ⟨K, D, ∅⟩. Felfernig et al. [8] base their understanding of configuration on a def- inition in [18]: configuration is a special case of design activity where Consider the skateboard example. The minimalist configurations the artifact being configured is assembled from instances of a fixed consist of a deck, two trucks, and four wheels as these components set of well defined component types which can be composed conform- are necessary to obtain a functional skateboard from a mereologi- ing to a set of constraints. A configuration task is the selection of the cal perspective. A configuration with two decks is not part of S ∅ , components and their properties to get a valid combination of the whereas configurations with different truck or wheel sizes are part product components, the outcome is also called product variant [4]. of S ∅ , although this may make the skateboard unusable. Maximum As a result the component types span a space of potential config- configurations consist of the above components plus all optional urations which are further restricted by constraints, which limit the components which can be installed in parallel. As risers and shock possibilities of how components can be combined. Practice shows pads are installed in the same place3 , there is no maximum config- that the restrictions may arise from technical feasibility, legal re- uration containing both components. As we are interested in valid quirements, product-design, or marketing purposes. In general, com- configurations in the end, we need to define a second configuration ponents or properties of a product are described by characteristics in space. formal product representations. There are additional notions to de- scribe properties of components like attributes or features. For rea- Definition 4. Valid Configuration Space or variant space (S): This sons of simplicity we will restrict to the term characteristics through- space contains only valid configurations, i.e. configurations that sat- out this paper. Based on this we can define a product characteristics isfy all constraints of the underlying configuration model. Therefore vector, product vector for short. it is given S ⊆ S ∅ . As valid configurations are also called variants, we will talk of ’variant space’ in the remainder of this paper. The Definition 1. Given a set of characteristics ki ∈ K with i ∈ variant space directly relates to the space of atomic solutions of a {0, . . . , N − 1} with values from domain Di ∈ D each, we define CSP. [k0 , k1 , . . . , kN −1 ] as the product (characteristics) vector p⃗. We note that N denotes the maximum number of possible charac- Taking the skateboard example again, configurations with differ- teristics. Especially, if a characteristic is optional, a specific domain ent wheel sizes are not part of S, whereas configurations with differ- value must be available defining that this characteristic is not chosen, ent wheel colors may be, depending whether such configurations are evaluated respectively. As combinations of domain values are not re- permissible with reference to the configuration model. stricted the product vector may reflect a product which is technically Other variant spaces may be defined on the ‘trading status’ of each not feasible. p⃗ contained, for example: Naturally, a configuration task can be considered as a constraint satisfaction problem (CSP), see e.g. [8]. Definition 5. Offered variant space (S O ) and sold variant space (S $ ): S O is defined as the space of all variants which have been Definition 2. Constraint Satisfaction Problem (CSP): ⟨K, D, C⟩: A quoted to customers. S $ contains only those variants which have CSP is defined as a set of variables ki ∈ K with i ∈ {0, . . . , N − 1} been sold. with values from domain Di ∈ D together with a set of constraints cj ∈ C and j ∈ {0, . . . , M −1} defining which combinations of values As in mass customization the variant space is rather large, in gen- are allowed or not. A solution of a CSP is a consistent evaluation to eral it can be assumed that not all variants were sold or offered. Nev- all variables (value assignment to all ki ), i.e. no constraint is vio- ertheless, in the very extreme case all possible variants have been lated. Otherwise the assignment is called inconsistent. Furthermore, offered and sold and thus S $ ⊆ S O ⊆ S. Based on the presented def- within an assignment values of ki do not need to be unique, i.e. that inition of variant space, an arbitrary number of variant spaces based ki may contain multiple valid values which can be considered as al- on relevant criteria can be defined for investigation and comparison. ternatives. Given a solution with a unique value per ki , it is called 3 between trucks and deck an atomic solution or according to variant management a variant. 2.2 Discrete standard and basic products efficiency of mass production or as stated in [8]: . . . is based on the idea of the customer-individual production of highly variant prod- In the context of discrete products a central term for entrepreneurial ucts under near mass production pricing conditions. In general, in considerations and decisions is standard product. this context products are multi-variant, i.e. there is more than one According to the Lexico dictionary (Oxford)4 on a general level option available. One important question for variant management is a standard is (a) a certain quality or attainment level reached or (b) how the variants can be compared in a reasonable manner. Buchholz something considered exemplary or as a measure or model according states that all variants need to be considered with respect to their to which others assess to (cf. benchmark, scale, guideline). product type and that relevant characteristics need to be selected for a Following information given by Wikipedia a technical standard is reasonable comparison [2]. Buchholz also discusses the relationship an established norm or requirement in regard to technical systems. between variants and standard. It is critically scrutinised whether a It is usually a formal document that establishes uniform engineering standard variant is the one with maximum quantity, some sort of av- or technical criteria, methods, processes, and practices. In contrast, erage or a yardstick for other variants. Nevertheless, it is specifically a custom, convention, company product, corporate standard, and so emphasized that a standard variant is something special compared to forth that becomes generally accepted and dominant is often called other variants. For comparison a measure of discrimination between a de facto standard.5 variants is necessary, but not all characteristics are important such Specifically considering discrete products a wide variety of def- that relevant characteristics need to be selected. In our notation this initions is available which take different aspects into account. For means, that the product vector K = [k0 , k1 , . . . , kN −1 ] is abstracted example, in the Gabler Wirtschaftslexikon standard product is de- to a reduced product vector K′ ⊂ K with N ′ < N . fined with a focus on quality: Products that have a generally agreed Buchholz also presents different views from literature whether (standardized) minimum quality. Product changes focus on quanti- such a standard variant needs to be part of the variant space itself ties, prices and times. Standard products can be traded on the stock or not. For example, according to Boysen a basic or standard product exchange.6 Other definitions base on the criteria whether they are may be a theoretical construct that has never been physically man- ready for batch production.7 ufactured [1]. Whether it needs to be manufacturable at all remains From our experience the term standard product is mainly used in unclear. For further details we refer to [2]. two different ways in manufacturing industry: On the one hand, to define a standard variant based on aggregated 1) Either as a label of a product which should be presented as a sales numbers over all variants of a variant space is unreasonable standard (defined before product is sold at all) from our perspective as it exactly ignores the possible differences of 2) or as a product which is established on the basis of different cri- the available variants. Such an approach could be rather considered teria e.g. it is sold the most within a given context, e.g. a region as a ’standard variant space’. On the other hand to only consider the or a specific type of customer. sales numbers of each variant individually bears problems as well, it even may lead to wrong interpretations. In general, the exact same In order to dissolve this ambiguity we speak of a predefined standard variant is not sold more than ’a few times’. For example, consider in case of 1) and a derived standard in case of 2). 100 skateboards of 96 different variants sold. This means that most Furthermore, a basic product – also called generic product – is variants were sold once and two may have been sold three times each. defined to realize the core benefit of the product. This implies that This also means that the standard variants may change within a few a basic product cannot be further reduced without losing the possi- new sales. Therefore, from our perspective it would not be useful to bility of intended product usage.8 In case of a skateboard this is the define these ”top selling” variants as standard variants. ability to ride on such a board with pushing oneself forward by foot. From the perspective of the product management and with the aim A basic product may not be saleable, e.g. due to legal restrictions. An of an efficient portfolio handling, it is also useful for multi-variant extended product is one which offers additional benefit to customers. products on the one hand to offer and place a standard variant in the In the context of manufacturing companies ... a basic product might market and on the other hand to analyze which product variant is sold be a rather simple good that experiences relatively consistent con- most or is never sold at all. sumer demand ....9 Sometimes a core product is differentiated from From our point of view the notion of a basic product can be di- the product: The core product of a book is information. It is not the rectly transferred to a basic variant: to cover the basic functionality book itself.10 The book itself is then the basic product. necessary characteristics must be set with corresponding values re- flecting a ”basic” quality. In case of a skateboard a deck, two trucks, 2.3 Multi-variant products and standard variants and four wheels each of rather low quality. In case only one compo- The term mass customization defines the challenge of anticipating nent (characteristic) is missing, it is no variant of a skateboard any- individualized products to be manufactured simultaneously with the more as it is non-functional. In addition top-level variants can be 4 given: variants with a maximum number of characteristics evaluated www.lexico.com/en/definition/standard (retrieved with corresponding values reflecting a high level of quality, i.e. based 2.8.2019) 5 en.wikipedia.org/wiki/Technical_standard (retrieved on the configuration model no further feature can be selected with- 2.5.2019) out deselecting at least one other feature. In some cases, depending 6 wirtschaftslexikon.gabler.de/definition/ on the context, it might appear that not more options are chosen in standardprodukte-42877 (retrieved 6.5.2019, in German) case of a professional board compared to a basic one, but components 7 e.g. www.lawinsider.com/dictionary/standard-products (retrieved 2.5.2019) of better quality, e.g. the material types of the deck or the wheels. In 8 wirtschaftslexikon.gabler.de/definition/ the end this must be reflected in the underlying metrics. produkt-42902 (retrieved 2.5.2019, in German) In Figure 1 we depict relations between basic (bi ), top-level (ti ), 9 www.businessdictionary.com/definition/ and ‘regular’ (pi ) product variants. Furthermore, each variant may basic-product.html (retrieved 8.5.2019) 10 www.marketing91.com/five-product-levels/ (retrieved also be computed or defined as a standard variant (marked with ☆ ). 8.5.2019) The level, i.e. the number of selected characteristics and ’rank’ of t0 t☆2 S ∅ , S, S O and S $ (see Sec. 2.1) we already defined specific P, i.e. sets where all p⃗j fulfill certain properties. As we are interested t1 in the ”best representative” of P we define a central representative p9 of P based on a measure of similarity or dissimilarity. p6 p10 Definition 6. Central representative r⃗P and deviation ν⃗P : r⃗P is the p5 product vector of a product space P which has the overall minimal p☆7 p8 dissimilarity to all p⃗j ∈ P considering a measure M. Furthermore, p0 we define the deviation ν⃗P to be the vector of the individual devia- tions νi of assigned values per characteristic ki (see Figure 2). p1 p3 p4 Simplified, one could say r⃗P is the average product of P regarding the measure M or more specific, the one that minimizes the dissimi- p2 larity to all pi ∈ P. The deviations νi can be defined in multiple ways. We detail this in Section 3.3. We note that, based on this definition, it is not necessary, that r⃗P ∈ P. Furthermore, as several solutions may have the same aggregated distance regarding pi ∈ P based on M, b0 b☆1 ⋯ bn−1 bn there may be no unique central representative r⃗P . We sketch how a measure M can be defined below. Figure 1. Schematic structure of basic (bi ), top-level (ti ), and standard (pi ) variants inbetween. Variants of any of these levels can be designated as a standard variant (☆ ). P ν0 r⃗P corresponding values is reflected by height. The edges depict that the variants differentiate in a single characteristic.11 Naturally, basic ν1 variants are rather at the bottom and top-level variants at the top of the figure. Nevertheless, it is possible to have feature combinations that aren’t separable and so basic as well as top-level variants can exist on different levels. But irrevocably basic variants must not have Figure 2. Schematic depiction of a variant space P with its central another connected variant ‘below’ them, top-level variants ‘above’ representative r⃗P and its deviation ν⃗ = [ν0 , ⋯, νn−1 ] with n = 2 respectively. All other variants inbetween have ‘smaller’ predeces- sors and ‘larger’ successors. Standard variants can be defined on any of these levels. Consider our skateboard example. We define a basic variant as standard skateboard for beginners, a mid-range skateboard as a standard for trained half pipe skaters and a top-level variant as a 3.2 Measure M: Dissimilarity of variants standard for skate competitions. M could be either a measure of similarity or dissimilarity. Although M can be defined arbitrarily, e.g. based on ∑, ∏, min, max or some 3 Approach complex aggregation function, we stick to a specific distance based measure, and thus dissimilarity, for reasons of simplicity. For future We believe that the availability of a standard variant in the sense of research a promising link is given by case-based reasoning (CBR) as an average product of the most selling variants is very helpful in port- the notion of similarity is central to this approach [9, 16, e.g.]. Nev- folio management. In order to prevent misunderstandings with other ertheless, although CBR has been applied to product configuration, definitions (see Sec. 2.2 and 2.3) we will talk of a central representa- to our knowledge specific product similarities have not been exten- tive of a variant space instead. One possibility to exploit the central sively investigated in the literature; exceptions are [12, 21, 20]. As- representative in portfolio management is to compare it with prede- pects of similarity have been studied in the context of CSP [7, 5, e.g.] fined standards and adapt them accordingly. In order to discuss the resulting in the need of Euclidian distance measures from a practical challenges in defining such a central representative in the context of perspective. In the following of this section we summarize aspects of multi-variant products, we need to give some formal definitions re- similarity measures relevant to our approach. garding configuration spaces (Section 3.1). We define a measure M A Euclidian distance measure δ for some entities o, p and q is (Sec. 3.2) for computation of a central representative (Sec. 3.3). We reflexive: δ(p, p) = 0, symmetric: δ(p, q) = δ(q, p), and transitive: close this section with an algorithmic sketch, integrating definitions δ(o, q) ≤ δ(o, p) + δ(p, q). For reasons of simplicity, we will talk of from preceding subsections (Sec. 3.4). distance in the remainder of this paper. In order to define a distance measure M consider a variant space, 3.1 Definition of a central representative of a e.g. S, and a subset thereof, e.g. S $ (S $ ⊆ S). This implies that p⃗ ∈ S variant space and q⃗ ∈ S $ contain the same characteristics kx with x ∈ {0, ⋯, N −1} in the same order. First, we need a distance between values from the In Section 2.1 we introduced the notion of a product (configuration) same characteristic δx for all x ∈ {0, ⋯, N − 1}, for example: vector p⃗, which holds all characteristics which define a certain prod- uct. Let P = {⃗p0 , p⃗1 , . . . , p⃗P −1 } be a set of P product vectors. With δx (kxp , kxq ) = ∣kxp − kxq ∣ (1) 11 For reasons of simplicity we neglect that connected variants may differ in with kxp denoting the value of the x-th characteristic of product more than one characteristic as they are inseparable due to the rule set. vector p⃗, kxq of q⃗ respectively. Depending on the type of scale of the characteristic (i.e. nominal, ordinal, interval or ratio scale) cer- N −1 tain calculations may not be possible, e.g. subtraction or addition on r, p⃗i ) with p⃗i ∈ S $ r⃗S $ = argmin ∑ ∆(⃗ (5) ⃗∈S r i=0 nominal scale is not reasonable. On nominal scale only the equality between values can be determined, i.e. are two values the same or In the first case (Eq. 4) r⃗ has been sold itself as r⃗ ∈ S $ , whereas not. If a level of similarity is required at least an ordinal scale for the in the second case (Eq. 5) r⃗ is a general technically feasible variant values must be available, i.e. a linear order for the values for the def- r ∈ S). One could even relax that the representative not even needs (⃗ inition of a median. For interval or ratio scale a mean can be defined. to be technically feasible, and thus select r⃗ ∈ S ∅ (cf. 2.3). This results in a distance vector of distances per characteristic In conjunction with the central representative it is also of interest ’how large’ or ’how widespread’ the set is, which it represents. For ⎡ δ0 (k0p , k0q ) ⎤ ⎡ d0 ⎤ ⎢ ⎥ ⎢ ⎥ this we need a notion of deviation, diameter, or variance. For now, ⃗ p , q⃗) = ⎢ δ(⃗ ⎢ ⋮ ⎥ ⎢ ⎥=⎢ ⋮ ⎥ ⎥ (2) ⎢ ⎥ ⎢ ⎥ we stick with the notion of average deviation per characteristic (νi ) ⎢δN −1 (kN p −1 , kN −1 ) q ⎥ ⎢dN −1 ⎥ for all p⃗j ∈ P as it suffices our needs. ⎣ ⎦ ⎣ ⎦ The next step is to aggregate these individual distances into a sin- 1 P −1 gle distance value describing the distance between two product vec- νi = ∑ δi (ri , ki ) j (6) P j=0 tors. It needs to be reflected that not all characteristics are equally important. Therefore a weighting factor wi needs to be integrated Then ν⃗ = [ν0 , . . . , νN −1 ] denotes a vector of all deviations per char- for each characteristic. If characteristic ki should not be considered, acteristic. the corresponding wi needs to be set to zero. Furthermore, not all It is not beneficial if a central representative covers a ’too wide distances for individual characteristics may have the same range and range’ of variants, i.e. one or several νi are rather high for some thus, one characteristic may dominate others, therefore a normaliz- characteristics ki , as it would not give much help for portfolio op- ing factor vi is necessary. For example, consider a distance vector timization, especially if members of the set of product vectors are with N = 3 where d0 represents a binary distance (d0 ∈ {0, 1}), not distributed uniformly. Consider the case depicted in Figure 4. d1 represents a distance between zero and five (d1 ∈ [0, 5]), and d2 Products were sold in two rather distant regions of the variant space. represents a distance between zero and thousand (d2 ∈ [0, 1000]). Considering them as one set would lead to a representative which In most cases d2 would dominate or overrule d1 , which in turn also does not reflect the situation at hand (orange space). We need to look dominates d0 . Therefore, it is import that all value ranges of the ki for separate subsets, i.e. clusters, instead, to come to a result depicted are normalized, e.g. to values between zero and one. This results in a by the two separate regions S0$ and S1$ (light blue). As we have de- distance between two product vectors p⃗ and q⃗. fined a central representative and a deviation thereof, various cluster 1 N −1 analysis methods are applicable, e.g. centroid-based or density based p , q⃗) = ∆(⃗ ∑ wi vi di (3) clustering. For an overview of existing clustering methods we refer N i=0 to [13, 23, 17, e.g.]. The adequate selection of a clustering method We give a schematic impression of a distance ∆ between two will be a crucial task for the successful application of the approach product vectors r⃗S and r⃗S $ in Figure 3. Nevertheless, it still remains proposed. open how central representatives like r⃗S and r⃗S $ can be determined For pragmatic reasons we restrict our considerations to clustering based on ∆. parameters (assuming a clustering method given) to maximum de- viation per characteristic and a minimum number of members per cluster. Therefore, a vector of thresholds θ⃗ = [θ0 , . . . , θN −1 ] for the S corresponding characteristics ki and θ# for the minimum number needs to be given. r⃗S ∆ r⃗S $ S $ S r⃗S S $ r⃗S $ r⃗S $ 1 S1$ r⃗S $ 0 S0$ Figure 3. Schematic depiction of a general variant space (S, dark blue) rS ) and sales variant space (S $ , light blue) and its central representative (⃗ rS $ ). The ∆ depicts the difference also with its central representative (⃗ between r⃗S and r⃗S $ . Figure 4. Schematic depiction of cluster splitting due to high variance in single cluster consideration. 3.3 Calculation of central representatives We defined the central representative r⃗P as a variant which minimizes the overall dissimilarity (cf. Definition 6). Furthermore, it is not a 3.4 An algorithm sketch requirement that r⃗P is itself an element of P. Consider these two We summarize the parts of how to find adequate representatives for definitions of central representatives of S $ . a given set of product vectors P (e.g. sold variants S $ ) out of an- N −1 other given set of product vectors Q (e.g. the overall variant space r⃗S $ = argmin ∑ ∆(⃗ r, p⃗i ) with p⃗i ∈ S $ (4) S) in Algorithm 1. In the beginning only the single cluster P exists ⃗∈S $ r i=0 for which central representative r⃗P and deviation ν⃗P is calculated. If there is any deviation νi which is above its defined threshold θi P Maimon state that a focus on relevant characteristics has several ad- needs to be splitted in two clusters.12 In the next iteration at least two vantages [3]. For example, removal of irrelevant characteristics im- clusters need to be considered. At some point clusters with only few proves efficiency as well results are more conclusive and easier to members are computed (< θ# ). We ignore these clusters from fur- interpret due to the focus on key features. Nevertheless, a too lim- ther consideration in this iteration. We continue with increasing the ited choice of characteristics leads to information loss and reduces number of clusters until we obtain a set of clusters with each con- the quality of the results. For further information on feature selection taining a central representative with each deviation per characteristic methods we refer to [22]. If a characteristic is considered irrelevant below the given threshold (∀i νi ≤ θi ). We note, that we increase the for an evaluation at hand wi (cf. Eq. 3) should be set to zero in the number of clusters iteratively and start the cluster splitting from the calculations. For all characteristics with wi > 0 the relative relevance original set P on purpose. If not doing so the order of considering needs to considered very carefully as slight changes may lead to sig- pi ∈ P might have an effect and thus, would lead to different results nificant changes in the classification of the data. For example, if the if pi are represented in a different order. results are designed for adapting standard products a slight change in the parameters might lead to a different variant. Input: P, Q, θ,⃗ M, θ# Result: S ∶= set of central representatives for P out of Q no of clusters ∶= 1 ; S = {P}; 4.2 Data preparation R ∶= calculate list of representatives from Q for all sj ∈ S based on M; Θ ∶= calculate list of all deviations for corresponding rj and sj Practice shows that within companies often master data is not co- based on M; ordinated. In general, this leads to multiple characteristics contain- while ∃i, j with νij ∈ Θ > θi for any sj ∈ S do ing the same information, potentially represented differently, e.g. us- no of clusters ∶= no of clusters +1 ; ing different text strings, numbers, or different units. As products S ∶= clusterSplitting(P, M, no of clusters); are subject to permanent change, the inconsistency of data increases delete all sj ∈ S from S where ∣sj ∣ < θ# ; over time. In order to ease and automatize analysis in the long run, R ∶= calculate list of representatives from Q for all sj ∈ S data synchronization is inevitable. Nevertheless, considering given based on M; data, data cleansing is essential to prevent bad decisions based on Θ ∶= calculate list of all deviations for corresponding rj and bad analysis results [24]. Maletic described the data preparation as sj based on M; a multistep procedure comprising (1) definition of error types, (2) end finding instances of these errors, and (3) correction of them [11]. He Algorithm 1: Algorithmic sketch for deducing central representa- emphasizes that each of these steps is a complex task in itself. tives out of the variant space Q based on the variants given by the To give an idea of the effort that needs to be taken, we present a variant space P. non-exhaustive list of different error types in (master) data below. A common error type is conditioned by different notions or represen- tations, i.e. characteristics and values holding the same information, but represented with different spellings. These errors often arise from 4 Pragmatic considerations inconsistent usage of blanks, hyphens, prefixes, suffixes or abbrevia- Not all characteristics of product vectors must be considered as rele- tions. Different units may also be used, e.g. due to different intended vant information might be covered by other characteristics (Sec. 4.1). usage. Characteristics holding complex information, i.e. connected In general, data provided by companies needs some preparation as information, are problematic as well as further processing might be this data is often not consistent concerning characteristics’ and val- limited. A common example is a combined string representation of ues’ denomination (Sec. 4.2). We consider temporal restriction of length, width, and height (sometimes without a given unit) instead of data and how observations over time can be derived (Sec. 4.3). Be- having individual numerical characteristics for each of them. A tricky fore Algorithm 1 can be applied value ordering and weighting factors type of errors comprises misleading value specifications, e.g. frame for each characteristic must be available 4.4. sizes termed with numerical values which have to be interpreted in a specific manner so that naive calculation is not possible. Consider frame sizes 5, 8, and 12 which reflect three consecutive frame sizes. 4.1 Contentual evaluation The physical difference in size cannot be calculated from these val- In order to support a business question a contentual focus on data is ues, instead other data like length, width, and height of certain com- necessary. Simplified, two levels of contentual constraints can be dif- ponents need to be considered. Furthermore, the conceptual distance pi ) can be considered. ferentiated. First, the context of each variant (⃗ cannot be calculated from these ’values’: as the categories are con- Context can be defined on different perspectives, e.g. in which shop secutive the distance is 1 and not 3 and 4. In order to prevent trim- or region the variant has been generated, by whom, whether it has ming of leading zeros, such terms may be even stored as strings. been sold, only offered, or never even offered (cf. S, S O , S $ in Sec. Elimination of errors of this type requires very specific semantic 2.1), or for which application, domain respectively, it was bought if knowledge, which makes it not only hard to spot these errors, but this information is available. Second, the relevance of each charac- also to correct them. For further information on data cleansing and teristic should be checked as consideration of all characteristics may data quality we refer to [14, 15]. block the view on relevant information, for example, the color of the As a result of data preparation we get a set P of product vectors trucks or some non-visible strings on some component. Chizi and p⃗i with consistent [k0 , k1 , . . . , kN −1 ], i.e. with comparable informa- tion stored in the same characteristic with the same value for every 12 How this is actually done depends on the clustering algorithm chosen. product variant. 4.3 Temporal evaluation interval and ratio scale data naturally a distance is given – assum- ing the characteristic is not misinterpreted as such and is ’only’ on Products are subject to permanent change. They are designed, devel- ordinal scale (cf. Sec. 4.2). For ordinal data this is not the case, a oped, sold, and refined, potentially several times. Such refinements linear ordering has to be defined manually. Although, an ordering of and changes in expectations of the market may result in changes of terms like ”basic”, ”advanced”, ”expert”, and ”professional” might central representatives. Therefore, regardless whether from technical be considered trivial in the first place, it is a tricky, currently man- or sales perspective, it is not reasonable to consider outdated data, ual and time consuming task and thus, also error prone. Looking at which leads to the application of methods from time series analysis. the terms ”expert” and ”professional” the question is whether ”ex- Furthermore, as sales numbers for the products of interest may vary pert” is before or after ”professional” or equal in the end as they significantly over time, consideration of single time points (or rather relate to completely different aspects of the product. It may be pos- small time intervals only) may show varying results for each of these sible that a reasonable distance between terms like ”basic” and ”ad- time points. vanced” is definable, i.e. how far is ”basic” from ”advanced”, ”ad- One applicable method in order to generate smoothed results is vanced” from ”expert” and so forth. We refrain from this as the re- the sliding window approach (SWA), see for example [10]. The ba- sulting costs would not be in a reasonable cost-benefit relation for sic idea is to evaluate overlapping intervals, so called windows, to an industrial company. For a start an equidistant conceptual distance get smoother and more consistent results. We depict relevant param- measure should suffice, i.e. all preceding and succeeding terms in a eters for the SWA in Figure 5. Let d denote the overall period under linear order have the same distance. review (one year in the given example). The window size is denoted In business intelligence it is common to not only consider the num- by w (three month) with w ≪ d and the corresponding step size by ber of sold units, but also profit or the number of sold units per quote s (1 month) with s ≤ w. Analysis is then performed for data in each is part of the analysis for example. On the one hand a pragmatic way window separately. without changing the algorithm is to modify the original set by re- The choice of specific values for d, w and s is very crucial and ducing or multiplying the number of equal product vectors in P. On must be considered carefully, especially if conclusions on future de- the other hand an additional weighting factor per pi could be intro- velopments are drawn. For example, if d is chosen too small the duced, which would be much more efficient regarding run-time of corresponding data set may be too small to generate significant re- the algorithm. sults. Statistical or learning methods support a reasonable choice, [19, e.g.]. Algorithm 1 can be extended in such a way that not only a sin- 5 Summary and Outlook gle time point is considered (P), but subsequent sets, i.e. subsequent windows. On this basis developments of the central representatives To support portfolio management for multi-variant products we ex- and their corresponding deviations can be observed: how they ’won- amined definitions of ’standard’ for discrete and multi-variant prod- der around’ and how the number of clusters increases or decreases. ucts. To differentiate from these definitions we introduced the term central representative of a variant space. We derived an algorithmic d sketch based on a measure M to calculate representatives for clusters with reasonable size. Finally, we discussed tasks necessary before the algorithm can be applied to real data. As the work on central representatives for a variant space is in an early stage many tasks and questions remain open. The straightfor- jan feb mar apr may jun jul aug sep oct nov dec ward next step is to experiment with large scale real data instead of few small toy examples. Furthermore, the determination of weighting w factors wi is a challenging task. We need to investigate to what ex- s w tent learning methods, either supervised or unsupervised, may ease the task. Once real data is available it will be a worthwhile task to reconsider alternative definitions of distance functions, e.g. investi- Figure 5. Sliding window approach with d denoting the overall period gating the impacts of choosing ∏, min, max or some other function considered, w denoting the window size and s the step size. as aggregation operators. In theory it is possible that multiple central representatives are available. If this case also appears with real data, we need to investigate how to deal with it. 4.4 Weighting factors and value ordering Acknowledgement The approach is significantly based on the definition of the measure We thank the anonymous reviewers for critically reading the M containing the distances δ and ∆, which in turn contains weight- manuscript and providing helpful comments for clarification and im- ing factors wi for each characteristic. First experiments have shown provement of the manuscript. that distance measures on nominal data very much influences the re- sults significantly as the distance can be only either zero or one. A rather low weighting factor for these characteristics compared to the REFERENCES other ones may be a solution, but must be evaluated further in future. For now we tend to ignore these characteristics as dissimilarity is in [1] Nils Boysen, Variantenfließfertigung, volume 49, Deutscher Univer- sitätsverlag, 2005. most cases reflected in other characteristics as well. Our gut feeling, [2] M. Buchholz, Theorie der Variantenvielfalt: Ein produktions- und ab- but without proof, tells us that similar effects may be the case for satzwirtschaftliches Erklärungsmodell, SpringerLink : Bücher, Gabler integrating ordinal scale data with interval and ratio scale data. For Verlag, 2012. [3] Barak Chizi and Oded Maimon, ‘Dimension reduction and feature se- [23] Rui Xu and Donald C. Wunsch II, ‘Survey of clustering algorithms’, lection’, in Data Mining and Knowledge Discovery Handbook, 2nd ed., IEEE Trans. Neural Networks, 16(3), 645–678, (2005). eds., Oded Maimon and Lior Rokach, 83–100, Springer, (2010). [24] Marcus Zwirner, ‘Datenbereinigung zielgerichtet eingesetzt zur perma- [4] Bjørn Christensen and Thomas D. Brunoe, ‘Product configuration in the nenten Datenqualitätssteigerung’, in Daten- und Informationsqualität: eto and capital goods industry: A literature review and challenges’, in Auf dem Weg zur Information Excellence, chapter 6, 101–120, Springer Customization 4.0, eds., Stephan Hankammer, Kjeld Nielsen, Frank T. Fachmedien Wiesbaden, (06 2018). (in German). Piller, Günther Schuh, and Ning Wang, pp. 423–438, Cham, (2018). Springer International Publishing. [5] Jean-François Condotta, Souhila Kaci, Pierre Marquis, and Nicolas Schwind, ‘A syntactical approach to qualitative constraint networks merging’, in Logic for Programming, Artificial Intelligence, and Rea- soning - 17th International Conference, LPAR-17, Yogyakarta, Indone- sia, October 10-15, 2010. Proceedings, eds., Christian G. Fermüller and Andrei Voronkov, volume 6397 of Lecture Notes in Computer Science, pp. 233–247. Springer, (2010). [6] Robert Cooper, Scott Edgett, and Elko Kleinschmidt, ‘Portfolio man- agement - fundamental to new product success’, The PDMA Toolbook for New Product Development, (01 2002). [7] Frank Dylla, Jan Oliver Wallgrün, and Jasper van de Ven, ‘Merging qualitative information: Rationality and complexity’, in QUAC2015: Workshop on Qualitative Spatial and Temporal Reasoning: Computa- tional Complexity and Algorithms, (September 2015). [8] Alexander Felfernig, Lothar Hotz, Claire Bagley, and Juha Tiihonen, Knowledge-based Configuration: From Research to Business Cases, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1 edn., 2014. [9] Gavin Finnie and Zhaohao Sun, ‘Similarity and metrics in case-based reasoning’, Information Technology papers, 17, (03 2002). [10] Yupeng Hu, Cun Ji, Ming Jing, Yiming Ding, Shuo Kuai, and Xueqing Li, ‘A continuous segmentation algorithm for streaming time series’, in Collaborate Computing: Networking, Applications and Workshar- ing - 12th International Conference, CollaborateCom 2016, Beijing, China, November 10-11, 2016, Proceedings, eds., Shangguang Wang and Ao Zhou, volume 201 of Lecture Notes of the Institute for Com- puter Sciences, Social Informatics and Telecommunications Engineer- ing, pp. 140–151. Springer, (2016). [11] Jonathan I. Maletic and Andrian Marcus, Data Cleansing: A Prelude to Knowledge Discovery, 19–32, Springer US, 07 2010. [12] Hiroya Inakoshi, Seishi Okamoto, Yuiko Ohta, and Nobuhiro Yugami, ‘Effective decision support for product configuration by using CBR’, in International Conference on Case-Based Reasoning, (01 2001). [13] Leonard Kaufman and Peter J. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis, John Wiley & Sons, 1990. [14] Lukasz A. Kurgan and Petr Musilek, ‘A survey of knowledge discov- ery and data mining process models’, Knowl. Eng. Rev., 21(1), 1–24, (March 2006). [15] Ohbyung Kwon, Namyeon Lee, and Bongsik Shin, ‘Data quality man- agement, data usage experience and acquisition intention of big data an- alytics’, International Journal of Information Management, 34(3), 387 – 394, (2014). [16] Michael M. Richter and Rosina O. Weber, Case-Based Reasoning - A Textbook, Springer, 2013. [17] Lior Rokach, ‘A survey of clustering algorithms’, in Data Mining and Knowledge Discovery Handbook, 2nd ed., eds., Oded Maimon and Lior Rokach, 269–298, Springer, (2010). [18] D Sabin and R Weigel, ‘Product configuration frameworks-a survey’, Intelligent Systems and their Applications, IEEE, 13, 42 – 49, (08 1998). [19] Hela Sfar and Amel Bouzeghoub, ‘Dynamic streaming sensor data seg- mentation for smart environment applications’, in Neural Information Processing - 25th International Conference, ICONIP 2018, Siem Reap, Cambodia, December 13-16, 2018, Proceedings, Part VI, eds., Long Cheng, Andrew Chi-Sing Leung, and Seiichi Ozawa, volume 11306 of Lecture Notes in Computer Science, pp. 67–77. Springer, (2018). [20] Sara Shafiee, Katrin Kristjansdottir, and Lars Hvam, ‘Automatic identi- fication of similarities across products to improve the configuration pro- cess in eto companies’, International Journal of Industrial Engineering and Management, 8(3), 167–176, (2017). [21] Hwai-En Tseng, Chien-Chen Chang, and Shu-Hsuan Chang, ‘Applying case-based reasoning for product configuration in mass customization environments’, Expert Syst. Appl., 29(4), 913–925, (2005). [22] Cen Wan, Hierarchical Feature Selection for Knowledge Discovery, Advanced Information and Knowledge Processing, Springer Interna- tional Publishing, 2019.