=Paper=
{{Paper
|id=Vol-1679/paper6
|storemode=property
|title=Complements and Substitutes in Product Recommendations: The Differential Effects on Consumers' Willingness-to-pay
|pdfUrl=https://ceur-ws.org/Vol-1679/paper6.pdf
|volume=Vol-1679
|authors=Mingyue Zhang,Jesse Bockstedt
|dblpUrl=https://dblp.org/rec/conf/recsys/ZhangB16
}}
==Complements and Substitutes in Product Recommendations: The Differential Effects on Consumers' Willingness-to-pay==
                  Complements and Substitutes in Product
                Recommendations: The Differential Effects on
                     Consumers’ Willingness-to-pay
                          Mingyue Zhang                                                       Jesse Bockstedt
                         Tsinghua University                                                    Emory University
                            Beijing, China                                                        Atlanta, GA
         zhangmy.12@sem.tsinghua.edu.cn                                                  bockstedt@emory.edu
ABSTRACT                                                                 product on its own webpage, consumers are often exposed to
Product recommendations have been shown to influence                     additional relevant products as recommendations, such as on
consumers’ preferences and purchasing behavior. However,                 Amazon.com. It is an open research question whether consumer’s
empirical evidence has yet to be found illustrating whether and how      purchase decisions such as willingness-to-pay for the focal product
the recommendations of other products affect a consumers’                would be affected by the display of ‘other products’ and the type of
economic behavior for the focal product. In many e-commerce              information presented with these recommendations. Some work
websites, a product is presented with co-purchase and co-view            has studied this in offline settings [31], but little work addresses this
recommendations which potentially contain complement and                 issue in the online recommendation context.
substitute products, respectively. Very little research has explored     Particularly, the types of ‘other products’ in a recommendation set
the differential effects of complementary and substitutable              may vary, but they can be generally categorized into substitutes and
recommendations. In this study, we are interested in how the type        complements [19]. Substitutes are products that can be purchased
of recommendations of other products impact the consumers’               instead of each other, while complements are products that
willingness-to-pay for the focal product, and additionally how the       experience joint demand. For example, when a user is evaluating a
recommendations’ price and the consumers’ decision stage                 cellphone, it’s reasonable to recommend other phones to better
moderate this effect. We conducted a 2x2x2 randomized                    match his/her needs, but it also makes sense to recommend batteries,
experiment to examine how the consumers’ willingness-to-pay is           chargers, or cases, which commonly make up a bundle
affected by these factors. Experimental results provide evidence         recommendation [34]. Research has shown that consumers factor
that there is no significant main effect difference between              into consideration the source or type of information when making
complementary and substitutable recommendations. But we                  their purchase decisions [26]. Economic theory suggests that
observed a significant interaction effect between recommendation         complements increase demand for the focal product because of
type and decision stage, which highlights the importance of timing       increasing the possibility of users finding added value for the focal
in recommender systems. Other findings include that consumers            product [31]. With the increased demand, the market price will
are willing to pay more for a specific product when the price of a       increase accordingly, leading to higher individual willingness-to-
recommended product is high, as well as when they are in later           pay. Whereas, substitutes decrease demand for the focal product
decision stages. These findings have significant implications for the    due to competition, which leads to a lower market price and
design and applications of recommender systems.                          individual willingness-to-pay. Despite the extensive literature
                                                                         about complements and substitutes in economics, little research
Keywords                                                                 studied their differential effects in online recommendation settings.
recommender systems; complements; substitutes; willingness-to-
pay; price; decision stage                                               In this study, we are interested in how the type of recommendations
                                                                         of other products impact the focal product, and additionally how
1.   INTRODUCTION                                                        the recommendations’ price and consumers’ decision stage
Recommender systems (RS) are becoming integral to how                    moderate this effect. The price of a recommended product can be
consumers discover new products and have a strong influence on           perceived as a contextual reference point. Along with the nature of
what consumers buy and view. For instance, 60% of Netflix content        cross-price elasticity of demands between complementary/
consumption originates from recommendations, and 35% of                  substitutable goods [20], the research question of how the price
Amazon sales are attributed to recommendations [11]. With their          interacts with recommendation type is of great value. Additionally,
utmost importance for retailers, some studies have been conducted        when shopping online consumers tend to have a two-stage decision
to explore the behavioral effects of recommender systems on              making process [9]: first, screening a large set of available products
consumers [1][2]. Specifically, prior studies found that consumers’      to identify a subset of the most promising alternatives; and second,
preferences and willingness-to-pay for a product can be influenced       evaluating the latter in more depth to make a final purchase
by the values of personalized recommendations. This provides             decision. Zheng et al. (2009) [33] argue that consumers prefer
evidence that consumers’ behavior is vulnerable toward                   different recommendations in each stage because they are driven by
recommendation agents. However, there is still space for                 different goals in each stage, i.e., in stage 1 they are comparing
researching the behavioral effects of detailed recommendation            alternatives, whereas in stage 2 they are reviewing candidates. For
features. For example, when evaluating the information of a focal        this reason, we are also interested in the moderation role of decision
                                                                         stages, which has not been previously studied.
 IntRS 2016, September 16, 2016, Boston, MA, USA.
 Copyright remains with the authors and/or original copyright holders.
In the following sections, we firstly introduce the theoretical         protector or other accessories. Figure 1 and Figure 2 provide an
underpinnings of this research, based on which five hypotheses are      example to this extent with the co-purchase and co-view
proposed. Then we discuss the design of a randomized experiment,        recommendations on Amazon.com when the focal product is ‘Dell
which measures consumers’ willingness-to-pay across different           Inspiron 15 i5558-5718SLV’.
recommendation scenarios. We present the results of our analysis
and discuss the implications for online retail practices and research
involving recommender systems. The investigation provides a new
angle for understanding the behavioral aspect of recommender
systems, as well as guidelines to further improve the design of
recommendation agents.
2.   THEORETICAL FRAMEWORK
In this section, we discuss the relevant theoretical foundations for
our research questions in terms of three dimensions:                       Figure 1. Amazon co-purchase product recommendation
recommendation type, price, and consumers’ decision stage. Based
on our primary goal of uncovering the differential effects between
different recommendation types, we firstly review research about
complementary/substitutable goods in the marketing literature, as
well as related empirical studies in the recommender system
context. Second, since the basic relationship between
complements/substitutes is their cross-price elasticity of demands,
we discuss theories describing how one product’s price might
influence consumers’ willingness-to-pay for another product.
Finally, we discuss the related literature on consumer decision
making processes and how this interacts with the recommender
systems.                                                                      Figure 2. Amazon co-view product recommendation
2.1   Complements and Substitutes in                                    By definition in microeconomics, if product A and B are
                                                                        complements, increased demand for product A should be
Recommendation                                                          associated with increased demand for product B [33]. This
The study of complements and substitutes has long been a central        complementary product effect leads to the co-purchase network.
subject in the marketing literature. Generally, products are            On the other hand, substitute products have an inverse demand
considered complements (substitutes) if lowering (raising) the price    relationship. This leads to the co-view network because consumers
of one product leads to an increase in sales of another [31].           tend to view and compare substitutes before making final purchase
Research shows that consumer choice is easily influenced by             decisions. Given that recommendations of both complements and
context and the set of alternatives available at the time of decision   substitutes are often presented along with the focal product, it is of
[24], thus, there is significant demand interrelationship among         significant practical interest to understand their differential effects.
substitutable and complementary goods [20]. Generally, two              Based on economic theory, complements increase demand for the
moderate or strong substitutes should be offered separately,            focal product, and the market price will increase accordingly,
whereas two complements should be offered as a bundle [32], in          leading to higher individual willingness-to-pay. Whereas,
order to maximize the profit. This is because the introduction of a     substitutes decrease demand for the focal product due to
complement may increase the possibility of buyers finding new           competition, which leads to a lower market price and individual
uses or added value for existing products [31], whereas the             willingness-to-pay. Thus, we put forth the following hypothesis:
substitutes can be consumed or used in place of one another.
                                                                        H1: Consumers tend to have higher willingness-to-pay for the focal
In the online recommendation scenario, a focal product is often             product when it is displayed together with a complementary
presented with several related items as the recommendations. Take           recommendation as compared to being displayed with a
Amazon.com as an example, each product is featured on its own               substitutable recommendation.
designated webpage, along with additional relevant products as
recommendations. Hence, a visible directed product network is           Note that this initial hypothesis is intended to test a main effect of
created whereby products are explicitly connected by hyperlinks         recommendation type and is price agnostic. We address the
[16]. Some studies [22][23][5] have examined the behavioral             moderating effects of price in the next section.
impacts of recommendation networks, with these studies primarily
focusing on the co-purchase recommendation network. Many e-             2.2   Pricing
commerce websites provide recommendations from two product              Researchers in marketing and economics have long recognized that
networks: co-view and co-purchase product networks. Only                pricing decisions sometimes incorporate more than one product.
recently have their differential effects been considered [16]. More     This is because consumers tend to respond to price relative to some
interestingly, co-purchase and co-view networks can be used to          reference price [25], such as the other prices in the store at the point
implicitly represent two recommendation strategies, that is,            of purchase. Both prospect theory and mental accounting suggest
recommending complementary and substitutable products,                  that consumers make decisions based on losses or gains relative to
respectively. Although not always the case, it is common that co-       a reference point. When consumers compare the actual price of the
purchased items contain complementary products while co-viewed          focal product with other reference prices, incidental price learning
products contain substitutable products. For example, a consumer        [21] occurs without any explicit intention to memorize them. In
buy a laptop computer may view several laptops, but purchase only       offline physical stores, retailers can attempt to influence positively
one laptop along with a complementary mouse, software, screen           the degree to which the sales of one item affect sales of other items
through in-store product locations and shelf space allocations, for      shopping goals in the first stage of a shopping process. Thus, their
example, locating two complements together. This is very similar         thinking is more abstract when in the first stage. Shopping goals
to the online recommendations where other products are co-               become concrete when they are closer to the final purchase point in
displayed with prices along with the focal product on the webpage.       the second stage. Therefore, marketing promotions for similar
When consumers evaluate the focal product, the prices of                 products (i.e., substitutes) are more effective in influencing
recommended products are expected to affect their purchase-              consumer’s spending when their goals are less concrete [10][14].
related decisions for the focal product. Contrast effect theory [30]     Researchers have also studied the behavioral effects of
suggests that the perceived value of the focal product’s price is        recommendations beyond standard substitute recommendations.
decreased (increased) when the recommendations presented along           For instance, Zheng et al. (2009) [33] argued that customers prefer
with have relatively higher (lower) price. That is, consumers are        different types of recommendations in different purchase stages. In
willing to pay more (less) when the product seems cheaper (more          the first stage of an online purchase process, customers are
expensive) relative to other products. Thus, we have the following       navigating webpages to compare a large set of similar products.
hypothesis about the main effect of price on willingness-to-pay:         Whereas in the second stage, customers already have a clear
                                                                         candidate set through which to make a purchase decision. In the
H2: The price of a recommended product has a positive influence          second stage, substitutive recommendations likely have little
    on consumers’ willingness-to-pay for the focal product, that         impact and recommendations of complement products may be
    is, consumers are willing to pay more when the price of the          preferred since they introduce items that can add value to the
    recommended product is higher than the price of focal                purchase of the focal product. Hence, we hypothesize the following
    product.                                                             interaction effect:
There are also significant cross-relationships among the sales of        H3A: There is an interaction effect between the stage and type of
substitutable and complementary products [20]. In particular, lower         recommendations, such that the stage has a positive effect on
price or promotion of one product can stimulate sales of a
                                                                            willingness-to-pay for the focal product when the
complement, whereas supplant sales of other substitutes. That is to         recommendation type is complement and it has a negative
say, when the prices of complementary goods go up, the purchase             effect on willingness-to-pay for the focal product when
likelihood for the other complementary good may go down, while              recommendation type is substitute.
if the price of one of the substitute goods goes up, the purchase
likelihood for others will go up [12]. Furthermore, by influencing       3.   EXPERIMENTS
the demand of a complementary/ substitutive product through its          Recommendations on Amazon.com and other platforms generally
price, the consumers’ willingness-to-pay for that product will be        fall into the complement and substitute product types through the
influenced as well [18]. This cross-relationship nature of substitutes   co-purchase and co-view lists. However, this is not always the case,
and complements indicates that the effect of price is stronger when      and other contextual factors and user self-selection can impact the
the competition between two products is high, which is common            effect and content of these recommendations. Therefore, to
among substitutes. Therefore, we hypothesize the following:              eliminate these confounding factors and conditions that naturally
H2A: There is an interaction effect between the price and type of        occur in the field, we designed a randomized controlled experiment
   recommendations, such that the positive influence of                  so that the recommendation type, recommendation price, and
   recommendation price on willingness-to-pay for the focal              decision stage can be cleanly manipulated. This controlled and
   product is stronger when the recommendation type is                   randomized treatment approach allowed us to test our hypotheses
   substitute as compared to complement.                                 and make causal inferences.
2.3   Consumers’ Two-Stage Decision Making                               3.1   Experiment Design and Participants
As illustrated in the previous literature [9][17][29], consumers are     Our hypotheses express the main effects of each of three main
often not capable of evaluating all available alternatives in great      factors (recommendation type, price of recommended product, and
depth, and this results in a two-stage decision making process. In       decision stage) as well as two two-way interaction effects (price x
the first stage, consumers usually browse a large set of available       type and stage x type) on willingness-to-pay for a focal product.
options and identify a small subset of candidates for further            Since the focus of the study is on the effects of complementary
consideration. In the second stage, they tend to thoroughly evaluate     versus substitutable recommendations, we did not hypothesize the
the candidates and make a final purchase decision. In the second         interaction between recommendation price and decision stage.
decision stage consumers’ motivation and determination to make           Additionally, since the three-way interaction among these factors
purchases are increased, thus having higher willingness-to-pay for       is complex and no prior theory provides insights to this regard, this
selected alternatives. Hence, we hypothesize the following main          interaction was also not hypothesized.
effect of decision stage:                                                A factorial experiment was used to test our hypotheses efficiently.
H3: The decision stage has a positive influence on consumers’            Specifically, a 2 (types of recommendation: complements vs.
    willingness-to-pay for the focal product, that is, consumers         substitutes) x 2 (recommended products’ price relative to the focal
    are willing to pay more when they are in the second stage.           product’s price: low vs. high) x 2 (decision stage: stage 1 vs. stage
                                                                         2) full-factorial design was used, which results in 8 treatment
Given this multistage mental process, Ge et al. (2012) [8] argued        conditions. Although, the full factorial provides the opportunity to
that the manner in which information is processed differs                test the three-way interaction and the price x stage interaction, our
systematically between the two decision stages. Their experimental       analysis focuses only on the main effects and interactions identified
results reveal that the timing of the presentation of specific pieces    in our hypotheses. The advantage of factorial experiment designs
of information about an alternative across shopping stages has a         over randomized controlled trails (RCTs) is that they provide more
great impact on consumers’ choice. This difference can be                statistical power with fewer participants. Generally, the objective
attributed to the shopping goals theory [14] and construal level         of RCT is to compare the individual experimental conditions to
theory [15]. Specifically, consumers are less certain of their           each other directly, while in a factorial experiment the
combinations of experimental conditions are compared, i.e., the         (i.e., consider-then-choose) adapted directly from [8], which the
main effects and interactions.                                          participants followed prior to measuring dependent variables.
                                                                        Participants were randomly assigned to one of two decision stage
We manipulated the three factors between subjects, who were             manipulations: complete the first and second stage of the shopping
undergraduates from a business school in a large public university      procedure before being shown the focal product page or complete
in North America. Subjects received extra credit for their
                                                                        only the first stage of the shopping procedure before being shown
participation in the experiment. We performed a power analysis          the focal product page. In all treatment groups, the focal product
with the assumption that the effect size of our model will be           and its posted price and description remained the same.
medium, i.e., 𝐶𝑜ℎ𝑒𝑛	
  𝑓 ( = 0.15 (Cohen, 1988). To achieve power
(1-𝛽) of 0.80 and a medium effect size, as well as maintain a           3.2   Stimuli and Procedures
significance level (α) at 0.05, the minimum sample size for a model     Participants were first instructed with a cover story that they were
with three main effects and two interactions is 92 (the calculation     participating in research focusing on consumers’ preferences and
was made by using the package ‘pwr’ in R).                              purchase behavior. They were also told that there were no right or
We published a web link for our online experiment to a large            wrong answers. Following these initial instructions, participants
undergraduate class containing approximately 400 students. 261          were randomly assigned to one of the eight treatment groups.
students clicked on the link to initiate the study. Participants were   Before the main task of the experiment, participants were asked to
randomly assigned to one of the eight treatment conditions. The         answer some basic questions about their opinions on electronic
median time of completion is 12 minutes. We dropped observations        products and were asked to rate several different electronic product
for 126 participants for the following reasons: not completing the      categories. Participants were told that their answers to these
study, completing the experiments in an extremely short time (e.g.,     questions would be used later by our system to predict their
less than 4 minutes), completing the study in very long time (e.g.,     preferences and make personalized recommendations for them.
more than 4 hours indicating the study was started, stopped, and        This pre-experiment task was used to eliminate their doubt about
started again later), and not passing manipulation checks.              the basis of recommendations in later steps.
Participants were informed that multiple manipulation checks
would be used to determine if they took the study seriously, which      In the main task of the experiment, subjects were asked to shop for
would then impact whether they received extra credit for their          a computer mouse and make a purchase decision. We implemented
participation. Since we collected the data as an online survey          the two-stage shopping decision process based on the methodology
instead of bringing students to a laboratory, it is a common            used in the marketing literature, (e.g., [8]). In the first stage process,
phenomenon that response and completion rates are relatively low        participants were presented with descriptions of 12 computer mice
[3]. As a result, 135 valid observations were left. The distribution    as search results on the e-commerce store. They were asked to
of the valid observations across treatment groups is shown in Table     browse and evaluate all the product information presented. The
1. As can be seen in Table 1, the randomly assigned treatments are      ‘Next Page’ button appeared only after 30 seconds had elapsed, as
evenly distributed among participants.                                  means of preventing participants from moving ahead too quickly,
                                                                        without reviewing the stage 1 products. Manipulation check
   Table 1. Experimental design and sample sizes per group              questions were also asked to check their impressions about these
                              complements      substitutes              initial 12 mice. In the second stage, we narrowed down the choice
                                                                        set to 2 mice and participants were asked to evaluate the two items
         Stage 1     Low           16               17                  and pick one of them as their final purchase choice. Participants
         Stage 1     High          17               18                  who were randomly assigned to the groups with condition ‘stage 1’
         Stage 2     Low           17               16                  would only go through the first stage (i.e., browsing information).
         Stage 2     High          17               17                  Comparably, those who were randomly assigned to groups with
                                                                        condition ‘stage 2’ would go through both the first and the second
Experiment participants were put in the scenario of purchasing a        stage.
new computer mouse on an e-commerce site like Amazon.com. We
chose a mouse because it is very common for consumers to
purchase electronic products online, and a computer mouse has a
large number of potential complements and substitutes with both
low and high prices.
For the first manipulation factor (i.e., type of recommendation),
participants were randomly assigned to one of two different
shopping interfaces: the focal product page with recommendations
of complementary products, or the focal product page with
recommendations of substitute products. The pages included
product descriptions directly from Amazon.com. We omitted brand
information in the descriptions to eliminate any brand bias. Note
that both the complementary and substitutable goods derived from
real recommendations from the website. For the second
manipulation factor, (i.e., price of the recommended products),
participants were randomly assigned to either a high or low price
condition. In the high price condition, the recommended products         Figure 3. Example Screenshot for the experimental interface
were higher in price than the focal product and in the low price                (i.e., substitutes recommendation with low price)
condition the opposite was true. For the third manipulation factor
(i.e., decision stage), we designed a two-stage shopping procedure
After the stage manipulation, participants viewed a specific focal       (1) Do you think the products in the section titled ‘We think you
mouse product page. Along with the focal mouse,                          may also like these items’ are complements to the mouse you
recommendations were presented according to the random                   evaluated? and (2) Do you think the products in the section titled
treatment condition. Figure 3 provides example screenshots of the        ‘We think you may also like these items’ are substitutes to the
recommendation interface. In the groups with ‘low (high) price’          mouse you evaluated?. In terms of the decision stage manipulation
condition, all the recommendations’ prices were slightly lower           check, we did not directly ask subjects’ perceptions about decision
(higher) than the price of the focal mouse. Subjects could click on      stage because this may be an incomprehensible terminology.
the recommendation to view a detailed description. The number of         Instead, we asked them ‘In the previous task you just finished,
clicks and duration on each webpage were also recorded.                  which procedure(s) have you been through?’, and provided the
                                                                         following possible responses: (1) Evaluating a large set of
After viewing the focal product page based on their treatment            alternative products as if you were gathering information in early
condition, participants were asked to provide their willingness-to-      stages of shopping and (2) Evaluating a small set of alternative
pay for the focal product. Upon completing the shopping task,            products as if you were trying to choose a final one to purchase.
participants responded to a set of manipulation check questions and
                                                                         An additional question was used to check participants’ perception
completed a short survey with demographic questions that we use          about the relative price: ‘What do you think of the price level of the
as control variables in our analysis (age, gender, level of education,   mouse you just evaluated?’.
computer experience, web experience, e-commerce experience,
familiarity with and attitudes toward recommender systems).              First, to check if participants consciously distinguished between
                                                                         complement and substitute recommendations, we compared their
3.3   Dependent Measure                                                  responses toward the two manipulation check questions about
Willingness-to-pay is the maximum amount an individual is willing        recommendation type. They responded with the following 5 claims:
to sacrifice to procure a product. Here we adopted the method used       “Definitely yes” (coded as 5), “Probably yes” (coded as 4),
by Rucker & Galinsky (2008), Rucker et al. (2014) and Kim & Gal          “Maybe” (coded as 3), “Probably not” (coded as 2), and “Definitely
(2014) [13][27][28] to measure willingness-to-pay. Participants          not” (coded as 1). As expected, participants in the complements
indicated their willingness-to-pay using a sliding scale where they      group perceived the recommendations as complements
could choose from 0% to 120% of the retail price. The interval (i.e.,    ( 𝑀01234525678 = 4.15, 𝑀8;<8=7;758 = 1.61, 𝑡 133 = 16.17, 𝑝 < 0.001 ),
0%-120%) is used to reduce the amount of response variance and           and not as substitutes ( 𝑀01234525678 = 1.42, 𝑀8;<8=7;758 =
to guard against outliers. We are interested in relative changes in      4.68, 𝑡 133 = −28.87, 𝑝 < 0.001). The extremely low p-values of
willingness-to-pay due to treatment effects and not in estimating        these tests help guard against any potential multiple comparison
point estimates of willingness-to-pay for specific products, thus the    issues. These results support the validity of our manipulation for
interval willingness-to-pay metric is sufficient. Furthermore,           recommendation types. Further, for the stage check question,
because the market price for the focal product was given, it’s not       participants in different stage conditions correctly perceived their
realistic to use the Becker–DeGroot–Marschak approach [7] or             decision stages ( 𝑀87GH5I = 1.19, 𝑆𝐷 = 0.39, 𝑀87GH5( = 1.97, 𝑆𝐷 =
second-price auctions to elicit willingness-to-pay. Figure 4 shows       0.17, 𝑡 133 = −14.81, 𝑝 < 0.001 ).         Finally,  a     successful
the interface for entering willingness-to-pay:                           manipulation check was observed for the price. Due to the contrast
                                                                         effect, people in the high price recommendation condition felt the
                                                                         price of focal product is lower than those assigned in the low price
                                                                         recommendation condition ( 𝑀M=HM = 2.96, 𝑆𝐷 = 0.57, 𝑀41N =
                                                                         3.33, 𝑆𝐷 = 0.68, 𝑡 133 = −3.45, 𝑝 < 0.001).
                                                                         4.2   Main Results
                                                                         Table 3 shows the mean and standard deviation of the willingness-
              Figure 4. Entering willingness-to-pay                      to-pay, measured as a percentage (0%-120%) of the focal product’s
                                                                         original price, in each of the eight treatment groups.
4.   RESULTS
Table 2 provides summary statistics on the demographic items               Table 3. Mean (SD) Willingness-to-pay (%) in Each Group
collected in our post-experiment survey.                                  Decision stage      Price     complements         substitutes
           Table 2. Demographic Summary Statistics                        Stage 1             Low       68.19 (15.86)       79.29 (17.66)
Control variables                        Summary                          Stage 1             High      87.65 (13.50)       93.78 (21.81)
        Age              Mean: 21.6, SD: 3.35                             Stage 2             Low       88.29 (14.29)       81.63 (16.04)
      Gender             50.37% --- female                                Stage 2             High      93.94 (10.87)       89.53 (13.14)
 Primary language        88.89% --- native English speaker               To test the proposed hypotheses, we need to make comparisons
 Experience with         88.15% --- spend more than 4 hours per          between combinations of groups – the main and interaction effects.
      Internet           day on the Internet                             Since there are three manipulated factors in the experiment, we
Experience with e-       77.04% --- browse e-commerce websites           started by conducting a three-factor Analysis of Variance
     commerce            more frequently than once a week                (ANOVA) and the results are presented in Table 4. The results
Familiarity with RS      85.19% --- familiar with RS                     reveal that the main effect of stage and price, as well as the
Attitude toward RS       82.22% --- RS is helpful for finding            interaction between recommendation type and stage are significant.
                         relevant items                                  We also conducted orthogonal contrast analysis, which provided
                                                                         consistent results and is omitted due to space constraints. Details
4.1   Manipulation Checks                                                can be obtained by contacting the authors directly.
In order to check the saliency of our recommendation type, two
questions were asked of participants in the post-experiment survey:
             Table 4. Results of Three-factor ANOVA
                  Df     SSE      MSE        F value        Pr(>F)
      Type         1     73        73        0.279          0.5981
      Price        1    4670      4670       17.786      4.66e-05 ***
      Stage        1    1200      1200       4.570         0.0345 *
  Type : Price     1      9        9         0.036          0.8505
  Type : Stage     1    1688      1688       6.430         0.0124 *
  Price : Stage    1     872      872        3.321         0.0708 +
   Residuals      127   33343     263
                                                                           Figure 6. Interaction effects (Left: type × price; Right: type × stage)
Significance levels: + 𝑝 ≤ 0.1, * 𝑝 ≤ 0.05, ** 𝑝 ≤ 0.01, *** 𝑝 ≤ 0.001.
                                                                           Furthermore, to get the effect size of our model as well as
Figure 5 displays the average willingness-to-pay under the                 coefficients of each factor, we estimated a sequential linear model.
combined conditions. Specifically, there is no significant difference      Firstly, we regressed the willingness-to-pay on a set of control
between groups with complement and groups with substitute                  variables, including gender, preference to the focal product,
recommendations ( 𝑀01234525678 = 84.76, 𝑆𝐷 = 16.67, 𝑀8;<87=7;758 =         experience with e-commerce, familiarity with and attitude toward
86.24, 𝑆𝐷 = 18.85, 𝐹 = 0.279, 𝑝 = 0.598). However, when the prices         recommender systems. After that, we included the five independent
of recommended products are relatively high, participants’                 variables of interests to the model. The type factor has two levels,
willingness-to-pay is much higher than that in relatively low prices       either complements (0) or substitutes (1), the price factor is either
condition( 𝑀41N = 79.48, 𝑆𝐷 = 17.48, 𝑀M=HM = 91.26, 𝑆𝐷 = 15.75, 𝐹 =        low (0) or high (1), and the stage factor is either stage 1 (0) or stage
17.786, 𝑝 < 0.001), which supported H2. Similarly, the difference          2 (1). The regression results of these two models are shown in Table
between conditions in stage 1 and stage 2 was in the expected              5, and the R-square increased 0.149 after including these five
directions ( 𝑀87GH5I = 82.60, 𝑆𝐷 = 19.99, 𝑀M=HM = 88.45, 𝑆𝐷 = 14.26,       variables. Consistent with the ANOVA results, we got significant
𝐹 = 4.570, 𝑝 < 0.05), thus supporting the hypothesis that consumers        positive coefficients for price and stage, indicating that consumers
are willing to pay more when they are in the second decision stage         have higher willingness-to-pay in the high price condition
(H3). Further, we calculated Cohen’s d to capture the effect size.         (compared to low price) and stage 2 condition (compared to stage
Cohen’s d is known as the difference of two population means and           1). In addition, the interaction effect between type and stage is also
divided by the standard deviation from the data. The effect size for       marginally significant at level α = 0.1. The coefficients in the table
price and stage factors are 0.669 and 0.372, which indicate                suggest that consumers shown a recommended product with high
medium-to-large and small-to-medium effect, respectively.                  price reported 10.353% higher willingness-to-pay in terms of the
                                                                           retail price. Similarly, consumers in the second stage reported
                                                                           10.832% higher willingness-to-pay in terms of the retail price than
                                                                           those in the first stage.
                                                                                      Table 5. Results of the Linear Regression Models
                                                                                                      Dependent variable: Willingness-to-pay (%)
                                                                                                Model 1: Control                   Model 2: Full model
                                                                                                  (𝑅 ( = 0.1699)                      (𝑅 ( = 0.3189)
                                                                                         Coefficient        T       P-      Coefficient         T       P-
                                                                                            (SE)        statistic value         (SE)        statistic  value
                                                                           Intercept       53.166         5.088   1.25e-       44.589         4.521   1.42e-
                                                                                          (10.450)                06***        (9.862)                05***
                                                                            Type                                                5.572         1.154   0.251
                                                                                                                               (4.828)
                                                                            Price                                              10.353         2.747   0.007
                                                                                                                               (3.769)                  **
Figure 5. Average Willingness-to-pay in Combined Conditions                 Stage                                              10.832         2.703   0.008
                                                                                                                               (4.008)                  **
For the interaction effects, Figure 6 demonstrates the difference of        Type *                                              2.279         0.427   0.670
mean values for complements and substitutes groups under                     Price                                             (5.336)
different price levels and decision stages, respectively. The left          Type *                                            -10.729        -1.885   0.062
                                                                            Stage                                              (5.691)                   +
figure shows no interaction between type and price, since both
                                                                          Preference        7.141         4.640   8.45e-        6.232         4.178   5.50e-
complements and substitutes groups have higher willingness-to-                             (1.539)                06***        (1.492)                05***
pay in high relative price conditions ( 𝐹 = 0.036, 𝑝 = 0.851 ). The        Gender           3.255         1.056   0.292         3.803         1.309   0.193
crossing lines in right figure indicate significant interaction effect                     (3.081)                             (2.905)
between type and stage ( 𝐹 = 6.430, 𝑝 < 0.05 ). Particularly, the         Experience       -1.126        -0.697   0.487        -1.054        -0.690   0.492
effect of decision stage is much stronger when the                                         (1.616)                             (1.527)
                                                                          Familiarity       0.600         0.388   0.699         1.412         0.947   0.345
recommendations are complements ( 𝑀87GH5I = 78.21, 𝑆𝐷 =
                                                                                           (1.548)                             (1.491)
17.62, 𝑀87GH5( = 91.12, 𝑆𝐷 = 12.28, 𝑡 65 = −3.29, 𝑝 < 0.001 ), while       Attitude        -6.362        -2.098 0.0378         -6.924        -2.467   0.015
it’s not significant under substitute conditions ( 𝑀87GH5I =                               (3.032)                   *         (2.807)                   *
86.74, 𝑆𝐷 = 21.18, 𝑀87GH5( = 85.70, 𝑆𝐷 = 15.14, 𝑡 66 = 0.23, 𝑝 =           Significance levels: + 𝑝 ≤ 0.1, * 𝑝 ≤ 0.05, ** 𝑝 ≤ 0.01, *** 𝑝 ≤ 0.001.
0.41 ). Therefore, our hypothesis of interaction effect H3A is
partially supported, and H2A is not supported.                             The effect size of our sequential multiple regression model is
                                                                                                                                      V
                                                                                                                                     STU WSTV
                                                                           calculated by Cohen’s 𝑓 ( . It is defined as 𝑓 ( =            V      , where 𝑅X(
                                                                                                                                      IWSTU
                                                                                                                                              (
                                                                           is the variance accounted for by a set of control variables 𝐴 and 𝑅XZ
is the combined variance accounted by 𝐴 and another set of                recommendations against different purchase stages, as well as
independent variables of interest 𝐵. Here in our model, we have           highlighting the importance of timing in recommender systems.
𝑅X( = 0.1699 and 𝑅XZ (
                       = 0.3189 , resulting in a medium-to-large          The price of recommended products was also found to have
effect size of 𝑓Z( = 0.219 . We also conducted a post-hoc power           significant effects on willingness-to-pay. Serving as a reference
analysis and with the 135 observations and the calculated effect          point, the prices of recommendations may be compared with the
size, the power of our model is 0.993 while maintaining the               retail price of the focal product, which could cause consumers to
significance level at 0.05. This provides evidence that the null          adjust their willingness-to-pay through incidental price learning.
effects are true and not the result of a lack of power.                   Under the condition with high recommendation prices, consumers
                                                                          tend to have higher willingness-to-pay for the focal product and
Since we measured the willingness-to-pay by restricting                   vice versa. This positive effect is significant no matter the type of
participants’ choice from 0% to 120% of the stated retail price, it       recommendation for other products.
may result in censored and non-normal data. Therefore, we
performed robustness check that removed the normality                     5.2   Theoretical Contributions
assumption. We conducted two non-parametric tests and ran a               Our research offers important theoretical contributions in the
Tobit regression model using both dummy coding (0,1) and effect           following ways. First, studies on product recommendations have
coding (-1,1). The coefficients and significance levels are               focused on the consumers’ different preferences and behaviors for
consistent with our baseline analysis. Due to space constraints,          one products in the presence of recommendations [1][2]. This paper
details of the robustness check are omitted.                              extends the behavioral research on recommender systems by
                                                                          studying the question whether recommending ‘other products’ on
5.   DISCUSSION AND CONCLUSIONS                                           the same webpage had an effect on consumers’ willingness-to-pay
5.1   Summary of Findings                                                 for the focal product. Second, prior research has not paid much
In this paper, we conducted a randomized experiment to examine            attention to different types of recommendations. Deriving from
the impact of complement and substitute recommendations on                economics literature, two typical relationships between products
consumers’ willingness-to-pay for the focal product. Table 6              are examined, that is, complements and substitutes. Third, our
summarizes our findings which corresponds to the proposed                 research is one of the few studies that examine the detailed
hypotheses.                                                               recommendation features, i.e., price of recommended products as
                                                                          well as consumers’ decision stage. Integrating the consumers’
                 Table 6. Hypotheses and Results
                                                                          decision process, we have a better understanding of the behavioral
                         Hypotheses                             Results   aspects of recommendations in online purchases.
 H1: Consumers tend to have higher willingness-to-pay         Not
 about the focal product when it is displayed together        Supported   5.3   Implications for Practice
 with a complementary recommendation as compared to                       Beyond contributing to advancing the academic literature, our
 being displayed with a substitutable recommendation.
                                                                          findings also have significant practical implications and may guide
 H2: The price of a recommended product has a positive        Supported   the platform’s recommendation strategy. The vulnerability of
 influence on consumers’ willingness-to-pay for the
 focal product, that is, consumers are willing to pay more
                                                                          consumers’ willingness-to-pay indicates the importance of ‘other
 when the price of the recommended product is higher                      products’ in recommender systems. This suggests new possibilities
 than the price of focal product.                                         for influencing product sales by manipulating the contextual
 H2A: There is an interaction effect between the price        Not         information of recommendations. Another important implication is
 and type of recommendations, such that the positive          supported   about the timing of recommendations, i.e., complementary
 influence of recommendation price on willingness-to-                     recommendations should be delayed to the second decision stage.
 pay for the focal product is stronger when the
 recommendation type is substitute as compared to                         5.4   Future Work
 complement.                                                              The main limitation of this study is that we are not observing real
 H3: The decision stage has a positive influence on           Supported   world purchases. In contrast, however, an advantage is that our
 consumers’ willingness-to-pay for the focal product,
                                                                          controlled randomized experiment allows us to make causal
 that is, consumers are willing to pay more when they are
 in the second stage.
                                                                          inferences, and thus trading external validity for identification.
 H3A: There is an interaction effect between the stage        Partially   Future research can be developed by exploring other factors
 and type of recommendations, such that the stage has a       Supported   associated with recommended products, such as average ratings,
 positive effect on willingness-to-pay for the focal                      quality, pictures of complements/substitutes and so on.
 product when the recommendation type is complement                       Additionally, we can use observational data to empirically validate
 and it has a negative effect on willingness-to-pay for the               the findings of our experiment. By examining the relationships
 focal product when recommendation type is substitute.                    between recommendation network properties and products’ sales,
Experimental results provide evidence that there is no significant        we will have additional support for the influence of complementary
main effect difference between complementary and substitutable            and substitutable product recommendations on consumers’
recommendations on consumers’ willingness-to-pay for the focal            economic behavior from an aggregate level.
product. We further investigated two factors that commonly present
with recommendations: decision stage and the price of
                                                                          6.   REFERENCES
                                                                          [1]   Adomavicius, G., Bockstedt, J.C., Curley, S.P. and Zhang, J.
recommended products. We found that consumers are willing to
                                                                                2014. Suggest or Sway? Effects of Online Recommendations
pay more for a specific product as decision stage increases. An
                                                                                on Consumers’ Willingness to Pay. (2014). Working paper.
interesting finding is the interaction between recommendation type
and decision stage. The positive effect of stage vanished when the        [2]   Adomavicius, G., Bockstedt, J.C., Curley, S.P. and Zhang, J.
recommendation is substitute to the focal product, while it is very             2013. Do recommender systems manipulate consumer
significant with complementary recommendations. This is                         preferences? A study of anchoring effects. Information
consistent with previous findings that customers prefer different               Systems Research. 24, 4 (2013), 956–975.
[3]   Baruch, Y. and Holtom, B.C. 2008. Survey response rate                 Conference on Knowledge Discovery and Data Mining
      levels and trends in organizational research. Human                    (2015).
      Relations. 61, 8 (2008), 1139–1160.                              [20]  Mulhern, F.J. and Leone, R.P. 1991. Implicit Price Bundling
[4]   Cohen, J. 1988. Statistical Power Analysis for the Behavioral          of Retail Products: A Multiproduct Approach to Maximizing
      Sciences (second ed.). Lawrence Erlbaum Associates.                    Store Profitability. Journal of Marketing. 55, 4 (1991), 63–
[5]   Dhar, V., Geva, T., Oestreicher-singer, G. and Sundararajan,           76.
      A. 2014. Prediction in Economic Networks. Information            [21]  Nunes, J.C. and Boatwright, P. 2004. Incidental Prices and
      Systems Research. 25, 2 (2014), 264–284.                               Their Effect on Willingness to Pay. Journal of Marketing
[6]   Donaldson, C., Jones, A.M., Mapp, T.J. and Olson, J.A.                 Research. 41, 4 (2004), 457–466.
      1998. Limited dependent variables in willingness to pay          [22]  Oestreicher-Singer, G. and Sundararajan, A. 2012a.
      studies: applications in health care. Applied Economics. 30, 5         Recommendation networks and the long tail of electronic
      (1998), 667–677.                                                       commerce. MIS Quarterly. 36, 1 (2012), 65–83.
[7]   Frederick, S. 2012. Overestimating Others’ Willingness to        [23]  Oestreicher-Singer, G. and Sundararajan, A. 2012b. The
      Pay. Journal of Consumer Research. 39, 1 (2012), 1–21.                 visible hand? Demand effects of recommendation networks
[8]   Ge, X., Häubl, G. and Elrod, T. 2012. What to Say When:                in electronic markets. Management Science. 58, 11 (2012),
      Influencing Consumer Choice by Delaying the Presentation               1963–1981.
      of Favorable Information. Journal of Consumer Research.          [24]  Payne, J.W., Bettman, J.R. and Johnson, E.J. 1992.
      38, 6 (2012), 1004–1021.                                               Behavioral decision research: a constructive processing
[9]   Häubl, G. and Trifts, V. 2000. Consumer decision making in             perspective. Annual Reviews of Psychology. 43, 1 (1992),
      online shopping environments: The effects of interactive               87–131.
      decision aids. Marketing Science. 19, 1 (2000), 4–21.            [25]  Rajendran, K.N. and Tellis, G.J. 1994. Contextual and
[10]  Ho, S.Y., Bodoff, D. and Tam, K.Y. 2011. Timing of                     temporal components of reference price. Journal of
      Adaptive Web Personalization and Its Effects on Online                 Marketing. 58, 1 (1994), 22–34.
      Consumer Behavior. Information Systems Research. 22, 3           [26]  Rao, A.R. and Sieben, W. a. 1992. The Effect of Prior
      (2011), 660–679.                                                       Knowledge on Price Acceptability and the Type of
[11]  Hosanagar, K., Fleder, D., Lee, D. and Buja, A. 2014. Will             Information Examined. Journal of Consumer Research. 19, 2
      the Global Village Fracture Into Tribes? Recommender                   (1992), 256–270.
      Systems and Their Effects on Consumer Fragmentation.             [27]  Rucker, D.D. and Galinsky, A.D. 2008. Desire to Acquire:
      Management Science. 60, 4 (2014), 805–823.                             Powerlessness and Compensatory Consumption. Journal of
[12]  Jin, R.K.-X., Parkes, D.C. and Wolfe, P.J. 2007. Analysis of           Consumer Research. 35, 2 (2008), 257–267.
      bidding networks in eBay: Aggregate preference                   [28]  Rucker, D.D., Hu, M. and Galinsky, A.D. 2014. The
      identification through community detection. AAAI Workshop              Experience versus the Expectations of Power: A Recipe for
      - Technical Report. WS-07-09, (2007), 66–73.                           Altering the Effects of Power on Behavior. Journal of
[13]  Kim, S. and Gal, D. 2014. From Compensatory Consumption                Consumer Research. 41, August (2014), 381–396.
      to Adaptive Consumption: The Role of Self- Acceptance in         [29]  Russo, J.E. and Leclerc, F. 1994. An eye-fixation analysis of
      Resolving Self-Deficits. Journal of Consumer Research. 41,             choice processes for consumer nondurables. Journal of
      August (2014), 526–542.                                                Consumer Research. 21, September (1994), 274–290.
[14]  Lee, L. and Ariely, D. 2006. Shopping goals, goal                [30]  Sherif, M. and Hovland, C.I. 1961. Social judgment:
      concreteness, and Conditional Promotions. Journal of                   Assimilation and contrast effects in communication and
      Consumer Research. 33, June (2006), 60–71.                             attitude change. (1961).
[15]  Liberman, N., Trope, Y. and Wakslak, C. 2007. Construal          [31]  Shocker, A., Bayus, B. and Kim, N. 2004. Product
      level theory and consumer behavior. Journal of Consumer                Complements and Substitutes in the Real World: The
      Psychology. 17, 2 (2007), 113–117.                                     Relevance of “Other Products.” Journal of Marketing. 68, 1
[16]  Lin, Z., Goh, K.-Y. and Heng, C.-S. 2015. The demand                   (2004), 28–40.
      effects of product recommendation networks: an empirical         [32]  Venkatesh, R. and Kamakura, W. 2003. Optimal Bundling
      analysis of network diversity and stability. MIS Quarterly.            and Pricing under a Monopoly: Contrasting Complements
      (2015). Forthcoming.                                                   and Substitutes from Independently Valued Products. The
[17]  Liu, Q. and Arora, N. 2011. Efficient Choice Designs for a             Journal of Business. 76, 2 (2003), 211–231.
      Consider-Then-Choose Model. Marketing Science. 30, 2             [33]  Zheng, J., Wu, X., Niu, J. and Bolivar, A. 2009. Substitutes
      (2011), 321–338.                                                       or Complements: Another Step Forward in
[18]  Loomis, J., Gonzalez-Caban, A. and Gregory, R. 1994. Do                Recommendations. Proceedings of the 10th ACM conference
      Reminders of Substitutes Influence Contingent Valuation                on Electronic commerce. (2009), 139–145.
      Estimates? Land Economics. 70, 4 (1994), 499–506.                [34]  Zhu, T., Harrington, P., Li, J. and Tang, L. 2014. Bundle
[19]  McAuley, J., Pandey, R. and Leskovec, J. 2015. Inferring               recommendation in ecommerce. Proceedings of the 37th
      Networks of Substitutable and Complementary Products.                  international ACM SIGIR conference on Research &
      Proceedings of the 21th ACM SIGKDD International                       development in information retrieval - SIGIR ’14. (2014),
                                                                             657–666.