=Paper=
{{Paper
|id=Vol-3651/DARLI-AP_paper9
|storemode=property
|title=In Time Recommendations through an Associative Classifier and LookBackApriori: a Case Study
|pdfUrl=https://ceur-ws.org/Vol-3651/DARLI-AP-9.pdf
|volume=Vol-3651
|authors=Anna Dalla Vecchia,Niccolò Marastoni,Elisa Quintarelli
|dblpUrl=https://dblp.org/rec/conf/edbt/VecchiaMQ24
}}
==In Time Recommendations through an Associative Classifier and LookBackApriori: a Case Study==
<pdf width="1500px">https://ceur-ws.org/Vol-3651/DARLI-AP-9.pdf</pdf>
<pre>
                                In time recommendations through an Associative Classifier
                                and LookBackApriori: a case study
                                Anna Dalla Vecchia1,*,† , Niccolò Marastoni1,† and Elisa Quintarelli1,†
                                1
                                    Dept. of Computer Science, University of Verona, Verona, Italy


                                                                       Abstract
                                                                       Recommender systems are becoming essential tools in many scenarios, as they help users extract hidden knowledge and useful
                                                                       insights from datasets. In many real domains, the temporal order between events, combined with their contextualization,
                                                                       improves the accuracy of provided suggestions. In this paper, we introduce a framework designed to mine personalized,
                                                                       in time, contextual, and explainable sequential rules useful to provide recommendations for a predefined target parameter.
                                                                       Specifically, this framework is composed of the 𝐿3 Associative Classifier and LookBackApriori, a modification of Apriori
                                                                       algorithm. Our proposal takes historical data and contextual information as input and generates two sets of rules: the first set
                                                                       comprises rules that allow enhancement of the target parameter, and the second makes it worse. The proposed technique is
                                                                       applied to a real-world scenario involving data collected by Fitbit wearable devices to improve the user’s sleep score after
                                                                       performing fitness activities in different contexts. The idea has been evaluated on two real datasets, and the results confirm
                                                                       the positive effects of the combination of 𝐿3 with LookBackApriori.

                                                                       Keywords
                                                                       Recommendations, Associative classifier, Explanation, Data Mining


                                1. Introduction                                                                                        conditions in the user location and holidays.
                                                                                                                                          More in detail, our use case is based on data gathered
                                The widespread popularity of sensors and wearable de- with Fitbit and focuses on suggesting the intensity of
                                vices, like smartwatches and fitness trackers, has in- physical activities and rest periods to carry out during
                                creased the amount of available data that can be lever- the current day to sleep better. This is done by mining
                                aged to monitor and enhance various aspects of their the historical contextualized physical activities during a
                                users’ well-being. Such devices are often equipped with specified temporal window, which represents the number
                                intuitive apps for activity tracking that mainly provide of consecutive observation days taken into account.
                                aggregate parameters and trend analysis, thus leaving                                                     We use two datasets consisting of activity logs from
                                room for more personalized and insightful suggestions to Fitbit wearable devices: PMDataset [1] and a Custom
                                raise the end-users’ awareness about what affects certain dataset. The latter has been collected from four will-
                                monitored parameters and habits.                                                                       ing participants in the past two years to integrate more
                                   To achieve advanced insights, historical data, possi- specific information about the user context.
                                bly integrated with external information describing the                                                   The main aim of this paper is the construction of a
                                user context, needs to be analyzed for each user to offer novel recommender system that combines the strengths
                                tailored and context-aware suggestions to improve their of two algorithms: the 𝐿3 associative classifier [2, 3] and
                                life beyond generic recommendations.                                                                   LookBackApriori (LBA) [4, 5, 6]. The first one allows us
                                   In this work, we propose a framework that aims to to predict a specific target parameter based on associative
                                give personalized, and in time, contextual suggestions to classification. It takes as input all the historical physical
                                a specific user to improve a target parameter (e.g., sleep activity and the related contextual information (i.e., the
                                quality) along with an explanation about the provided context at the time the physical activity was performed)
                                suggestions. To achieve this goal we integrate monitored and outputs the predicted sleep score. We leverage the
                                data with contextual information, e.g., current weather second algorithm to provide an explainable recommen-
                                                                                                                                       dation about what activity to do and what to avoid to
                                Published in the Proceedings of the Workshops of the EDBT/ICDT 2024 increase the predicted sleep quality and not decrease it.
                                Joint Conference (March 25-28, 2024), Paestum, Italy                                                      With this framework, we overcome the limitations of
                                *
                                  Corresponding author.
                                †                                                                                                      the two algorithms and, in particular, the state explo-
                                  These authors contributed equally.
                                $ anna.dallavecchia@univr.it (A. Dalla Vecchia);                                                       sion problems of LBA when managing wide temporal
                                niccolo.marastoni@univr.it (N. Marastoni);                                                             windows are less severe in L3 . In addition, LBA allows
                                elisa.quintarelli@univr.it (E. Quintarelli)                                                            the production of explainable recommendations; indeed,
                                 0000-0001-7026-5205 (A. Dalla Vecchia); 0000-0001-6988-1203                                          since LBA is based on Apriori, it mines sequential rules
                                (N. Marastoni); 0000-0001-6092-6831 (E. Quintarelli)                                                   that contain in their antecedent the explanation of the
                                          © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License
                                    CEUR
                                          Attribution 4.0 International (CC BY 4.0).
                                          CEUR Workshop Proceedings (CEUR-WS.org)
                                    Workshop
                                    Proceedings
                                                  http://ceur-ws.org
                                                  ISSN 1613-0073
                                                                                                                                       provided sleep score present in the consequent.


CEUR
                  ceur-ws.org
Workshop      ISSN 1613-0073
Proceedings
                                                       Prediction
Historical Data
                              Associative                   OutC                                                      R+
       +                                                                                     Ordering
                               Classifier
Contextual Data                                                                                and
                                                                                             Splitting                R-

 Contextual Data
                                                      Rule Generaor               R
  at time zero
                                                                                           Explanation
                                  Data
                             at time zero

Figure 1: The main components in our framework


   The document is organized as follows: Section 2 out-       LookBackApriori The second part of our framework
lines the architecture of our recommender system, while       uses the LBA algorithm, which was initially proposed
Section 3 presents the case study. Section 4 reports the      in [4] and extended in [5, 6]. It is an algorithm based
validation of the proposed framework, and Section 5 ex-       on Apriori that mines totally ordered sequential rules
plores related work. Finally, Section 6 summarizes the        and provides timely and explainable recommendations.
main contributions and outlines future directions.            More in detail, LBA mines association rules where the
                                                              antecedent is a sequence of itemsets that represent past
                                                              events or actions by the user with an explicit relative
2. Architecture                                               order w.r.t. the current itemset (representing current day
                                                              events and actions).
The framework proposed in this paper combines an as-
                                                                 For our specific scenario related to wearable devices, a
sociative classifier, 𝐿3 , to predict the value of a target
                                                              rule can be formalized as follows:
parameter (e.g., the sleep score for the current day, the
stress level) and an algorithm based on Apriori, called          𝑟 : 𝐼−(𝜏𝑤 −1) ∧ · · · ∧ 𝐼−2 ∧ 𝐼−1 ∧ 𝐼0𝑓 → 𝐼0𝑠 [𝑠𝑖 , 𝑐𝑖 ]
LookBackApriori (LBA), to generate timely and explain-
able contextual recommendations helpful to suggest what The rule 𝑟 consists of a sequence of itemsets, each rep-
to do to improve the predicted value (i.e. the fitness ac- resenting either fitness activities (𝐼 𝑓 ), sleep quality (𝐼 𝑠 ),
tivities to undertake in the current day to increase the or both (𝐼), for a specific day, where 0 represents the
sleep score, whenever it is possible).                        current day. The parameter 𝜏𝑤 defines the temporal win-
                                                              dow, that is, the number of consecutive days that the
Associative classifier The first part of our framework algorithm can consider and thus may be present in a rule.
comprises the 𝐿3 associative classifier [2, 3]. 𝐿3 uses The antecedent can contain all the itemsets up to 𝐼−1
a technique of lazy pruning to discard those rules that (the day before the current one) or some of them (i.e., the
classify training data incorrectly.                           sequence may be incomplete), while only the physical
    Then, the classification of unlabeled data is executed activity data is present for the current day. Indeed, this
in two steps: first, by considering a subset of high-quality itemset, 𝐼0𝑓 , represents the physical activity suggested to
rules for the classification process, and second, by adding the user by our framework to improve their sleep score
a larger set of rules when it fails to find rules for certain for the same night, represented in the consequent as 𝐼0𝑠 .
data points.                                                     The associative classifier 𝐿3 is used for historical data
    In the green section of Fig. 1, we show the classifier processing and prediction to address memory-related is-
component, which is employed for predicting the value sues faced by LBA that stem from the size of the input.
of a target parameter, represented in our scenario by the After the sleep score prediction step, LBA is used to pro-
sleep score for the current day. It takes historical data vide explainable positive and negative recommendations
(i.e., the user’s log of their fitness activities and sleep thanks to the form of the mined rules. To this end, the
scores) as input and integrates it with past contextual antecedent contains the sequence of events that will lead
information, including the context of the current day at to the sleeping score in the consequent; thus, it provides
the time of the prediction.                                   an explanation for the recommendation.
                                                                 Three possible rules mined by the LBA Algorithm are
                                                              the following:
                                                                From both datasets, we make use of the logs about
                                                             "light", "medium", and "heavy" activity, along with rest
𝑟1 : {𝐻𝐴 : 3, 𝐿𝐴 : 2}−1 ∧{𝐻𝐴 : 3, 𝐿𝐴 : 2}0 → {𝑆𝐿 : 1}0
                                                             periods and the sleep score for each day. Fitbit records
    𝑟2 : {𝐻𝐴 : 2, 𝑅 : 3}−2 ∧ {𝐿𝐴 : 1}−1 → {𝑆𝐿 : 3}0          these features as minutes spent in each activity type; thus,
    𝑟3 : {𝐻𝐴 : 2, 𝑅 : 3}−2 ∧ {𝑀 𝐴 : 1}0 → {𝑆𝐿 : 3}0          we  discretize them to obtain categorical data as described
                                                             in [4]. During this discretization process, the activity
 𝑟1 states that if yesterday the user performed a high levels and sleep scores are further split into three sub-
level of heavy physical activity (𝐻𝐴 : 3) and a medium levels according to set thresholds, e.g., a heavy activity
level of light activity (𝐿𝐴 : 2), and today they perform (𝐻𝐴) can be encoded into three possible labels: 𝐻𝐴 :
the same activities, the resulting sleep score will have a 1, 𝐻𝐴 : 2, and 𝐻𝐴 : 3. These represent, respectively, a
low value (𝑆𝐿 : 1).                                          low level, medium level, and high level of heavy activity,
𝑟2 is an incomplete rule since it does not contain informa- all decided by the amount of time spent undertaking the
tion about the physical activity the user should perform specific physical activity during the day.
during the current day. Although the rule is valid, it is       Regarding the context, for both datasets, we also have
not helpful for making a recommendation to improve information on whether a day falls on a weekend (𝑊 𝐸)
sleep quality, as it does not have any itemset labeled 0 in or not (𝑊 𝐷).
the antecedent.                                                 In addition, for the Custom dataset, we integrate the
𝑟3 is another incomplete rule, but it gives us information information about the user’s vacations (𝑉 𝐴/𝑛𝑜𝑉 𝐴) and
about the physical activity the user should do during the the weather conditions. For this last aspect, we have
current day to sleep well; thus, it can be used to provide simplified its representation as follows: if there has been
a recommendation.                                            rain, snow, fog, or other bad weather conditions, the label
   In the orange part of Fig. 1, we show the contribution is 𝐵𝑎𝑑. In all the other cases, it is 𝐺𝑜𝑜𝑑. Regarding the
of LBA to our framework. Firstly, it mines a set of rules temperature, we use the average daily temperature as a
𝑅 using the Rule Generator, which takes as input the feature, and the result is labeled into 𝐶𝑜𝑙𝑑 or 𝐻𝑜𝑡, using
current context, also used by 𝐿3 for the prediction step, the yearly average temperature as a threshold.
and data at the time of the prediction. Secondly, taking
advantage of the label produced by the classifier, the rules
generated are split into two sets: those that improve the 3.1. Practical example
target parameter value w.r.t. the predicted one is labeled
𝑅+ , and those that do not are labeled 𝑅− . Then, the rules
in both sets are ordered according to the completeness of
the rule antecedent, confidence, and support. The rules
                                                                         t-3        t-2        t-1        t0
recommended to the user are the most complete ones
with the highest confidence and support.
                                                                                                                              Associative
                                                                                                                               Classifier           t0
                                                                             t-3         t-2        t-1       t0
3. A case study
Our experiments focus on wearable device data: their                                    Rule Generator                 R                                  t0
logs contain information about daily physical activity                  t0                                                                  t0

levels and sleep scores. Whenever possible (i.e., when                                                             Training

                                                                                                                                       R which
we have enough data about the user), we integrate such                                                                                 improve       t0
logs with additional information, e.g., holidays, day of            R
                                                                                   t0
                                                                                               Ordering and Splitting

the week, and weather conditions related to the user’s                                                                                 R which
                                                                                                                                      deteriorate    t0
location, to better contextualize the gathered fitness and
sleep quality data.
                                                                Figure 2: An intuitive workflow of our framework
   We consider two datasets for this domain: PMdata [1]
and Custom. PMdata consists of logs from 16 users, 13
male and 3 female, all aged 23 to 60 years old. The data          Fig. 2 shows the workflow of the framework in the
was collected for 149 days between November 2019 and            specific case of study of this paper.
March 2020.                                                       First, we need to decide the dimension of the temporal
   The Custom dataset was collected from 4 users specifi-       window to consider to make the recommendation. The
cally for this study, the earliest of which started recording   temporal window shown in Fig. 2 is 4 days, i.e., the three
in August 2021 and ended in September 2022. The partic-         previous days are considered along with the current one.
ipants are evenly split between males and females; their          All the information about the past days is fed to the
ages vary from 16 to 55.                                        associative classifier 𝐿3 along with the context. The
                                                           The relevance of historical data
                                                                                                                Only last timeslot data
                                                                                                                Historical data
           0.4


           0.3
Accuracy


           0.2


           0.1


           0.0         p01 p02 p03 p04 p05 p06 p07 p08 p09 p10 p13 p14 p15 p16 USER1USER2USER3USER4
                                                         User

 Figure 3: The importance of historical data for each user


 classifier then returns a sleep score for the current day.      • testing the efficacy of the proposed framework
    The rules generated during the training phase state             on 𝐿3 w.r.t. to its ability to predict the value of
 correlations that are only related to the current day, and         the predefined target parameter.
 are in the form shown in the purple rectangle, i.e., con-
 textual information together with physical activity in the    For all the experiments, we have reserved the first
 antecedent of the rule and the sleep score in the conse-   80%  of the Fitbit logs of each user as a training set and
 quent.                                                     the remaining 20% of the data for testing. Due to the
    Some examples of mined rules are the following:         sequential nature of the problem, we cannot randomize
                                                            the sampling of the two sets. Thus, we maintain the
   𝑟1 : {𝐶𝑜𝑙𝑑, 𝐺𝑜𝑜𝑑, 𝑊 𝐷, 𝐻𝐴 : 1, 𝐿𝐴 : 2}0 → {𝑆𝐿 : 1}0      sequential order based on the timestamp of the logs and
       𝑟 : {𝐵𝑎𝑑, 𝑊 𝐷, 𝐿𝐴 : 2, 𝑀 𝐴 : 3} → {𝑆𝐿 : 3}           select the last 20% of the dataset for the tests.
                 2                                0             0

  Rule 𝑟1 tells us that on a weekday with clear weather, if
 the user performs low levels of heavy activity (𝐻𝐴 : 1)                  4.1. Relevance of historical data
 and medium levels of light activity (𝐿𝐴 : 2), their sleep                To conduct this part of the experiments, we use the 𝐿3
 score for the same night will be low. Whereas rule 𝑟2                    associative classifier to predict the sleep label related to
 states that, in the case of a stormy weekday, the user will              the current day 𝑡_0.
 sleep well after performing medium levels of light activity                 First, we perform the prediction by selecting only the
 (𝐿𝐴 : 2) and high levels of medium activity (𝑀 𝐴 : 3).                   physical activity and context of the current day as input
    Thanks to the sleep score obtained by the associative                 to the classifier without considering historical data. An
 classifier, it is possible to split the rules into those that            example of input data for a Custom user is:
 increase or decrease the predicted sleep score. At this
 stage, the user can explain why their sleep quality may                      (𝐺𝑜𝑜𝑑_𝑡0, 𝐻𝑜𝑡_𝑡0, 𝑊 𝐷_𝑡0, 𝑉 𝐴_𝑡0, 𝐿𝐴_3_𝑡0,
 improve or not by looking at the antecedent of the rules.                     𝑀 𝐴_1_𝑡0, 𝐻𝐴_1_𝑡0, 𝑅_2_𝑡0)

                                                                          This can be interpreted as: on clear weather and hot
 4. Evaluation                                                            weekdays during a holiday, the user performs a high
 To test the validity of our framework, we have performed                 level of light physical activity, low levels of both medium
 several experiments on the following aspects:                            and heavy activity, and a medium level of rest.
                                                                             Then, we add the historical data (i.e., physical activity,
                 • verifying the relevance of historical data and their   context, and sleep score of the past days) to the input
                   context in the ability to predict the value of a       used for the first set of experiments.
                   target parameter, i.e., the sleep score.                  Fig. 3 depicts the recorded accuracies of these two
                 • testing the performance of the two algorithms          experiments for each user in both PMdata and Custom
                   used by the framework w.r.t. execution time and        datasets. These results show that for 77% of users, using
                   memory consumption.
  historical data instead of only using data from the current                                          Fig. 6 and Fig. 7 depict the elapsed time and the mem-
  day improves the accuracy of the classifier.                                                      ory consumption of the framework when using data from
     Fig. 4 shows that, regardless of the presence of con-                                          one of the users of the Custom dataset, showing the im-
  textual information, having historical data improves the                                          portant contribution of 𝐿3 . During the experiments, the
  accuracy of most users. The performance of the algo-                                              input is the physical activity log and the available con-
  rithm decays quickly as the temporal window increases,                                            textual information of the chosen user. We maintain the
  especially in the absence of contextual data. Thus, it                                            same support and confidence and gradually increase the
  seems clear that sleep quality does not depend on data                                            temporal windows, i.e., increase the historical data given
  that is temporally distant from the current day.                                                  in input. LBA shows an exponential trend for memory
                                                                                                    consumption and time elapsed, until its memory alloca-
     Percentage of users where the historical data is better than the only last timeslot data
                                                                            Without context
                                                                                                    tion fails when the temporal window reaches value 4.
         75
         70
                                                                            With context            On the other hand, 𝐿3 can manage at most a temporal
         65                                                                                         window of 6, maintaining a relatively constant trend.
Percentage


         60
         55                                                                                                                  User: USER2
         50                                                                                                                                             L^3
         45                                                                                                                                             LBA
                     2                 3              4
                                               Temporal Window
                                                                    5                   6                       300


                                                                                                  Time (s)
  Figure 4: Percentage of users that obtain higher accuracy with                                                200
  the use of historical data, with different lengths of temporal
  windows, w.r.t. the use of data of the current day
                                                                                                                100

     Additionally, the accuracy value obtained by some       0
  users increases gradually in the presence of contextual       1        2         3          4     5        6
                                                                                 Temporal window
  information as the length of the temporal window in-
  creases. One example is shown in Fig. 5, where we can Figure 6: Execution time with different temporal windows
  also confirm that sleep does not depend on the activi- for USER2 of Custom dataset.
  ties performed six days earlier. We can also observe that
  contextual information improves the final result.
                                                                                                                              User: USER2
                     Accuracy of the classifier for user p03 with different temporal windows                3500                                        L^3
             0.450           Without Context                                                                3000                                        LBA
             0.425           With Context
                                                                                                            2500
                                                                                                  Memory (MB)


             0.400
             0.375                                                                                          2000
Accuracy


             0.350                                                                                          1500
             0.325                                                                                          1000
             0.300                                                                                           500
             0.275                                                                                             0 1
             0.250
                                                                                                                      2        3          4      5        6
                         2                 3            4               5                     6                              Temporal window
                                                 Temporal Window
                                                                                                    Figure 7: Memory usage with different temporal windows
  Figure 5: Accuracy trend when we add more historical data,                                        for USER2 of Custom dataset.
  comparison with and without context


                                                                                                    4.3. Evaluation of the framework
  4.2. Time and memory performance
                                                                                                    The last set of experiments performed is on the complete
  Despite LBA’s intrinsic capability to provide explainable                                         framework, as explained in Sec. 3.1. The idea is to val-
  recommendations for achieving a better sleep score, we                                            idate, on real user logs, the prediction of 𝐿3 enriched
  still employ the associative classifier 𝐿3 to process his-                                        with the recommendation produced by LBA. In order to
  torical data. Thanks to the associative classifier, we can                                        validate the results of the whole framework, the best ap-
  process significantly larger volumes of data, e.g., longer                                        proach would be to ask for inputs directly from the users.
  sequences of data with their contextual information, with-                                        Due to time constraints, the evaluation of the framework
  out the risk of memory errors. Additionally, 𝐿3 is signifi-                                       is strictly empirical.
  cantly faster in obtaining the results.
                                               Accuracy of L^3 associative classifier

           0.4


           0.3
Accuracy


           0.2


           0.1


           0.0   p01 p02 p03 p04 p05 p06 p07 p08 p09 p10 p13 p14 p15 p16 USER1USER2USER3USER4
                                                   User

 Figure 8: Accuracy of the 𝐿3 associative classifier


    As before, we use the first 80% of the dataset to train        5. Related Work
 both 𝐿3 and LBA. The first step is setting the length
 of the temporal window, which is done empirically by              With the spread of smart devices and the availability of
 analyzing the experiments in the previous case study.             their large datasets, we have the possibility to extract
 The input of the classifier is composed of the physical           both explicit and implicit knowledge about monitored
 activities, contextual information, and sleep score of the        parameters. For this reason, there are many intelligent
 past days and the contextual information of the current           techniques proposed in the literature to improve the cus-
 day. The classifier then predicts the sleep score on the          tomization of data exploitation. Recommender Systems
 current day.                                                      (RS) offer suggestions on items, services, or news that
    Separately, the rule generator produces a set of rules         may interest users and affect their decisions based on
 correlating the physical activities and contextual infor-         their profile, history, and preferences [7]. For instance, in
 mation for the current day (antecedent) and the related           [8], the authors develop an RS that can suggest activities
 sleep score (consequent).                                         targeted to specific users to improve their health condi-
    At this point, a trained classifier and a set of rules exist   tions starting from data collected by a Fitbit wearable
 for each user. To test the complete framework, we take            device. The physical activity information collected by
 from the remaining 20% of the dataset a temporal window           Fitbit is also used in [9] to correlate daily physical activity
 of observation at a time: all the past data, together with        levels with predictions of sleep quality. Neither of the
 the current context (i.e., the contextual information of          mentioned works considers contextual information.
 the current day), are used by the associative classifier to          In the literature, there are many methodologies for
 predict the sleep score.                                          sleep prediction. In particular, [10] introduces an explain-
    In Fig. 8, we report both the accuracy of the classifier,      able sleep model that exploits the correlation between
 without any knowledge about physical activity for the             daily activities and sleep quality, providing recommenda-
 current dayIt can be noted that the accuracy of 𝐿3 for            tions to improve sleep quality. While the outcome of this
 most users is less than 0.45.                                     framework aligns closely with our proposal, the approach
    The output of 𝐿3 produced is used as a threshold to            does not account for sequences of events that occurred
 separate the rules mined by the Rule Generator in the             in the days leading up to the prediction intended for the
 two sets 𝑅+ and 𝑅− of positive and negative recommen-             user. Furthermore, the model presented fails to incor-
 dations, respectively. Due to the nature of the problem           porate external contextual information beyond sensed
 at hand, it would not be accurate to use historical data to       humidity and temperature. As highlighted in Subsection
 check whether the recommendation given by the frame-              4.1, historical data are important to improve the quality
 work will actually result in a change in sleep score.             of provided predictions.
                                                                      In the state of the art, contextual information is often
                                                                   integrated into RS to improve the precision of recommen-
                                                                   dations. In general, user preferences may vary depending
                                                                   on the environment and the situation in which they are
acting [11, 12]. Therefore, Context-Aware Recommender         and how to improve it when fitness and sleep parameters
Systems (CARS) use contextual information, such as time,      are monitored through wearable devices. As future work,
location, and social situation, to add knowledge during       we are extending the proposal to other domains, like the
the recommendation process, thus improving the per-           correlation of fitness activities with blood glucose levels.
sonalization and the relevance of the suggestion [13]. A
systematic literature review is proposed in [14], where
the authors describe the integration of the context in RS,    References
the main categories of contextual features, and the vali-
                                                          [1] V. Thambawita, S. A. Hicks, H. Borgli, H. K. Stens-
dation mechanism with respect to datasets, properties,
                                                              land, D. Jha, M. K. Svensen, S.-A. Pettersen, D. Jo-
metrics, and evaluation protocols.
                                                              hansen, H. D. Johansen, S. D. Pettersen, et al., Pm-
   In a lot of real scenarios, the history of events and
                                                              data: a sports logging dataset, in: Proceedings
their order is important and can be leveraged in the pro-
                                                              of the 11th ACM Multimedia Systems Conference,
cess of knowledge extraction. Sequential pattern mining
                                                              2020, pp. 231–236.
aims to extract hidden sequential patterns in sequential
                                                          [2] E. Baralis, P. Garza, A lazy approach to pruning
databases. These patterns can be used as a first step to
                                                              classification rules, in: 2002 IEEE International Con-
extract frequent sequential rules, usually formalized as
                                                              ference on Data Mining, 2002. Proceedings., 2002,
𝑋 → 𝑌 [15, 16, 17]. These algorithms propose partially
                                                              pp. 35–42. doi:10.1109/ICDM.2002.1183883.
ordered sequential rules where the antecedent 𝑋 is com-
                                                          [3] E. Baralis, S. Chiusano, P. Garza, A lazy approach
posed of events that happened before the consequent
                                                              to associative classification, IEEE Transactions on
𝑌 and do not consider the relative order of the events.
                                                              Knowledge and Data Engineering 20 (2008) 156–
For this reason, we develop our approach where the an-
                                                              171. doi:10.1109/TKDE.2007.190677.
tecedent is a sequence of events with a relative temporal
                                                          [4] N. Marastoni, B. Oliboni, E. Quintarelli, Explainable
label.
                                                              recommendations for wearable sensor data, in: In-
   In general, when using algorithms based on Apriori to
                                                              ternational Conference on Big Data Analytics and
predict a target parameter, the dimension of sequences
                                                              Knowledge Discovery, Springer, 2022, pp. 241–246.
one can manage is limited. For this purpose, an analysis
                                                          [5] A. Dalla Vecchia, N. Marastoni, B. Oliboni,
of classifiers is mandatory. In the literature, there are
                                                              E. Quintarelli, The synergies of context and data
several associative classifiers, such as CBA [18], DeEPs
                                                              aging in recommendations, in: International Con-
[19], CMAR [20], CPAR [21], and iCAEP [22]. However,
                                                              ference on Big Data Analytics and Knowledge Dis-
these present two main drawbacks: either a loss of useful
                                                              covery, Springer, 2023, pp. 80–87.
information due to an overpruned set of rules or, con-
                                                          [6] A. Dalla Vecchia, N. Marastoni, S. Migliorini, B. Oli-
versely, a significant growth of the rule set in case of
                                                              boni, E. Quintarelli, Mining totally ordered sequen-
limited rule pruning. For these reasons, we decided to
                                                              tial rules to provide timely recommendations, in:
use the 𝐿3 associative classifier [2, 3], which addresses
                                                              European Conference on Advances in Databases
both problems.
                                                              and Information Systems, Springer, 2023, pp. 197–
   It has also been recognized that when users under-
                                                              207.
stand why certain items have been recommended, the
                                                          [7] F. Ricci, L. Rokach, B. Shapira, Introduction to rec-
suggestion becomes more persuasive. Hence, in recent
                                                              ommender systems handbook, in: Recommender
years, explainable RSs have been applied in real-world
                                                              systems handbook, Springer, 2011, pp. 1–35.
scenarios [23]. However, explanation techniques are of-
                                                          [8] A. Calero Valdez, M. Ziefle, K. Verbert, A. Felfernig,
ten not compatible with some machine learning or deep
                                                              A. Holzinger, Recommender systems for health in-
learning approaches due to their black-box nature [24].
                                                              formatics: state-of-the-art and future perspectives,
Although there exist some studies on the explainabil-
                                                              Machine Learning for Health Informatics: State-of-
ity, interpretability, and trustworthiness of deep learning
                                                              the-Art and Future Challenges (2016) 391–414.
techniques [25, 26], data mining techniques remain more
                                                          [9] A. Sathyanarayana, S. Joty, L. Fernandez-Luque,
effective in offering comprehensive explanations for the
                                                              F. Ofli, J. Srivastava, A. Elmagarmid, T. Arora,
nature of the rules they are able to infer.
                                                              S. Taheri, Sleep quality prediction from wear-
                                                              able data using deep learning, JMIR mHealth and
6. Conclusion                                                 uHealth 4 (2016) e125.
                                                         [10] V. Pandey, D. D. Upadhyay, N. Nag, R. C. Jain, Per-
In this paper, we have combined the 𝐿3 associative clas-      sonalized user modelling for context-aware lifestyle
sifier and the LBA algorithm to provide explainable rec-      recommendations to improve sleep., in: HealthRec-
ommendations. Such an approach helps give insights to         Sys@ RecSys, 2020, pp. 8–14.
users who desire to know what affects their sleep score [11] G. D. Abowd, A. K. Dey, P. J. Brown, N. Davies,
     M. Smith, P. Steggles, Towards a better under- [24] M. Alshammari, O. Nasraoui, S. Sanders, Mining
     standing of context and context-awareness, in:           semantic knowledge graphs to add explainability
     H. Gellersen (Ed.), Handheld and Ubiquitous Com-         to black box recommender systems, IEEE Access 7
     puting, First International Symposium, HUC’99,           (2019) 110563–110579.
     Karlsruhe, Germany, September 27-29, 1999, Pro- [25] R. Guidotti, A. Monreale, S. Ruggieri, F. Turini, F. Gi-
     ceedings, volume 1707 of Lecture Notes in Computer       annotti, D. Pedreschi, A Survey of Methods for
     Science, Springer, 1999, pp. 304–307.                    Explaining Black Box Models, ACM Comput. Surv.
[12] G. Adomavicius, R. Sankaranarayanan, S. Sen,             51 (2018). doi:10.1145/3236009.
     A. Tuzhilin, Incorporating contextual information [26] R. Meo, R. Nai, E. Sulis, Explainable, Interpretable,
     in recommender systems using a multidimensional          Trustworthy, Responsible, Ethical, Fair, Verifiable
     approach, ACM Trans. Inf. Syst. 23 (2005) 103–145.       AI... What’s Next?, in: In Proceedings of 26th
[13] G. Adomavicius, B. Mobasher, F. Ricci, A. Tuzhilin,      European Conference on Advances in Databases
     Context-aware recommender systems, AI Mag. 32            and Information Systems, ADBIS, 2022, pp. 25–34.
     (2011) 67–80.                                            doi:10.1007/978-3-031-15740-0_3.
[14] N. M. Villegas, C. Sánchez, J. Díaz-Cely, G. Tamura,
     Characterizing context-aware recommender sys-
     tems: A systematic literature review, Knowledge-
     Based Systems 140 (2018) 173–200.
[15] P. Fournier-Viger, J. C.-W. Lin, R. U. Kiran, Y. S. Koh,
     A survey of sequential pattern mining, Data Science
     and Pattern Recognition 1 (2017) 54–77.
[16] J. Pei, J. Han, B. Mortazavi-Asl, J. Wang, H. Pinto,
     Q. Chen, U. Dayal, M. Hsu, Mining sequential pat-
     terns by pattern-growth: The prefixspan approach,
     IEEE Trans. Knowl. Data Eng. 16 (2004) 1424–1440.
[17] R. Srikant, R. Agrawal, Mining sequential patterns:
     Generalizations and performance improvements,
     in: P. M. G. Apers, M. Bouzeghoub, G. Gardarin
     (Eds.), Advances in Database Technology - EDBT’96,
     volume 1057 of Lecture Notes in Computer Science,
     Springer, 1996, pp. 3–17.
[18] B. Ma, B. Liu, Y. Hsu, Integrating classification
     and association rule mining, in: Proceedings of
     the fourth international conference on knowledge
     discovery and data mining, volume 50, 1998, pp.
     55–00.
[19] J. Li, G. Dong, K. Ramamohanarao, L. Wong, Deeps:
     A new instance-based lazy discovery and classifica-
     tion system, Machine learning 54 (2004) 99–124.
[20] W. Li, J. Han, J. Pei, Cmar: Accurate and efficient
     classification based on multiple class-association
     rules, in: Proceedings 2001 IEEE international con-
     ference on data mining, IEEE, 2001, pp. 369–376.
[21] X. Yin, J. Han, Cpar: Classification based on pre-
     dictive association rules, in: Proceedings of the
     2003 SIAM international conference on data min-
     ing, SIAM, 2003, pp. 331–335.
[22] X. Zhang, G. Dong, K. Ramamohanarao,
     Information-based classification by aggregating
     emerging patterns, in: International Conference
     on Intelligent Data Engineering and Automated
     Learning, Springer, 2000, pp. 48–53.
[23] Y. Zhang, X. Chen, Explainable recommendation:
     A survey and new perspectives, Found. Trends Inf.
     Retr. 14 (2020) 1–101. doi:10.1561/1500000066.

</pre>