<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Engaging Learners in an Enterprise L&amp;K System</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Wesley M. Gifford</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ashish Jagmohan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yi-Min Chee</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anshul Sheopuri IBM Research Yorktown Heights</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>John Ambrose</institution>
          ,
          <addr-line>Sue Rodeman, Shota Aki Skillsoft Nashua, NH</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <abstract>
        <p>We describe a system being designed for a leading provider of enterprise learning solutions, to improve engagement among learners. The system consists of an engagement timing component which estimates a learner's level of engagement and likely preferred interaction times, and a recommendation component which generates personalized content recommendations. We summarize early results from a recently initiated pilot deployment.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>The problem of interest is improving learner engagement
in an enterprise learning system, by utilizing consumption
data captured by the learning platform. The existing
learning platform records each content launch, tracking user and
content ID and launch time and duration, among other data.
The platform also defines an expert-curated hierarchy of
content, wherein assets are grouped into a forest of
assetfolders on the basis of subject matter. We have developed an
engagement system consisting of two major components: 1)
an engagement timing component that is responsible for
estimating both a learner’s level of engagement, and preference
to interact at certain days and times; and 2) a
recommendation component that generates personalized
recommendations for each learner, based on historical learner activity. In
an initial email-based pilot, these components have
demonstrated significant improvements in user response compared
to industry benchmarks.</p>
    </sec>
    <sec id="sec-2">
      <title>ENGAGEMENT TIMING</title>
      <p>The goal of the engagement system is to improve the
level of engagement of its learners. The engagement timing
component helps the system with proper timing of actions,
based on each learner’s current level of engagement. We
observed that learners often exhibit “bursty” or self-excitation
behavior, where a learner’s interactions frequently occur in
clusters. We model these interactions as arrivals from a
stochastic process that captures the temporal dependencies
in learner behavior; the typical homogeneous Poisson
process is not capable of doing so.</p>
      <p>For users with sufficient interaction histories, we consider
a Hawkes’ process, which can be viewed as a counting
process whose time-varying intensity function adheres to a
specific structure that enables capturing of temporal
dependencies. The original structure considered by Hawkes’ was[Hawkes
t
1971]: λ(t) = μ+R−∞ g(t−u; θ)dN (u), where N (u) is an
appropriate point process. Hawkes’ specifically considered the
case where g(t) = PiP=1 αi exp{−βit}, t &gt; 0. This function
states that the intensity at the current time consists of
decayed contributions from prior events. If only a short period
of time has elapsed since the learner’s last action, the
intensity function is impacted by these recent events and hence
captures the fact that the learner is more likely to reengage.
If a long period has elapsed since the last action, the process
behaves more like a homogeneous Poisson process with rate
μ until the next action. Similar models have been used to
model stock market trades and earthquake aftershocks.</p>
      <p>The system estimates the level of engagement for each
learner by considering their reengagement probability in a
time window given their prior interaction history. For
learners with sufficient history, the parameters of the model above
can first be determined using maximum likelihood
estimation. This was done using numerical maximization of the
likelihood, whose expressions are available in [Ozaki 1979]1.
Then, the probability that a particular learner reengages
in the next s days, given their prior interaction history, is
equivalent to the event that there is at least one arrival in
the time period of interest from the underlying stochastic
process (details omitted for brevity).</p>
      <p>In addition to knowing a learner’s engagement level, it
is also important to know the best time of day and day of
week to contact individual learners. This is derived from a
learner’s prior interactions under the assumption that prior
interaction times are indicative of preferred interaction times.
Each day of the week is divided into n uniform duration
bins, giving a multinomial distribution with a total of 7n
categories. In many cases, estimation of the category
probabilities suffers from sparsity due to limited interactions.
This problem is solved by using Bayesian estimation with
a Dirichlet prior that incorporates the aggregate preferences
of the entire population. Results based on a test across
multiple customers are shown in Figure 1. The preferences
estimated in this test use an exponential weighting scheme
(with parameter γ) to place more weight on recent
activity. The plot indicates that the perfromance saturates for γ
greater than 24 months. For this value of γ the estimated
distribution significantly outperforms a naive model.
1For learners with fewer prior interactions, one promising
strategy is to aggregate their inter-arrival times and fit an
aggregate model.</p>
      <p>1.8
e
c
an1.6
m
frr
o
ep1.4
d
e
z
il
ram1.2
o
N
1.0
0.8 5
3.</p>
    </sec>
    <sec id="sec-3">
      <title>RECOMMENDATION ENGINE</title>
      <p>The recommendation engine seeks to improve learner
engagement by generating personalized recommendations for
which the learner will likely have high preference and, hence,
high consumption likelihood.</p>
      <p>The recommendation engine utilizes a blended ensemble
[Koren 2009], wherein several baseline recommenders are
combined to yield a final set of recommendations. We use
three groups of baseline recommenders, each group
containing multiple individual recommenders. The first group
consists of popularity-based recommenders which use
several metrics to measure popularity, including temporal
recency, and launch and duration information. The second
group consists of content-based recommenders, wherein the
learner’s historical consumption of certain asset-types, as
determined by the expert-curated hierarchy, is leveraged
to generate new recommendations. These include
recommenders based on generative Bayesian models of the learner’s
type-preferences, and based on tfidf type metrics over the
content hierarchy. The third group consists of
collaborativefiltering recommenders, which leverage the implicit feedback
information [Hu et al. 2008] manifested in each user’s
historical asset consumption activity; individual recommenders
include some based on matrix factorization, and others based
on separate user-user and asset-asset based filtering.</p>
      <p>The recommender ensemble described above is combined
to generate a final set of recommendations (typically 5-10)
for each learner. Activity data was temporally split into
training and validation data sets. We used several metrics
to quantify recommender goodness, including metrics based
on discounted cumulative gain and precision, and
predictive metrics quantifying the number of trained
recommendations which were consumed in the validation set. The
metrics yielded largely consistent results. We tried
multiple blending techniques including gradient-boosted decision
trees and random-forests; random forests were found to yield
best performance. Figure 2 shows, for one enterprise, a
comparison of the blended recommender to the best popular
recommender, as the number of recommendations varies. The
performance is normalized to that of the best popular
recommender. Note that this comparison is aggregated over all
learners, including a significant number with no prior
historical activity, for whom the popular recommender is best.
6</p>
      <p>7 8
Number of recommendations
9
10</p>
      <p>We created visualizations to help learners understand why
they were receiving specific recommendations. One type of
visualization shows the relative strength of each
recommendation along each of the three broad recommender groups
described above. Another set of visualizations compares
the strength of the recommendations along a single
dimension using their relative ranks from a specific recommender
group. Anecdotal evidence indicates that users found these
visualizations to be useful in helping to determine which of
the recommendations might be of interest.</p>
    </sec>
    <sec id="sec-4">
      <title>4. PRELIMINARY EVALUATION</title>
      <p>Pilot deployments of the described engagement solution
have been recently initiated. Learners receive emails at
engagement times determined as in Section 2, containing
personalized recommendations as described in Section 3. Some
preliminary quantitative indications of the efficacy of the
solution have been gleaned by examining initial email
interaction metrics. After the first set of emails was sent to all
participants, the click-through and click-to-open rates (ie.
the fraction of participants who clicked upon one of the
recommendations after opening the email) were tracked. The
overall click-through rate was 5.6%, while the click-to-open
rate was 31.6%. These metrics were compared to
industry benchmarks for email campaigns in the education
industry, as reported in [Silverpop 2014]. The comparison shows
that the both rates are significantly higher than the
industry median (2.8% and 14.3% respectively). These metrics
give some preliminary confirmation of the promise of the
proposed engagement approach.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [Hawkes 1971] Alan G Hawkes.
          <year>1971</year>
          .
          <article-title>Point spectra of some mutually exciting point processes</article-title>
          .
          <source>JRSS. Series B</source>
          (
          <year>1971</year>
          ),
          <fpage>438</fpage>
          -
          <lpage>443</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [Hu et al. 2008]
          <string-name>
            <given-names>Yifan</given-names>
            <surname>Hu</surname>
          </string-name>
          , Yehuda Koren, and
          <string-name>
            <given-names>Chris</given-names>
            <surname>Volinsky</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Collaborative Filtering for Implicit Feedback Datasets</article-title>
          .
          <source>In Proc. ICDM</source>
          <year>2008</year>
          .
          <volume>263</volume>
          -
          <fpage>272</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [Koren 2009]
          <string-name>
            <given-names>Yehuda</given-names>
            <surname>Koren</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>The BellKor Solution to the Netflix Grand Prize</article-title>
          . (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [Ozaki 1979]
          <string-name>
            <given-names>T</given-names>
            <surname>Ozaki</surname>
          </string-name>
          .
          <year>1979</year>
          .
          <article-title>Maximum likelihood estimation of Hawkes' self-exciting point processes</article-title>
          . Ann. Inst. Stat. Math.
          <volume>31</volume>
          ,
          <issue>1</issue>
          (
          <year>1979</year>
          ),
          <fpage>145</fpage>
          -
          <lpage>155</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>[Silverpop 2014] Silverpop</source>
          .
          <year>2014</year>
          .
          <article-title>2014 Silverpop email marketing metrics benchmark study</article-title>
          . (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>