<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Revealing and Interpreting Crowd Stories in Online Social Environments</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Chris Kiefer</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Matthew Yee-King</string-name>
          <email>m.yee-king@gold.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mark d'Inverno</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computing</institution>
          ,
          <addr-line>Goldsmiths</addr-line>
          ,
          <institution>University of London</institution>
        </aff>
      </contrib-group>
      <fpage>47</fpage>
      <lpage>52</lpage>
      <abstract>
        <p>The underlying patterns in large scale social media datasets can reveal valuable information for interaction designers and researchers, both as part of realtime interactive systems and for post-hoc analysis. Music Circle is a social media platform aimed at researching the role of community feedback in online learning environments. A large dataset was collected when the platform was used as part of a Massive Open Online Course (MOOC). We developed a novel analysis technique for observing global patterns in the behaviour of students. The technique employs network theory techniques to view student activity as an interconnected complex system, and observes the temporal dynamics of network metrics to create timelines which are clustered into groups using unsupervised learning methods. This approach highlighted global trends and groups of outliers that needed further attention or intervention.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Online social activity has become a fundamental part of many
interactive systems, either explicitly as part of their intended
design, or implicitly as part of external and pervasive social
media networks. With social activity comes large or
massive scale data, which describes interactions between
individuals mediated through a variety of possible formats. These
datasets can reveal stories about individuals and groups that
may be of high significance to stakeholders; for interaction
designers, this data can show aspects of behaviour that reveal
design problems, suggest design solutions and highlight
directions for future iterations. For researchers, this data can
give us a broad understanding of trends in behaviour within
the context of specific technological environments. Large
datasets also present significant challenges in analysis, with
the scale of raw data often making direct human
interpretation an intractable task. However, by bringing computational
analysis into the loop, we can attempt to sculpt the raw data
into new forms that, while not necessarily giving absolute
answers, present data in a suitable format for further (human)
interpretation and criticism.</p>
      <p>Music Circle is an online social platform, aimed at
exploring ways of understanding and enhancing learning through
community feedback. For six weeks in the summer of 2014 it
was used to support a Coursera MOOC, ’Creative
Programming for Digital Media and Mobile Apps’1. A substantial
and detailed log of student interactions was collected. While
there were specific questions that could be asked of the data,
such a large and complex set of interactions could most likely
hold some interesting and unexpected results, and it seemed
pertinent to follow a bottom-up approach to data analysis, by
letting patterns emerge rather than imposing them. To this
purpose, a set of techniques was developed that attempted to
elucidate the broad patterns and temporal dynamics of crowd
behaviour that occurred during the period of the MOOC, to
transform the raw data into a format that would give the
research and design teams a deeper understanding of student
interactions within the Music Circle environment.</p>
      <p>A novel approach was developed, which leveraged network
analysis and machine learning techniques to cluster
temporal data. We outline the development of this technique and
present and critique the results. The following sections
address the research questions that were encountered during this
development process: how can social media data be encoded
into a human readable form that describes temporal patterns
in actor behaviour? How can network analysis techniques
enhance this encoding? What are good ways to present this data
for interpretation by stakeholders?</p>
      <p>We present this research as a technique for eliciting
information from large datasets for analysis by stakeholders and
domain experts, rather than as a process which will supply
absolute answers concerning student behaviour. In this light,
we do not attempt to provide a quantitative evaluation of the
effectiveness of the findings, but try to show, through
crosschecking of results in a post-hoc analysis of the Music Circle
MOOC data, the potential strengths of our method for use in
future projects.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>Our approach is rooted in a network theory perspective.
Jiawei et. al [Han et al., 2012] review data mining in this
context, proposing that we can extract much more valuable
1https://www.coursera.org/course/
digitalmedia
information from a database by viewing it as a
heterogeneous information network rather than a homogenous data
repository. We draw on techniques highlighted by Holme
and Sarama¨ki [Holme and Sarama¨ki, 2012]; they review the
emerging field of temporal networks, looking at techniques
for analysing how network topology changes over time, and
how temporal information flows. There have been varied
approaches to social network analysis, for example, Gottron
and Pickhardt[Gottron et al., 2013] explore techniques for
temporal analysis of social data, Gilbert et. al. use
statistical methods to analyse Pinterest [Gilbert et al., 2013], and
Diya et. al. [Yang et al., 2013] look for causes of student
dropouts in MOOCs using a network theory approach. Rowe
et. al. [Rowe et al., 2013] outline their technique for
modelling and analysing the behaviour of users in online
communities. They focus on defining individual role categories, and
look at how the global composition of these roles changes
over time. Chao et. al [Chau et al., 2011] look at the
intersection between HCI and data mining, investigating interactive
machine learning approaches, and sensemaking.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Music Circle</title>
      <p>Our system (pictured in figure 1) allows students to share
and discuss creative work, and acts as a research platform for
studying the role of social media in learning [Brenton et al.,
2014]. The key feature is the Social Timeline, an online
environment for annotating and discussing time-based media.
The Social Timeline allows students to highlight and
comment on sections of time-based media, and to further discuss
these comments. The website has been used in a variety of
scenarios to explore creative feedback [d’Inverno and Still,
2014] between students, including jazz piano tuition and as a
rehearsal support tool for musical ensembles.</p>
      <p>Over the course of a MOOC in the summer of 2014, the
website was employed for students to discuss, share feedback
on and peer assess videos of their coursework pieces. During
the six week course, the students were required to submit a
piece of coursework every two weeks. Each coursework brief
asked them to program a software application, and to submit
a video demo of their application to the Music Circle website
for peer assessment. Students were also asked to peer assess
three other peoples’ work for each submission. As part of the
peer assessment process, they could discuss other students’
work through the website.</p>
      <p>To give an overview of the course statistics, during a six
week period, 3716 users registered with the website. Of these,
3558 viewed one or more videos, 827 made one or more
comments, and 258 made one or more replies to comments. 2898
videos were submitted for three separate assessments, and
were viewed a total of 112,189 times. 7370 comments were
made, along with 978 replies. Detailed log data was collected,
including timestamped records of all discussions and all
media viewing activity.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Encoding Stories</title>
      <p>Having collected the data, we needed to present it in a format
that was both interpretable by humans, and cluster-able by a
computer. We built networks of data relating to singular
concepts, and observed several network metrics as they evolved
over time. In this way, we could use network measurements
that put an individual’s actions into a global interconnected
context, rather than observing them in local scope.</p>
      <p>Two sets of networks were built, that separately
represented commenting and viewing activity. Each set consisted
of networks that evolved in two hour windows over the
period of the MOOC, giving 503 networks in each set. This two
hour period was chosen as a compromise between time
resolution and practical limitations in data processing capacity.
The networks had directed and weighted connections; each
node represented a student, with weighted links representing
the numbers of comments or views made from one student to
another.</p>
      <p>In this analysis, a set of four metrics were chosen for
observation from each network. The first two were simple metrics
which sum the (a) incoming and (b) outgoing weights of each
node. These could also be calculated without using a
network. The next was (c) betweenness centrality. This metric
was chosen as it provides interesting representations of each
user’s importance within the global context of the network,
based on how much information flows through their node. It
shows the extent that the actor is positioned on the shortest
path between other pairs of nodes in the network
[Leydesdorff, 2007]. Betweenness centrality is calculated based on
link direction and weight, thereby using the full information
available in the networks we constructed. The last metric was
a calculation of each node’s (d) HITS authority. This was
calculated with the HITS algorithm [Kleinberg et al., 1999],
which gives a measurement of the importance of a node based
on link structure. More specifically the algorithm gives each
node two co-dependent scores; a hub score based on the
authority of nodes that link to it, and an authority score based
on how the degree to which the nodes that point to it are hubs.</p>
      <p>These four metrics were observed for each two hour
iteration of the networks, giving each student a set of timelines,
one for each metric. The analysis provided a rich data set for
further exploration. Network analysis was carried out using
iPython with the NetworkX library.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Discovering Themes</title>
      <p>Having collected the sets of timelines for each user, the next
step was to cluster these timelines into similar groups to
reveal underlying patterns. An exploratory approach was taken,
searching for interesting features in the data set by creating
both clusters of single features and clusters of compound
features in order to reveal correlations between groups of
metrics. Useful clustering results were obtained using two
methods: k-means alone, and k-means with unsupervised
pretraining using Restricted Boltzmann Machines. In the
latter case, RBMs were used to find sparse, low dimensional
representations of the salient features in the data, before
kmeans clustered these features. We used the Extended RBM
from the Oger toolbox [Verstraeten et al., 2012], with
gaussian visible units for our continuous valued data. The RBMs
were trained with guidance from [Hinton, 2010].</p>
    </sec>
    <sec id="sec-6">
      <title>Visualising Crowd Behaviour</title>
      <p>Having calculated clusters, we visualised them in two ways.
Simple graphs of means and variances for each cluster group
(e.g. figure 2) gave an easily interpretable summary. A
more complex view showing members of individual clusters
was presented in sets of polar graphs, where each graph
displayed every individual timeline in a cluster, superimposed
with semi-transparent colouring to show patterns of density
(e.g. figure 3). Upon identifying a cluster of interest, a set
of graphs was generated to show the cluster mean for each
metric, compared to the global means (e.g. figure 4).
7</p>
    </sec>
    <sec id="sec-7">
      <title>Results</title>
      <p>An exploratory approach was taken to analysing the data,
visualising features separately and in combination to find
clusters of possible interest. The following examples describe
salient outcomes of this process.
7.1</p>
      <sec id="sec-7-1">
        <title>Example A: Betweenness Centrality</title>
        <p>Figure 2 shows the means of 5 clusters of betweenness
centrality timelines from the views network, generated by
training an RBM with 503 visible units and 5 hidden units, the
output of which was clustered with k-means. In this context, we
could consider betweenness centrality to indicate the extent to
which a student is engaged in a community of other students
who are active in viewing each others’ work. A large group of
low activity users is shown in cluster 2, which is what would
be expected in a social network dataset. A smaller group of
high activity users is highlighted in cluster 4 (shown in more
detail in figure 3). This timing of this higher viewing
activity correlates with an incentive being offered to students
for engaging in forum activity. Cluster 3’s value is dropping
while other clusters are rising, indicating that this cluster may
include students who need attention in some way. Further
analysis shows that the number of comments received by this
group is below the global average (see figure 4),
strengthening the case that this group may need some sort of help or
motivation.
7.2</p>
      </sec>
      <sec id="sec-7-2">
        <title>Example B: HITS Authority</title>
        <p>Clusters of the HITS authority timelines for the last 40% of
the course were clustered using k-means (shown in figure 5).
The graph shows that the students in cluster 4 have a steadily
declining authority, which may indicate the students in this
group have low activity and are therefore becoming less
important community members. The concern is verified when
looking at their viewing timelines; they are significantly
below the global mean for betweenness centrality of views.</p>
        <p>Figure 6 shows 10 clusters of a compound feature, made
with k-means. The first half is a betweenness centrality from
the views network, and the second half is the same metric
from the comments network. By joining features in this way,
the clustering process can pick out potentially interesting
correlations or contrasts between the sources. In this example,
we can see that the students in cluster 5 have a relatively
high value for views but a very low value for comments. This
could indicate students who are potentially active, but are part
of low activity peer communities, and would perhaps benefit
from being introduced to new peers.
8</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Discussion</title>
      <p>By exploring and clustering the sets of timelines describing
behaviour on Music Circle, it was possible to reveal global
patterns of crowd behaviour in the specific context of the
network metrics we observed. The clusters also highlighted
groups of outliers; understanding these small groups can be
of great value in trying to understand a complex social
system, both in terms of isolating problems, and understanding
positive deviance [Ramalingam, 2013]. The technique can
be used both post-hoc and for live analysis. Post-hoc
analysis can help us to understand crowd behaviour in order to
improve the design of future iterations. Live analysis has a
number of possible uses in the context of our platform;
outlying groups may predict students who are likely to drop out
or disengage, and need some contact, reward or support from
peers and teachers. Outliers may also highlight successful
groups who we may try to link with new peers to strengthen
the overall community of learning. A further use is to give
students live feedback of their status in terms of these metrics,
in order to aid their learning or motivate them. For example, a
live timeline of centrality in the comments network, together
with a summary based on cluster membership, could provide
a good motivator to increase commenting activities.</p>
      <p>The first two examples in particular demonstrate the
potential strengths of our analysis technique. Both highlight groups
of interest, which are validated by patterns in other metrics
on other domains. e.g. in example A, the betweenness
centrality clusters for viewing behaviour highlight a group that
may need attention. Further investigation of the
commenting network reveals that this group is less active at making
comments, compared to the global mean. Examples A and B
also demonstrate the values of analysing our data from a
network perspective, viewing user activity as a complex
evolving system of interconnected nodes. In example A, the
clusters highlight a group whose betweenness centrality is
dropping progressively. Observing non-network based metrics for
this group, i.e. the number of views made and received, the
timelines for this group do not differ greatly from the global
average, so this group would not show up in any clusters.
However, the highlighting of the group is validated by their
inactivity in commenting. In example B, a group is revealed
whose HITS authority for comments is dropping. Again, this
would be difficult to spot from this group’s number of
comments made and received, which are close to the global
average, but the choice is validated by revealing their low
viewing activity. Overall, network analysis algorithms such as
betweenness centrality and HITS evaluate each node in the
wider context of a complex system. This means these
metrics are much more sensitive to global events in the network,
and can reveal dynamics that locally scoped measurements
may fail to. The results show the merits of temporal analysis
of these network metrics; the clusters of interest were
highlighted by discovering anomalies in temporal dynamics, and
reveal more detailed information compared to instantaneous
analysis.</p>
      <p>A challenge of using this system is in interpretation. To
fully interpret a graph of clusters, it is necessary to
understand the network analysis metric being presented, along with
its meaning in the context of the network and in the wider
context of the source domain of the data. For example, to
understand betweenness centrality of commenting activity, we
need to understand the concept of this measurement along
with the network theory that supports it, and we also need to
understand how people are connected by comments on Music
Circle and the affordances of the interface that allow activity
to propagate though the network of students. It’s also a
challenge to present the clusters in an optimal way. The means
and variance give a good idea of general trends but miss some
details. The graphs of superimposed timelines can become
dense and difficult to compare, but do give much more detail.
Conducting analysis with both of these perspectives seems a
good compromise, but ideally an interactive tool would be
very useful.
9</p>
    </sec>
    <sec id="sec-9">
      <title>Conclusions</title>
      <p>The motivation for this project was to reveal patterns of global
crowd behaviour based on a large scale database of social and
educational activity. Our approach was to look at simple
information through the perspective of network analysis. We
observed how a variety of network analysis measurements
vary over time, and then undertook an exploratory analysis
of these timelines though clustering. The clusters highlighted
interesting global behaviours of groups of users, and also
revealed smaller groups of outliers that may need some sort
of intervention or attention. The strength of this technique
is demonstrated in examples where the highlighted clusters
were shown to need attention though cross checking with
other data sources. The possibilities of our technique were
demonstrated through post-hoc analysis of forum data. The
next step would be to apply this technique on a live forum,
and observe the effects of any pedagogical interventions that
are made based on the analysis of the resulting data.
10</p>
    </sec>
    <sec id="sec-10">
      <title>Future Work</title>
      <p>The analysis highlights two areas which could benefit from
further development. Firstly, the development of presentation
tools to aid human analysis of the clusters. Secondly, the
network analysis algorithms employed here have been successful
in highlighting clusters but also add an extra layer of
interpretation. It would be interesting to investigate the development
of domain specific networks measurements, whose output is
closely matched to the semantics of the forum behaviour
being observed.</p>
    </sec>
    <sec id="sec-11">
      <title>Acknowledgments</title>
      <p>The work reported in this paper is part of the PRAISE
(Practice and Performance Analysis Inspiring Social
Education) project which is funded under the EU FP7
Technology Enhanced Learning programme, grant agreement number
318770.</p>
      <p>Holme and Jari</p>
      <p>Physics reports,</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [Brenton et al.,
          <year>2014</year>
          ]
          <string-name>
            <given-names>Harry</given-names>
            <surname>Brenton</surname>
          </string-name>
          , Matthew Yee-King,
          <article-title>Andreu Grimalt-Reynes, Marco Gilles, Maria Krivenski, and Mark d'Inverno. A social timeline for exchanging feedback about musical performances</article-title>
          .
          <source>In Proceedings of the 28th International BCS Human Computer Interaction Conference on HCI 2014-Sand</source>
          , Sea and
          <string-name>
            <surname>Sky-Holiday</surname>
            <given-names>HCI</given-names>
          </string-name>
          , pages
          <fpage>281</fpage>
          -
          <lpage>286</lpage>
          . BCS,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [Chau et al.,
          <year>2011</year>
          ]
          <string-name>
            <given-names>Duen</given-names>
            <surname>Horng</surname>
          </string-name>
          <string-name>
            <surname>Chau</surname>
          </string-name>
          , Aniket Kittur,
          <string-name>
            <surname>Jason I Hong,</surname>
          </string-name>
          and
          <string-name>
            <given-names>Christos</given-names>
            <surname>Faloutsos</surname>
          </string-name>
          .
          <article-title>Apolo: making sense of large network data by combining rich user interaction and machine learning</article-title>
          .
          <source>In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems</source>
          , pages
          <fpage>167</fpage>
          -
          <lpage>176</lpage>
          . ACM,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <source>[d'Inverno and Still</source>
          , 2014]
          <article-title>Mark d'Inverno and Arthur Still. Creative feedback: a manifesto for social learning</article-title>
          .
          <source>In Proceedings of the Workshops held at Educational Data Mining</source>
          <year>2014</year>
          conference,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [Gilbert et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Eric</given-names>
            <surname>Gilbert</surname>
          </string-name>
          , Saeideh Bakhshi,
          <string-name>
            <given-names>Shuo</given-names>
            <surname>Chang</surname>
          </string-name>
          , and Loren Terveen.
          <article-title>”i need to try this”?: A statistical overview of pinterest</article-title>
          .
          <source>In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI '13</source>
          , pages
          <fpage>2427</fpage>
          -
          <lpage>2436</lpage>
          , New York, NY, USA,
          <year>2013</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [Gottron et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Thomas</given-names>
            <surname>Gottron</surname>
          </string-name>
          , Olaf Radcke, and
          <string-name>
            <given-names>Rene</given-names>
            <surname>Pickhardt</surname>
          </string-name>
          .
          <article-title>On the temporal dynamics of influence on the social semantic web</article-title>
          .
          <source>In Semantic Web and Web Science</source>
          , pages
          <fpage>75</fpage>
          -
          <lpage>87</lpage>
          . Springer,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [Han et al.,
          <year>2012</year>
          ] Jiawei Han, Yizhou Sun,
          <article-title>Xifeng Yan, and Philip S Yu. Mining knowledge from data: An information network analysis approach</article-title>
          .
          <source>In Data Engineering (ICDE)</source>
          ,
          <year>2012</year>
          IEEE 28th International Conference on, pages
          <fpage>1214</fpage>
          -
          <lpage>1217</lpage>
          . IEEE,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>[Hinton</source>
          , 2010]
          <string-name>
            <given-names>Geoffrey</given-names>
            <surname>Hinton</surname>
          </string-name>
          .
          <article-title>A practical guide to training restricted boltzmann machines</article-title>
          .
          <source>Technical Report 1</source>
          , University of Toronto,
          <year>August 2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [Holme and Sarama¨ki, 2012]
          <article-title>Petter Sarama¨ki. Temporal networks</article-title>
          .
          <volume>519</volume>
          (
          <issue>3</issue>
          ):
          <fpage>97</fpage>
          -
          <lpage>125</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [Kleinberg et al.,
          <year>1999</year>
          ]
          <string-name>
            <surname>Jon</surname>
            <given-names>M Kleinberg</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Ravi</given-names>
            <surname>Kumar</surname>
          </string-name>
          , Prabhakar Raghavan,
          <article-title>Sridhar Rajagopalan, and Andrew S Tomkins. The web as a graph: measurements, models, and methods</article-title>
          .
          <source>In Computing and combinatorics</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          . Springer,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <source>[Leydesdorff</source>
          , 2007]
          <string-name>
            <given-names>Loet</given-names>
            <surname>Leydesdorff</surname>
          </string-name>
          .
          <article-title>Betweenness centrality as an indicator of the interdisciplinarity of scientific journals</article-title>
          .
          <source>Journal of the American Society for Information Science and Technology</source>
          ,
          <volume>58</volume>
          (
          <issue>9</issue>
          ):
          <fpage>1303</fpage>
          -
          <lpage>1319</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <source>[Ramalingam</source>
          , 2013]
          <string-name>
            <given-names>Ben</given-names>
            <surname>Ramalingam</surname>
          </string-name>
          .
          <article-title>Aid on the edge of chaos: rethinking international cooperation in a complex world</article-title>
          . Oxford University Press,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [Rowe et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Mattew</given-names>
            <surname>Rowe</surname>
          </string-name>
          , Miriam Fernandez, and
          <string-name>
            <given-names>Harith</given-names>
            <surname>Alani</surname>
          </string-name>
          .
          <article-title>Modelling and analysis of user behaviour in online communities: Ieee computer society special technical community on social networking e-letter</article-title>
          . IEEE Computer Society Special Technical Community on Social Networking E-Letter,
          <volume>1</volume>
          (
          <issue>2</issue>
          ), May
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [Verstraeten et al.,
          <year>2012</year>
          ]
          <string-name>
            <given-names>David</given-names>
            <surname>Verstraeten</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Benjamin</given-names>
            <surname>Schrauwen</surname>
          </string-name>
          , Sander Dieleman, Philemon Brakel, Pieter Buteneers, and
          <string-name>
            <given-names>Dejan</given-names>
            <surname>Pecevski</surname>
          </string-name>
          .
          <article-title>Oger: modular learning architectures for large-scale sequential processing</article-title>
          .
          <source>The Journal of Machine Learning Research</source>
          ,
          <volume>13</volume>
          (
          <issue>1</issue>
          ):
          <fpage>2995</fpage>
          -
          <lpage>2998</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [Yang et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Diyi</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Tanmay</given-names>
            <surname>Sinha</surname>
          </string-name>
          , David Adamson, and Carolyn Penstein Rose´. Turn on, tune in, drop out:
          <article-title>Anticipating student dropouts in massive open online courses</article-title>
          .
          <source>In Proceedings of the 2013 NIPS Data-Driven Education Workshop</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>