<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Virtual Teaching Assistant for Personalized Learning</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Extended Abstract</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Luca Benedetto Politecnico di Milano Milan</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Manuel Parenti Politecnico di Milano Milan</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Paolo Cremonesi Politecnico di Milano Milan</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this extended abstract, we propose an intelligent system that can be used as a Personalized Virtual Teaching Assistant (PVTA) to improve the students' learning experience both for online and on-site courses. We show the architecture of such system, which is composed of an instance of IBM's Watson Assistant and a server, and present an initial implementation, consisting in a chatbot that can be questioned about the content and the organization of the RecSys course, an introductory course on recommender systems.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Intelligent systems are extensively used in many domains, and they
can bring some relevant advantages in education as well: indeed,
they ofer the opportunity to improve the learning experience and
the quality of teaching, both in the case of online and on-site courses.
So far, diverse applications have been explored: for instance, some
research focused on the usage of recommender systems for
suggesting new learning content [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and to perform students’ performance
prediction [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Predicting students’ performance is particularly
important for e-learning, in order to improve retention and completion
rates, which are one of the biggest limitations of online learning
[
        <xref ref-type="bibr" rid="ref4 ref7">4, 7</xref>
        ]. Several works, such as [
        <xref ref-type="bibr" rid="ref1 ref3">1, 3</xref>
        ], discussed the possibility of
using virtual teaching assistants (VTA) in order to reduce
professors’ workloads: indeed, VTAs can make education much more
scalable since students can solve most of their problems without
asking the professors for help. This extended abstract lies in this
last branch of research: we propose a Personalized Virtual Teaching
Copyright © CIKM 2018 for the individual papers by the papers'
authors. Copyright © CIKM 2018 for the volume as a collection
by its editors. This volume and its papers are published under
1.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>IBM’s Watson Assistant</title>
      <p>The IBM’s Watson Assistant1 (called “Assistant” from now on)
is ofered by IBM as part of the Watson suite, an AI engine that
provides several NLP services. It is able to “understand
naturallanguage input and use machine learning to respond to customers
in a way that simulates a conversation between humans” and can be
used to build virtual assistants. IBM does not share with customers
the details of the core NLP model, thus Assistant can be used only
as a black-box; however, we can adapt it to any desired application
domain by feeding it with additional data in order to enlarge the
training set and fine-tune the model. The training data we can
feed the model with is made of objects belonging to two classes:
intents and entities: intents identify the goals that we expect a user
to have while interacting with the system, while entities afect the
way in which Assistant reacts to a specific intent by giving it a
context. Once defined intents and entities, it is necessary to create
the dialog flow in order to teach Assistant how it should answer
diferent requests.
2</p>
    </sec>
    <sec id="sec-3">
      <title>RELATED WORK</title>
      <p>
        In relation to this project, the most important work is the
introduction of Jill Watson (JW) by A. Goel et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], a VTA somewhat
similar to the PVTA proposed in this document. However, although
the authors showed the possible applications of JW, they never
presented the details of the implementation and always
considered it as a black-box; we aim at filling that gap, focusing on the
architecture of our PVTA and explaining the role of the diferent
components it is made of. Also, there are some diferences between
the two systems: JW aimed at completely replacing human TAs,
thus it had to deal with situations outside of the educational domain,
which caused some problems that still have to be addressed [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
Our PVTA, instead, focuses on helping students in relation to the
contents and the structure of the course, therefore we will not have
1https://console.bluemix.net/docs/services/conversation
to deal with that kind of issues. Lastly, JW did not provide any kind
of personalization, while we are building a personalized system.
      </p>
    </sec>
    <sec id="sec-4">
      <title>3 SYSTEM ARCHITECTURE</title>
      <p>As shown in Figure 1, the PVTA is made of three main components:
the front-end, a server and an instance of Assistant. The server
contains the data about the course and the students, while the
instance of Assistant contains the intents and entities we defined.</p>
    </sec>
    <sec id="sec-5">
      <title>3.1 Front-end</title>
      <p>The front-end of the PVTA is a simple application that enables the
user to interact with Assistant and does not perform any operations
on the input data; for the first implementation we used Slack 2.</p>
    </sec>
    <sec id="sec-6">
      <title>3.2 Server</title>
      <p>It is involved in diferent phases of the process: at the moment it
performs up to three tasks for each question but this number will
increase in the future, as we add new services to the PVTA.
3.2.1 Preprocessing. It is necessary if the server knows some
information that Assistant is unaware of. Watson has no long-term
memory to store the context of the interaction with a student and
we must leverage the server in order to do so. As an example, if a
student asks a question using the pronoun “it” referring to
something he said in a previous question, the server has to modify the
sentence in order for Assistant to understand such reference.
3.2.2 Post-processing. This is the analogous of preprocessing; for
instance, when a student asks for the date of an exam, Assistant
is able to understand what the user is interested in but it does not
have access to the schedule (which is stored in the server). Thus,
it sends an incomplete answer to the server, which fills the gap by
adding the date and time of the exam.
3.2.3 Students’ modeling. The server contains some intelligence
as well: it collects data about students’ behavior and clusters them
in diferent groups using as similarity the intents and the entities
they searched for.
3.2.4 Interaction with a human TA. Figure 1 shows that the server
can interact with a human TA: Assistant assigns a confidence level
to each answer and, if that is too low, the server forwards the
question and the proposed answer to the human TA. He checks
the proposed answer, possibly corrects it and sends it back to the
server, which forwards it to the student and inserts the correct
question-response pair in the training set (stored in Assistant).</p>
    </sec>
    <sec id="sec-7">
      <title>3.3 Assistant</title>
      <p>It is responsible for the NLP-related tasks of the VTA: it receives the
question (possibly preprocessed) and provides an answer (which
might require post-processing). In order to build a system capable
of working in the educational domain, we had to define the entities
and the intents related to such domain, as well as all the technical
terms specific to the RecSys course. Each entity does not represent
a unique concept, but a group of concepts; also, each concept might
be referred to with diferent synonyms. So far, we defined 50 intents
and 20 entities (more than 170 diferent concepts) but this number
is likely to increase in the future, as we add new functionalities.</p>
    </sec>
    <sec id="sec-8">
      <title>4 CONCLUSION AND FUTURE WORK</title>
      <p>This document proposed an architecture for a PVTA capable of
providing several services to students of online and on-site courses,
moving towards the goal of “assisted learning”. Also, we introduced
an initial implementation of such system, consisting in a chatbot
capable of answering the questions of students enrolled in a course
about recommender systems. This chatbot is only a small part of the
PVTA we propose, and we are working on new services to be
implemented in the system. We are working on ways to continuously
enrich the set of intents, entities and rules by monitoring the
interactions between students and the PVTA: by means of knowledge
extraction algorithms we aim to keep fine-tuning and improving the
model even after deployment. We also plan to implement student
engagement: looking at students’ behavior, the PVTA can
understand which are the students at risk of dropping-out and the ones
not satisfied with the course; then, it can proactively intervene or
send a warning to human TAs. Another aspect we are focusing on
is analyzing whether it is possible to reduce the usage of the IBM’s
Watson Assistant and perform a bigger part of the NLP-related
tasks in the server. Indeed, we have two objectives in mind:
implementing some NLG (natural language generation) in the server in
order to overcome the biggest limitation of Assistant, which is the
impossibility to generate answers, and - as a second step -
exploring the generation of personalized answers. Lastly, further work
is focusing on the possibility of personalizing learning material,
recommending diferent contents depending on the student and its
interactions with the PVTA.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>B. du Boulay.</surname>
          </string-name>
          <year>2016</year>
          .
          <article-title>Artificial Intelligence as an Efective Classroom Assistant</article-title>
          .
          <source>IEEE Intelligent Systems 31, 6 (Nov.-Dec</source>
          .
          <year>2016</year>
          ),
          <fpage>76</fpage>
          -
          <lpage>81</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>B.</given-names>
            <surname>Eicher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Polepeddi</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Goel</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Jill Watson Doesn't Care if You're Pregnant: Grounding AI Ethics in Empirical Studies</article-title>
          .
          <source>In AAAI/ACM Conference on Artificial Intelligence</source>
          , Ethics, and
          <string-name>
            <surname>Society</surname>
          </string-name>
          (AIES'18) .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Goel</surname>
          </string-name>
          and
          <string-name>
            <given-names>L.</given-names>
            <surname>Polepeddi</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Jill Watson: A Virtual Teaching Assistant for Online Education</article-title>
          .
          <source>Retrieved July 16</source>
          ,
          <year>2018</year>
          from https://smartech.gatech.edu/ handle/1853/59104
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hlosta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zdrahal</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Zendulka</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Ouroboros: Early identification of at-risk students without models based on legacy data</article-title>
          .
          <source>In Proceedings of the Seventh International Learning Analytics and Knowledge Conference (LAK'17)</source>
          . ACM,
          <fpage>6</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Klasnja-Milicevic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ivanovic</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Nanopoulos</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Recommender systems in e-learning environments: a survey of the state-of-the-art and possible extensions</article-title>
          .
          <source>Artificial Intelligence Review</source>
          <volume>44</volume>
          ,
          <issue>4</issue>
          (
          <year>December 2015</year>
          ),
          <fpage>571</fpage>
          -
          <lpage>604</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N.</given-names>
            <surname>Thai-Nghe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Drumond</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Krohn-Grimberghe</surname>
          </string-name>
          , and L.
          <string-name>
            <surname>Schmidt-Thieme</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Recommender Systems for Predicting Students Performance</article-title>
          .
          <source>Procedia Computer Science</source>
          <volume>1</volume>
          ,
          <issue>2</issue>
          (
          <year>January 2010</year>
          ),
          <fpage>2811</fpage>
          -
          <lpage>2819</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Wolf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zdrahal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Pantucek</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Improving retention: predicting at-risk students by analysing clicking behaviour in a virtual learning environment</article-title>
          .
          <source>In Proceedings of the Third International Learning Analytics and Knowledge Conference (LAK'13)</source>
          . ACM,
          <volume>145</volume>
          -
          <fpage>149</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>