<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>N. Pirttinen);</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Coverage of Course Topics in Learnersourced SQL Exercises</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nea Pirttinen</string-name>
          <email>nea.pirttinen@helsinki.fi</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Arto Hellas</string-name>
          <email>arto.hellas@aalto.fi</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Juho Leinonen</string-name>
          <email>juho.2.leinonen@aalto.fi</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Aalto University</institution>
          ,
          <addr-line>Espoo</addr-line>
          ,
          <country country="FI">Finland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Helsinki</institution>
          ,
          <addr-line>Helsinki</addr-line>
          ,
          <country country="FI">Finland</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>Learnersourcing is a common task in modern computing classrooms, where it is used, for example, for the creation of educational resources such as multiple-choice questions and programming exercises. One less studied type of learnersourced artefact is SQL exercises. In this work, we explore how well diferent SQL topics are covered by learnersourced SQL exercises. Covering most course topics would allow students to practice the full content of the course by completing learnersourced exercises. Our results suggest that learnersourcing can be used to create a large pool of SQL exercises that cover most of the topics of the course.</p>
      </abstract>
      <kwd-group>
        <kwd>learnersourcing</kwd>
        <kwd>crowdsourcing</kwd>
        <kwd>SQL</kwd>
        <kwd>databases</kwd>
        <kwd>topic coverage</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. Tool and Context</title>
      <sec id="sec-2-1">
        <title>2.1. SQL Trainer</title>
        <p>SQL Trainer is a learnersourcing system for practicing SQL queries [7]. When using the system,
a student is presented with a list of topics in the order of appearance in the course material.
Students can choose to either create exercises or practice a specific topic. If the student chooses
to create an exercise for a particular topic, the system will ask which database the student wishes
to use (selection is made from a pre-defined list of databases created by the course instructor).
The student is then required to give a name for the exercise, an exercise description, and a
sample solution for that exercise. Once the exercise is completed, it is added to the pool of
exercises for that topic. If the student chooses to practice a topic, they will be given a randomly
selected exercise that they have not completed yet from the pool of created exercises.</p>
        <p>A more detailed description of SQL Trainer and its usage, as well as the description of the
teacher view, is presented in [7].</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Context and Research Questions</title>
        <p>For the present study, we use data collected from three introductory database courses ofered
by the University of Helsinki, where SQL trainer was used to support SQL practice. In the
courses, SQL trainer had 11 topics (briefly outlined in Table 1). Approximately 10% of the course
grade was based on completing at least four exercises per course topic and creating at least a
single exercise per course topic. Students were free to complete and create more exercises than
required for the points, however.</p>
        <p>Our research questions are as follows:
RQ1. Which topics do students create exercises for?
RQ2. How well do the exercises created by students cover the course topics?</p>
        <p>To answer research question 1 and 2, we partially replicate earlier work by Denny et al.
[8] and Purchase et al. [9] by studying to what extent the exercises that students complete
and create cover the course topics. In particular, we study how many exercises are created by
students per course topic, and how the used SQL concepts match the topics.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results and Discussion</title>
      <sec id="sec-3-1">
        <title>3.1. Created Exercises</title>
        <p>In total, 1569 students entered the system. Out of these, a total of 1187 created at least one
exercise of their own. In total, students created 11247 exercises for 11 diferent instructor-defined
topics. The number of created exercises for each topic can be found in Table 2 (topic number
references listed in Table 1).</p>
        <p>From Table 2, we can see that there is a clear downwards trend in the number of created
exercises per topic. One possible reason for this is that the latter topics come later in the course
and thus fewer students actively participate at that point in the course. Another potential
explanation for the trend is that the latter topics are related to somewhat more dificult topics
and thus students might not be as inclined to create exercises for those. One clear outlier is
topic 5, which was related to “other types of joins”. Why this topic elicited fewer learnersourced
exercises requires further research.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Topic Coverage</title>
        <p>Figure 1 presents the most common SQL concepts for each topic (topic number references
listed in Table 1). Only concepts that were relatively common (occurred in over 10% of created
exercises in at least a single topic) are included. In the vast majority of cases, students are able
to identify relevant concepts for the topic, and utilize them when creating an exercise.</p>
        <p>Interestingly, based on the SQL concepts used in topic 5, students seem to majorly prefer
creating exercises related to left joins compared to right or inner joins (which occurred rarely
enough that they were not included in the figure). Similarly, for topic 6, we can see that students
are more inclined to create exercises related to creating tables than dropping tables.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Future Work</title>
        <p>Altogether, these preliminary results support the use of learnersourcing for creating a large
exercise pool with a good coverage of course topics. This provides the opportunity for students
to practice a wide variety of SQL concepts using the learnersourced exercises. In our future
work, we are interested in extending this research to similarly examine how much students
practice the diferent concepts using the system. In addition, we are keen to study whether the
exercises created by students cover varying levels of dificulty and if the exercises are efective.
Lastly, we are exploring whether there are diferences between demographic groups in to what
extent they participate in learnersourcing activities.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments References</title>
      <p>We are grateful for the doctoral research grant awarded by Jenny and Antti Wihuri Foundation
to the first author.
[1] P. Denny, J. Hamer, A. Luxton-Reilly, H. Purchase, Peerwise: students sharing their multiple
choice questions, in: Proceedings of the fourth international workshop on computing
education research, 2008, pp. 51–58.
[2] S. Moore, H. A. Nguyen, J. Stamper, Examining the efects of student participation and
performance on the quality of learnersourcing multiple-choice questions, in: Proceedings
of the Eighth ACM Conference on Learning@ Scale, 2021, pp. 209–220.
[3] A. Singh, C. Brooks, Y. Lin, W. Li, What’s in it for the learners? evidence from a randomized
ifeld experiment on learnersourcing questions in a mooc, in: Proceedings of the Eighth
ACM Conference on Learning@ Scale, 2021, pp. 221–233.
[4] H. Khosravi, K. Kitto, J. J. Williams, Ripple: A crowdsourced adaptive platform for
recommendation of learning activities., Journal of Learning Analytics 6 (2019) 91–105.
[5] N. Pirttinen, V. Kangas, I. Nikkarinen, H. Nygren, J. Leinonen, A. Hellas,
Crowdsourcing programming assignments with crowdsorcerer, in: Proceedings of the 23rd Annual
ACM Conference on Innovation and Technology in Computer Science Education, 2018, pp.
326–331.
[6] P. Denny, A. Luxton-Reilly, E. Tempero, J. Hendrickx, Codewrite: supporting student-driven
practice of java, in: Proceedings of the 42nd ACM technical symposium on Computer
science education, 2011, pp. 471–476.
[7] J. Leinonen, N. Pirttinen, A. Hellas, Crowdsourcing content creation for sql practice, in:
Proceedings of the 2020 ACM Conference on Innovation and Technology in Computer
Science Education, 2020, pp. 349–355.
[8] P. Denny, A. Luxton-Reilly, J. Hamer, H. Purchase, Coverage of course topics in a student
generated mcq repository, ACM SIGCSE Bulletin 41 (2009) 11–15.
[9] H. Purchase, J. Hamer, P. Denny, A. Luxton-Reilly, The quality of a peerwise mcq repository,
in: Proceedings of the Twelfth Australasian Conference on Computing Education - Volume
103, ACE ’10, Australian Computer Society, Inc., AUS, 2010, p. 137–146.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>