=Paper= {{Paper |id=Vol-1746/paper-16 |storemode=property |title=Student Assessment by Optimal Questionnaire Design |pdfUrl=https://ceur-ws.org/Vol-1746/paper-16.pdf |volume=Vol-1746 |authors=Melisa Aruci,Giuseppina Lotito,Giuseppe Pirlo |dblpUrl=https://dblp.org/rec/conf/rtacsit/AruciLP16 }} ==Student Assessment by Optimal Questionnaire Design== https://ceur-ws.org/Vol-1746/paper-16.pdf
                Student Assessment by Optimal Questionnaire Design

       Melisa Aruci                        Giuseppina Lotito                       Giuseppe Pirlo
Departmenti i Informatikës             “Gorjux-Tridente-Vivante”            Dipartimento di Informatica
 Fakulteti i Shkencave të                   70125 Bari, Italy                Univ. di Bari, 70125 Bari,
 Natyrës, Tiranë, Albania              {giuseppina.lotito@istruzio                     Italy
{melisa.aruci@gmail.com}                         ne.it}                     {giuseppe.pirlo@uniba.it}


                       Abstract                              The organization of the paper is the following. Section 2
                                                             presents the problem of item evaluation by IRT. The
    In this paper a new technique is presented for           problem of optimal questionnaire design is formally
    automatic design of optimal questionnaires.              described in Section 3. Section 4 presents the genetic
    The technique, that is based on the Item                 algorithm used for automatic questionnaire design.
    Response Theory, performs multiple-choice                Section 5 presents the experimental results. Section 6
    item selection by a Genetic Algorithm. The               reports the conclusion.
    experimental results demonstrate the validity
    of the proposed approach to adjust the                   2. Item Evaluation by IRT
    characteristics of the questionnaire to the
    abilities of the student class.                          IRT states that responses to a set of items can be
                                                             explained by the existence of one or more latent traits,
1. Introduction                                              named abilities [Van der Linen1997; Fraley2000]. A
Computer-based student assessment is now considered          main objective in item response modelling is to
a fundamental service of Learning Management                 characterize the relation between a latent trait, , and
Systems [Amelung2011; Dimauro2003; Romero2008;               the probability of item endorsement. This relation is
Greco2006b]. Although several types of computer-             typically referred to as the Item Characteristic Curve
based systems for student’s assessment have been             (ICC) and can be defined as the (nonlinear) regression
proposed so far Multiple-choice Item on-line                 line that represents the probability of endorsing an item
Questionnaires (MIQs) is the most diffuse approach           (or an item response category) as a function of the
[Lan2011; Romero2010] since they can be easily               underlying trait [Fraley2000]. For the purpose of this
integrated into computer-based assessment systems            work, the Two-Parameter Logistic Model (2PLM)
[Kuechler2003; Romero2009]. When a MIQ is                    [Birnbaum1968] is considered. In this case, given the
considered, students are asked to select the best            set of items T={t1, t2,…, tj…, tM}, the probability that
possible answer from the choices provided on a list          an individual with trait level i will endorse item tj is
[Kuechler2003]. Data from MIQs can be used for               defined as a function [Birnbaum1968]:
providing      personalized       learning   suggestions
[Chu2006], for the analysis of individual targets                                                 1
[Yamanishi2001], for discovering the individual needs                      Pj ( i )            j ( i   j )
                                                                                                                     (1)
of the students [Pechenizkiy2008], for discovering rule                                  1 e
patterns [Chen2009]. Unfortunately, the design of a          where j and j are the item discrimination parameter
questionnaire is a complex task that requires the            and the item difficulty parameter, respectively. The
selection of the set of items most advantageous for          difficulty parameter j represents the level of the latent
assessing the skill level of a student [Lan2011].            trait necessary for an individual to have a 50%
In this paper a new approach for optimal questionnaire       probability of endorsing the item; the item
design is proposed, based on the Item Response Theory        discrimination parameter j represents an item’s ability
(IRT). A questionnaire is considered as an entity that       to differentiate between people with contiguous trait
must be tailored according to the specific                   levels. Of course, items are not equally informative
characteristics of the group of students to be assessed.     across the entire range of the trait . In fact, an item
The proposed approach uses a two-steps strategy. In          yields the most information when i equals j.In the
the first step the system estimates item difficulty for a    IRT, an item is considered difficult if a high level of
given student class with specific abilities. In the second   ability or knowledge is required to answer it correctly.
step a Genetic Algorithm (GA) is used to determine the       Therefore, the difference Pj(max)-Pj(min) can be used
best set of items to be included in the questionnaire.
to estimate the extent to which item tj is effective to        [Michalewicz1996; Goldberg1989]. The genetic
assess students in the range [min, max]: the greater the     approach is based on the following phases
difference Pj(max)-Pj(min) the better the item tj.           [Baeck1996].        The        initial   –     population
                                                               Pop=Npopof random individuals was
                                                               created. In our tests Npop has been set to 20. since some
                                                               preliminary experiments have shown Npop=20 is a good
                                                               trade-off between convergence speed of the genetic
                                                               algorithm and its capability to escape from local
                                                               extrema. In our approach, each individual (that is a
                                                               MIQ) is represented by a vector kh h hj
                                                               h, where each gene hj was a Boolean value: hj=0
                                                               means that j-th item of T (i.e. the item tj) was not
                                                               included in MIQ; hj=1 means that j-th item of T (i.e.
   Figure 1. Item Effectiveness Estimation by ICCs             the item tj) was included in Q. Of course, since P items
                                                               must be included into the questionnaire Q, the
Figure 1 shows the ICCs of two items t1 and t2. In this        following normalization procedure was performed for
case, the results indicated that t1 is better than t2 for      each individual k. In particular, let be
assessing the students in the range [min, max], since        P’=h1+h2+…+hM, if P’>P then select randomly (P’-P)
P1(max)-P1(min) > P2(max)-P2(min).                         genes equal to 1 and set them to 0; if P’ < P then select
                                                               randomly (P-P’) genes equal to 0 and set them to 1.
3. A Theoretical Approach to Optimal                           Successively, the fitness function was computed for
MIQ Design by GA                                               each individual k of the population, according to eq.
In this paper the problem of optimal MIQ design is             (2).
considered as an optimization process in which - from          From the initial - population, the following four genetic
the set of M items T - the subset of N items (N