=Paper= {{Paper |id=Vol-1746/paper-16 |storemode=property |title=Student Assessment by Optimal Questionnaire Design |pdfUrl=https://ceur-ws.org/Vol-1746/paper-16.pdf |volume=Vol-1746 |authors=Melisa Aruci,Giuseppina Lotito,Giuseppe Pirlo |dblpUrl=https://dblp.org/rec/conf/rtacsit/AruciLP16 }} ==Student Assessment by Optimal Questionnaire Design== https://ceur-ws.org/Vol-1746/paper-16.pdf

Student Assessment by Optimal Questionnaire Design

Melisa Aruci Giuseppina Lotito Giuseppe Pirlo
Departmenti i Informatikës “Gorjux-Tridente-Vivante” Dipartimento di Informatica
Fakulteti i Shkencave të 70125 Bari, Italy Univ. di Bari, 70125 Bari,
Natyrës, Tiranë, Albania {giuseppina.lotito@istruzio Italy
{melisa.aruci@gmail.com} ne.it} {giuseppe.pirlo@uniba.it}

Abstract The organization of the paper is the following. Section 2
presents the problem of item evaluation by IRT. The
In this paper a new technique is presented for problem of optimal questionnaire design is formally
automatic design of optimal questionnaires. described in Section 3. Section 4 presents the genetic
The technique, that is based on the Item algorithm used for automatic questionnaire design.
Response Theory, performs multiple-choice Section 5 presents the experimental results. Section 6
item selection by a Genetic Algorithm. The reports the conclusion.
experimental results demonstrate the validity
of the proposed approach to adjust the 2. Item Evaluation by IRT
characteristics of the questionnaire to the
abilities of the student class. IRT states that responses to a set of items can be
explained by the existence of one or more latent traits,
1. Introduction named abilities [Van der Linen1997; Fraley2000]. A
Computer-based student assessment is now considered main objective in item response modelling is to
a fundamental service of Learning Management characterize the relation between a latent trait, , and
Systems [Amelung2011; Dimauro2003; Romero2008; the probability of item endorsement. This relation is
Greco2006b]. Although several types of computer- typically referred to as the Item Characteristic Curve
based systems for student’s assessment have been (ICC) and can be defined as the (nonlinear) regression
proposed so far Multiple-choice Item on-line line that represents the probability of endorsing an item
Questionnaires (MIQs) is the most diffuse approach (or an item response category) as a function of the
[Lan2011; Romero2010] since they can be easily underlying trait [Fraley2000]. For the purpose of this
integrated into computer-based assessment systems work, the Two-Parameter Logistic Model (2PLM)
[Kuechler2003; Romero2009]. When a MIQ is [Birnbaum1968] is considered. In this case, given the
considered, students are asked to select the best set of items T={t1, t2,…, tj…, tM}, the probability that
possible answer from the choices provided on a list an individual with trait level i will endorse item tj is
[Kuechler2003]. Data from MIQs can be used for defined as a function [Birnbaum1968]:
providing personalized learning suggestions
[Chu2006], for the analysis of individual targets 1
[Yamanishi2001], for discovering the individual needs Pj ( i )   j ( i   j )
(1)
of the students [Pechenizkiy2008], for discovering rule 1 e
patterns [Chen2009]. Unfortunately, the design of a where j and j are the item discrimination parameter
questionnaire is a complex task that requires the and the item difficulty parameter, respectively. The
selection of the set of items most advantageous for difficulty parameter j represents the level of the latent
assessing the skill level of a student [Lan2011]. trait necessary for an individual to have a 50%
In this paper a new approach for optimal questionnaire probability of endorsing the item; the item
design is proposed, based on the Item Response Theory discrimination parameter j represents an item’s ability
(IRT). A questionnaire is considered as an entity that to differentiate between people with contiguous trait
must be tailored according to the specific levels. Of course, items are not equally informative
characteristics of the group of students to be assessed. across the entire range of the trait . In fact, an item
The proposed approach uses a two-steps strategy. In yields the most information when i equals j.In the
the first step the system estimates item difficulty for a IRT, an item is considered difficult if a high level of
given student class with specific abilities. In the second ability or knowledge is required to answer it correctly.
step a Genetic Algorithm (GA) is used to determine the Therefore, the difference Pj(max)-Pj(min) can be used
best set of items to be included in the questionnaire.
to estimate the extent to which item tj is effective to [Michalewicz1996; Goldberg1989]. The genetic
assess students in the range [min, max]: the greater the approach is based on the following phases
difference Pj(max)-Pj(min) the better the item tj. [Baeck1996]. The initial – population
Pop=Npopof random individuals was
created. In our tests Npop has been set to 20. since some
preliminary experiments have shown Npop=20 is a good
trade-off between convergence speed of the genetic
algorithm and its capability to escape from local
extrema. In our approach, each individual (that is a
MIQ) is represented by a vector kh h hj
h, where each gene hj was a Boolean value: hj=0
means that j-th item of T (i.e. the item tj) was not
included in MIQ; hj=1 means that j-th item of T (i.e.
Figure 1. Item Effectiveness Estimation by ICCs the item tj) was included in Q. Of course, since P items
must be included into the questionnaire Q, the
Figure 1 shows the ICCs of two items t1 and t2. In this following normalization procedure was performed for
case, the results indicated that t1 is better than t2 for each individual k. In particular, let be
assessing the students in the range [min, max], since P’=h1+h2+…+hM, if P’>P then select randomly (P’-P)
P1(max)-P1(min) > P2(max)-P2(min). genes equal to 1 and set them to 0; if P’ < P then select
randomly (P-P’) genes equal to 0 and set them to 1.
3. A Theoretical Approach to Optimal Successively, the fitness function was computed for
MIQ Design by GA each individual k of the population, according to eq.
In this paper the problem of optimal MIQ design is (2).
considered as an optimization process in which - from From the initial - population, the following four genetic
the set of M items T - the subset of N items (N