=Paper=
{{Paper
|id=Vol-1746/paper-16
|storemode=property
|title=Student Assessment by Optimal Questionnaire Design
|pdfUrl=https://ceur-ws.org/Vol-1746/paper-16.pdf
|volume=Vol-1746
|authors=Melisa Aruci,Giuseppina Lotito,Giuseppe Pirlo
|dblpUrl=https://dblp.org/rec/conf/rtacsit/AruciLP16
}}
==Student Assessment by Optimal Questionnaire Design==
Student Assessment by Optimal Questionnaire Design Melisa Aruci Giuseppina Lotito Giuseppe Pirlo Departmenti i Informatikës “Gorjux-Tridente-Vivante” Dipartimento di Informatica Fakulteti i Shkencave të 70125 Bari, Italy Univ. di Bari, 70125 Bari, Natyrës, Tiranë, Albania {giuseppina.lotito@istruzio Italy {melisa.aruci@gmail.com} ne.it} {giuseppe.pirlo@uniba.it} Abstract The organization of the paper is the following. Section 2 presents the problem of item evaluation by IRT. The In this paper a new technique is presented for problem of optimal questionnaire design is formally automatic design of optimal questionnaires. described in Section 3. Section 4 presents the genetic The technique, that is based on the Item algorithm used for automatic questionnaire design. Response Theory, performs multiple-choice Section 5 presents the experimental results. Section 6 item selection by a Genetic Algorithm. The reports the conclusion. experimental results demonstrate the validity of the proposed approach to adjust the 2. Item Evaluation by IRT characteristics of the questionnaire to the abilities of the student class. IRT states that responses to a set of items can be explained by the existence of one or more latent traits, 1. Introduction named abilities [Van der Linen1997; Fraley2000]. A Computer-based student assessment is now considered main objective in item response modelling is to a fundamental service of Learning Management characterize the relation between a latent trait, , and Systems [Amelung2011; Dimauro2003; Romero2008; the probability of item endorsement. This relation is Greco2006b]. Although several types of computer- typically referred to as the Item Characteristic Curve based systems for student’s assessment have been (ICC) and can be defined as the (nonlinear) regression proposed so far Multiple-choice Item on-line line that represents the probability of endorsing an item Questionnaires (MIQs) is the most diffuse approach (or an item response category) as a function of the [Lan2011; Romero2010] since they can be easily underlying trait [Fraley2000]. For the purpose of this integrated into computer-based assessment systems work, the Two-Parameter Logistic Model (2PLM) [Kuechler2003; Romero2009]. When a MIQ is [Birnbaum1968] is considered. In this case, given the considered, students are asked to select the best set of items T={t1, t2,…, tj…, tM}, the probability that possible answer from the choices provided on a list an individual with trait level i will endorse item tj is [Kuechler2003]. Data from MIQs can be used for defined as a function [Birnbaum1968]: providing personalized learning suggestions [Chu2006], for the analysis of individual targets 1 [Yamanishi2001], for discovering the individual needs Pj ( i ) j ( i j ) (1) of the students [Pechenizkiy2008], for discovering rule 1 e patterns [Chen2009]. Unfortunately, the design of a where j and j are the item discrimination parameter questionnaire is a complex task that requires the and the item difficulty parameter, respectively. The selection of the set of items most advantageous for difficulty parameter j represents the level of the latent assessing the skill level of a student [Lan2011]. trait necessary for an individual to have a 50% In this paper a new approach for optimal questionnaire probability of endorsing the item; the item design is proposed, based on the Item Response Theory discrimination parameter j represents an item’s ability (IRT). A questionnaire is considered as an entity that to differentiate between people with contiguous trait must be tailored according to the specific levels. Of course, items are not equally informative characteristics of the group of students to be assessed. across the entire range of the trait . In fact, an item The proposed approach uses a two-steps strategy. In yields the most information when i equals j.In the the first step the system estimates item difficulty for a IRT, an item is considered difficult if a high level of given student class with specific abilities. In the second ability or knowledge is required to answer it correctly. step a Genetic Algorithm (GA) is used to determine the Therefore, the difference Pj(max)-Pj(min) can be used best set of items to be included in the questionnaire. to estimate the extent to which item tj is effective to [Michalewicz1996; Goldberg1989]. The genetic assess students in the range [min, max]: the greater the approach is based on the following phases difference Pj(max)-Pj(min) the better the item tj. [Baeck1996]. The initial – population Pop=Npopof random individuals was created. In our tests Npop has been set to 20. since some preliminary experiments have shown Npop=20 is a good trade-off between convergence speed of the genetic algorithm and its capability to escape from local extrema. In our approach, each individual (that is a MIQ) is represented by a vector kh h hj h, where each gene hj was a Boolean value: hj=0 means that j-th item of T (i.e. the item tj) was not included in MIQ; hj=1 means that j-th item of T (i.e. Figure 1. Item Effectiveness Estimation by ICCs the item tj) was included in Q. Of course, since P items must be included into the questionnaire Q, the Figure 1 shows the ICCs of two items t1 and t2. In this following normalization procedure was performed for case, the results indicated that t1 is better than t2 for each individual k. In particular, let be assessing the students in the range [min, max], since P’=h1+h2+…+hM, if P’>P then select randomly (P’-P) P1(max)-P1(min) > P2(max)-P2(min). genes equal to 1 and set them to 0; if P’ < P then select randomly (P-P’) genes equal to 0 and set them to 1. 3. A Theoretical Approach to Optimal Successively, the fitness function was computed for MIQ Design by GA each individual k of the population, according to eq. In this paper the problem of optimal MIQ design is (2). considered as an optimization process in which - from From the initial - population, the following four genetic the set of M items T - the subset of N items (N