<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Quantitative Conceptual Model Analysis for Evaluating Simple Class Diagrams made by Novices</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mizue Kayama</string-name>
          <email>kayama@cs.shinshu-u.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shinpei Ogata</string-name>
          <email>ogata@cs.shinshu-u.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David K. Asano</string-name>
          <email>david@cs.shinshu-u.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Masami Hashimoto</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Shinshu University</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>6</fpage>
      <lpage>13</lpage>
      <abstract>
        <p>In this paper, we aim to propose criteria for evaluating conceptual modeling errors made by university freshmen. We quantitatively analyzed class diagrams made by novice learners. Based on the results of three types of experiments, we propose 12 criteria, which are divided into 4 types, for evaluating class diagrams made by novices.</p>
      </abstract>
      <kwd-group>
        <kwd>conceptual modeling</kwd>
        <kwd>class diagram</kwd>
        <kwd>criteria</kwd>
        <kwd>quantitative analysis</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Page 6</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>
        These days, educational methods or learning courses related to conceptual modeling
have been explored in many educational institutes, academic conferences and
academic journals [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1-3</xref>
        ]. The learners who were subjects of previous research were
mainly third or fourth-year undergraduate students, graduate students and/or young
engineers in computer science (CS). They already had finished many specialized
classes related to programming, object-oriented analysis and design, databases and so
on. Especially, almost all of the students were in a CS program as their major[
        <xref ref-type="bibr" rid="ref4 ref5">4,5</xref>
        ]. A
few studies whose subjects had little prerequisite knowledge, for example, high
school students (pre-university), undergraduate students in non-CS programs and CS
freshmen in their first semester, also reported their teaching experience or teaching
methods related to modeling education [
        <xref ref-type="bibr" rid="ref6 ref7 ref8">6-8</xref>
        ]. However, quantitative evaluation
results were not shown in their reports. Especially for students in pre-university,
NonCS and CS freshmen, there are no quantitative studies about model based thinking
and subsequent curricula of conceptual modeling.
      </p>
      <p>
        In this paper, we aim to propose criteria for evaluating conceptual modeling
errors made by university freshmen. To achieve this research goal, we quantitatively
analyzed class diagrams made by students. During this analysis, we asked ourselves
“What kind of criteria are suitable for novice learners when they create conceptual
models? ” and “Are there any differences between the scores of novice learners with
and without programming knowledge? ” There are two differences between previous
research and our research. The first difference is our subjects. We focus on
university freshmen and pre-university students, who have not taken any kind of CS specific
courses. The other difference is the empirical and continual research method. We
have been engaged in this research since 2010 [
        <xref ref-type="bibr" rid="ref10 ref9">9,10</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>Research Methods</title>
      <sec id="sec-3-1">
        <title>Overview</title>
        <p>
          We define conceptual modeling as a way of thinking to solve problems using
engineering methodology. The learning objectives of this course are to develop 3 types of
capabilities [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]:
1. Conceptual modeling: The ability to sketch a model diagram correctly according to
a certain notation.
2. Requirements analysis: The ability to create the diagram so as to satisfy the
requirements represented as sentences.
3. Appropriate abstraction: The ability to avoid defining unnecessary or inadequate
classes and attributes for a target domain.
        </p>
        <p>Capability 1 is concerned with the ability to form concepts for designing visual
models. If learners lack this ability, they cannot read the given models correctly
based on requirements or cannot appropriately detect the differences between the
given model and the requirements. Capability 2 is related to the ability to capture the
essentials of software requirements. If learners lack this ability, they cannot create
suitable models for the requirements. Capability 3 is the same as the ability to
abstract fundamental features and/or significant entities from an object or service. If
learners lack this ability, they cannot control the abstraction level in model reading
and creation.
2.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Simple Class Diagram</title>
        <p>We use a class diagram which is a simplified standard class diagram defined using
UML2.x. Our simple class diagram has the minimum essential elements for
conceptual modeling. For each class, a name and some attributes are listed, while no
attribute types, method names, arguments, return types or visibilities are used. For each
association, two names and two multiplicities with four types (0..1, 1, 1..*, *) are used,
while no role, inheritance, aggregation, composition or dependency are used. The
only association used in this diagram is a simple association between two peer classes.
This association represents a pure structural relationship between two peers. Both
classes are conceptually at the same level, neither being more important than the other.</p>
        <p>In general modeling using object oriented methodology, classes in different
levels are used in one diagram. However, novice learners tend not to control abstraction
level appropriately. They often assign a system name to a class name or the name of
a concrete value to an attribute name. Therefore, we only used a subset of the
notation of class diagrams from the original UML2.x notation.
2.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Subjects</title>
        <p>Our subjects were 174 university students who were novices at conceptual
modeling. They were divided into two groups based on their computer science knowledge.
The members of the 11T group were 86 sophomores. They already had some
computer science knowledge. Our experiment was held in their second semester in
second year during one of their elective courses. On the other hand, the subjects in the
12T group were 88 freshmen. They had not taken any CS related courses. Our
experiment for this group was held in their first semester in first year during one of their
required courses.</p>
        <p>All subjects were required to answer the questions individually. They were not
allowed to discuss the questions with each other or to solve the problems in groups.
2.4</p>
      </sec>
      <sec id="sec-3-4">
        <title>Experimental Procedure</title>
        <p>When humans acquire a new notation or concept, the typical first step is to read or
observe some appropriate samples. By doing this, some features of the notation or
concept can be captured. In the next step, the given notation or concept can be used
to draw or describe some product. Our experiment expands on this method by using
three tests: a model reading test, a model creation test and a model modification test.</p>
        <p>Before these tests, the instructor asked his students the names of the essential
elements in a simple class diagram to confirm their level of understanding. Two
instructors were engaged in the course management. They planned the learning
contents of this course and gave our subjects lectures. Then, we analyzed the subjects’
answers and discussed the results.
3
3.1</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experimental Results</title>
      <sec id="sec-4-1">
        <title>Model Reading Test</title>
        <p>The goal of this test is to check the conceptual modeling capability of students. In
this test, students point out the differences between a given diagram and the problem
(P) statements. This test includes four problems. Each problem is related to classes,
attributes, associations and multiplicities. Among the choices, some statements are
not true for the given class diagram.</p>
        <p>correct incorrect correct incorrect
P4 69.8% 30.2% P4 72.7% 27.3%
P3
P2
P1
0%
32.6%
67.4%
46.6%
53.4%
74.4%</p>
        <p>25.6%
91.9%
difference. So, the level of understanding about “attribute” is much lower than for
other elements (class, association, multiplicity) for both groups.
3.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Model Creation Test</title>
        <p>The goal of this test is to check the requirement analysis capability and the
appropriate abstraction capability. The 12T group (38.6%) has a higher score than the 11T
group (14.0%). However, the average total scores and variances of these two groups
show a significant difference. At first, we categorized the answers which had some
errors based on the three error types from previous research. As a result, we found a
new error type: class related error. In total, we extracted four types of errors:
syntactic errors, attribute related errors, association related errors and class related errors.
Figure 2 shows the percentage of the four error types that occurred in the model
creation test. For the 11T students, the number of incorrect answers was 74. For the 12T
students, the number of incorrect answers was 54.</p>
        <p>100%
90% 97.3% 96.3% 11T 12T
80%
70%
60%
50%
40%
3200%% 33.3%
10% 6.8% 25.7%
0%
5.4% 11.1%
Syntactic
errors</p>
        <p>Attribute
related
errors</p>
        <p>Association
related
errors</p>
        <p>5.6%
Class
related
errors</p>
        <p>In this experiment, we found that attribute related errors are the most common
type of error made. In both groups, over 95% of the incorrect answers had this type
of error. For the 11T group, which has programming knowledge, the percentage of
class related errors is relatively higher. On the other hand, 12T group, which has no
programming knowledge, shows a higher percentage of association related errors.</p>
        <p>We analyzed these three in four types of errors in more detail.</p>
        <p>The class related error has two detailed subcategories:
(a) There are some classes which have different abstraction levels in one diagram.
(b) There are more than two classes whose names or attributes have the same
meaning in one diagram.</p>
        <p>The attribute related errors had six detailed error categories:
(a) A class does not have any attributes (No attribute). This error also is included
the syntactic error.
(b) Two or more classes have the same set of attributes (Same attribute). “Same”
means that each attribute has the same range of values.
(c) An attribute is defined as not “name” but “value” (Value attribute).
(d) Attributes which are actions or methods are listed (Behavioral attribute).
(e) The meaning of both an attribute and the multiplicity of an association is
overlapped (Overlapped property).
(f) Duplicated attributes are used (Duplicated attribute).</p>
        <p>The association related error type includes some class diagrams which have no
association name or multiplicity and have inadequate association name or multiplicity.
This type has four detailed error categories.</p>
        <p>(a) There are no association names. This error also is included the syntactic error.
(b) Inadequate association name is given.
(c) There are not two multiplicities for one association. This error also is included
the syntactic error.
(d) Inadequate multiplicity is given.</p>
        <p>In the 6 subcategories of the attribute related error, the “Value attribute” errors
and the “Same attribute” errors have relatively high occurrence percentages for both
groups. For the 11T, which has programming knowledge, the percentage of the “No
attribute” errors and the “Behavioral attribute” errors is about 20%. The “Duplicated
attribute” errors occurred only in the 12T, which has no programming knowledge.
3.3</p>
      </sec>
      <sec id="sec-4-3">
        <title>Model Modification Test</title>
        <p>Overview. The goal of this test is to check the ability of conceptual modeling,
requirement analysis and appropriate abstraction. In all five problems, students need to
point out the mistakes in each class diagram and describe why they are incorrect.
Then, they are asked to modify the class diagram to correct the mistakes. P1 has
association related errors, which are inadequate multiplicity and duplicate association
names. P2 has an attribute related error, where the attribute name is defined as a
value instead of a property. P3 has an association related error, which is inadequate
multiplicity. P4 has an association related error, whi ch is the lack of association
names. P5 has a syntactic error, which is redundant multiplicity.</p>
        <p>Results. Figure 4 shows the percentage of questions answered correctly and
incorrectly in the model modification test for the two groups. The trend of the percentage
of questions answered correctly is the same for both groups. The highest percentage
of correctly corrected errors was for the syntactic error (P1) and the association
related error (P4). The lowest percentage was for the association related error (P3). Their
level of understanding decreases as follows: P1 &gt; P4 &gt; P2 &gt; P5 &gt; P3. Both P1 and P4
are lacking necessary elements in the diagram. P2, P5 and P3 have inadequate
elements in the given diagrams. This means that the “inadequate description” error is
more difficult to modify than the “lack of necessary element” error. The average total
scores and variances of these two groups are statistically the same. Only the P3
scores of these two groups show a significant difference.</p>
        <p>correct incorrect correct incorrect
P5 80.2% 19.8% P5 84.0% 16.0%
76.6%
91.5%
88.3%
94.7%</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <sec id="sec-5-1">
        <title>Question 1: What kinds of criteria are suitable for novice learners when they create conceptual models with simple class diagrams?</title>
        <p>We propose 12 criteria, which are divided into 4 types, for evaluating simple
class diagrams made by novices for conceptual modeling based on the results we
mentioned above. Table 1 shows the proposed criteria. The frequency of occurrence
is different for each item. However, by using these items we can check the level of
understanding for conceptual modeling of novice learners. Therefore, conceptual
modeling instructors can develop their course for novices with these criteria.
4.2</p>
      </sec>
      <sec id="sec-5-2">
        <title>Question 2: Are there any differences between the programming-known group and the not-known group in terms of their level of understanding of conceptual modeling?</title>
        <p>In the model reading test, there are no significant differences between the 11T
group and the 12T group in terms of the percentage of questions answered correctly.
Especially, the trend of the percentage of problems answered correctly is statistically
the same for both groups. The average of the percentage of questions answered
correctly by the 12T group is higher than the average of the 11T group. However, there
are no statistically significant differences between any scores.</p>
        <p>In the model creation test, there is a statistically significant difference between
scores of the 11T group and the 12T group.</p>
        <p>(a) The percentage of problems answered correctly by the 12T group which has no
programming knowledge is higher than the 11T group which has programming
knowledge.
(b) About the percentage of the four error types that occurred in this test, for the
11T group the percentage of class related errors is high. On the other hand, the
12T group shows a high percentage of association related errors.
(c) About the percentage of attribute related error types in this test, for the 11T
group, the percentage of no attribute errors and behavioral attribute errors is
about 20%. Duplicated attribute errors occurred only in the 12T group.</p>
        <p>About (a), though these two groups were given the same contents and the same
length of lectures about conceptual modeling, the 12T group which has no
programming knowledge showed a higher score in creating models based on the given
requirements. About (b), whereas the association related errors are relatively superficial
mistakes, the class related errors are quite essential mistakes in conceptual modeling
using class diagrams. These types of errors are concerned with abstraction level
control. This fact means that programming knowledge has no effect on the ability to
control abstraction levels. About (c), the behavioral attribute errors occurred only in
the 11T group. We think this fact is caused by structured programming knowledge
which includes functions. If they draw a class diagram with methods, students in the
11T group would get a higher score on this test. Therefore, it is better to teach
conceptual modeling with this notation before programming. However, the total trend of
our 13 criteria seems to be the same for both groups.</p>
        <p>The results of the model modification test are the same as the model reading test.
There are no significant differences between the 11T group and the 12T group in
terms of the percentage of questions answered correctly. Especially, the trend of the
percentage of problems answered correctly is statistically the same for both groups.
The average of the percentage of questions answered correctly by the 12T group is
higher than the average of the 11T group. However there are no statistically
significant differences between any scores. Overall, based on our experiments,
programming knowledge seems to not directly affect conceptual modeling ability. If so,
conceptual modeling education in this notation for university freshmen is reasonable. In
this case, the instructors should consider our 12 criteria listed above.
5</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>Our research questions are “What kind of criteria are suitable for novice learners
when they create conceptual models?” and “Are there any differences between the
scores of novice learners with and without programming knowledge?”</p>
      <p>In this paper, we propose criteria for evaluating conceptual modeling errors made
by novices based on the results of three experiments. We found that there is no
relation between programming knowledge and conceptual modeling ability for the
notation used in our experiments. We used real world objects in our models, not abstract
objects in this study. Moreover, we asked students to solve each problem individually,
without discussion with other students.</p>
      <p>The effects of these matters for the proposed conclusions need to be considered
in future work. Also, we need to discuss the relation between diagram notation and
education timing more carefully.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgement References</title>
      <p>This work was supported by JSPS KAKENHI Grant Number 22300286 &amp; 16H03074.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>J.</given-names>
            <surname>Bezivin</surname>
          </string-name>
          , et al, “Teaching Modeling: Why, When, What?”,
          <string-name>
            <surname>MODELS</surname>
          </string-name>
          <year>2009</year>
          ,
          <volume>55</volume>
          -
          <fpage>62</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>J.</given-names>
            <surname>Börstler</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          <article-title>Michiels, and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Fjuk</surname>
          </string-name>
          , “ECOOP 2004 Workshop Report: Eighth Workshop on Pedagogies and
          <article-title>Tools for the Teaching and Learning Object-Oriented Concepts”</article-title>
          ,
          <source>ECOOP 2004 Workshop Reader</source>
          ,
          <fpage>36</fpage>
          -
          <lpage>48</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>J.</given-names>
            <surname>Kramer</surname>
          </string-name>
          ,J. “Is Abstraction The Key To Computing?”, CACM,
          <volume>50</volume>
          (
          <issue>4</issue>
          ),
          <fpage>37</fpage>
          -
          <lpage>42</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>N.</given-names>
            <surname>Bolloju</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.S.K.</given-names>
            <surname>Leung</surname>
          </string-name>
          , “
          <article-title>Assisting Novice Analysts in Developing Quality Conceptual Models with UML”</article-title>
          ,
          <source>CACM</source>
          ,
          <volume>49</volume>
          (
          <issue>7</issue>
          ),
          <fpage>108</fpage>
          -
          <lpage>112</lpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>H.C.</given-names>
            <surname>Cham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.H.</given-names>
            <surname>Teo</surname>
          </string-name>
          and
          <string-name>
            <given-names>X.H.</given-names>
            <surname>Zeng</surname>
          </string-name>
          , “
          <article-title>An evaluation of Novice End-User Computing Performance : Data Modeling, Query Writing</article-title>
          and Comprehension”,
          <source>J. of the American Society for Information Science and Technology</source>
          ,
          <volume>56</volume>
          (
          <issue>8</issue>
          ),
          <fpage>843</fpage>
          -
          <lpage>853</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>J.</given-names>
            <surname>Niere</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Schulte</surname>
          </string-name>
          , “
          <article-title>Avoiding anecdotal evidence : An experience report about evaluating an object-oriented modeling course”</article-title>
          ,
          <source>MODELS 2005 Educator's Symposium</source>
          ,
          <volume>63</volume>
          -
          <fpage>70</fpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>C.</given-names>
            <surname>Schulte</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Niere</surname>
          </string-name>
          , “
          <article-title>Thinking in Object Structures: Teaching Modeling in Secondary Schools”</article-title>
          ,
          <source>6th ECOOP Workshop on PTLOOC</source>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>S.</given-names>
            <surname>Akayama</surname>
          </string-name>
          et al, “
          <article-title>Development of a Modeling Education Program for Novices using Model-Driven Development”</article-title>
          , Workshop on Embedded and
          <string-name>
            <surname>Cyber-Physical Systems Education</surname>
          </string-name>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hara</surname>
          </string-name>
          . et al, “
          <article-title>A Basic Study of an Educational Environment for Modeling and Abstraction”</article-title>
          ,
          <source>ESS</source>
          <year>2011</year>
          , pp.
          <fpage>16</fpage>
          -
          <lpage>1</lpage>
          -16-
          <issue>5</issue>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. M.kayama et al, “
          <article-title>A Practical Conceptual Modeling Teaching Method Based on Quantitative Error Analyses for Novices Learning to Create Error-Free Simple Class Diagrams”</article-title>
          ,
          <source>3rd ICAAI</source>
          ,
          <fpage>616</fpage>
          -
          <lpage>622</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>S.J.</given-names>
            <surname>Merror</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.C.</given-names>
            <surname>Balcer</surname>
          </string-name>
          , “
          <string-name>
            <surname>Executable</surname>
            <given-names>UML</given-names>
          </string-name>
          :
          <article-title>A Foundation for Model Driven Architecture”</article-title>
          ,
          <string-name>
            <surname>Addison-Wesley Professional</surname>
          </string-name>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>