<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>ORCID:</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Towards a Valid and Reliable Checklist Argumentative Essays Composed by ChatGPT</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Danijela Ljubojević</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Djordje M. Kadijevich</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nikoleta Gutvajn</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute for Educational Research</institution>
          ,
          <addr-line>11/III Dobrinjska, 11000 Belgrade</addr-line>
          ,
          <country>Republic of Serbia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>Since its launch a year ago, ChatGPT has sparked many concerns in education, especially when it comes to writing. Many students enjoy the benefits of getting generated text for their homework assignments; however, this behaviour impacts profoundly the writing process and the development of critical thinking skills. Among these assignments that are particularly important to critical skills development are so-called argumentative essays, which require the student to investigate a topic, collect, generate, and evaluate evidence, and establish a position on the topic in a concise manner. To assess these essays in a thoughtful way, this paper presents a checklist whose indicators focus on main aspects of essay organisation and higher-order critical thinking skills. The checklist was developed for both machine and human responses by using relevant theoretical framework (the Classical model of Argumentation and Paul-Elder critical thinking framework), the five-paragraph approach, and Cambridge English Qualifications scales at level C1 of the CEFR. As this assessment tool was applied in evaluating ChatGPT-composed argumentative essays, apart from the validity of the tool, this paper also presents its inter-rater reliability. Suggestions for research and practice are included.</p>
      </abstract>
      <kwd-group>
        <kwd>1 ELT</kwd>
        <kwd>argumentative essay</kwd>
        <kwd>writing</kwd>
        <kwd>ChatGPT</kwd>
        <kwd>checklist</kwd>
        <kwd>critical thinking</kwd>
        <kwd>reliability</kwd>
        <kwd>validity</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Artificial intelligence (AI) has become an integral part of learning and teaching in many fields and
English language teaching (ELT) is not an exception. Recent studies have recognised the importance
of using chatbots, such as ChatGPT, in ELT [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] [2]. ChatGPT generates human-like text based on the
input it receives and up to now users have been very satisfied with it. However, essay writing in
English is about not only generating written content but also demonstrating good essay organization
and paragraph structuring.
      </p>
      <p>This pilot study set out to analyse argumentative essays generated through ChatGPT and,
consequently, to come up with a reliable and valid instrument to assess students writing tasks in
English. Furthermore, it aims to determine what can ChatGPT generate in terms of developing critical
thinking skills. To assess the extent to which writing proficiency has been achieved, an appropriate
checklist with good validity and reliability needs to be applied. Hence, this study examined validity
and reliability of the developed checklist.</p>
      <p>The findings of this research hold implications for teaching writing and the integration of AI in
English language classrooms. A positive answer to the question of validity and reliability could
contribute to the improvement concerning the missing instrument that assesses the promotion of
critical thinking skills and the use of AI-generated content.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Theoretical Framework</title>
      <p>Argumentative essays are types of essays which require the student to investigate a topic, collect,
generate, and evaluate evidence, and establish a position on the topic in a concise manner. They are
very important for developing critical thinking skills. Providing sufficient and sound arguments in the
argumentative essays is essential to their success. It is not enough just to support the idea with enough
details and examples; there are some more aspect that should be covered.</p>
      <p>Developing good argumentation is vital to argumentative essays. There are different models of
argumentation that be used: the Classical Model of Argumentation, the Toulmin Model, Rogerian
Argumentation Model, etc. One of the most applied models in ELT is the Classical. It is also called
Aristotelian because it was first mentioned in Aristotle’s work Rhetoric. Aristotle’s central idea is that
persuasion comes about through arguments, i.e. by proving that something is the case. The classical
argument is made up of five components, which are commonly composed in the following order [3]:
introduction, narration, confirmation, refutation, and conclusion. When using this model, the writer
should start with a clear, concise, and defined thesis statement that occurs in the first paragraph of the
essay. Each paragraph should develop only one idea (paragraph unity) which must be supported by
sufficient supporting details. What is important for this model is the use of “opposing” point of view:
argumentative essays should also consider and explain differing points of view regarding the topic
and discuss conflicting opinions on the topic. It is also important to use clear and logical transitions
between the introduction, body, and conclusion, because without logical progression of thought, the
reader will be unable to follow the essay’s argument, and the structure will collapse.</p>
      <p>A common outline for writing an argumentative essay is the five-paragraph approach (also known
as the “hamburger essay,” the “one-three-one essay,” and the “three-tier essay.”). It consists of an
introductory paragraph, three body paragraphs with evidence that include discussion of opposing
views, and a conclusion.</p>
      <p>Writing argumentative essays is crucial for the development of critical thinking (CT) skills with
students. CT refers to the ability to analyse and evaluate arguments or evidence. The National Council
for Excellence in Critical Thinking defined it as ‘the intellectually disciplined process of actively and
skillfully conceptualizing, applying, analyzing, synthesizing, and/or evaluating information gathered
from, or generated by, observation, experience, reflection, reasoning, or communication, as a guide to
belief and action” [4]. However, it is a skill that cannot be measured directly; instead, intellectual
standards are used to determine the quality of reasoning [5]. One of the critical thinking models that
can be adopted in improving argumentation skills is the Paul-Elder critical thinking framework [6]
[7]. The intellectual standards proposed by the framework are clarity, precision, accuracy, depth,
breadth, logic, significance, relevance, and fairness and they are used for the checklist in this study.</p>
      <p>When designing a checklist, the items addressed all five Aristotelian components and Paul-Edler’s
intellectual standards.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>The aim of this study was to determine if a reliable and valid checklist can be applied in assessing
ChatGPT generated essays. It set out to explore inter-rater agreement between the two reviewers who
applied the checklist developed. Furthermore, it explored if ChatGPT offers some possibilities for
language teaching and learning.</p>
      <p>In order to do this a comprehensive checklist for assessing essay structure and intellectual
standards was designed. Two independent secondary school teachers graded the essays using the
proposed checklist. The reviewers were not aware of the fact that they were assessing
computergenerated essays.</p>
      <p>The following instructions were given as prompts to ChatGPT:</p>
      <p>Higher education increases the chances of employment. Agree or disagree with this statement.
Support your opinion with reasons and examples. Write an essay in around 240 - 280 words.</p>
      <p>The first researcher in this study generated the essays. The essays were then sent to the reviewers
for assessment.</p>
      <p>Essay organization was assessed using 14-item instrument, whose indicators were derived from the
five-paragraph approach, the Classical model, and Cambridge English Qualifications scales at level
C1 of the CEFR. The checklist was meticulously designed based on the previous research by the first
author [8]. Each item gets different marks. These indicators are listed in Table 1. Ranging from 0
(lowest) to 5 (highest) and precise instructions were given for the band in the right column.
Table 1</p>
      <sec id="sec-3-1">
        <title>Checklist for Essay Structure</title>
        <sec id="sec-3-1-1">
          <title>Indicator Items for assessment</title>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Does the essay have an introduction, a body, and a conclusion?</title>
      </sec>
      <sec id="sec-3-3">
        <title>Is the response of appropriate length?</title>
        <p>Max. no.
of points
3
1</p>
        <sec id="sec-3-3-1">
          <title>Introduction</title>
        </sec>
      </sec>
      <sec id="sec-3-4">
        <title>Do the general statements give background information?</title>
      </sec>
      <sec id="sec-3-5">
        <title>Is it a funnel introduction?</title>
      </sec>
      <sec id="sec-3-6">
        <title>Does the thesis statement state a clearly focused main idea for the whole essay?</title>
        <sec id="sec-3-6-1">
          <title>Body</title>
        </sec>
      </sec>
      <sec id="sec-3-7">
        <title>Are there arguments expressing the writer's point of view?</title>
      </sec>
      <sec id="sec-3-8">
        <title>Are there arguments expressing the opposing point of view?</title>
      </sec>
      <sec id="sec-3-9">
        <title>Does each body paragraph have a clearly stated topic sentence with a main (controlling) idea?</title>
      </sec>
      <sec id="sec-3-10">
        <title>Does each body paragraph have good development with sufficient supporting details (facts, examples, and quotations)?</title>
      </sec>
      <sec id="sec-3-11">
        <title>Does each body paragraph have unity (one idea per paragraph, there are no sentences that are "off the topic")?</title>
      </sec>
      <sec id="sec-3-12">
        <title>Does each body paragraph have coherence (logical organization, transition words, and consistent pronouns)?</title>
        <sec id="sec-3-12-1">
          <title>Conclusion</title>
        </sec>
      </sec>
      <sec id="sec-3-13">
        <title>Does the conclusion restate your thesis or summarize your main points? What kind of conclusion does the essay have? Is it summary of the main points or restatement of the thesis?</title>
        <p>Does the conclusion give writer’s personal opinion about the topic?</p>
        <sec id="sec-3-13-1">
          <title>Language (choose only one)</title>
        </sec>
      </sec>
      <sec id="sec-3-14">
        <title>Uses a (wide) range of vocabulary, including less common lexis, effectively and precisely.</title>
      </sec>
      <sec id="sec-3-15">
        <title>Uses a wide range of simple and complex grammatical forms with full control, flexibility and sophistication.</title>
      </sec>
      <sec id="sec-3-16">
        <title>Errors, if present, are related to less common words and structures, or occur as slips.</title>
        <p>Performance shares features of Bands 3 and 5.</p>
      </sec>
      <sec id="sec-3-17">
        <title>Uses a range of vocabulary, including less common lexis, appropriately.</title>
      </sec>
      <sec id="sec-3-18">
        <title>Uses a range of simple and complex grammatical forms with control and flexibility.</title>
      </sec>
      <sec id="sec-3-19">
        <title>Occasional errors may be present but do not impede communication.</title>
        <p>Performance shares features of Bands 1 and 3.</p>
      </sec>
      <sec id="sec-3-20">
        <title>Uses a range of everyday vocabulary appropriately, with occasional inappropriate use of less common lexis. Uses a range of simple and some complex grammatical forms with a good</title>
        <p>1
1
1
2
2
3
3
3
3
1
1
5
5
4
3
2
1
degree of control.</p>
      </sec>
      <sec id="sec-3-21">
        <title>Errors do not impede communication.</title>
        <p>The attainment of critical thinking skills was examined using a 9-item instrument (Table 2), whose
indicators were derived from the above-mentioned Paul-Elder CT Model [9] with clarification as
given by Inoshita et al. [10]. These indicators are listed in Table 2. Each item gets marks from 0
(lowest) to 3 (highest).</p>
        <p>Table 2</p>
      </sec>
      <sec id="sec-3-22">
        <title>Checklist for Intellectual Standards</title>
        <p>Intellectual Standards
(Paul – Elder Model) [7]
Indicator 15 Clarity
Could you elaborate?
Could you illustrate what you mean?
Could you give me an example?
Indicator 16 Accuracy
How could we check on that?
How could we find out if that is true?
How could we verify or test that?
Indicator 17 Precision
Could you be more specific?
Could you give me more details?
Could you be more exact?
Indicator 18 Relevance
How does that relate to the problem?
How does that bear on the question?
How does that help us with the issue?
Indicator 19 Depth
What factors make this difficult?
What are some of the complexities of
this question?
What are some of the difficulties we
need to deal with?</p>
        <p>Clarification [10]
An essay is clear, it’s understandable and communicates
information to readers with ease. None of the statements are
confusing or ambiguous. There aren’t areas within the essay
where the meaning is lost due to exaggerated narrative or
forced and unnatural word choice. When an essay is clear,
readers can follow the path that the writer is communicating.</p>
        <p>They can read smoothly without stopping to ponder what a
word or even an entire sentence means.</p>
        <p>Is it correct? Is it true? Accuracy, not only when it comes to
spelling, punctuation, and word usage, but also grammar,
syntax, and conducting research within and outside of the
respective disciplines.</p>
        <p>Precision within writing demands that words are not only
spelled correctly but that their meanings are also clear and that
the words are not overused. Punctuation needs to be used in a
manner that follows standard rules, and ideas must be
expressed in ways that are direct while still allowing for the
writer to perform with skill and artistry.</p>
        <p>Is it essential to the main idea? If paragraphs in an essay are
relevant, they are related to the main topic and help support
the main idea with additional, related, relevant details and
evidence. If paragraphs are irrelevant, a reader might think,
“Wait. What? How is this on topic?”
Does this point help readers understand the main issue?
Does this essay focus on the assignment question or prompt?
Does it answer the main question?
If this paragraph is slightly off-topic, what can be done to
refocus it so that it does its job in supporting the main idea in
the thesis statement?
If a point is confusing readers who don’t understand how it’s
related to the main idea, does it belong in this essay?
Is it sufficiently complex? How deeply does this essay go into its
topic?
Is it detailed enough?
Did it go far enough into the research and reviews of other
texts to demonstrate a deep knowledge about the subject?
How thoroughly have specific subtopics within a major been
researched?
Indicator 20 Breadth
Do we need to look at this from
another perspective?
Do we need to consider another point
of view?
Do we need to look at this in other
ways?
Indicator 21 Logic
Does all of this make sense together?
Does your first paragraph fit in with
your last one?
Does what you say follow from the
evidence?
Indicator 22 Significance
Is this the most important problem to
consider?
Is this the central idea to focus on?
Which of these facts are most
important?
Indicator 23 Fairness
Is my thinking justifiable in context?
Am I taking into account the thinking of
others?
Is my purpose fair given the situation?
Am I using my concepts in keeping with
educated usage, or am I distorting
them to get what I want?</p>
        <p>Are all views considered? - a writer must consider not only one
point of view, but all the multiple major perspectives about an
issue.</p>
        <p>Is the content of an essay sufficiently comprehensive enough to
cover a wide range of perspectives and angles on a given topic?
Is anything missing that should be included in the scope of the
topic and which would help the essay achieve enough breadth?
Has the opposing view (i.e., the “naysayer’s” perspective) been
explored so as to strengthen the writer’s own argument? (This
consideration is particularly key in the development of a fully
supported and wisely composed persuasive or argumentative
essay.)
What has not yet been considered to make this idea or essay
complete?
Does a sentence, paragraph, or argument make sense?
Does one point follow another point with reason and
connected ideas and transitions, rather than jumping from
point A to point Z without sufficient explanation of how they
are related?
Are all assertions fortified by sufficient evidence?
Does all data collected, whether determined as relevant or not,
follow a logical approach?
Is the information important enough to include?
Does the information answer the question asked in an
assignment?
What key points are most important for writers to include and
for readers to consider?
Is this topic important in relation to other topics in the same
subject area?
What’s the most important thing to focus on?
Is this topic worth the writer’s time researching and the
reader’s time reading?
Would there be enough readers interested in this topic?
Does the writer of this essay exhibit the ability to fairly assess
the viewpoints of others, even opposing viewpoints?
Are there any fallacies, such as ad hominems that unfairly label
opponents rather than speak directly and precisely about the
opposing argument or point of view itself?
Does the writer or speaker have a conflict of interest?</p>
        <p>Does that conflict of interest appear as bias in the text?</p>
        <p>For each essay (there were 23 instances), the average Spearman correlations among the reviewers’
score was calculated. These values were 0.885, 0.733, 0.754, and 0.918, respectively. As being above
0.60, these figures evidence a good agreement among the four raters, especially for the first and the
last essay. Note that in many cases different measures of inter-rater reliability result in the same
conclusion [11], meaning that a standard reliability threshold (&gt;0.60) used for the kappa measure [12]
could be applied.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>When the prompts were given to ChatGPT, it generated the following two essays:
Essay 1</p>
      <p>In today's rapidly evolving job market, the role of higher education in increasing employment prospects is a
topic of considerable debate. While some argue that a college degree significantly enhances one's chances of
securing a job, others contend that it is not always a guarantee of employment. In this essay, I will argue that
higher education does indeed increase the chances of employment, but its effectiveness depends on various
factors.</p>
      <p>To begin with, higher education equips individuals with valuable skills and knowledge that are often
required in the modern workforce. Many professions, such as engineering, medicine, and law, demand
specialized training and qualifications that can only be acquired through higher education. Without these
credentials, individuals may find it challenging to access certain career paths or secure well-paying jobs.</p>
      <p>Moreover, a college degree demonstrates to employers a candidate's commitment, discipline, and ability to
complete a long-term educational program. It serves as a signal of a person's willingness to invest time and
effort in self-improvement, qualities highly valued in the job market.</p>
      <p>However, it's essential to acknowledge that the relationship between higher education and employment is not
always straightforward. Factors such as the choice of major, the prestige of the institution, and the individual's
networking skills also play significant roles. For example, a degree in computer science from a top-tier
university may open doors to lucrative job opportunities, while a less specialized degree from a less-known
institution might not yield the same results.</p>
      <p>In conclusion, higher education undeniably enhances the chances of employment by providing essential
skills, demonstrating commitment, and opening doors to specialized professions. However, the extent of its
impact varies depending on the field of study, the institution, and the individual's networking abilities.
Therefore, while higher education is a valuable asset in the pursuit of employment, it should be considered as
part of a broader strategy that includes factors beyond academic qualifications.</p>
      <p>Word count: 312
Essay 2</p>
      <p>I strongly agree that higher education increases the chances of employment. This belief is grounded in the
fact that higher education equips individuals with valuable skills, knowledge, and qualifications that are highly
sought after by employers.</p>
      <p>Firstly, higher education provides students with specialized skills and expertise that are often essential for
many professions. For example, a degree in engineering equips graduates with the technical knowledge required
for jobs in the engineering field. Similarly, a medical degree is necessary to become a doctor. Employers often
prioritize candidates with such specialized knowledge and skills, making higher education a key factor in
securing employment.</p>
      <p>Secondly, higher education institutions offer students opportunities for personal and professional growth.
Students learn critical thinking, problem-solving, communication, and teamwork skills that are transferable
across various industries. These skills not only make individuals more employable but also help them succeed
and advance in their chosen careers.</p>
      <p>Furthermore, statistics consistently show a strong correlation between higher education attainment and
employment rates. According to data from the U.S. Bureau of Labor Statistics, individuals with higher levels of
education tend to have lower unemployment rates and higher earning potential compared to those with only a
high school diploma.</p>
      <p>For instance, in the United States, the unemployment rate for individuals with a bachelor's degree is
significantly lower than for those with only a high school diploma. This demonstrates that higher education not
only increases the chances of finding employment but also provides better job security.</p>
      <p>In conclusion, higher education is a significant factor in increasing the chances of employment. It equips
individuals with specialized skills, fosters personal and professional growth, and is strongly correlated with
lower unemployment rates and higher earning potential. While there are exceptions, overall, pursuing higher
education is a wise investment in one's future career prospects.</p>
      <p>Word count: 296 words</p>
      <p>Reliability of this checklist was examined using inter-rater reliability based upon Spearman's
correlation. This correlation suitable for ordinal data was determined using an online calculator
available at https://www.socscistatistics.com/tests/spearman/default2.aspx). For the first essay, this
correlation was 0.90. For the second essay, the correlation was 0.78. These figures evidence a good
agreement between the two raters, especially for the first essay. Note that in many cases different
measures of inter-rater reliability result in the same conclusion [11], meaning that a standard
reliability threshold (&gt;0.60) used for the kappa measure [12] could be applied.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion</title>
      <p>The present study was designed to determine if a reliable and valid checklist can be applied in
assessing ChatGPT generated essays, to explore inter-rater agreement between the two reviewers who
applied the checklist developed and explore the potential of using ChatGPT generated essays in the
classroom.</p>
      <p>It was shown that ChatGPT can produce argumentative essays that are given high marks in almost
every aspect regarding the requirements set in the checklist. The results for reliability of the given
instrument clearly show that the applied checklist had good psychometric features, which answers the
applied research question in a positive way. It can be thus said that this checklist successfully
measures one underlying construct and thus it can confidently be used in further research. Hence, the
outcome of this study contributes to developing an instrument that assesses the promotion of critical
thinking with the use of ChatGPT, which has been a neglected research area so far, to the authors’
readings.</p>
      <p>The results for reliability of the given instrument were affirmative, meaning that it enables a
consistent, reliable assessment. Hence, the outcome of this study contributes to developing a valid and
reliable instrument that assesses the promotion of critical thinking in a broader context (both for a
human and machine generated responses), which has been a neglected research area so far, to the
authors’ readings.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Closing Remarks</title>
      <p>This study set out to determine if a reliable and valid instrument can be used to assess
argumentative essays in terms of essay organization and the elements of critical thinking. These
essays were ChatGPT generated. The checklist was designed for this purpose comprising the
requirements stemming from: the five-paragraph approach, the Classical model of argumentation, and
Paul-Elder critical thinking framework. The findings have shown that the checklist is both reliable
and valid so it can be used within ELT classrooms. Implications for ELT are numerous: teachers can
use this checklist not only to evaluate students’ argumentative essays but also to benefit from it by
analysing together with students ChatGPT generated essays and thus, focus on the promotion of
critical thinking skills.</p>
      <p>Limitations of this study can be regarded in terms of the sample of essays used and the number of
teachers who participated as reviewers. This limitation means that study findings need to be
interpreted cautiously.</p>
      <p>Up to now, no studies were undertaken longitudinally because ChatGPT is a new technology.
Further research should be undertaken to find good learning models with the help of ChatGPT and
how to implement them within educational settings.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Acknowledgements</title>
      <p>The authors of this paper would like to thank the following teachers for their participation in the
research as reviewers: Nana Shavishvili, Professional English Instructor, Georgian American
University, Georgia; and Biljana Pipović, English Language Teacher, Grammar School “Stevan
Jakovljević” Vlasotince, Serbia.</p>
      <p>This research was funded by the Ministry of Science, Technological Development and Innovation
of the Republic of Serbia (Contract No. 451-03-47/2023-01/ 200018).
8. References</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>W. C. H.</given-names>
            <surname>Hong</surname>
          </string-name>
          , “
          <article-title>The impact of ChatGPT on foreign language teaching and learning: Opportunities in education and research</article-title>
          ,
          <source>” Journal of Educational Technology and Innovation(JETI)</source>
          , vol.
          <volume>5</volume>
          , no.
          <issue>1</issue>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>W.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. F.</given-names>
            <surname>Hew</surname>
          </string-name>
          and
          <string-name>
            <given-names>L. K.</given-names>
            <surname>Fryer</surname>
          </string-name>
          , “
          <article-title>Chatbots for language learning-Are they really useful? A systematic review of chatbot-supported language learning</article-title>
          ,
          <source>” Journal of Computer Assisted Learning</source>
          , vol.
          <volume>38</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>237</fpage>
          -
          <lpage>257</lpage>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Aristotel</surname>
          </string-name>
          , Retorika, Beograd: Štampar Makarije,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <article-title>National Council for Excellence in Critical Thinking, “The Foundation for Critical Thinking</article-title>
          ,”
          <year>1987</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>[Online]. Available: https://www.criticalthinking.org/pages/defining-critical-thinking/766.</mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Z. S.</given-names>
            <surname>Nakrowi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Ansori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Mulyati</surname>
          </string-name>
          and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Setyaningsih</surname>
          </string-name>
          , “
          <article-title>The use of intellectual standards to assess the quality of students' argumentative writings</article-title>
          ,
          <source>” LITERA</source>
          , vol.
          <volume>22</volume>
          , no.
          <issue>2</issue>
          , pp.
          <fpage>200</fpage>
          -
          <lpage>212</lpage>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>L.</given-names>
            <surname>Elder</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Paul</surname>
          </string-name>
          ,
          <article-title>The Thinker's Guide to Analytic Thinking: How to Take Thinking Apart and What to Look for When You Do, 2nd Edition</article-title>
          , Tomales: Rowman &amp; Littlefield Publishers / The Foundation for Critical Thinking,
          <year>2016</year>
          , p.
          <fpage>9</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>E.</given-names>
            <surname>Linda</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Paul</surname>
          </string-name>
          , “Critical Thinking:
          <article-title>Intellectual Standards essential to Reasoning Well Within Every Domain of Thought,”</article-title>
          <source>Journal of Developmental Education</source>
          , vol.
          <volume>36</volume>
          , no.
          <issue>3</issue>
          , pp.
          <fpage>34</fpage>
          -
          <lpage>35</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>D.</given-names>
            <surname>Ljubojevic</surname>
          </string-name>
          ,
          <article-title>Developing academic writing skills in English as L2 by means of collaborative elearning tools</article-title>
          , University of Belgrade, Faculty of Philology,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>R.</given-names>
            <surname>Paul</surname>
          </string-name>
          and L. Elder, “Critical Thinking:
          <article-title>Intellectual Standards Essential to Reasoning Well Within Every Domain of Human Thought, Part Two</article-title>
          ,
          <source>” Journal of Developmental Education</source>
          , vol.
          <volume>37</volume>
          , no.
          <issue>1</issue>
          , p.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>A.</given-names>
            <surname>Inoshita</surname>
          </string-name>
          , Garland,
          <string-name>
            <given-names>K. K.</given-names>
            <surname>Sims</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Keuma</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. K.</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Williams</surname>
          </string-name>
          , English Composition - Connect, Collaborate, Communicate, Honolulu: University of Hawai, OER ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>A. de Raadt</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Warrens</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Bosker</surname>
            and
            <given-names>H. A. L.</given-names>
          </string-name>
          <string-name>
            <surname>Kiers</surname>
          </string-name>
          , “
          <article-title>A Comparison of Reliability Coefficients for Ordinal Rating Scales</article-title>
          ,
          <source>” Journal of Classification</source>
          , vol.
          <volume>38</volume>
          , p.
          <fpage>519</fpage>
          -
          <lpage>543</lpage>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>J.</given-names>
            <surname>Flo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Landmark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. E.</given-names>
            <surname>Hatlevik</surname>
          </string-name>
          and
          <string-name>
            <given-names>L.</given-names>
            <surname>Fagerström</surname>
          </string-name>
          , “
          <article-title>Using a new interrater reliability method to test the modified Oulu Patient Classification instrument in home health care,” Nursing Open</article-title>
          ., vol.
          <volume>5</volume>
          , p.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>