<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Y. Tseng);</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Contextualizing Problems to Student Interests at Scale in Intelligent Tutoring System Using Large Language Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gautam Yadav</string-name>
          <email>gyadav@andrew.cmu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ying-Jui Tseng</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiaolin Ni</string-name>
          <email>xiaolinn@andrew.cmu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Large Language Models, Mass Production, Student Interests, Intelligent Tutoring System</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Carnegie Mellon University</institution>
          ,
          <addr-line>5000 Forbes Ave Pittsburgh PA 15213</addr-line>
          ,
          <country country="US">United States</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Empowering Education with LLMs - the Next-Gen Interface and Content Generation</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>000</volume>
      <fpage>9</fpage>
      <lpage>0009</lpage>
      <abstract>
        <p>Contextualizing problems to align with student interests can significantly improve learning outcomes. However, this task often presents scalability challenges due to resource and time constraints. Recent advancements in Large Language Models (LLMs) like GPT-4 [1]ofer potential solutions to these issues. This study explores the ability of GPT-4 in the contextualization of problems within CTAT [2], an authoring tool for the intelligent tutoring system, aiming to increase student engagement and enhance learning outcomes. Through iterative prompt engineering, we achieved meaningful contextualization that preserved the dificulty and original intent of the problem, thereby not altering values or overcomplicating the questions. To evaluate the efectiveness of these newly generated questions, we conducted focus groups and interviews with instructional designers. The positive assessment from the instructional designers signifies that these questions are suitable for implementation, potentially saving significant time spent on manual contextualization. Despite these promising findings, we acknowledge current limitations of our research, particularly with geometry problems, and emphasize the need for ongoing evaluation and research. Future work includes systematic studies to measure the impact of this tool on students' learning outcomes and enhancements to handle a broader range of problems.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <sec id="sec-1-1">
        <title>Research has demonstrated that integrating problem contextualization with student interests</title>
        <p>
          can significantly enhance learning outcomes in algebra, resulting in increased proficiency in
problem-solving, improved accuracy, and the ability to transfer to future learning [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
      </sec>
      <sec id="sec-1-2">
        <title>Teachers, who intimately comprehend their students’ interests, often find the task of con</title>
        <p>textualizing problems according to these interests challenging, since the scalability of such
task is often met with resource and time constraints. However, recent developments in Large</p>
      </sec>
      <sec id="sec-1-3">
        <title>Language Models (LLMs) may provide an opportunity to lessen the strains associated with the personalization of learning context for students.</title>
      </sec>
      <sec id="sec-1-4">
        <title>This research aims to explore the capability of LLMs in contextualizing problems to align with student interests at a large scale within CTAT [2], an authoring tool for the intelligent tutoring system. In this study, we perform experiments using one of the most advanced LLMs</title>
        <p>
          nEvelop-O
currently accessible, the GPT-4 [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], obtained via the OpenAI API. Our hypothesis suggests that
the application of LLMs for problem contextualization, based on student interests, could result
in increased student engagement and enhanced learning outcomes.
        </p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Prior Work</title>
      <sec id="sec-2-1">
        <title>2.1. Context Personalization</title>
        <p>
          The works of Walkington [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] introduced the concept of contextualizing algebraic questions
based on students’ interests. This innovative methodology, featuring student-created
”algebra stories,” aimed to boost engagement, cultivate ownership, and enhance understanding of
algebraic principles. Interest has been identified as a pivotal factor in learning, impacting
attention, persistence, and motivation. Personalized learning that mirrors individual interests
has demonstrated a capacity to elicit positive emotional responses, enhance appreciation for
instructional content, and leverage existing knowledge. [
          <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
          ]
        </p>
        <p>The eficacy of context personalization was investigated using both qualitative and
quantitative research methods, demonstrating a positive association between these ’algebra stories’
and improved student engagement and performance. Despite possible implementation
obstacles due to the diversity of learners’ interests, the use of digital tools has been proposed as a
facilitative means for this personalization process. In its totality, contextual personalization has
the potential to enhance learning efectiveness and accuracy, decrease the practice required for
mastery, and foster transferable skills applicable to various scenarios.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Mass Production in Intelligent Tutoring Systems</title>
        <sec id="sec-2-2-1">
          <title>Mass Production in Intelligent Tutoring Systems (ITS) is a technique that enables authors to</title>
          <p>parameterize previously authored problem-specific content, which can then be instantiated to
suit a multitude of diferent problems. This technique essentially permits authors to manually
generalize Example-Tracing expert models (known as behavior graphs) to accommodate all
problems that share isomorphic solution structures.</p>
          <p>
            The Mass Production technique in Intelligent Tutoring Systems (ITS) is leveraged to generate
a multitude of diverse problems from a single, parameterized structure, enhancing mastery
learning by allowing repeated practice in various contexts[
            <xref ref-type="bibr" rid="ref6">6</xref>
            ] . In our work, we utilized this
principle within the CTAT platform, adjusting only the contextual ’cover stories’ to individualize
problems in alignment with student interests. This personalized approach, derived from mass
production, could potentially foster increased student engagement, improved problem-solving
skills, and enhanced subject matter understanding.
          </p>
        </sec>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Instruction Generation with Large Language Models</title>
        <sec id="sec-2-3-1">
          <title>Previous research involving large language models has explored their application in educational</title>
          <p>
            settings, such as the use of models like GPT for generating questions or providing
hints/explanations to students [
            <xref ref-type="bibr" rid="ref7">7</xref>
            ]. Empirical evaluations of these applications and their impact on student
outcomes suggested that students perform better on human-generated content compared to
these models [
            <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
            ].
          </p>
        </sec>
        <sec id="sec-2-3-2">
          <title>It is evident that large language models hold great potential in enhancing learning experiences, making them a promising tool for future educational endeavors. However, as our work proposes, a step further in personalized learning can be taken by leveraging these models in a more contextaware manner, which could further improve student engagement and outcomes.</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. System Design</title>
      <sec id="sec-3-1">
        <title>To test the potential of GPT-4 in problem contextualization, we developed a prototype tool</title>
        <p>integrated with CTAT, illustrated by a few problem examples. This paper primarily focuses
on outlining our approach and tool design. However, we recognize the necessity for rigorous
evaluation. Future work includes systematic studies such as randomized controlled trials to
comprehensively assess the tool’s eficacy and its impact on students’ learning outcomes.</p>
        <sec id="sec-3-1-1">
          <title>3.1. Iterative Prompt Engineering in GPT-4</title>
          <p>
            Our iterative prompt engineering was executed over four diferent problem sets of algebra
questions in TutorShop [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ], with the objective of achieving meaningful contextualization
aligned with diverse interests. A few-shot learning approach was used, drawing on examples
from a range of contexts and interests to foster coverage and generalization. Throughout this
process, we gradually refined and added rules based on testing until our output could aptly
accommodate novel interests such as TikTok and NBA, as demonstrated in Tables 1 and 2.
3.1.1. Prompt
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>We used the following prompt:</title>
        <p>• Your task is to change context based on interest for a problem, for example:
– Input Problem 1:</p>
      </sec>
      <sec id="sec-3-3">
        <title>Chaz and Nikki are standing in a long line to buy rock concert tickets. Nikki is 8 feet ahead of Chaz in the line. Let’s compare Chaz’s distance to Nikki’s distance from the front of the line.</title>
      </sec>
      <sec id="sec-3-4">
        <title>When Nikki is 20 feet from the front of the line, how far away is Chaz?</title>
      </sec>
      <sec id="sec-3-5">
        <title>When Nikki is 16 feet from the front of the line, how far away is Chaz?</title>
      </sec>
      <sec id="sec-3-6">
        <title>In the row labeled ”Expression”, define a variable for Nikki’s distance and use that variable to write an expression that will allow you to calculate Chaz’s distance.</title>
      </sec>
      <sec id="sec-3-7">
        <title>Output Problem 1 based on interest ”Video Games”:</title>
      </sec>
      <sec id="sec-3-8">
        <title>In a video game, two players, Mario and Luigi, are standing at diferent points in a</title>
        <p>level. Luigi is 8 units ahead of Mario in the game. Let’s compare Mario’s distance to</p>
      </sec>
      <sec id="sec-3-9">
        <title>Luigi’s distance from the level’s end.</title>
      </sec>
      <sec id="sec-3-10">
        <title>When Luigi is 20 units from the end of the level, how far away is Mario?</title>
      </sec>
      <sec id="sec-3-11">
        <title>When Luigi is 16 units from the end of the level, how far away is Mario?</title>
      </sec>
      <sec id="sec-3-12">
        <title>In the row labeled ”Expression”, define a variable for Mario’s distance and use that variable to write an expression that will allow you to calculate Luigi’s distance.</title>
      </sec>
      <sec id="sec-3-13">
        <title>Output Problem 1 based on interest ”basketball”:</title>
      </sec>
      <sec id="sec-3-14">
        <title>During a basketball game, two players, Jordan and Kobe, are standing at diferent</title>
        <p>positions on the court. Jordan is 12 feet ahead of Kobe on the court. Let’s compare</p>
      </sec>
      <sec id="sec-3-15">
        <title>Jordan’s distance to Kobe’s distance from the basket.</title>
      </sec>
      <sec id="sec-3-16">
        <title>When Kobe is 20 feet away from the basket, how far away is Jordan from the basket?</title>
      </sec>
      <sec id="sec-3-17">
        <title>When Kobe is 16 feet away from the basket, how far away is Jordan from the basket?</title>
      </sec>
      <sec id="sec-3-18">
        <title>In the row labeled ”Expression”, define a variable for Kobe’s distance and use</title>
        <p>that variable to write an expression that will allow you to calculate Jordan’s distance.
– Input Problem 2:</p>
      </sec>
      <sec id="sec-3-19">
        <title>You are a product inspector for a company that produces light bulbs. You find that two out of every 300 bulbs are defective: they don’t work properly.</title>
      </sec>
      <sec id="sec-3-20">
        <title>Output Problem 2 based on interest ”World of Warcraft”:</title>
      </sec>
      <sec id="sec-3-21">
        <title>You enjoy playing World of Warcraft on your computer. You notice that two out of every 300 times you defeat a monster, the monster has an epic item: a treasure that you want to collect.</title>
        <p>– Input Problem 3:
y = 80 - 6x
If x = 10, what is y?
If x = 7, what is y?
If y = 8, what is x?
Write a story that could go along with the equation y = 80 - 6x.</p>
      </sec>
      <sec id="sec-3-22">
        <title>Output Problem 3 based on interest ”Video Games”:</title>
        <p>You are playing your favorite war game on the Xbox 360. When you started playing
today, there were 80 enemies left in the locust horde. You kill an average of 6 enemies
every minute.
(a) How many enemies are left after 10 minutes?
(b) How many enemies are left after 7 minutes?
(c) Write an algebra rule that represents this situation using symbols.
(d) If there are only 8 enemies left, how long have you been playing today?</p>
      </sec>
      <sec id="sec-3-23">
        <title>Now give output for</title>
      </sec>
      <sec id="sec-3-24">
        <title>Some rules to follow:</title>
      </sec>
      <sec id="sec-3-25">
        <title>1. don’t change values</title>
        <p>– input problem: 2x+3=15
– Interest: [The interest that the problem needs to be contextualized for.]</p>
      </sec>
      <sec id="sec-3-26">
        <title>2. we want to have deeper contextualization not surface details based on Using Adaptive</title>
      </sec>
      <sec id="sec-3-27">
        <title>Learning Technologies to Personalize Instruction to Student Interests: The Impact</title>
        <p>of Relevant Contexts on Performance and Learning Outcomes</p>
      </sec>
      <sec id="sec-3-28">
        <title>3. output question should ask same thing as input question, don’t ask any additional question or complicate the info by adding unnecessary details</title>
      </sec>
      <sec id="sec-3-29">
        <title>This strict adherence to rules ensures that we maintain consistency in problem dificulty and preserve the problem’s original intent. This methodology respects the principle of not altering values or over-complicating the question by adding unnecessary details as observed in our earlier iterations.</title>
        <sec id="sec-3-29-1">
          <title>3.2. CTAT Implementation</title>
        </sec>
      </sec>
      <sec id="sec-3-30">
        <title>In this section, we propose a novel interaction design for contextualizing problems in Intelligent</title>
        <p>Tutoring Systems using CTAT and GPT-4 that emphasizes problem-authoring control. Teachers
or instructional designers could contextualize existing problems simply by adding interest in
the “Contextualized by Interest” tab in the Mass Production feature (Figure 1). After the user
click the contextualize problem, the system will use GPT-4 and the prompt we mentioned in
the prompt engineering section to generate variations of the problem for each interest. They
can also preview and edit the contextualized result in the student-facing interface on the right
panel to make sure whether they are satisfied with the generation result (Figure 2).</p>
        <p>
          The application of mass production in CTAT ofers significant value, as it facilitates the
creation of a vast array of distinct problems using the same underlying structure. This contributes to
mastery learning, allowing learners to practice similar problems in various contexts, ultimately
strengthening their grasp of the subject [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. In our approach, we utilized this principle, where
only the contextual ’cover stories’ were varied for the problem within the CTAT platform. This
delivers similar problem-solving opportunities to students, yet personalizes these scenarios to
align with their individual interests. The implications of this mass production approach based on
interests are manifold; it can potentially increase student engagement, improve problem-solving
abilities, and promote a better understanding of the subject matter.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Future Work and Limitations</title>
      <sec id="sec-4-1">
        <title>While our approach benefits significantly from established pedagogical and technological</title>
        <p>foundations, it notably leverages the Mass Production facility to enhance problem diversity
within existing problem sets. By utilizing mass production, we can incorporate existing cover
stories and contextualized hints as examples in our few-shot learning approach with GPT-4,
providing a coherent and contextual learning experience.</p>
      </sec>
      <sec id="sec-4-2">
        <title>Systematic studies to measure the impact of this tool on students’ learning outcomes remain</title>
        <p>a crucial part of our future work. This includes investigating improvements in initial response
accuracy, learning eficiency enhancements, and the rate of progress towards proficiency.</p>
        <p>However, we acknowledge certain limitations in our current model, particularly in handling
geometry problems involving graphs, tables, or when diagrams would be beneficial. Despite
its proficiency in algebra, GPT-4’s present capabilities fall short in generating images that
align with the problem text, hindering its efectiveness in scenarios where visual aids could
significantly contribute to a student’s comprehension of concepts. Future work will focus on
overcoming these limitations to better facilitate student understanding across diverse problem
types.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <sec id="sec-5-1">
        <title>We extend our sincere gratitude to Prof. Vincent Aleven, whose expert guidance was indispensable to the success of this research. His profound wisdom and unwavering support enriched this work immeasurably.</title>
        <p>Interest
Danny and the Algebraics are recording their first demo album. They plan
to send 15 CDs to record companies to try to land a recording contract.
They are wondering how many additional CDs they should make. They
have a fixed budget of $1000 and it costs $2.50 to make a CD. They use
this formula to see how much they will have left if they make a diferent
number of CDs.</p>
        <p>Let  be the number of CDs they make after the initial 15 CDs.
The amount of money they will have left = 1000 - 2.50 ( + 15)
1. How much money is left if they make 85 additional CDs?
2. How much money is left if they make 125 additional CDs?
3. How much money is left if they make 250 additional CDs?
4. How much money is left if they make 385 additional CDs?
An upcoming TikTok creator, Danny and his creative team, are planning
to promote their first viral video. They plan to run a paid advertising
campaign to 15 prominent TikTok influencers to try to gain more followers.
They are wondering how many more influencers they should reach out
to. They have a fixed budget of $1000 and it costs $2.50 to run an ad per
influencer. They use this formula to see how much they will have left if
they promote to a diferent number of influencers. Let  = the number
of influencers they target after the initial 15 influencers. The amount of
money they will have left = 1000-2.50 ( + 15)
1. How much money is left if they reach out to 85 additional
influencers?
2. How much money is left if they reach out to 125 additional
influencers?
3. How much money is left if they reach out to 250 additional
influencers?
4. How much money is left if they reach out to 385 additional
influencers?
The Lakers are planning their season’s training sessions. They have a
budget of $1000 to buy basketballs for practice. Each basketball costs
$2.50. They are wondering how many additional basketballs they should
buy. They already plan to buy 15 basketballs to start of. They use
this formula to see how much they will have left if they buy a diferent
number of basketballs. Let  = the number of basketballs they buy after
the initial 15 basketballs. The amount of money they will have left =
1000-2.50( + 15)
1. How much money is left if they buy 85 additional basketballs?
2. How much money is left if they buy 125 additional basketballs?
3. How much money is left if they buy 250 additional basketballs?
4. How much money is left if they buy 385 additional basketballs?
Original
TikTok
NBA
2x + 3 = 15
In the realm of TikTok, you’re working on a new viral challenge. You
have found a way to get 2 new followers every time you post a video.
Additionally, you already have 3 followers to start with. Your aim is to
have 15 followers. Create an equation to show how many TikTok videos
’x’ you need to post to reach your follower goal.</p>
        <p>During an NBA game, a player earns points for their team by scoring
baskets. Each 2-point field goal adds 2 points and every free throw adds
a single point to the team’s total. Imagine a situation where a player,
LeBron, makes a certain number of 2-point field goals and 3 successful
free throws, resulting in 15 points for his team. Write an equation that
would help determine the number of 2-point field goals LeBron made.
Use ’x’ to denote the number of 2-point field goals.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1] OpenAI, Gpt-4
          <source>technical report</source>
          ,
          <year>2023</year>
          . arXiv:
          <volume>2303</volume>
          .
          <fpage>08774</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>V.</given-names>
            <surname>Aleven</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. M.</given-names>
            <surname>McLaren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sewall</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. Van Velsen</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <string-name>
            <surname>Popescu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Demi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Ringenberg</surname>
            ,
            <given-names>K. R.</given-names>
          </string-name>
          <string-name>
            <surname>Koedinger</surname>
          </string-name>
          ,
          <article-title>Example-tracing tutors: Intelligent tutor development for non-programmers</article-title>
          ,
          <source>International Journal of Artificial Intelligence in Education</source>
          <volume>26</volume>
          (
          <year>2016</year>
          )
          <fpage>224</fpage>
          -
          <lpage>269</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Walkington</surname>
          </string-name>
          ,
          <article-title>Using adaptive learning technologies to personalize instruction to student interests: The impact of relevant contexts on performance and learning outcomes</article-title>
          .,
          <source>Journal of educational psychology 105</source>
          (
          <year>2013</year>
          )
          <fpage>932</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Hidi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. A.</given-names>
            <surname>Renninger</surname>
          </string-name>
          ,
          <article-title>The four-phase model of interest development</article-title>
          ,
          <source>Educational psychologist 41</source>
          (
          <year>2006</year>
          )
          <fpage>111</fpage>
          -
          <lpage>127</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Goldstone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Y.</given-names>
            <surname>Son</surname>
          </string-name>
          ,
          <article-title>The transfer of scientific principles using concrete and idealized simulations</article-title>
          ,
          <source>The Journal of the learning sciences 14</source>
          (
          <year>2005</year>
          )
          <fpage>69</fpage>
          -
          <lpage>110</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>C. J. MacLellan</surname>
            ,
            <given-names>K. R.</given-names>
          </string-name>
          <string-name>
            <surname>Koedinger</surname>
          </string-name>
          ,
          <article-title>Domain-general tutor authoring with apprentice learner models</article-title>
          ,
          <source>International Journal of Artificial Intelligence in Education</source>
          (
          <year>2022</year>
          )
          <fpage>1</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Elkins</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Kochmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Cheung</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Serban</surname>
          </string-name>
          ,
          <article-title>How useful are educational questions generated by large language models?</article-title>
          ,
          <source>arXiv preprint arXiv:2304.06638</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>E.</given-names>
            <surname>Prihar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hopman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kalai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vempala</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Wickline</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Hefernan</surname>
          </string-name>
          ,
          <article-title>Comparing diferent approaches to generating mathematics explanations using large language models</article-title>
          ,
          <source>in: Proceedings of the AIED2023 Conference</source>
          ,
          <year>2023</year>
          . To be published.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Z. A.</given-names>
            <surname>Pardos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bhandari</surname>
          </string-name>
          ,
          <article-title>Learning gain diferences between chatgpt and human tutor generated algebra hints</article-title>
          ,
          <source>arXiv preprint arXiv:2302.06871</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>V.</given-names>
            <surname>Aleven</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sewall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Popescu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Xhakaj</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Baker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Siemens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Rosé</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Gasevic</surname>
          </string-name>
          ,
          <article-title>The beginning of a beautiful friendship? intelligent tutoring systems and moocs</article-title>
          ,
          <source>in: Artificial Intelligence in Education: 17th International Conference, AIED 2015</source>
          , Madrid, Spain, June 22-26,
          <year>2015</year>
          . Proceedings 17, Springer,
          <year>2015</year>
          , pp.
          <fpage>525</fpage>
          -
          <lpage>528</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>