=Paper= {{Paper |id=Vol-3862/paper2 |storemode=property |title= |pdfUrl=https://ceur-ws.org/Vol-3862/paper2.pdf |volume=Vol-3862 |authors=Juuso Eronen,Saeun Lee |dblpUrl=https://dblp.org/rec/conf/lacatoda/EronenL24 }} ==== https://ceur-ws.org/Vol-3862/paper2.pdf
                         Improving English Education in Japan: Leveraging Large
                         Language Models for Personalized and Skill-Diverse
                         Learning⋆
                         Juuso Eronen1,*,† , Saeun Lee1,†
                         1
                             Prefectural University of Kumamoto, 3 Chome-1-100 Tsukide, Higashi Ward, Kumamoto, Japan, 862-0920


                                       Abstract
                                       The introduction of advanced generative AI has significantly impacted higher education, particularly in language
                                       learning. In Japan’s test-oriented and teacher-centered English education system, students have limited opportu-
                                       nities to leverage generative AI for developing critical skills like analysis, evaluation, and creativity. This learning
                                       is often passive, focusing on memorization and testing rather than fostering higher-order thinking skills (HOTS).
                                       To address these challenges, the authors propose an AI-enhanced language teaching method that integrates HOTS
                                       with lower-order thinking skills (LOTS), based on Bloom’s revised taxonomy. This approach encourages deeper
                                       engagement, critical thinking, and active learning through AI-supported activities like synthesizing information
                                       and problem-solving. The proposed method employs large language models, such as LLaMA 3, to deliver per-
                                       sonalized learning experiences tailored to students’ English proficiency and personal interests. Pre-prompts are
                                       used to focus learning on specific goals, such as vocabulary, grammar, and pronunciation. Additionally, plans to
                                       integrate the system with an AI assistant robot aim to further enhance interactivity and immersion in the learning
                                       process. Despite concerns about reduced human interaction, privacy issues, algorithmic bias, and accessibility
                                       challenges, AI-powered language learning may offer significant benefits in Japan’s higher education context. By
                                       providing a safe, non-judgmental environment, generative AI can help overcome communication barriers, build
                                       student confidence, and support the digitalization of Japan’s education system, ultimately contributing to the
                                       development of global human resources.

                                        Keywords
                                        English Education in Japan, Large Language Models (LLMs), Personalized Learning, Critical Thinking Skills




                         1. Introduction
                         In the field of language education, the integration of advanced AI technology presents a compelling
                         opportunity to revolutionize traditional teaching methods. Imagine a future where language learning is
                         tailored to the unique needs of each student, offering personalized experiences that enhance engagement
                         and learning outcomes. However, the journey towards AI-driven language education is not without
                         its challenges, particularly within the context of Japan’s education system, which values tradition and
                         passive learning focused on memorization.
                            In this study, we explore the potential of language teaching enhanced by generative AI while
                         addressing the practical issues faced in Japan’s educational landscape. The introduction of AI technology
                         in language education holds the promise of individualized learning experiences that cater to diverse
                         learning styles and preferences. By leveraging generative AI, language learning platforms can analyze
                         the students’ strengths and weaknesses, providing targeted support and feedback tailored to their
                         specific needs. Additionally, AI-driven content delivery systems can adapt in real-time to students’
                         progress, ensuring that lessons remain engaging and relevant.
                            By leveraging established educational principles, namely the revised Bloom’s taxonomy, our research
                         seeks to enhance language education through the integration of generative AI. By equipping students

                         The 9th Linguistic and Cognitive Approaches to Dialog Agents Workshop, November 19, 2024, Kyoto University, Kyoto, Japan
                         (co-located with PRICAI 2024)
                         *
                           Corresponding author.
                         †
                           These authors contributed equally.
                         $ eronenj@pu-kumamoto.ac.jp (J. Eronen); saeunlee@pu-kumamoto.ac.jp (S. Lee)
                          0000-0001-9841-3652 (J. Eronen)
                                       © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).


CEUR
                  ceur-ws.org
Workshop      ISSN 1613-0073
Proceedings

                                                                                                              17
with both lower-order and higher-order thinking skills, our methodology aims for a holistic learning
experience that prepares students for success in the digital age. However, the implementation of
generative AI in language education is not without its challenges, as the Japanese education system has
long prioritized standardized testing and teacher-centered instruction, requiring a shift in pedagogical
approaches and a re-evaluation of traditional teaching paradigms. Through a systematic examination
of both the opportunities and obstacles, this research aims to provide practical insights for educators
and policymakers.
   To explore this potential, a survey was conducted with 84 university students, focusing on their use
of ChatGPT as a learning tool. Building on these insights, a generative AI-powered learning system
has been developed as a pilot project, leveraging Large Language Models (LLMs), namely LLaMA 3.
This system offers personalized educational experiences by understanding and generating human-
like language. The chatbot’s use of pre-prompts ensures focused learning on specific goals, such as
vocabulary and grammar. A planned integration the chatbot with an AI assistant robot to create a more
immersive learning environment.


2. Background
2.1. English education in Japan
Japan’s education system is known for its traditional approach, emphasizing rote memorization and stan-
dardized testing [1]. In the context of English education, this often results in a focus on passive learning
and comprehension rather than the development of critical thinking and creativity. Students typically
encounter limited opportunities for personalized learning experiences tailored to their individual needs
and preferences [2].
   The system is facing a multitude of issues that hinder its ability to fully meet the evolving needs
of students in the 21st century. Foremost among these challenges is the pressure placed on students,
stemming from the system’s heavy reliance on standardized testing as the primary measure of academic
success [3]. This emphasis on exam performance not only fosters a hyper-competitive atmosphere,
contributing to heightened stress levels, but also interferes with the development of critical thinking,
problem-solving, and creativity among students. Furthermore, the teachers within this system often
contend with overwhelming workloads, including administrative burdens that detract from their
capacity to deliver engaging and personalized instruction [4]. Also, opportunities for developing and
applying innovative teaching methods and tailored learning experiences are indeed constrained by
various factors inherent in the conservative nature and strict hierarchy of the Japanese education
system [5]. The entrenched traditionalism within the system often resists change and experimentation,
favoring stability and conformity over innovation and flexibility. These systemic challenges underscore
the urgent need for comprehensive reforms and innovative solutions to ensure that Japan’s education
system remains relevant and effective in preparing students for the complexities of the modern world.

2.2. Artificial intelligence in education
In recent years, the integration of generative artificial intelligence (AI) models, particularly OpenAI’s
ChatGPT, into education has garnered significant attention. Various studies have highlighted both the
potential benefits and challenges of this technology in enhancing learning outcomes. The integration
of AI technology, specifically large language models like ChatGPT, into the education system presents
an opportunity to address these limitations and enhance the learning experience for students.
   ChatGPT offers personalized tutoring by tailoring educational content to meet individual student
needs, thereby fostering a more customized learning experience. Studies by Baidoo-Anu and Owusu
Ansah [6] emphasize its ability to provide ongoing feedback and promote self-directed learning, which
is critical in improving students’ engagement and motivation. Similarly, Su and Yang [7] argue that
ChatGPT’s application in creating virtual tutors can provide real-time, personalized feedback, making
it a valuable tool in both formal and informal learning settings.



                                                    18
   AI-powered chatbots, including ChatGPT, have been utilized to create interactive learning environ-
ments that support self-paced education. In a study by Adiguzel et al. [8], the authors highlight the
potential of ChatGPT to generate interactive and engaging learning scenarios, which promote deep
thinking and problem-solving. Another key advantage of using ChatGPT in education is its ability to
handle routine tasks such as grading and administrative work, which allows teachers to focus more
on instructional design and student support. This frees up more time for personalized instruction and
student interaction. This allows teachers to focus on facilitating meaningful learning experiences and
fostering critical thinking skills in their students [9]. Studies suggest that ChatGPT can automate essay
grading, providing accurate assessments of student work, which can save educators considerable time
[6], [7].
   Despite its benefits, the implementation of ChatGPT in education is not without challenges. One
major issue is the risk of perpetuating biases embedded in the data used to train these models. Baidoo-
Anu and Owusu Ansah [6] caution that generative AI may unintentionally amplify existing biases,
which can affect the fairness and equity of educational tools. Similarly, Mhlanga [10] raises concerns
about privacy and the ethical use of generative AI in educational settings, calling for transparency and
fairness to be prioritized when integrating such technologies into classrooms. Additional concerns
include for example, ethical issues related to academic dishonesty and the potential displacement of
educators [11].
   However, the current solutions mostly rely on ChatGPT without considering the needs of individual
students, highlighting a critical gap in the application of generative AI in education. By leveraging
AI-powered tools and platforms, personalized learning pathways can be developed based on students’
abilities, interests, and learning styles. AI algorithms can provide valuable insights into students’
strengths and weaknesses, enabling educators to tailor instruction accordingly. Moreover, generative AI
can facilitate adaptive learning environments where content delivery is adjusted in real-time based on
students’ progress and comprehension levels, ultimately promoting a more effective and individualized
educational experience. This ensures that students receive the appropriate level of challenge and
support, promoting deeper engagement and understanding [12], [13].
   Overall, the integration of generative AI has the potential to revolutionize Japan’s education system
by promoting personalized learning, fostering critical thinking skills, and enhancing overall educational
outcomes. Our goal is to create a solution that utilizes generative AI to not only adapt to individual learn-
ing needs but also to cultivate a more dynamic, interactive and student-centered learning environment
based on Bloom’s revised taxonomy. By incorporating elements from all levels of Bloom’s taxonomy
we aim to enhance students’ critical thinking abilities and empower both students and educators to
engage in a more meaningful educational experience. We seek to develop higher-order thinking skills,
ensuring that students are equipped to navigate complex problems and contribute effectively in their
academic and professional endeavors and to better prepares them for global success.

2.3. Bloom’s taxonomy
One potential attempt to challenge the circumstance of the prevailing traditional language learning
approach in Japan is to deconstruct and reconstruct the learning process using Bloom’s taxonomy.
Bloom’s taxonomy is widely acknowledged for its role in mastering learning [14], as it delineates the
knowledge and skills necessary to facilitate learners’ mental processes [15],[16], [17], [18], [19], [20],
[21], [22]. Introduced by Benjamin Samuel Bloom in 1956, this taxonomy aims to investigate learners’
cognitive skills to enhance learning and to develop teaching strategies that progress from lower-order
cognitive skills to higher-order cognitive skills [23].
  The Bloom’s framework was later revised by Anderson and Krathwohl [24] to include six components,
which are described below:

    • Remembering: Retrieving, recognizing, and recalling relevant knowledge from long-term memory.

    • Understanding: Constructing meaning from oral, written, and graphic messages through inter-
      preting, exemplifying, classifying, summarizing, inferring, comparing, and explaining.



                                                     19
    • Applying: Carrying out or using a procedure through executing or implementing.

    • Analyzing: Breaking material into constituent parts, determining how the parts relate to one
      another and to an overall structure or purpose through differentiating, organizing, and attributing.

    • Evaluating: Making judgements based on criteria and standards through checking and critiquing.

    • Creating: Putting elements together to form a coherent or functional whole; reorganizing elements
      into a new pattern or structure through generating, planning, or producing.

   Bloom’s taxonomy serves as a valuable tool for fostering the advancement of higher-order thinking
skills, which are crucial qualifications for both life and careers among students. Additionally, instruc-
tional lessons formulated using Bloom’s taxonomy tend to transition towards a more student-centered
approach, emphasizing cognitive awareness and empowering students with control over their learning
activities.
   One of the primary components of Bloom’s taxonomy involves student self-assessment and/or
self-evaluation, which is closely associated with the development of higher-order thinking skills in
student-centered learning. The student self-assessment [25] allows for valuable development of one’s
learning, as it aligns with principles of double-loop learning [26], student self-regulation and self-
management, and the student-centered classroom [27].
   Within the framework of Bloom’s taxonomy, learning extends beyond mere evaluation of work
to encompass reflection on learning, engagement in thought processes, and enhancement of higher-
order thinking skills. However, a challenge arises regarding the extent to which a teacher can support
students in mastering learning in depth. Self-regulated learning is also necessary, with consistent
support. To overcome this challenge and to master learning effectively, there is promise in current AI
technology-based learning, as it enables the practice of higher-order thinking skills and the development
of cognitive skills through ongoing interaction with generative AI.


3. Implementation
3.1. Lesson plan
We present two lesson plan examples that demonstrate how generative AI can be incorporated into
writing and reading, as well as speaking and listening activities. These lesson plans aim to engage
students in interactive learning experiences while leveraging AI-powered feedback to enhance their
skills. By combining traditional pedagogical approaches with generative AI, educators can create
dynamic and personalized learning environments that cater to the diverse needs and preferences of
students.

Writing & Reading Lesson plan example (Figure 1)
   1. Write an Essay TOPIC: Recently, the number of car accidents due to smartphone use has been
      increased. Provide three solutions to reduce accidents resulting from smartphone use in public
      areas.

   2. Brainstorming and search about the problem and solutions (understand)

   3. Plan & write an outline.

   4. Write Problem solving essay organization. 5 paragraphs essay. (Remember/recall)

   5. AI feedback Accuracy check: Organization, Grammar, Spelling, Remember & Compare the Essay
      organization Understand the feedback from AI device Apply the feedback from AI device Analyze
      the own writing drafts Evaluate the feedback from AI and writing drafts Create & rewrite the
      essays



                                                   20
   6. Revise & rewrite

   7. Final draft

Speaking & Listening Lesson plan example (Figure 2)
   1. Have a conversation with AI-device TOPIC: I am going to England next month.

   2. Prepare a few questions What should I prepare? Where do you recommend to go? What can I do
      in England? Can you give me some advice? Etc.

   3. Submit the conversation script

   4. Evaluate the conversation script and understand the self-level


Figure 1: Writing lesson plan example




Figure 2: Speaking lesson plan example




                                                 21
   In the writing and reading lesson plan example, students are tasked with writing an essay on
the topic of car accidents resulting from smartphone use in public areas. The lesson progresses
through brainstorming and research, outlining, drafting, receiving AI feedback, revising, and ultimately
producing a final draft. AI feedback is utilized to assess organization, grammar, and spelling, helping
students refine their writing skills through iterative improvements. Similarly, in the speaking and
listening lesson plan example, students engage in a conversation with an AI device about their upcoming
trip to England. They prepare questions, submit the conversation script, and evaluate their performance,
gaining insights into their speaking and listening abilities. These lesson plans demonstrate how
generative AI can support and enhance language learning experiences, fostering student engagement
and skill development.

3.2. Preliminary survey
We conducted a preliminary survey to understand students’ thoughts on using ChatGPT as a tool for
learning. The survey aimed to gather insights into students’ experiences, perceived benefits, challenges,
and overall satisfaction with ChatGPT in an educational context. The results provide valuable feedback
for educators and developers to enhance the effectiveness of educational tools that utilize generative AI.
   A questionnaire was designed and distributed to a sample of 84 second-year undergraduate stu-
dents. The survey included both quantitative and qualitative questions, covering various aspects of
their experience, including ease of use, effectiveness, benefits and detriments, and overall satisfaction.
The questions are listed in Appendix A. The survey results provide a detailed overview of students’
experiences and perceptions regarding the use of ChatGPT in their educational activities. The data
indicates that only about 14% of students had previously used ChatGPT in other classes, suggesting
that its adoption in educational contexts may still be relatively new.
   A significant challenge noted was the difficulty in using ChatGPT, with over 91% of students reporting
some level of difficulty. Most commonly, the students had difficulties in designing prompts or saying that
the answer was not what they wanted or that it was too long. Despite these challenges, an overwhelming
majority of students found ChatGPT to be helpful in class, with 98% acknowledging its positive impact,
praising it for being able to give you new ideas and refine your writing. Similarly, 95% of students
agreed that ChatGPT assisted in their learning process by providing them with easily accessible new
knowledge, reinforcing its effectiveness as an educational tool. They also said that it can revise your
answers and improve your thinking skills.
   The survey also explored whether students’ opinions of ChatGPT changed after using it, with
nearly 60% reporting a shift in perspective. The comments reflected both positive experiences and
the recognition of areas where the tool could improve. In terms of potential enhancements, 74% of
respondents believe that ChatGPT needs improvement. The students especially wanted to be sure
that the information is reliable and that ChatGPT should give sources for the generated information.
They also wished it to be easier to get the answers they wanted, which links to the earlier mentioned
difficulties.
   When asked about future use, 90% of students expressed a desire to use ChatGPT in class again,
and 95% were interested in taking a dedicated ChatGPT class. Many students stated that ChatGPT
will be useful for them later when they enter the workforce. These figures suggest strong interest and
confidence in the tool’s potential to support learning.
   Finally, the overall satisfaction level was notably high, with an average rating of 4.23 out of 5. This
satisfaction level underscores the positive reception of ChatGPT among students, despite the identified
challenges and areas for improvement. Overall, the survey results reflect a generally positive attitude
towards ChatGPT, with significant interest in its continued use and development within educational
contexts.
   The survey results indicate that while a significant portion of students found ChatGPT to be a
beneficial tool for learning, there are areas that require attention. Addressing challenges such as
occasional misunderstandings of AI responses and technical issues is crucial. The feedback suggests a
need for enhanced personalization and more robust technical support to maximize the benefits of using



                                                   22
ChatGPT in educational settings.
   Based on the survey findings, several recommendations are proposed to improve the use of ChatGPT
in educational contexts. Incorporating more personalized learning experiences, addressing technical
issues to ensure seamless usage, and providing additional resources and support for students are key
steps to enhance the effectiveness and satisfaction of using AI-driven educational tools. As 90% of the
respodents stated that they would like to take a class where they would learn about using generative
AI. We are planning to implement this as a part of an introductory course for first year students.
   While the overall response is positive, addressing the identified challenges can further enhance the
effectiveness and satisfaction of students using educational tools based on generative AI. The findings
underscore the potential of AI in revolutionizing language education and offer a roadmap for future
improvements in integrating generative AI into educational practices.

3.3. Artificial intelligence system
In practice, the implementation of an AI-powered learning solution leveraging Large Language Models
(LLMs) is made possible by recent advancements in the field of Natural Language Processing (NLP).
LLMs are trained on vast amounts of text data, enabling them to understand and generate human-like
language with remarkable accuracy. This capability allows educational tools powered by LLMs to
provide tailored learning experiences that adapt to each student’s individual needs, preferences, and
learning style. These developments have opened up new possibilities for creating intelligent educational
tools and platforms that can understand, generate, and interact with human language in meaningful
ways. With the aid of LLMs, educators can harness the power of AI to provide personalized and adaptive
learning experiences for students.
   The implementation of an AI-powered learning solution leveraging Large Language Models (LLMs)
can take different forms, depending on factors such as infrastructure, privacy considerations, and the
desired level of control. This comes down to two approaches: utilizing third-party managed models like
ChatGPT or running a local LLM. Using third-party models like involves integrating pre-trained LLMs,
such as ChatGPT, into educational platforms or chatbot applications. These third-party models are
hosted and maintained by external providers, offering out-of-the-box functionality for natural language
understanding and generation, reducing development time and effort.
   Alternatively, there is the option to run a local LLM, either in a centralized or distributed manner. In
a centralized setup, a single server hosts the LLM, while in a distributed setup, each student receives
a dedicated device running a localized instance of the LLM. These approaches prioritize privacy and
security, ensuring data remains on-premises, while also allowing for customization at the cost of
requiring more specialized hardware. While the latter also comes with the added benefit of enabling
offline access, running LLMs on edge devices is still challenging due to performance issues. However,
this might change in the near future as the field is evolving rapidly.

3.3.1. Pilot system
As a pilot system, we are running a centralized local LLM. The architecture of the chatbot is designed to
integrate state-of-the-art AI models with high-performance hardware and a user-friendly interface. The
system uses the LLaMA 3 model [28], chosen for its superior natural language processing capabilities,
running on Ollama1 . The model has been trained on extensive and diverse datasets encompassing
various aspects of the English language, including conversational contexts, academic discourse, and
everyday language use. This comprehensive training allows the models to support users in enhancing
their vocabulary, grammar, pronunciation, and conversational proficiency.
   To optimize the learning experience, the chatbot employs pre-prompts, which are strategically
designed prompts that guide the conversation depending on the chosen learning setting (beginner
conversation, job interview practice, etc.). Pre-prompts help maintain the focus on specific educational
objectives such as vocabulary acquisition, grammatical accuracy, and pronunciation improvement. By
1
    https://github.com/ollama




                                                    23
using pre-prompts, the chatbot can deliver materials tailored to the user’s proficiency level and learning
goals.
   The OpenWebUI2 serves as the primary medium for user interaction with the chatbot. This ChatGPT-
like interface is designed to be intuitive and user-friendly, facilitating smooth and real-time communi-
cation between the user and the chatbot. The OpenWebUI supports speech inputs, enabling users to
practice conversation. Accessible through http, the user interface can be connected to by almost any
device running a modern browser.
   The system’s computational needs are met by the NVIDIA RTX A5000 GPU with 24GB of GDDR6
memory and 8192 CUDA cores. This ensures the efficient execution of complex NLP tasks, thereby
delivering low latency and high throughput. This hardware configuration is essential for maintaining
the system’s responsiveness and ensuring a seamless user experience, particularly during extended and
complex interactions.
   A significant future enhancement involves integrating the chatbot with the AI assistant robot, Temi3 .
This integration aims to create a more immersive and interactive learning environment by combining
the virtual capabilities of the chatbot with the physical presence of Temi. Temi will be connected to the
same LLaMa 3 model, enabling it to engage in real-time conversations, provide immediate feedback, and
facilitate hands-on learning activities. This hybrid approach is expected to enhance user engagement
and improve learning outcomes by offering a dynamic and interactive educational experience. We
are about to hold another survey regarding the use of the pilot system, this time with both students
and teachers, in order to develop it to be better suited for classroom use. After the robot integration
is completed, we plan to do a followup survey for both students and teachers in order to get more
feedback and to optimize the system for classroom use.
   Additionally, we are exploring the integration of Retrieval-Augmented Generation (RAG) technology.
RAG allows the model to pull information from new data sources, for example course-specific databases,
offering students more precise and relevant content tailored to their educational needs. This technology
can significantly enhance the personalization of learning experiences, where the model can adapt not
only to individual learning styles but also to the specific content of a course, providing contextualized
assistance that improves comprehension and retention.


4. Limitations and future prospects
While the integration of generative AI in education holds tremendous promise, several limitations
warrant consideration. Firstly, there’s a risk of diminished human interaction as generative AI tools
may replace meaningful teacher-student engagement, potentially impacting students’ social and emo-
tional development. Privacy concerns arise regarding the collection and utilization of student data for
personalized learning, raising ethical questions about data privacy and security.
   Algorithm bias is another challenge, as generative AI models can perpetuate existing inequalities or
favor certain demographics over others in educational outcomes. Accessibility remains a concern, with
students lacking access to technology or digital literacy skills potentially being left behind in AI-powered
learning environments. Moreover, an overemphasis on standardized testing may overshadow the
cultivation of critical thinking and creativity, limiting the holistic development of students. Dependence
on technology introduces vulnerabilities, such as technical failures or cyberattacks, while teacher
resistance and training needs may hinder effective implementation. Teachers may feel apprehensive
about the integration of AI, fearing that it might replace their role or undermine their authority in the
classroom.
   Additionally, concerns about navigating new technologies, potential job displacement, and the
perceived threat to pedagogical control may further intensify resistance. To mitigate these risks, it is
essential to provide teachers with comprehensive training and ongoing support, demonstrating how AI
can complement rather than replace their expertise. Encouraging collaboration between educators and
2
    https://github.com/open-webui
3
    https://www.robotemi.com/




                                                    24
AI systems can highlight how these tools can assist with routine tasks, such as grading or providing
personalized feedback, allowing teachers to focus on more complex, higher-order teaching activities. By
involving teachers in the development and implementation process, their concerns can be addressed, and
their role in an AI-enhanced educational environment can be clearly defined, ensuring they feel valued
and integral to student success. Ethical considerations surrounding decision-making by generative AI
in education require careful oversight, and the financial costs associated with implementing generative
AI pose challenges for resource-strapped institutions.
   Lastly, there’s a risk of students becoming overly dependent on AI assistance, potentially hindering
the development of independent learning skills. Addressing these limitations requires thoughtful
planning, ongoing evaluation, and a commitment to ethical and inclusive practices in AI integration
within educational settings.
   Still, the future of generative AI in education is promising, with potential to revolutionize personalized
learning, content creation, and assessment. AI can create customized learning paths, generate high-
quality materials, and provide real-time feedback. It will enhance accessibility through inclusive tools
and language translation, and support data-driven decision-making with predictive analytics. AI will
also facilitate lifelong learning and workforce training, and foster global collaboration.
   In the future, we will explore the potential for generalizing the proposed method beyond Japan’s
education system. While the current focus addresses the specific challenges of Japan’s test-oriented and
teacher-centered education, many of the underlying principles—such as fostering personalized learning,
critical thinking, and integrating higher-order thinking skills (HOTS) through AI—are universally
applicable. Future research will involve testing these approaches in different educational contexts
globally, adapting them to local curricula, cultural norms, and technological infrastructures. Additionally,
further investigation will be required to assess how AI systems can mitigate common issues like teacher
resistance, data privacy concerns, and algorithmic biases in a variety of educational environments. We
aim to contribute to a broader, global shift toward more dynamic, personalized, and effective educational
systems. Overall, AI integration in education promises more personalized, inclusive, and effective
learning experiences, preparing students for success in a digital world.


5. Conclusions
The recent introduction of advanced generative AI has ushered in a new era of possibilities across various
fields, including higher education language learning. The advent of this technology presents an array
of opportunities, including the provision of personalized learning experiences tailored to individual
student needs, seamless accessibility around-the-clock, immediate feedback mechanisms, and adaptive
content delivery aimed at enhancing overall learning outcomes. However, within the predominantly
test-oriented and teacher-centered landscape of English education in Japan, the integration of generative
AI remains underutilized, limiting students’ exposure to diverse skills crucial for future career success.
Often, learning experiences tend to be passive, focusing primarily on memorization, comprehension, and
testing, neglecting the development of essential higher-order thinking skills (HOTS) such as analysis,
evaluation, and creativity.
   To address this gap, we propose an innovative generative AI-enhanced language teaching method
designed to cultivate both lower-order thinking skills (LOTS) and higher-order thinking skills (HOTS)
as delineated by the revised Bloom’s taxonomy (2001) for educational goals. While LOTS are typically
developed through passive learning activities in traditional classrooms, the acquisition of HOTS requires
deeper engagement and critical thinking. By integrating generative AI into the learning process, students
can enhance their analytical and evaluative abilities by synthesizing information, presenting solutions,
and expanding their knowledge base. This approach facilitates tailored learning experiences that cater
to individual English proficiency levels and personal interests, thereby optimizing learning outcomes.
As students attain proficiency in HOTS and confidence in their abilities, they are better equipped to
apply their learning to academic and real-world career contexts.
   We conducted a survey to understand students’ thoughts on using AI for learning. The survey




                                                     25
involved 84 second-year university students who answered questions about their experiences using
ChatGPT as a learning tool. While only 14% had prior experience with it, 98% found it helpful in
class, particularly for generating ideas and improving writing. However, 91% faced challenges, mainly
with prompt design and receiving relevant answers. Despite this, 95% acknowledged ChatGPT’s role
in enhancing learning and expressed interest in continued use, with 90% wanting to use it in future
classes. Overall satisfaction was high (4.23/5), highlighting the tool’s potential, though improvements
in reliability and ease of use are needed.
   After analyzing the results of the survey, a generative AI-powered learning system using Large
Language Models (LLMs) has been implemented as a pilot project. Utilizing the LLaMA 3 model,
the system offers personalized educational experiences by understanding and generating human-like
language. The chatbot uses pre-prompts to focus on specific learning goals such as vocabulary, grammar,
and pronunciation. Future enhancements include integrating the chatbot with an AI assistant robot
for a more immersive and interactive learning environment. A followup survey for both students and
teachers will guide further development to optimize the system for classroom use.
   Despite the evident benefits, the integration of generative AI-powered language teaching raises
legitimate concerns regarding limited human interaction, privacy, algorithm bias, and accessibility
challenges. However, within the context of Japanese higher education, these challenges are outweighed
by the opportunities presented. AI-powered language learning environments offer students a safe and
non-judgmental space to overcome communication barriers, foster confidence, and navigate hierarchical
constraints often encountered in traditional English classrooms. Ultimately, the digitalization of the
Japanese education system through AI integration contributes to the development of globally competitive
human resources, equipped with the essential skills and competencies necessary for success in the
digital age.


References
 [1] Y. Nemoto, The Japanese education system, Universal-Publishers, 1999.
 [2] W. K. Cummings, Education and equality in Japan, volume 869, Princeton university press, 2014.
 [3] P. Doyon, A review of higher education reform in modern japan, Higher Education 41 (2001)
     443–470.
 [4] A. Murray, Teacher burnout in japanese higher education, The Language Teacher 37 (2013) 51–55.
 [5] K. Nuske, It is very hard for teachers to make changes to policies that have become so solidified”:
     Teacher resistance at corporate eikaiwa franchises in japan, The Asian EFL Journal Quarterly 16
     (2014) 105–131.
 [6] D. Baidoo-anu, L. Owusu Ansah, Education in the era of generative artificial intelligence (ai):
     Understanding the potential benefits of chatgpt in promoting teaching and learning, Journal of AI
     7 (2023) 52–62. doi:10.61969/jai.1337500.
 [7] J. Su, W. Yang,             Unlocking the power of chatgpt: A framework for apply-
     ing generative ai in education,             ECNU Review of Education 6 (2023) 355–366.
     URL:        https://doi.org/10.1177/20965311231168423.     doi:10.1177/20965311231168423.
     arXiv:https://doi.org/10.1177/20965311231168423.
 [8] T. Adiguzel, M. H. Kaya, F. K. Cansu, Revolutionizing education with ai: Exploring the transfor-
     mative potential of chatgpt, Contemporary Educational Technology (2023).
 [9] J. Jeon, S. Lee, Large language models in education: A focus on the complementary relationship
     between human teachers and chatgpt, Education and Information Technologies 28 (2023) 15873–
     15892.
[10] D. Mhlanga, Open AI in Education, the Responsible and Ethical Use of ChatGPT Towards Lifelong
     Learning, Springer Nature Switzerland, Cham, 2023, pp. 387–409.
[11] J. Qadir, Engineering education in the era of chatgpt: Promise and pitfalls of generative ai for
     education, in: 2023 IEEE Global Engineering Education Conference (EDUCON), 2023, pp. 1–9.
     doi:10.1109/EDUCON54358.2023.10125121.




                                                  26
[12] S. Milano, J. A. McGrane, S. Leonelli, Large language models challenge the future of higher
     education, Nature Machine Intelligence 5 (2023) 333–334.
[13] O. Koraishi, Teaching english in the age of ai: Embracing chatgpt to optimize efl materials and
     assessment, Language Education and Technology 3 (2023).
[14] M. L. Bigge, Learning theories for teachers (1964).
[15] N. T. Bümen, A study on the effectiveness and problems pertaining curriculum development
     departments at private schools in three major city of turkey, Kuram Ve Uygulamada Egitim
     Bilimleri 6 (2006) 655. URL: https://api.semanticscholar.org/CorpusID:141970610.
[16] N. E. Gronlund, Assessment of student achievement. sixth edition., 1998. URL: https://api.
     semanticscholar.org/CorpusID:140774868.
[17] C. G. Johnson, U. Fuller, Is bloom’s taxonomy appropriate for computer science?, in: Proceedings
     of the 6th Baltic Sea conference on Computing education research: Koli Calling 2006, 2006, pp.
     120–123.
[18] R. McBain, How high can students think? a study of students’ cognitive levels using bloom’s
     taxonomy in social studies., Online Submission (2011).
[19] M. H. Oermann, K. B. Gaberson, J. C. De Gagne, C. NPD-BC, et al., Evaluation and testing in
     nursing education, Springer Publishing Company, 2014.
[20] Y. Özden, Learning and teaching, Ankara: Pegem A Yayıncılık, 2011.
[21] J. Poole,       E-learning and learning styles: students’ reactions to web-based language
     and style at blackpool and the fylde college,                    Language and Literature 15 (2006)
     307–320. URL: https://doi.org/10.1177/0963947006066129. doi:10.1177/0963947006066129.
     arXiv:https://doi.org/10.1177/0963947006066129.
[22] M. Valcke, B. De Wever, C. Zhu, C. Deed, Supporting active cognitive processing in collaborative
     groups: The potential of bloom’s taxonomy as a labeling tool, The Internet and Higher Education
     12 (2009) 165–172. doi:10.1016/j.iheduc.2009.08.003.
[23] L. A. Lovell-Troy, Teaching techniques for instructional goals: A partial review of the literature.,
     Teaching Sociology 17 (1989) 28–37. URL: https://api.semanticscholar.org/CorpusID:144379073.
[24] A. LW, K. DR, A. PW, C. KA, R. Mayer, P. PR, J. Raths, W. MC, A Taxonomy for Learning, Teaching,
     and Assessing: A Revision of Bloom’s Taxonomy of Educational Objectives, 2001.
[25] S. Brookhart, Successful students’ formative and summative uses of assessment informa-
     tion, Assessment in Education: Principles, Policy & Practice 8 (2001) 153–169. doi:10.1080/
     09695940120062638.
[26] C. Argyris, Double loop learning in organizations, Harvard business review 55 (1977) 115–125.
[27] H. Daniels, M. Bizar, Methods that matter, York, Maine: Stenhouse (1998).
[28] A. Dubey, A. Jauhri, A. Pandey, A. Kadian, A. Al-Dahle, A. Letman, A. Mathur, A. Schelten, A. Yang,
     A. Fan, A. Goyal, A. Hartshorn, A. Yang, A. Mitra, A. Sravankumar, A. Korenev, A. Hinsvark,
     A. Rao, A. Zhang, A. Rodriguez, A. Gregerson, A. Spataru, B. Roziere, B. Biron, B. Tang, B. Chern,
     C. Caucheteux, C. Nayak, C. Bi, C. Marra, C. McConnell, C. Keller, C. Touret, C. Wu, C. Wong, C. C.
     Ferrer, C. Nikolaidis, D. Allonsius, D. Song, D. Pintz, D. Livshits, D. Esiobu, D. Choudhary, D. Maha-
     jan, D. Garcia-Olano, D. Perino, D. Hupkes, E. Lakomkin, E. AlBadawy, E. Lobanova, E. Dinan, E. M.
     Smith, F. Radenovic, F. Zhang, G. Synnaeve, G. Lee, G. L. Anderson, G. Nail, G. Mialon, G. Pang,
     G. Cucurell, H. Nguyen, H. Korevaar, H. Xu, H. Touvron, I. Zarov, I. A. Ibarra, I. Kloumann, I. Misra,
     I. Evtimov, J. Copet, J. Lee, J. Geffert, J. Vranes, J. Park, J. Mahadeokar, J. Shah, J. van der Linde,
     J. Billock, J. Hong, J. Lee, J. Fu, J. Chi, J. Huang, J. Liu, J. Wang, J. Yu, J. Bitton, J. Spisak, J. Park,
     J. Rocca, J. Johnstun, J. Saxe, J. Jia, K. V. Alwala, K. Upasani, K. Plawiak, K. Li, K. Heafield, K. Stone,
     K. El-Arini, K. Iyer, K. Malik, K. Chiu, K. Bhalla, L. Rantala-Yeary, L. van der Maaten, L. Chen,
     L. Tan, L. Jenkins, L. Martin, L. Madaan, L. Malo, L. Blecher, L. Landzaat, L. de Oliveira, M. Muzzi,
     M. Pasupuleti, M. Singh, M. Paluri, M. Kardas, M. Oldham, M. Rita, M. Pavlova, M. Kambadur,
     M. Lewis, M. Si, M. K. Singh, M. Hassan, N. Goyal, N. Torabi, N. Bashlykov, N. Bogoychev, N. Chat-
     terji, O. Duchenne, O. Çelebi, P. Alrassy, P. Zhang, P. Li, P. Vasic, P. Weng, P. Bhargava, P. Dubal,
     P. Krishnan, P. S. Koura, P. Xu, Q. He, Q. Dong, R. Srinivasan, R. Ganapathy, R. Calderer, R. S.
     Cabral, R. Stojnic, R. Raileanu, R. Girdhar, R. Patel, R. Sauvestre, R. Polidoro, R. Sumbaly, R. Taylor,



                                                       27
R. Silva, R. Hou, R. Wang, S. Hosseini, S. Chennabasappa, S. Singh, S. Bell, S. S. Kim, S. Edunov,
S. Nie, S. Narang, S. Raparthy, S. Shen, S. Wan, S. Bhosale, S. Zhang, S. Vandenhende, S. Batra,
S. Whitman, S. Sootla, S. Collot, S. Gururangan, S. Borodinsky, T. Herman, T. Fowler, T. Sheasha,
T. Georgiou, T. Scialom, T. Speckbacher, T. Mihaylov, T. Xiao, U. Karn, V. Goswami, V. Gupta,
V. Ramanathan, V. Kerkez, V. Gonguet, V. Do, V. Vogeti, V. Petrovic, W. Chu, W. Xiong, W. Fu,
W. Meers, X. Martinet, X. Wang, X. E. Tan, X. Xie, X. Jia, X. Wang, Y. Goldschlag, Y. Gaur, Y. Babaei,
Y. Wen, Y. Song, Y. Zhang, Y. Li, Y. Mao, Z. D. Coudert, Z. Yan, Z. Chen, Z. Papakipos, A. Singh,
A. Grattafiori, A. Jain, A. Kelsey, A. Shajnfeld, A. Gangidi, A. Victoria, A. Goldstand, A. Menon,
A. Sharma, A. Boesenberg, A. Vaughan, A. Baevski, A. Feinstein, A. Kallet, A. Sangani, A. Yunus,
A. Lupu, A. Alvarado, A. Caples, A. Gu, A. Ho, A. Poulton, A. Ryan, A. Ramchandani, A. Franco,
A. Saraf, A. Chowdhury, A. Gabriel, A. Bharambe, A. Eisenman, A. Yazdan, B. James, B. Maurer,
B. Leonhardi, B. Huang, B. Loyd, B. D. Paola, B. Paranjape, B. Liu, B. Wu, B. Ni, B. Hancock, B. Wasti,
B. Spence, B. Stojkovic, B. Gamido, B. Montalvo, C. Parker, C. Burton, C. Mejia, C. Wang, C. Kim,
C. Zhou, C. Hu, C.-H. Chu, C. Cai, C. Tindal, C. Feichtenhofer, D. Civin, D. Beaty, D. Kreymer, D. Li,
D. Wyatt, D. Adkins, D. Xu, D. Testuggine, D. David, D. Parikh, D. Liskovich, D. Foss, D. Wang,
D. Le, D. Holland, E. Dowling, E. Jamil, E. Montgomery, E. Presani, E. Hahn, E. Wood, E. Brinkman,
E. Arcaute, E. Dunbar, E. Smothers, F. Sun, F. Kreuk, F. Tian, F. Ozgenel, F. Caggioni, F. Guzmán,
F. Kanayet, F. Seide, G. M. Florez, G. Schwarz, G. Badeer, G. Swee, G. Halpern, G. Thattai, G. Her-
man, G. Sizov, Guangyi, Zhang, G. Lakshminarayanan, H. Shojanazeri, H. Zou, H. Wang, H. Zha,
H. Habeeb, H. Rudolph, H. Suk, H. Aspegren, H. Goldman, I. Molybog, I. Tufanov, I.-E. Veliche,
I. Gat, J. Weissman, J. Geboski, J. Kohli, J. Asher, J.-B. Gaya, J. Marcus, J. Tang, J. Chan, J. Zhen,
J. Reizenstein, J. Teboul, J. Zhong, J. Jin, J. Yang, J. Cummings, J. Carvill, J. Shepard, J. McPhie,
J. Torres, J. Ginsburg, J. Wang, K. Wu, K. H. U, K. Saxena, K. Prasad, K. Khandelwal, K. Zand,
K. Matosich, K. Veeraraghavan, K. Michelena, K. Li, K. Huang, K. Chawla, K. Lakhotia, K. Huang,
L. Chen, L. Garg, L. A, L. Silva, L. Bell, L. Zhang, L. Guo, L. Yu, L. Moshkovich, L. Wehrstedt,
M. Khabsa, M. Avalani, M. Bhatt, M. Tsimpoukelli, M. Mankus, M. Hasson, M. Lennie, M. Reso,
M. Groshev, M. Naumov, M. Lathi, M. Keneally, M. L. Seltzer, M. Valko, M. Restrepo, M. Patel,
M. Vyatskov, M. Samvelyan, M. Clark, M. Macey, M. Wang, M. J. Hermoso, M. Metanat, M. Raste-
gari, M. Bansal, N. Santhanam, N. Parks, N. White, N. Bawa, N. Singhal, N. Egebo, N. Usunier,
N. P. Laptev, N. Dong, N. Zhang, N. Cheng, O. Chernoguz, O. Hart, O. Salpekar, O. Kalinli, P. Kent,
P. Parekh, P. Saab, P. Balaji, P. Rittner, P. Bontrager, P. Roux, P. Dollar, P. Zvyagina, P. Ratanchan-
dani, P. Yuvraj, Q. Liang, R. Alao, R. Rodriguez, R. Ayub, R. Murthy, R. Nayani, R. Mitra, R. Li,
R. Hogan, R. Battey, R. Wang, R. Maheswari, R. Howes, R. Rinott, S. J. Bondu, S. Datta, S. Chugh,
S. Hunt, S. Dhillon, S. Sidorov, S. Pan, S. Verma, S. Yamamoto, S. Ramaswamy, S. Lindsay, S. Lind-
say, S. Feng, S. Lin, S. C. Zha, S. Shankar, S. Zhang, S. Zhang, S. Wang, S. Agarwal, S. Sajuyigbe,
S. Chintala, S. Max, S. Chen, S. Kehoe, S. Satterfield, S. Govindaprasad, S. Gupta, S. Cho, S. Virk,
S. Subramanian, S. Choudhury, S. Goldman, T. Remez, T. Glaser, T. Best, T. Kohler, T. Robinson,
T. Li, T. Zhang, T. Matthews, T. Chou, T. Shaked, V. Vontimitta, V. Ajayi, V. Montanez, V. Mohan,
V. S. Kumar, V. Mangla, V. Ionescu, V. Poenaru, V. T. Mihailescu, V. Ivanov, W. Li, W. Wang, W. Jiang,
W. Bouaziz, W. Constable, X. Tang, X. Wang, X. Wu, X. Wang, X. Xia, X. Wu, X. Gao, Y. Chen,
Y. Hu, Y. Jia, Y. Qi, Y. Li, Y. Zhang, Y. Zhang, Y. Adi, Y. Nam, Yu, Wang, Y. Hao, Y. Qian, Y. He,
Z. Rait, Z. DeVito, Z. Rosnbrick, Z. Wen, Z. Yang, Z. Zhao, The llama 3 herd of models, 2024. URL:
https://arxiv.org/abs/2407.21783. arXiv:2407.21783.




                                               28
A. Survey Questions
  1. 今まで、他の授業でChatGPTを利用した授業をした事がありますか。
     YES・NO

  2. ChatGPTを使用する際に難しかった事はありましたか。それはなんですか

  3. ChatGPTを使った授業は何に役に立ったと思いますか。

  4. ChatGPTは自分の学びに役に立つと思いますか。

  5. ChatGPTの授業を受ける前と受けた後、ChatGPTに対する意見が変わりましたか。

  6. ChatGPTを使う時に、改善するべきところはありますか。それは何ですか。

  7. 今後の授業でまたChatGPTを使った授業を受けたいですか。YES・NO

  8. ChatGPTの使い方を学ぶ授業は必要だと思いますか。YES・NO

  9. ChatGPTを使った時に、自分の学びをより発展させる事が出来ると思いますか

 10. ChatGPTを使った授業に対する満足度を選んでください。
     □ 非常に満足
     □ 満足
     □ 普通
     □ 不満
     □ 非常に不満

 11. ChatGPTを利用してみて、どう思いましたか。授業の感想を書いてください。(良い
     点、悪い点、難しかった点など、自由に書いてください。)




                         29