=Paper=
{{Paper
|id=Vol-2879/paper28
|storemode=property
|title=Cloud technologies and learning analytics: web application for PISA results analysis and visualization
|pdfUrl=https://ceur-ws.org/Vol-2879/paper28.pdf
|volume=Vol-2879
|authors=Mariia S. Mazorchuk,Tetyana S. Vakulenko,Anna O. Bychko,Olena H. Kuzminska,Oleksandr V. Prokhorov
|dblpUrl=https://dblp.org/rec/conf/cte/MazorchukVBKP20
}}
==Cloud technologies and learning analytics: web application for PISA results analysis and visualization==
Cloud technologies and learning analytics: web application for PISA results analysis and visualization Mariia S. Mazorchuk1,2 , Tetyana S. Vakulenko2 , Anna O. Bychko2 , Olena H. Kuzminska3 and Oleksandr V. Prokhorov4 1 V. N. Karazin Kharkiv National University, 4 Svobody Sq., Kharkiv, 61022, Ukraine 2 Ukrainian Center for Educational Quality Assessment, 5 V. Vinnichenko Str., Kyiv, 04053, Ukraine 3 National University of Life and Environmental Sciences of Ukraine, 15 Heroyiv Oborony Str., Kyiv, 03041, Ukraine 4 The National Aerospace University “Kharkiv Aviation Institute”, 17 Chkalov Str., Kharkiv, 61070, Ukraine Abstract This article analyzes the ways to apply Learning Analytics, Cloud Technologies, and Big Data in the field of education on the international level. This paper provides examples of international analytical researches and cloud technologies used to process the results of those researches. It considers the PISA research methodology and related tools, including the IDB Analyzer application, free R intsvy environment for processing statistical data, and cloud-based web application PISA Data Explorer. The paper justifies the necessity of creating a stand-alone web application that supports Ukrainian localization and provides Ukrainian researchers with rapid access to well-structured PISA data. In particular, such an application should provide for data across the factorial features and indicators applied at the country level and demonstrate the Ukrainian indicators compared to the other countries’ results. This paper includes a description of the application core functionalities, architecture, and technologies used for development. The proposed solution leverages the shiny package available with R environment that allows implementing both the UI and server sides of the application. The technical implementation is a proven solution that allows for simplifying the access to PISA data for Ukrainian researchers and helping them utilize the calculation results on the key features without having to apply tools for processing statistical data. Keywords learning analytics, Cloud Technologies, PISA, web application 1. Introduction Currently, we can observe the rapid growth of the demand in Big Data Analytics and Business Analytics [1]. Big Data and Cloud Computing solutions empower real-time decision-making, identify trends, and allow for creating data models within the most powerful Data Analytics CTE 2020: 8th Workshop on Cloud Technologies in Education, December 18, 2020, Kryvyi Rih, Ukraine " mazorchuk.mary@gmail.com (M. S. Mazorchuk); vakulenko_tetyana@ukr.net (T. S. Vakulenko); bychko.anya@gmail.com (A. O. Bychko); o.kuzminska@nubip.edu.ua (O. H. Kuzminska); o.prokhorov@khai.edu (O. V. Prokhorov) ~ https://www.univer.kharkov.ua/en (M. S. Mazorchuk); https://testportal.gov.ua/l (T. S. Vakulenko); https://nubip.edu.ua/en (O. H. Kuzminska); https://khai.edu/en/ (O. V. Prokhorov) 0000-0002-4416-8361 (M. S. Mazorchuk); 0000-0002-7403-1075 (T. S. Vakulenko); 0000-0001-5609-6978 (A. O. Bychko); 0000-0002-8849-9648 (O. H. Kuzminska); 0000-0003-4680-4082 (O. V. Prokhorov) © 2020 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings http://ceur-ws.org ISSN 1613-0073 CEUR Workshop Proceedings (CEUR-WS.org) 484 solutions. Cloud Technologies provide for a brand-new way of data processing to retrieve data and analytics valuable for business and decision-making [2]. The recent research [3], as well as the agenda of the latest international conferences [4, 5, 6, 7], prove that the Big Data processing approaches are applicable and relevant for education as well. Horizon Report 2020 [8] considers Learning Analytics and Analytics of Student Data to be the most promising areas in educational technologies. Being powered by Cloud Technologies by Google, IBM, Amazon, and Microsoft [9], Learning Analytics can empower effective decision- making on the level of an educational institution, a region, or the whole world. Besides, a vast amount of data is available for the public, so it is easy to utilize for research. On the local level, introducing the Learning Analytics can facilitate: • building the educational trajectories for the college undergraduate students [10], • leveraging data on the students’ activities to support them during online learning [11], analyzing the influence of group behavior on the individual student’s success [12], • analyzing the External Independent Evaluation (EIT) data to decide on the school for a pupil [13]. Across the world, the researchers analyze the volumes of data received from the international exams. For instance, the educational community keeps on track of research that studies the influence of the family background (including social, economic, and cultural factors) on the students’ success [14, 15]. However, many analysts still cannot take the advantage of the analytics provided by interna- tional research such as PISA, TIMSS, PIRLS, PIAAC, NAEP, and TALIS [16]. The results are available for the public, and it’s possible to access the databases and technical data, but not every researcher has a full understanding of the research methodology and tools. For instance, to make full use of data, analysts require background knowledge of calculation methodology, data structure and data processing algorithms of the research they work with. Focusing on the PISA project, which it is currently the most noticeable and comprehensive evaluation project in Ukraine [16]. In 2018, Ukraine participated in the Program for International Student Assessment (PISA) for the first time, yet the question of efficiently utilizing the PISA data is still open to negotiation [17]. The Ukrainian researchers would have leveraged PISA data as a consistent input for analysis and synthesis, indicating and solving problems in our education system. Though, published at the end of 2019, the PISA analytical reports were hardly utilized to analyze the current state of education in Ukraine. We cannot get away from the fact that PISA reports contain analytical data that reveal how our cultural, social, economic, and educational environment influences the success of Ukrainian students. The data is evidenced by precise calculations and introduces the researchers to the most entire gamut. However, not every analyst can manage the research results without additional training, which takes them time and effort. Challenges that arise for them when it comes to processing data provided by PISA or any other international research are the following: • the analysts have not enough background understanding the research structure and evaluation techniques, 485 • the analysts receive too much data, that seems to them unstructured and difficult to process, • the analysts can be confused with the research-specific terminology. We aim to make the results of any international research more accessible, to allow more analysts to use the analytical results for their independent research and get the most out of the supplied data. This research suggests developing a web application that would allow the educational com- munity to assess PISA results in the most transparent and user-friendly format. 2. Analysis of PISA tools for results processing PISA utilizes three main software tools for data processing: the IDB Analyzer application, free environment for static data analysis R intsvy, and the PISA Data Explorer web application powered by cloud technologies. IDB Analyzer is free software that allows for generating scripts for commercial SPSS (Statistical Package for the Social Sciences) or SAS (Statistical Analysis Software) packages, each of those has a cloud solution. The tool considers the students’ selection when calculating the standard deviation and the scheme of the test book rotation (Plausible Value – the probable values of the students’ results). IDB Analyzer generates code that allows users to process descriptive statistics and verify the statistical hypothesis without writing the code in SPSS or SAS. Though, utilizing IDB Analyzer requires both skills with SPSS and/or SAS, and installing commercial software. Besides, it requires data preprocessing (uploading and cleansing). Also, the researchers require certain skills and expertise with data processing. intsvy is a free R program package for processing the PISA, TIMSS, PIRLS, PIAAC and ICILS results. This package includes such functions as data import, data analysis and results visual- ization. The data analysis function considers the complicated selection construct and possible values when calculating grades and standard variance, regression coefficients, correlation coef- ficients, and frequency array. The visualization tools allow for demonstrating aggregated data in standard graphic form, that can be adjusted via the open function code. Likewise, working with intsvy requires users minimal experience with R language and RStudio. PISA Data Explorer is a cloud-based solution for processing PISA results, available at https://www.oecd.org/pisa/data/. PISA Data Explorer requires from the users at least min- imal skills and background with statistical data processing. The main advantage of this tool is that the PISA data should not be uploaded and preprocessed. The whole data is available online. The calculations are conducted online as well. The tool’s disadvantage is limited to the fact that a researcher can only conduct the data analysis limited to the built-in PISA Data Explorer functions. We shall refer to the Education GPS/Explore Data service available at https://gpseducation.oecd.org/Home, as an example of an internationally applied application for PISA data display that allows for receiving the PISA estimate indicators. This service generalizes data on results, publications, and conclusions of the researches held in OECD (PISA, TALIS, PIAAC). Education GPS/Explore Data organizes the research data according to the countries, topics and research agendas. Using this service does not require 486 any specific skills in data analysis. The navigation panel has useful data filters that allow for displaying values for the selected countries. The system stores huge amounts of data, yet due to the structure issues, sometimes it prevents users from finding the data they need. Thus, we came to a conclusion, that any researcher who is interested in an independent study requires either a considerable experience with data analysis and statistical data processing, or should search for generalized data at the Education GPS resource. Using international research results, many local researchers face challenges in understanding the data processing methods utilized by particular research centers. These challenges can occur both on the level of statistical data analysis and on the level of understanding the deep context of the research [18]. For instance, the problem with processing statistical data on student’s contextual characteris- tics and learning environments can be caused by terminology. Also, some PISA conclusions rely on the indexed received from a complicated mathematical model, which bases upon the survey results. These conclusions can be unobvious. To calculate all the values exponents, indexes, or distributions percentage the researcher has to process a big amount of data and understand the whole PISA evaluation strategy. Thus, we can see how data processing issues make research complicated and prevent many researchers from utilizing Learning Analytics. This paper suggests designing a web application that would ensure the Ukrainian researchers’ quick access to PISA data in a comprehensive, interactive, and user-friendly format. This solution can become a highly effective tool to analyze PISA results for individual research and get the most out of the PISA data. 3. Results 3.1. The challenges addressed by the application This work aims to design a web application for displaying PISA results for Ukraine and launch it at http://pisa.testportal.gov.ua/ to make the research data available for the public. The proposed application will allow analysts, researchers, and teachers from various institu- tions to get the most out of the PISA research experience, including: • retrieving the data on the evaluation of semantic and contextual assessments from the complicated PISA research structure, • retrieving the PISA calculation on students’ success (shown in PISA scores and the level passing of students’ competence), • retrieving the PISA calculations on contextual characteristics of the learning environ- ment (including students’ gender, institution’s location, student’s social and economic background), • retrieving the PISA results that demonstrate the research indicators for the national level (e.g. the type of the educational institution can be specified on the national level), • assessing the interactive data visualizations for Ukraine with descriptions for the PISA indicators, • accessing open PISA data on Ukraine and the benchmark countries, 487 • PISA content localized for Ukrainian users. The detailed descriptions for PISA indicators and results implementation in the format of hypertext guide users through the process of searching and help them understand the strategic evaluation results. The content in Ukrainian allows more Ukrainian speaking users to utilize PISA results for their research. 3.2. Application structure The application structure aligns with the structure of PISA research. The survey results allow the researchers to assess various factors that determine the current condition of educational institutions and their effects on the students’ success. This allows for analyzing the causes and effects of the education system in a selected country. The PISA program is a three-year cycle. The evaluation takes place every three years and the results are revealed at the end of each cycle. The results of each cycle are not connected. These results can be compared in a timeline on the countries and key indicators, that provides a thorough grounding for strategic solutions in education. The research structure consists of two main parts: the results on student’s subject competence in math, reading, and natural science and the meaning of various indexes that demonstrate the contextual student’s characteristics, educational environment, motivation, and other factors that can influence the students’ success. The data should be displayed both for each cycle separately and in a timeline. Figure 1 demonstrates the main elements of the PISA research results structure. The questions from the student’s survey reflect the main goals and objectives of the PISA research. These questions highlight the key factors required to analyze what influences the students’ success. The survey also allows for collecting data on such important for representative analytics factors as: • gender, • educational institutions’ locations, • education programs and • students’ social and economic status. The key indicators shaped on the international level allow for comparing the results and backgrounds both for the same and different categories of students within one country. This allows us to compare results for students from different countries based on their gender, region or social and economic status. The type of educational institution or educational program serves as a national indicator and allows for evaluating results only on the level of a particular country. The index values serve as self-sufficient research results that can be analyzed on the interna- tional level to compare students within the country, or countries on the given parameters, and also as factor values which influence student’s success and a range of other indicators. For instance, the index of students’ social, economic, and cultural statuses demonstrates not only the financial situation in a family, but the social and cultural aspects of their lives including their parents’ education, availability of books, musical instruments, educational software, etc. 488 Figure 1: The structure of the PISA study. The index value is an important indicator of the students’ social, economic, and cultural statuses compared to their peers from other countries. Besides, index value influences both students’ success and other indexes received during the analysis. The values of the main indexes, criteria and characteristics can be divided into several categories: • wellbeing, students’ ambitions, expectations, and attitude to education, • resources invested in education, • the learning environment and school climate. Each category has a set of indexes and criterias, received from the questioning students and their educators. Calculated via complicated mathematical models, these indexes allow for comparing values on the international level and separate categories of students. The indexes mostly summarize student’ input to the survey and are considered to be imputed relative values that characterize some aspect of the research. The indexes are compared based on OECD, an average of the countries, that equals to 0. Thus, the indexes mentioned on the graphs or in tables should have explanations to help users interpret the displayed data correctly. Also, we can use interactive graphs to visualize the real-life condition of Ukrainian education compared to the reference countries with similar cultural, social, or economical conditions or 489 shared historical background. With a convenient visualization, the user can concentrate on the appropriate for comparing educational systems with similar social and cultural backgrounds. Studying the experience of countries with similar backgrounds and factors that influence their students’ success is more appropriate than comparing Ukrainian experience to the experience of countries with better initial conditions for education. 3.3. Technical implementation The Ukrainian Center for Educational Quality Assessment experts calculate PISA results with intsvy package in R environment and store them in text documents or tables (.csv or .txt). Based on that data, we build diagrams to demonstrate the distribution of indicators. Figure 2 demonstrates the general structure of the results visualization. Figure 2: Generalized structure of visualisation of PISA results. We utilized R Shiny package (https://shiny.rstudio.com/) to design both the application interface and the server sides, and deploy the application to the Cloud [19]. Figure 3 demonstrates the application structure. The application provides for calculating criteria based on PISA results in R via the intsvy package. The application stores the calculation results in the text files used to visualize data on the server-side. Thus, the application doesn’t conduct any calculations and 490 doesn’t require any time to upload data and process the results. Figure 3: The technical implementation. The application interface is designed as an interactive dashboard and provides for navigation, interactive menu, graphics, and hypertext links. Figure 4 demonstrates the application home page and the main navigation elements that allow users to go to the data structure and access the diagrams of PISA scores distribution and students’ competence levels. Figure 5 demonstrates an interactive graphic that displays the data on student’s success depending on countries and categories. This graphic allows users to compare different categories of students from different countries and see the factors that influence students’ success. With the Description option, the users can get explanations for the main criteria and indexes. This option is available both from the context menu and from the hypertext prompt message. 4. Conclusion Working on this project we came to the conclusion that applying Learning Analytics is not only a trend but an effective tool for improving the system of education at all levels. Utilizing the international level results of Learning Analytics can be challenging for the local researchers. These challenges are connected with managing data analysis tools, understanding the research methodology, and the analysed indicators in the local context. Ukraine participated in PISA in 2018, yet still, only a limited number of researchers work on processing that data. These results are important to understand the problems and make decisions on the different levels of education in Ukraine. Ukrainian specialists have challenges leveraging the PISA data. The PISA calculations method- ology is difficult to understand for researchers without specific background with tooling and a 491 Figure 4: Home page of application. Figure 5: Interactive graphic. good level of English. Thus, this work proves that having an application that would simplify accessing PISA results is pertinent. The application we suggest is a convenient tool for a wide range of people working in the education segment who utilize PISA results for their research. This web application will allow for assessing analytical data based on PISA results in the most convenient, comprehensive, and user-friendly format. This will be an effective tool for accessing PISA data in Ukraine and utilizing the calculations’ results on the key indicators. What is more the users should not apply the tools for statistical data processing. Currently, we only implemented a part of the main functionality. Though, 492 we plan the updates that would allow for uploading the calculated results in table format and utilizing the whole learning analytics on PISA results in Ukraine. References [1] The Digital Enterprise: moving from experimentation to transformation, 2018. URL: http://www3.weforum.org/docs/Media/47538_Digital%20Enterprise_Moving_ Experimentation_Transformation_report_2018%20-%20final%20(2).pdf. [2] K. Andriushchenko, V. Rudyk, O. Riabchenko, M. Kachynska, N. Marynenko, L. Shergina, V. Kovtun, M. Tepliuk, A. Zhemba, O. Kuchai, Processes of managing information infras- tructure of a digital enterprise in the framework of the «Industry 4.0» concept, Eastern- European Journal of Enterprise Technologies 1 (2019) 60–72. URL: http://journals.uran.ua/ eejet/article/view/157765. doi:10.15587/1729-4061.2019.157765. [3] O. Heyer, From Learning to Data Analytics: Some Implications for IT Strategy and Transformation, EDUCAUSE Review 54 (2019). [4] A. Kiv, V. Soloviev, S. Semerikov, CTE 2018 – How cloud technologies continues to transform education, CEUR Workshop Proceedings 2433 (2019) 1–19. URL: http://ceur-ws. org/Vol-2433/paper00.pdf. [5] A. Kiv, M. Shyshkina, S. Semerikov, A. Striuk, M. Striuk, H. Shalatska, CTE 2019 - When cloud technologies ruled the education, CEUR Workshop Proceedings 2643 (2020) 1–59. URL: http://ceur-ws.org/Vol-2643/paper00.pdf, 7th Workshop on Cloud Technologies in Education, CTE 2019 ; Conference Date: 20 December 2019. [6] A. E. Kiv, V. N. Soloviev, S. O. Semerikov, XII International Conference on Mathematics, Sci- ence and Technology Education, Journal of Physics: Conference Series 1840 (2021) 011001. URL: https://doi.org/10.1088/1742-6596/1840/1/011001. doi:10.1088/1742-6596/1840/ 1/011001. [7] A. E. Kiv, V. N. Soloviev, S. O. Semerikov, A. M. Striuk, V. V. Osadchyi, T. A. Vakaliuk, P. P. Nechypurenko, O. V. Bondarenko, I. S. Mintii, S. L. Malchenko, XIII International Conference on Mathematics, Science and Technology Education, Journal of Physics: Conference Series (2021). [8] M. Brown, M. McCormack, J. Reeves, D. C. Brooks, S. Grajek, B. Alexander, M. Bali, S. Bulger, S. Dark, N. Engelbert, K. Gannon, A. Gauthier, D. Gibson, R. Gibson, B. Lundin, G. Veletsianos, N. Weber, 2020 EDUCAUSE Horizon Report: Teaching and Learning Edition Available, EDUCAUSE, Louisville, CO, 2020. URL: https://library.educause.edu/-/media/ files/library/2020/3/2020_horizon_report_pdf. [9] M. S. A. El-Seoud, H. F. El-Ssofany, I. A. T. F. Taj-Eddin, A. Nosseir, M. M. El-Khouly, Imple- mentation of Web-Based Education in Egypt through Cloud Computing Technologies and Its Effect on Higher Education, Higher Education Studies 3 (2013) 62–76. URL: http://www. ccsenet.org/journal/index.php/hes/article/view/27423. doi:10.5539/hes.v3n3p62. [10] V. Pavlenko, A. Prokhorov, O. Kuzminska, M. Mazorchuk, Competence approach to modeling and control of students’ learning pathways in the cloud service, CEUR Workshop Proceedings 1844 (2017) 257–264. [11] S. Walker, T. Olney, C. Wood, A. Clarke, M. Dunworth, How do tutors use data to support 493 their students?, Open Learning: The Journal of Open, Distance and e-Learning 34 (2019) 118–133. URL: https://doi.org/10.1080/02680513.2018.1554476. doi:10.1080/02680513. 2018.1554476. [12] J.-H. Zhang, Y.-X. Zhang, Q. Zou, S. Huang, What learning analytics tells us: Group behavior analysis and individual learning diagnosis based on long-term and large-scale data, Journal of Educational Technology & Society 21 (2018) 245–258. URL: http://www. jstor.org/stable/26388404. [13] EIT 2008-2020: zvity, 2020. URL: https://testportal.gov.ua/ofzvit/. [14] G. Yang, M. Badri, A. Al Rashedi, K. Almazroui, The role of reading motivation, self- efficacy, and home influence in students’ literacy achievement: a preliminary examination of fourth graders in Abu Dhabi, Large-scale Assessments in Education 6 (2018) 10. URL: https://doi.org/10.1186/s40536-018-0063-0. doi:10.1186/s40536-018-0063-0. [15] H. W. Ko, Y. L. Chan, Family factors and primary students’ reading attainment, Chinese Education & Society 42 (2009) 33–48. URL: https://doi.org/10.2753/CED1061-1932420302. doi:10.2753/CED1061-1932420302. [16] S. Arikan, A regression model with a new tool: IDB analyzer for identifying factors predicting mathematics performance using PISA 2012 indices, US-China Education Review 4 (2014) 716–727. [17] PISA-2018: zvity, 2020. URL: http://pisa.testportal.gov.ua/pisa-2018-zvity/. [18] D. V. Vasyl’ieva, M. V. Holovko, Y. O. Zhuk, O. H. Kozlenko, O. I. Liashenko, S. O. Naumenko, V. I. Novos’olova, UROKY PISA-2018: metodychni rekomendatsii, Instytut pedahohiky NAPN Ukrainy, Pedahohichna dumka, Kyiv, 2020, p. 96. [19] H. Wickham, Mastering Shiny, 2020. URL: https://mastering-shiny.org/index.html. 494