<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Towards the Information Technology Usage for E-Government Portal Assessment based on Web Data Extraction Techniques</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Andrii</forename><surname>Kopp</surname></persName>
							<email>kopp93@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="institution">National Technical University &quot;Kharkiv Polytechnic Institute&quot;</orgName>
								<address>
									<addrLine>Kyrpychova str. 2</addrLine>
									<postCode>61002</postCode>
									<settlement>Kharkiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Oleksandr</forename><surname>Chornenkyi</surname></persName>
							<email>chornenkyi.o.o@gmail.com</email>
							<affiliation key="aff1">
								<orgName type="department">V.N</orgName>
								<orgName type="institution">Karazin Kharkiv National University</orgName>
								<address>
									<addrLine>Svobody sq. 4</addrLine>
									<postCode>61022</postCode>
									<settlement>Kharkiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff2">
								<orgName type="department" key="dep1">E-Government Web Portal</orgName>
								<orgName type="department" key="dep2">Citizen Portal</orgName>
								<orgName type="department" key="dep3">Information Technology</orgName>
								<orgName type="department" key="dep4">Web Data Extraction. Information Technology and Implementation (IT&amp;I-2023)</orgName>
								<address>
									<addrLine>November 20-21</addrLine>
									<postCode>2023</postCode>
									<settlement>Kyiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Towards the Information Technology Usage for E-Government Portal Assessment based on Web Data Extraction Techniques</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">BC551CEF7F40CFB44469207F2F28607D</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T20:01+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Today, interdisciplinary studies in computer science and social sciences, including political science, are inevitable due to the need to work with web-based sources to gain valuable insights, process large amounts of data, and apply various data analysis techniques. Web data extraction or web scraping is important for social and political studies when it is necessary to retrieve data arrays from a website for future analytical processing. Such automatic data collection and processing is a promising interdisciplinary field for social scientists and computer scientists. Therefore, this study aims to improve e-government web portal evaluation processes by proposing a corresponding information technology based on web data extraction techniques. The software implementation of the proposed technology is based on Python and Power BI for computation and visualization, respectively. The proposed toolkit was used to analyze the e-government web portals of two countries selected on the basis of their high egovernment development index, the obtained results of prevailing services on each of citizen portals were analyzed and discussed, and the corresponding conclusions were made.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction 1.Motivation</head><p>Nowadays, the rapid evolution of computer technologies changes scientific approaches to modern issues and provides ways for creating novel and enhancing existing research methods. It is especially considerable for applied science wherein computational technique implementation accelerates the complex applied problem solution requiring large volumes of calculations. Social sciences, which have historical relations with philosophy, have a peculiar wide range of research methods. However, social sciences are also in a transformation state and increasingly using computing technology for research problem solving, which has led to the emergence of computational social sciences. Initially, computational social sciences were associated with agent-based modeling for the simulation of the behavior of an individual or social group under certain conditions. Nevertheless, the Internet spreading, social networks and online platforms popularity increasing within the growth of numbers of Internet users provoked a new large stream of digital data which has become a valuable source of information for social sciences researchers and has led to the expansion of the concept of "computational social science" <ref type="bibr" target="#b0">[1]</ref>. Although earlier researchers have argued that digital data analysis-based computational social science has developed slowly <ref type="bibr" target="#b1">[2]</ref>, more recent studies show that in recent times increased the interest of social sciences scholars in using computational techniques for research <ref type="bibr" target="#b0">[1]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.2.">Related Work</head><p>The use of information technologies for political science research is not as new as it may seem, and began in the second half of the 20th century. The first experiments using computers were aimed at trying to predict election results. Typically, all studies were based on the use of agent-based modeling in different form, using classical theories of political interactions <ref type="bibr" target="#b2">[3]</ref>. The further evolution of the Internet, increasing the power and availability of computer technology provided new research fields for political scientists and tools for expanding methodology. Political science methodology expansion led to the fact that in addition to agent-based modeling, political scientists more often were beginning to use methods related to big data analysis <ref type="bibr" target="#b3">[4]</ref>.</p><p>Scholars in social and political science address "big data" as a broad concept that includes any digital elements left by users or organizations on the Internet that can be read by information technologies <ref type="bibr" target="#b4">[5]</ref>. In the computer science field, big data is usually associated with so-called "5Vs" used to describe its characteristics (value, variety, velocity, veracity, and volume) <ref type="bibr" target="#b5">[6]</ref>.</p><p>Today, for social sciences fields and particularly for political science, when working with big data, it is important to use web scraping tools, which is the automatic extraction of data from websites for further analytical processing <ref type="bibr" target="#b6">[7]</ref>. For political science, this approach can be valuable for defining features of how political parties or government agencies use their websites <ref type="bibr" target="#b4">[5]</ref> and for researching local politics through the mining and analysis of unstructured data from the websites of local government institutions <ref type="bibr" target="#b7">[8]</ref>. Some researchers conclude and we agree that the use of automated data mining in social sciences opens a new way for cooperation between social and computer science researchers <ref type="bibr" target="#b5">[6]</ref>.</p><p>It should be stressed that political science must be in the continuous dynamic movement condition and must permanently react to political changes in the modern world. Today, the policies of many countries aim at the formation of an inclusive information society, which includes widespread digital transformation. Governments create and support open e-government web portals, which aim to facilitate citizens' access to government information and improve the process of providing government services to citizens. However, it should be noted that the politics of different states differ from each other and may have various accents. Under such circumstances, it may be interesting how the policies of different states influence their e-government web portals.</p><p>Thus, we propose an information technology that can help researchers to explore services provided on government web portals, which can be useful for further analysis of different countries' policies. It aims to improve e-government web portal evaluation processes by using web harvesting techniques.</p><p>Therefore, this study is expected to answer the following research questions:  What reference model can be used to evaluate the e-government web portal?  What algorithms can be used to process and harvest the desired e-government web portal data?  How can the extracted data be quantitatively evaluated to compare the policies of different countries and define the prevailing citizen services?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Materials and Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">E-Government Web Portal Services Model</head><p>Let us formally describe the set of services that the e-government web portal is expected to provide:</p><formula xml:id="formula_0">𝑒𝐺𝑆 = {𝑒𝐺𝑆 1 , 𝑒𝐺𝑆 2 , … , 𝑒𝐺𝑆 𝑛 }. (<label>1</label></formula><formula xml:id="formula_1">)</formula><p>Here 𝑛 is the number of services 𝑒𝐺𝑆 1 , 𝑒𝐺𝑆 2 , … , 𝑒𝐺𝑆 𝑛 the e-government web portal is expected to provide, 𝑖 = 1, 𝑛.</p><p>Moreover, for each of the e-government web portal services 𝑒𝐺𝑆 𝑖 , 𝑖 = 1, 𝑛 we propose to define the set of keywords 𝑊 𝑖 , 𝑖 = 1, 𝑛 which completely describes the mentioned service:</p><formula xml:id="formula_2">𝛿: 𝑒𝐺𝑆 𝑖 → 𝑊 𝑖 = {𝑤 𝑖1 , 𝑤 𝑖2 , … , 𝑤 𝑖𝑚 𝑖 }. (<label>2</label></formula><formula xml:id="formula_3">)</formula><p>Here 𝑚 𝑖 is the number of synonymic keywords 𝑤 𝑖1 , 𝑤 𝑖2 , … , 𝑤 𝑖𝑚 𝑖 in 𝑊 𝑖 , 𝑖 = 1, 𝑛 defined for 𝑖-th egovernment web portal service 𝑒𝐺𝑆 𝑖 , 𝑗 = 1, 𝑚 𝑖 .</p><p>Hence, the formal definition of E-Government Web Portal Services (EGWPS) model can be formulated as given below: 𝑒𝐺𝑊𝑃𝑆 = 〈𝑒𝐺𝑆, 𝛿, 𝑊〉.</p><p>Here 𝑊 is the set of keyword sets mapped to each of the e-government web portal services, 𝑊 = {𝑊 1 , 𝑊 2 , … , 𝑊 𝑛 }. Let us graphically illustrate in Fig. <ref type="figure" target="#fig_0">1</ref> the proposed e-government web portal services model. Fig. <ref type="figure" target="#fig_0">1</ref> demonstrates the set of expected services and their keywords used to detect such services on the e-government web portals under assessment. Using the proposed EGWPS model (Fig. <ref type="figure" target="#fig_0">1</ref>), we propose to find the "distance" between the e-government web portal under assessment and the so-called "perfect" e-government web portal (in terms of its contents) described by this model.   The e-government services data extracted from HTML (Hyper Text Markup Language) pages of a corresponding web portal is represented as the set of HTML hyperlink tags <ref type="bibr" target="#b9">[10]</ref>:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Web Data Extraction Algorithm</head><formula xml:id="formula_5">𝐻 = {ℎ 1 , ℎ 2 , … , ℎ 𝑝 }. (<label>4</label></formula><formula xml:id="formula_6">)</formula><p>Here 𝑝 is the number of hyperlinks ℎ 1 , ℎ 2 , … , ℎ 𝑝 located on e-government web portal HTML pages, 𝑘 = 1, 𝑝.</p><p>Each hyperlink tag includes the text and URL (Unified Resource Locator) as it is demonstrated in the meta-model diagram in Fig. <ref type="figure" target="#fig_1">2</ref>.</p><p>The set of e-government web portal services 𝑒𝐺𝑆 extracted from the respective HTML pages is represented as follows: 𝑆 = {𝑆 1 , 𝑆 2 , … , 𝑆 𝑛 }.</p><p>(5) Each service 𝑆 𝑖 , 𝑖 = 1, 𝑛 is expected to be digitally implemented by one or multiple hyperlinks located on the e-government web portal pages:</p><p>𝑆 𝑖 = 𝐻 𝑆 𝑖 ⊆ 𝐻. (6) Here 𝐻 𝑆 𝑖 is the sub-set of hyperlinks extracted from the e-government web portal that implement 𝑖th service detected in the e-government web portal 𝑆 𝑖 , 𝑖 = 1, 𝑛. Therefore, to detect services provided by the e-government web portal using the proposed EGWPS model (Fig. <ref type="figure" target="#fig_0">1</ref>) and the meta-model (Fig. <ref type="figure" target="#fig_1">2</ref>), the following algorithm should be used:</p><p>Given: set of extracted e-government web portal hyperlinks 𝐻 EGWPS model 〈𝑒𝐺𝑆, 𝛿, 𝑊〉 empty set of detected e-government web portal services 𝑆 for each 𝑒𝐺𝑆 𝑖 in 𝑒𝐺𝑆:</p><p>for each ℎ 𝑘 in 𝐻: 𝑊 𝑖 = 𝛿(𝑒𝐺𝑆 𝑖 ) for each 𝑤 𝑖𝑗 in 𝑊 𝑖 : if 𝑤 𝑖𝑗 is a substring of ℎ 𝑘 text:</p><formula xml:id="formula_7">𝑆 𝑖 ← ℎ 𝑘 end end end</formula><p>The input set 𝐻 of the e-government web portal hyperlinks can be extracted using web scraping tools in Python or other programming languages.</p><p>The output set 𝑆 basically represents the instances of Service class defined in the proposed metamodel (Fig. <ref type="figure" target="#fig_1">2</ref>). Furthermore, each service 𝑆 𝑖 , 𝑖 = 1, 𝑛 has multiple hyperlinks that belong to 𝐻.</p><p>Fig. <ref type="figure" target="#fig_3">3</ref> graphically illustrates the proposed algorithm using the UML activity diagram <ref type="bibr" target="#b8">[9]</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">E-Government Portal Assessment Metrics</head><p>Finally, we propose the following metrics to assess the e-government web portal in terms of detected services. The following metric allows to find the number of services detected in the e-government web portal under assessment: 𝑆𝐷 = |{𝑆 𝑖 ∈ 𝑆, 𝑆 𝑖 ≠ ∅}|.</p><p>(7) The following metric allows to find the "service richness" calculated as the relative number of services of the e-government web portal under assessment in comparison to the reference EGWPS model (Fig. <ref type="figure" target="#fig_0">1</ref>) <ref type="bibr" target="#b10">[11]</ref>:</p><formula xml:id="formula_8">𝑆𝑅 = 1 𝑛 𝑆𝐷.<label>(8)</label></formula><p>The following metric allows to find the "relative cardinality" of the particular service calculated as the relative number of hyperlinks used to implement the 𝑖-th service detected in the e-government web portal under assessment in comparison to the maximum possible number of hyperlinks used in the same web portal for a certain service <ref type="bibr" target="#b11">[12]</ref>:</p><formula xml:id="formula_9">𝑆𝐶 𝑖 = { 1 max 𝑖=1,𝑛 |𝑆 𝑖 | |𝑆 𝑖 |, max 𝑖=1,𝑛 |𝑆 𝑖 | &gt; 0 0, max 𝑖=1,𝑛 |𝑆 𝑖 | = 0<label>(9)</label></formula><p>The following metric allows to find the total "service balance" to assess the balance of hyperlinks related to services detected in the e-government web portal under assessment:</p><formula xml:id="formula_10">𝑆𝐵 = 1 𝑛 ∑ 𝑆𝐶 𝑖 𝑛 𝑖=1 . (<label>10</label></formula><formula xml:id="formula_11">)</formula><p>Using the following algorithm, it is possible to evaluate an e-government web portal against the EGWPS model. Thus, as a reference model, we can use the experience and best practices of the most advanced e-government web portals, define the set of services 𝑒𝐺𝑆 a portal is expected to provide, and the keywords 𝑊 to detect such services in corresponding HTML web pages.</p><p>Fig. <ref type="figure" target="#fig_4">4</ref> graphically illustrates the proposed algorithm using the UML activity diagram <ref type="bibr" target="#b8">[9]</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Information Technology for E-Government Portal Assessment</head><p>Finally, the information technology for e-government web portal assessment can be formally described using the following tuple: 𝑒𝐺𝑊𝑃𝐴𝐼𝑇 = 〈𝑒𝐺𝑊𝑃𝑆, 𝑆, 𝐴𝑀〉.</p><p>(11) Here 𝐴𝑀 is the algorithmic model, which includes the proposed algorithms (Fig. <ref type="figure" target="#fig_3">3</ref> and Fig. <ref type="figure" target="#fig_4">4</ref>) <ref type="bibr" target="#b12">[13]</ref>:</p><formula xml:id="formula_12">𝐴𝑀 = (𝐴 = {𝐴 1 , 𝐴 2 }, 𝑅 ⊂ 𝐴 × 𝐴). (<label>12</label></formula><formula xml:id="formula_13">)</formula><p>Here 𝐴 is the set of algorithms, where 𝐴 1 is the data extraction algorithm and 𝐴 2 is the evaluation algorithm; 𝑅 describes the interconnections between the proposed algorithms when used to assess an e-government web portal. Selected e-government web portals will be analyzed using the proposed information technology implemented using Python, in-build packages, and third-party libraries:</p><p> "urllib" -used the "request" module to open and work with URLs <ref type="bibr" target="#b13">[14]</ref>;  "re"for regular expressions operations to parse web pages <ref type="bibr" target="#b14">[15]</ref>;  "json"to save results as JSON (JavaScript Object Notation) <ref type="bibr" target="#b15">[16]</ref>;  "bs4" -used the "Beautiful Soup" library to scrape information from web pages of citizen portals <ref type="bibr" target="#b16">[17]</ref>.</p><p>Fig. <ref type="figure">5</ref> below demonstrates the Data Flow Diagram (DFD) <ref type="bibr" target="#b17">[18]</ref> of the data processing workflow implemented by the proposed information technology.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 5: Data processing workflow for e-government portal assessment</head><p>According to Fig. <ref type="figure">5</ref>, obtained results are displayed using Power BIa high-performance Business Intelligence (BI) tool for advanced data visualization and data-driven decision making <ref type="bibr" target="#b18">[19]</ref>.</p><p>Fig. <ref type="figure">5</ref> illustrates the hands-on usage of the proposed information technology.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results and Discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">E-Government Portal Data Extraction: Citizens Viewpoint</head><p>Let us form the reference EGWPS model considering the Integrated Architecture Framework for E-Government (IAFEG) shown in Fig. <ref type="figure" target="#fig_5">6</ref>  <ref type="bibr" target="#b19">[20]</ref>. In this study, we focus on the "Social Sub-system" layer of this framework, in particular -on its "Citizens" perspective <ref type="bibr" target="#b19">[20]</ref>. The citizens' viewpoint according to IAFEG <ref type="bibr" target="#b19">[20]</ref> includes the following services (or topics) expected from an e-government web-portal:</p><p> taxation;  education;  health;  immigration;  employment. The IAFEG-based set of services the e-government web portals are expected to provide 𝑒𝐺𝑆 (from the citizens' viewpoint of the IAFEG "Social Sub-system" layer) and the keywords 𝑊 used to describe each of the services on HTML web pages are given in Table <ref type="table" target="#tab_0">1</ref>.</p><p>According to the "UN E-Government Knowledgebase" and its UN (United Nations) E-Government Survey 2022, top five countries with the highest E-Government Development Index are Denmark (0.9717), Finland (0.9533), Republic of Korea (0.9529), New Zealand (0.9432), and Iceland (0.9410).</p><p>Denmark citizen portal "Life in Denmark.dk" is shown in Fig.</p><p>7 <ref type="bibr" target="#b20">[21]</ref>. The "Life in Denmark.dk" portal offers topics related to immigration, housing, working, family and children, money and taxation, education, healthcare, travel and transportation, pension, rights, leisure and networking, as well as stand-alone digital services (Fig. <ref type="figure" target="#fig_6">7</ref>) <ref type="bibr" target="#b20">[21]</ref>. Table <ref type="table">2</ref> shows hyperlinks detected on the "Life in Denmark.dk" citizen portal according to IAFEG taxation, education, health, immigration, and employment services <ref type="bibr" target="#b19">[20]</ref>. The "Suomi.fi" portal offers similar topics to "Life in Denmark.dk". These topics are connected to family, social security, healthcare, education, working, housing, rights and obligations, finances and taxation, moving and travelling (Fig. <ref type="figure">8</ref>) <ref type="bibr">[22]</ref>. Table <ref type="table">3</ref> shows hyperlinks detected on the "Suomi.fi" citizen portal according to IAFEG <ref type="bibr" target="#b19">[20]</ref>.   Finland citizen portal "Suomi.fi" is shown in Fig. <ref type="bibr">8 [22]</ref>. In this study we focus on the top two countries (Denmark and Finland) and their citizen portals. First of all, their impact according to the E-Government Development Index is greater than 0.95. Another country, which aspirations were highly estimated is Republic of Korea, however, we failed to access its English web portal version. Therefore, we obtained the following e-government web portals evaluation results (Table <ref type="table">4</ref>). Here 𝑆𝐶 𝑖 , 𝑖 = 1,5 describe taxation, education, health, immigration, and employment services. </p><p>Here ∑ 𝐹𝑖𝑡𝑛𝑒𝑠𝑠 = 𝑇𝑟𝑢𝑒 is the number hyperlinks estimated as correctly categorized against IAFEG services <ref type="bibr" target="#b19">[20]</ref>, and ∑ 𝐹𝑖𝑡𝑛𝑒𝑠𝑠 = 𝐹𝑎𝑙𝑠𝑒 is vice versa (see Table <ref type="table">2</ref> -3). Hence, the accuracy of proposed information technology for e-government web portal assessment is 0.80 for Denmark and 0.88 for Finland. However, the total accuracy for both estimated citizen web portals "Life in Denmark.dk" and "Suomi.fi" is 0.83.</p><p>Therefore, the proposed information technology allows to obtain accurate (of 83%) e-government web portal assessment results and can be suggested scholars in social political science fields.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion and Future Work</head><p>In this paper we proposed the information technology for e-government portal assessment based on web data extraction techniques. The study aims to improve the processes of e-government web portal assessment by using web harvesting and data analysis approaches. Therefore, we developed algorithms to extract and assess e-government web portals using the proposed E-Government Web Portal Services reference model and evaluation metrics. The software implementation of the proposed technology is based on Python programming language and Power BI data visualization tool. Such a tool allows nontechnical users, i.e. social or political science scholars, to configure the desired references models and automatically assess e-government web-portals as part of their studies with the accuracy of 83%.</p><p>The following conclusions can be made after the obtained results analysis:  this approach has a room to identify the differences between e-government web portals and the services they provide;</p><p> consequently, for researchers who study and compare the state of information society formation in different countries, in terms of digital services provision, conducting such experiments can be a useful complement to other data-driven methods;</p><p> the need for interdisciplinary cooperation between social and computer science is increasing and such interdisciplinary studies can benefit both domains with new methods and solutions.</p><p>In the future we plan to elaborate metrics proposed to evaluate e-government web portals, as well as conduct a large-scale study, with more government portals and more careful sampling, to identify and study their differences. From the information technology viewpoint, such experiments require advanced techniques to be applied, such as data warehousing, data mining, and data visualization.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Proposed e-government web portal services model</figDesc><graphic coords="3,155.75,128.60,283.43,208.90" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2</head><label>2</label><figDesc>Fig. 2 below illustrates the e-government web portal data meta-model given using the UML (Unified Modeling Language) [9] class diagram.</figDesc><graphic coords="3,113.00,471.91,368.26,58.75" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Meta-model of the e-government web portal data</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Proposed algorithm for e-government services data extraction from a web portal</figDesc><graphic coords="4,127.25,331.87,340.14,293.95" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Proposed algorithm for e-government services evaluation</figDesc><graphic coords="5,113.00,340.37,368.46,261.25" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 6 :</head><label>6</label><figDesc>Figure 6: Integrated Architecture Framework for E-Government<ref type="bibr" target="#b19">[20]</ref> </figDesc><graphic coords="7,123.18,103.30,348.65,198.40" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>Figure 7 :</head><label>7</label><figDesc>Figure 7: Denmark e-Government web portal "Life in Denmark.dk" [21]</figDesc><graphic coords="7,72.00,441.22,453.40,211.48" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_7"><head>Figure 10 :</head><label>10</label><figDesc>Figure 10: Power BI dashboard Moreover, let us estimate the correlation value between 𝑆𝐵 and E-Government Development Index (EGDI) values [23]. The obtained Pearson's correlation coefficient [24] value is 1.00, which signalize absolute positive relation between EGDI estimated by UN and "service balance" scores. Finally, let us estimate the accuracy of the proposed information technology using the following formula: 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = (∑ 𝐹𝑖𝑡𝑛𝑒𝑠𝑠 = 𝑇𝑟𝑢𝑒) (∑ 𝐹𝑖𝑡𝑛𝑒𝑠𝑠 = 𝑇𝑟𝑢𝑒) + (∑ 𝐹𝑖𝑡𝑛𝑒𝑠𝑠 = 𝐹𝑎𝑙𝑠𝑒) .(13)</figDesc><graphic coords="10,84.93,72.00,425.11,239.10" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="6,127.25,195.00,340.10,167.85" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="8,73.25,371.20,448.10,274.48" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Proposed EGWPS model contents based on IAFEG<ref type="bibr" target="#b19">[20]</ref> </figDesc><table><row><cell>Services</cell><cell>Keywords</cell></row><row><cell>Taxation</cell><cell>tax, finance, income, money, debt, credit</cell></row><row><cell>Education</cell><cell>education, school, study, child, training, student</cell></row><row><cell>Health</cell><cell>health, insurance, care, sick, medical, funeral</cell></row><row><cell>Immigration</cell><cell>immigration, citizen, travel, visa, residence, international</cell></row><row><cell>Employment</cell><cell>employment, work, job, business, license, certification</cell></row></table></figure>
		</body>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Citizen Web Portal Data Analysis</head><p>Fig. <ref type="figure">9</ref> demonstrates the structure of JSON documents produced for Power BI visualization, which contain calculated metrics for assessed citizen web portals.   Here in Fig. <ref type="figure">9</ref> we have the following JSON properties:  "ServicesDetected" represents 𝑆𝐷;  "ServiceRichness" represents 𝑆𝑅;  "ServiceCardinality" represents 𝑆𝐶 𝑖 , 𝑖 = 1,5 accroding to IAFEG citizen services of taxation, education, health, immigration, and employment;</p><p> "ServiceBalance" represents 𝑆𝐵. Fig. <ref type="figure">10</ref> illustrates the Power BI dashboard developed to visualize JSON-based data and display citizen portal web services assessment results. Analyzing obtained results (Table <ref type="table">4</ref> and Fig. <ref type="figure">9</ref> -10), we can assume that:</p><p> both evaluated "Life in Denmark.dk" and "Suomi.fi" citizen service web portals demonstrate the highest service richness values (1.00), which signalize their general correspondence to 5 citizen services defined by IAFEG <ref type="bibr" target="#b19">[20]</ref>;</p><p> evaluated citizen web portals focus differently on provided services: "Life in Denmark.dk" is mostly focused on education (1.00), immigration (0.67), and employment (0.67), while "Suomi.fi" on employment (1.00), education (0.80), and taxation (0.60);</p><p> both evaluated citizen web portals have moderate "service balance" scores of 0.67 for "Life in Denmark.dk" and 0.64 for "Suomi.fi", which confirms the previous observations.</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title/>
		<author>
			<persName><forename type="first">A</forename><surname>Edelmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Wolff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Montagne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">A</forename><surname>Bail</surname></persName>
		</author>
		<idno type="DOI">10.1146/annurev-soc-121919-054621</idno>
	</analytic>
	<monogr>
		<title level="j">Computational Social Science and Sociology</title>
		<imprint>
			<biblScope unit="volume">46</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="61" to="81" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
	<note>Annual Review of Sociology</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title/>
		<author>
			<persName><forename type="first">D</forename><surname>Lazer</surname></persName>
		</author>
		<idno type="DOI">10.1126/science.1167742</idno>
	</analytic>
	<monogr>
		<title level="j">SOCIAL SCIENCE: Computational Social Science</title>
		<imprint>
			<biblScope unit="volume">323</biblScope>
			<biblScope unit="page" from="721" to="723" />
			<date type="published" when="2009">5915. 2009</date>
		</imprint>
	</monogr>
	<note>Science</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">F</forename><surname>Voinea</surname></persName>
		</author>
		<idno type="DOI">10.1002/9781118833162</idno>
		<title level="m">Political Attitudes: Computational and Simulation Modelling</title>
				<imprint>
			<publisher>John Wiley &amp; Sons</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Use of information and communication technologies for political science research</title>
		<author>
			<persName><forename type="first">O</forename><surname>Chornenkyi</surname></persName>
		</author>
		<idno type="DOI">10.26565/2220-8089-2022-42-06</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of V.N. Karazin Kharkiv National University</title>
		<imprint>
			<biblScope unit="volume">42</biblScope>
			<biblScope unit="page" from="38" to="44" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
	<note>Issues of Political Science</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Big data, analytics and artificial intelligence for sustainability</title>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">A</forename><surname>Ojokoh</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.sciaf.2020.e00551</idno>
	</analytic>
	<monogr>
		<title level="j">Scientific African</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="page">e00551</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Algorithmic thinking in the public interest: navigating technical, legal, and ethical hurdles to web scraping in the social sciences</title>
		<author>
			<persName><forename type="first">A</forename><surname>Luscombe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Dick</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Walby</surname></persName>
		</author>
		<idno type="DOI">10.1007/s11135-021-01164-0</idno>
	</analytic>
	<monogr>
		<title level="j">Quality &amp; Quantity</title>
		<imprint>
			<biblScope unit="volume">56</biblScope>
			<biblScope unit="page" from="1023" to="1044" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">V</forename><surname>Lowndes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Marsh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Stoker</surname></persName>
		</author>
		<title level="m">Theory and Methods in Political Science</title>
				<imprint>
			<publisher>Bloomsbury Academic</publisher>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
	<note>4th ed</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Gather-Narrow-Extract: A Framework for Studying Local Policy Variation Using Web-Scraping and Natural Language Processing</title>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">L</forename><surname>Anglin</surname></persName>
		</author>
		<idno type="DOI">10.1080/19345747.2019.1654576</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Research on Educational Effectiveness</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="685" to="706" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Unified Modeling Language: A Standard for Designing a Software</title>
		<author>
			<persName><forename type="first">J</forename><surname>Osis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">U</forename><surname>Donins</surname></persName>
		</author>
		<idno type="DOI">10.1016/B978-0-12-805476-5.00001-0</idno>
	</analytic>
	<monogr>
		<title level="m">Topological UML Modeling</title>
				<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="3" to="51" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">HTML5 and the evolution of HTML; tracing the origins of digital platforms</title>
		<author>
			<persName><forename type="first">R</forename><surname>Tabarés</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.techsoc.2021.101529</idno>
	</analytic>
	<monogr>
		<title level="j">Technology in Society</title>
		<imprint>
			<biblScope unit="volume">65</biblScope>
			<biblScope unit="page">101529</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Evaluation Metrics for Ontology Modules</title>
		<author>
			<persName><forename type="first">B</forename><surname>Gobin-Rahimbux</surname></persName>
		</author>
		<idno type="DOI">10.1109/ICDSIS55133.2022.9915950</idno>
	</analytic>
	<monogr>
		<title level="m">IEEE International Conference on Data Science and Information System (ICDSIS)</title>
				<meeting><address><addrLine>Hassan, India</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022. 2022</date>
			<biblScope unit="page" from="1" to="6" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Sampling strategies for extracting information from large data sets</title>
		<author>
			<persName><forename type="first">A</forename><surname>Boicea</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C.-O</forename><surname>Truică</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Rădulescu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E.-C</forename><surname>Buşe</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.datak.2018.01.002</idno>
	</analytic>
	<monogr>
		<title level="j">Data &amp; Knowledge Engineering</title>
		<imprint>
			<biblScope unit="volume">115</biblScope>
			<biblScope unit="page" from="1" to="15" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Development of software for effective enterprise product policy creation</title>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">L</forename><surname>Lysytskyi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">Y</forename><surname>Morhun</surname></persName>
		</author>
		<idno type="DOI">10.20998/2079-0023.2018.21.11</idno>
	</analytic>
	<monogr>
		<title level="j">Bulletin of National Technical University KhPI Series System Analysis Control and Information Technologies</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="page" from="59" to="64" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<ptr target="https://docs.python.org/3/library/urllib.html" />
		<title level="m">urllib -URL handling modules</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<ptr target="https://docs.python.org/3/library/re.html" />
		<title level="m">re -Regular expression operations</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<ptr target="https://docs.python.org/3/library/json.html" />
		<title level="m">json -JSON encoder and decoder</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<author>
			<persName><forename type="first">L</forename><surname>Richardson</surname></persName>
		</author>
		<ptr target="https://pypi.org/project/beautifulsoup4/" />
		<title level="m">beautifulsoup4, PyPI</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Analyzing data flow diagrams by combination of formal methods and visualization techniques</title>
		<author>
			<persName><forename type="first">H</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Xiong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Dong</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.jvlc.2018.08.001</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Visual Languages &amp; Computing</title>
		<imprint>
			<biblScope unit="volume">48</biblScope>
			<biblScope unit="page" from="41" to="51" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<ptr target="https://powerbi.microsoft.com/en-us/" />
		<title level="m">Microsoft Power BI -Interactive Data Visualization BI Tools</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Big Data Framework for Kenya&apos;s County Governments</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Luvembe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Mutai</surname></persName>
		</author>
		<idno type="DOI">10.4236/jcc.2019.71001</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Computer and Communications</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="9" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<ptr target="https://lifeindenmark.borger.dk/" />
		<title level="m">The official guide to Life in Denmark</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<ptr target="https://publicadministration.un.org/egovkb/en-us/About/Overview/-E-Government-Development-Index" />
		<title level="m">E-Government Development Index (EGDI</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Foundational methods: descriptive statistics: bivariate and multivariate data (correlations, associations)</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">L</forename><surname>Hahs-Vaughn</surname></persName>
		</author>
		<idno type="DOI">10.1016/B978-0-12-818630-5.10084-3</idno>
	</analytic>
	<monogr>
		<title level="m">International Encyclopedia of Education (Fourth Edition)</title>
				<imprint>
			<publisher>Elsevier Science</publisher>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="734" to="750" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
