<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Towards the Information Technology Usage for E-Government Portal Assessment based on Web Data Extraction Techniques</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andrii Kopp</string-name>
          <email>kopp93@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oleksandr Chornenkyi</string-name>
          <email>chornenkyi.o.o@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>E-Government Web Portal, Citizen Portal, Information Technology</institution>
          ,
          <addr-line>Web Data Extraction</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>National Technical University “Kharkiv Polytechnic Institute”</institution>
          ,
          <addr-line>Kyrpychova str. 2, Kharkiv, 61002</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>V.N. Karazin Kharkiv National University</institution>
          ,
          <addr-line>Svobody sq. 4, Kharkiv, 61022</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <fpage>12</fpage>
      <lpage>22</lpage>
      <abstract>
        <p>Today, interdisciplinary studies in computer science and social sciences, including political science, are inevitable due to the need to work with web-based sources to gain valuable insights, process large amounts of data, and apply various data analysis techniques. Web data extraction or web scraping is important for social and political studies when it is necessary to retrieve data arrays from a website for future analytical processing. Such automatic data collection and processing is a promising interdisciplinary field for social scientists and computer scientists. Therefore, this study aims to improve e-government web portal evaluation processes by proposing a corresponding information technology based on web data extraction techniques. The software implementation of the proposed technology is based on Python and Power BI for computation and visualization, respectively. The proposed toolkit was used to analyze the e-government web portals of two countries selected on the basis of their high egovernment development index, the obtained results of prevailing services on each of citizen portals were analyzed and discussed, and the corresponding conclusions were made.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction 1.1.</title>
    </sec>
    <sec id="sec-2">
      <title>Motivation</title>
      <p>
        Nowadays, the rapid evolution of computer technologies changes scientific approaches to modern
issues and provides ways for creating novel and enhancing existing research methods. It is especially
considerable for applied science wherein computational technique implementation accelerates the
complex applied problem solution requiring large volumes of calculations. Social sciences, which have
historical relations with philosophy, have a peculiar wide range of research methods. However, social
sciences are also in a transformation state and increasingly using computing technology for research
problem solving, which has led to the emergence of computational social sciences. Initially,
computational social sciences were associated with agent-based modeling for the simulation of the
behavior of an individual or social group under certain conditions. Nevertheless, the Internet spreading,
social networks and online platforms popularity increasing within the growth of numbers of Internet
users provoked a new large stream of digital data which has become a valuable source of information
for social sciences researchers and has led to the expansion of the concept of “computational social
science” [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Although earlier researchers have argued that digital data analysis-based computational
social science has developed slowly [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], more recent studies show that in recent times increased the
interest of social sciences scholars in using computational techniques for research [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
1.2.
      </p>
      <p>
        2023 Copyright for this paper by its authors.
CEUR
trying to predict election results. Typically, all studies were based on the use of agent-based modeling
in different form, using classical theories of political interactions [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The further evolution of the
Internet, increasing the power and availability of computer technology provided new research fields for
political scientists and tools for expanding methodology. Political science methodology expansion led
to the fact that in addition to agent-based modeling, political scientists more often were beginning to
use methods related to big data analysis [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        Scholars in social and political science address “big data” as a broad concept that includes any digital
elements left by users or organizations on the Internet that can be read by information technologies [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
In the computer science field, big data is usually associated with so-called “5Vs” used to describe its
characteristics (value, variety, velocity, veracity, and volume) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Today, for social sciences fields and particularly for political science, when working with big data,
it is important to use web scraping tools, which is the automatic extraction of data from websites for
further analytical processing [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. For political science, this approach can be valuable for defining
features of how political parties or government agencies use their websites [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and for researching local
politics through the mining and analysis of unstructured data from the websites of local government
institutions [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Some researchers conclude and we agree that the use of automated data mining in social
sciences opens a new way for cooperation between social and computer science researchers [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>It should be stressed that political science must be in the continuous dynamic movement condition
and must permanently react to political changes in the modern world. Today, the policies of many
countries aim at the formation of an inclusive information society, which includes widespread digital
transformation. Governments create and support open e-government web portals, which aim to facilitate
citizens’ access to government information and improve the process of providing government services
to citizens. However, it should be noted that the politics of different states differ from each other and
may have various accents. Under such circumstances, it may be interesting how the policies of different
states influence their e-government web portals.</p>
      <p>Thus, we propose an information technology that can help researchers to explore services provided
on government web portals, which can be useful for further analysis of different countries’ policies. It
aims to improve e-government web portal evaluation processes by using web harvesting techniques.</p>
      <p>Therefore, this study is expected to answer the following research questions:
 What reference model can be used to evaluate the e-government web portal?
 What algorithms can be used to process and harvest the desired e-government web portal data?
 How can the extracted data be quantitatively evaluated to compare the policies of different
countries and define the prevailing citizen services?
2. Materials and Methods</p>
    </sec>
    <sec id="sec-3">
      <title>2.1. E-Government Web Portal Services Model</title>
      <p>}.</p>
      <p>Let us formally describe the set of services that the e-government web portal is expected to provide:
(1)</p>
      <p>Here  is the number of services  1,  2, … ,   the e-government web portal is expected to
provide,  = 1,  .</p>
      <p>Moreover, for each of the e-government web portal services   ,  = 1,  we propose to define the
set of keywords   ,  = 1,  which completely describes the mentioned service:</p>
      <p>Here  is the set of keyword sets mapped to each of the e-government web portal services,  =
{ 1,  2, … ,   }. Let us graphically illustrate in Fig. 1 the proposed e-government web portal services
 : 
 →   = {  1,   2, … ,     }.</p>
      <p>= 〈   ,  ,  〉.</p>
      <p>Here   is the number of synonymic keywords   1,   2, … ,     in   ,  = 1,  defined for  -th
egovernment web portal service   ,  = 1,   .</p>
      <p>Hence, the formal definition of E-Government Web Portal Services (EGWPS) model can be
formulated as given below:
(2)
(3)
model. Fig. 1 demonstrates the set of expected services and their keywords used to detect such services
on the e-government web portals under assessment. Using the proposed EGWPS model (Fig. 1), we
propose to find the “distance” between the e-government web portal under assessment and the so-called
“perfect” e-government web portal (in terms of its contents) described by this model.</p>
      <p>Therefore, we propose to use the web data extraction (or web scraping, web harvesting etc.)
technique to automatically explore and assess e-government web portals against the proposed EGWPS
model (Fig. 1).
2.2.</p>
    </sec>
    <sec id="sec-4">
      <title>Web Data Extraction Algorithm</title>
      <p>Given:
set of extracted e-government web portal hyperlinks 
EGWPS model 〈</p>
      <p>,  ,  〉
empty set of detected e-government web portal services 
for each 
 in</p>
      <p>:
for each ℎ</p>
      <p>in  :
  =  (
for each  

)
in   :
if   is a substring of ℎ text:</p>
      <p>← ℎ
by the e-government web portal using the proposed EGWPS model (Fig. 1) and the meta-model (Fig.</p>
      <sec id="sec-4-1">
        <title>2), the following algorithm should be used:</title>
        <p>end
end</p>
        <p>end</p>
        <p>The input set  of the e-government web portal hyperlinks can be extracted using web scraping tools
in Python or other programming languages.</p>
        <p>The output set  basically represents the instances of Service class defined in the proposed
metamodel (Fig. 2). Furthermore, each service   ,  = 1,  has multiple hyperlinks that belong to  .</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>E-Government Portal Assessment Metrics</title>
      <p>Finally, we propose the following metrics to assess the e-government web portal in terms of detected
services. The following metric allows to find the number of services detected in the e-government web
portal under assessment:</p>
      <p>= |{  ∈  ,   ≠ ∅}|.</p>
      <p>
        The following metric allows to find the “service richness” calculated as the relative number of
services of the e-government web portal under assessment in comparison to the reference EGWPS
model (Fig. 1) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]:
(7)
      </p>
      <p>
        The following metric allows to find the “relative cardinality” of the particular service calculated as
the relative number of hyperlinks used to implement the  -th service detected in the e-government web
portal under assessment in comparison to the maximum possible number of hyperlinks used in the same
web portal for a certain service [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]:
      </p>
      <p>The following metric allows to find the total “service balance” to assess the balance of hyperlinks
related to services detected in the e-government web portal under assessment:</p>
      <p>Using the following algorithm, it is possible to evaluate an e-government web portal against the
EGWPS model. Thus, as a reference model, we can use the experience and best practices of the most
advanced e-government web portals, define the set of services 
a portal is expected to provide, and
the keywords</p>
      <p>to detect such services in corresponding HTML web pages.</p>
      <p>.</p>
      <p>=1
  =
{
max | 
 =1,</p>
      <p>|
0,
|  |,
 =1,
max |  | &gt; 0
 =1,
max |  | = 0
(8)
(9)
(10)
= 〈
,  ,</p>
      <p>〉.
= (
= { 1,  2}, 
⊂  ×  ).</p>
      <p>Here</p>
      <p>
        is the algorithmic model, which includes the proposed algorithms (Fig. 3 and Fig. 4) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]:
Here  is the set of algorithms, where  1 is the data extraction algorithm and  2 is the evaluation
algorithm;  describes the interconnections between the proposed algorithms when used to assess an
e(11)
(12)
government web portal. Selected e-government web portals will be analyzed using the proposed
information technology implemented using Python, in-build packages, and third-party libraries:
 “urllib” – used the “request” module to open and work with URLs [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ];
 “re” – for regular expressions operations to parse web pages [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ];
 “json” – to save results as JSON (JavaScript Object Notation) [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ];
 “bs4” – used the “Beautiful Soup” library to scrape information from web pages of citizen portals
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>
        Fig. 5 below demonstrates the Data Flow Diagram (DFD) [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] of the data processing workflow
implemented by the proposed information technology.
      </p>
      <p>
        According to Fig. 5, obtained results are displayed using Power BI – a high-performance Business
Intelligence (BI) tool for advanced data visualization and data-driven decision making [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
      <p>Fig. 5 illustrates the hands-on usage of the proposed information technology.</p>
    </sec>
    <sec id="sec-6">
      <title>3. Results and Discussion</title>
    </sec>
    <sec id="sec-7">
      <title>3.1. E-Government Portal Data Extraction: Citizens Viewpoint</title>
      <p>
        Let us form the reference EGWPS model considering the Integrated Architecture Framework for
EGovernment (IAFEG) shown in Fig. 6 [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. In this study, we focus on the “Social Sub-system” layer
of this framework, in particular – on its “Citizens” perspective [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. The citizens’ viewpoint according
to IAFEG [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] includes the following services (or topics) expected from an e-government web-portal:
 taxation;
 education;
 health;
 immigration;
 employment.
      </p>
      <p>The IAFEG-based set of services the e-government web portals are expected to provide  (from
the citizens’ viewpoint of the IAFEG “Social Sub-system” layer) and the keywords  used to describe
each of the services on HTML web pages are given in Table 1.</p>
      <p>According to the “UN E-Government Knowledgebase” and its UN (United Nations) E-Government
Survey 2022, top five countries with the highest E-Government Development Index are Denmark
(0.9717), Finland (0.9533), Republic of Korea (0.9529), New Zealand (0.9432), and Iceland (0.9410).</p>
      <p>
        Denmark citizen portal “Life in Denmark.dk” is shown in Fig. 7 [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. The “Life in Denmark.dk”
portal offers topics related to immigration, housing, working, family and children, money and taxation,
education, healthcare, travel and transportation, pension, rights, leisure and networking, as well as
stand-alone digital services (Fig. 7) [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Table 2 shows hyperlinks detected on the “Life in
Denmark.dk” citizen portal according to IAFEG taxation, education, health, immigration, and
employment services [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. The “Suomi.fi” portal offers similar topics to “Life in Denmark.dk”. These
topics are connected to family, social security, healthcare, education, working, housing, rights and
obligations, finances and taxation, moving and travelling (Fig. 8) [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. Table 3 shows hyperlinks
detected on the “Suomi.fi” citizen portal according to IAFEG [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
      </p>
      <p>Keywords
tax, finance, income, money, debt, credit
education, school, study, child, training, student</p>
      <p>health, insurance, care, sick, medical, funeral
immigration, citizen, travel, visa, residence, international</p>
      <p>employment, work, job, business, license, certification</p>
      <p>
        Finland citizen portal “Suomi.fi” is shown in Fig. 8 [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. In this study we focus on the top two
countries (Denmark and Finland) and their citizen portals. First of all, their impact according to the
EGovernment Development Index is greater than 0.95. Another country, which aspirations were highly
estimated is Republic of Korea, however, we failed to access its English web portal version.
      </p>
      <p>Therefore, we obtained the following e-government web portals evaluation results (Table 4). Here
  ,  = 1,5 describe taxation, education, health, immigration, and employment services.</p>
    </sec>
    <sec id="sec-8">
      <title>Citizen Web Portal Data Analysis</title>
      <sec id="sec-8-1">
        <title>Here in Fig. 9 we have the following JSON properties:</title>
        <p> “ServicesDetected” represents  ;
 “ServiceRichness” represents  ;
 “ServiceCardinality” represents   ,  = 1,5 accroding to IAFEG citizen services of taxation,
education, health, immigration, and employment;
 “ServiceBalance” represents  .</p>
        <p>Fig. 10 illustrates the Power BI dashboard developed to visualize JSON-based data and display
citizen portal web services assessment results. Analyzing obtained results (Table 4 and Fig. 9 – 10), we
can assume that:</p>
        <p>
           both evaluated “Life in Denmark.dk” and “Suomi.fi” citizen service web portals demonstrate the
highest service richness values (1.00), which signalize their general correspondence to 5 citizen services
defined by IAFEG [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ];
        </p>
        <p> evaluated citizen web portals focus differently on provided services: “Life in Denmark.dk” is
mostly focused on education (1.00), immigration (0.67), and employment (0.67), while “Suomi.fi” on
employment (1.00), education (0.80), and taxation (0.60);</p>
        <p> both evaluated citizen web portals have moderate “service balance” scores of 0.67 for “Life in
Denmark.dk” and 0.64 for “Suomi.fi”, which confirms the previous observations.</p>
        <p>
          Moreover, let us estimate the correlation value between  and E-Government Development Index
(EGDI) values [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. The obtained Pearson’s correlation coefficient [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] value is 1.00, which signalize
absolute positive relation between EGDI estimated by UN and “service balance” scores. Finally, let us
estimate the accuracy of the proposed information technology using the following formula:
(∑ =   )
(∑ =  ) + (∑ =
        </p>
        <p>
          Here ∑ =  is the number hyperlinks estimated as correctly categorized against IAFEG
services [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ], and ∑ =  is vice versa (see Table 2 – 3). Hence, the accuracy of proposed
information technology for e-government web portal assessment is 0.80 for Denmark and 0.88 for
Finland. However, the total accuracy for both estimated citizen web portals “Life in Denmark.dk” and
“Suomi.fi” is 0.83.
        </p>
        <p>Therefore, the proposed information technology allows to obtain accurate (of 83%) e-government
web portal assessment results and can be suggested scholars in social political science fields.
(13)</p>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>4. Conclusion and Future Work</title>
      <p>In this paper we proposed the information technology for e-government web portal assessment based
on web data extraction techniques. The study aims to improve the processes of e-government web portal
assessment by using web harvesting and data analysis approaches. Therefore, we developed algorithms
to extract and assess e-government web portals using the proposed E-Government Web Portal Services
reference model and evaluation metrics. The software implementation of the proposed technology is
based on Python programming language and Power BI data visualization tool. Such a tool allows
nontechnical users, i.e. social or political science scholars, to configure the desired references models and
automatically assess e-government web-portals as part of their studies with the accuracy of 83%.</p>
      <p>The following conclusions can be made after the obtained results analysis:
 this approach has a room to identify the differences between e-government web portals and the
services they provide;</p>
      <p> consequently, for researchers who study and compare the state of information society formation
in different countries, in terms of digital services provision, conducting such experiments can be a useful
complement to other data-driven methods;</p>
      <p> the need for interdisciplinary cooperation between social and computer science is increasing and
such interdisciplinary studies can benefit both domains with new methods and solutions.</p>
      <p>In the future we plan to elaborate metrics proposed to evaluate e-government web portals, as well as
conduct a large-scale study, with more government portals and more careful sampling, to identify and
study their differences. From the information technology viewpoint, such experiments require advanced
techniques to be applied, such as data warehousing, data mining, and data visualization.</p>
    </sec>
    <sec id="sec-10">
      <title>5. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Edelmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wolff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Montagne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Bail</surname>
          </string-name>
          ,
          <source>Computational Social Science and Sociology, Annual Review of Sociology</source>
          <volume>46</volume>
          (
          <issue>1</issue>
          ) (
          <year>2020</year>
          )
          <fpage>61</fpage>
          -
          <lpage>81</lpage>
          . doi:
          <volume>10</volume>
          .1146/annurev-soc-
          <volume>121919</volume>
          -054621.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Lazer</surname>
          </string-name>
          et al.,
          <source>SOCIAL SCIENCE: Computational Social Science, Science</source>
          <volume>323</volume>
          (
          <issue>5915</issue>
          ) (
          <year>2009</year>
          )
          <fpage>721</fpage>
          -
          <lpage>723</lpage>
          . doi:
          <volume>10</volume>
          .1126/science.1167742.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C. F.</given-names>
            <surname>Voinea</surname>
          </string-name>
          , Political Attitudes: Computational and
          <string-name>
            <given-names>Simulation</given-names>
            <surname>Modelling</surname>
          </string-name>
          , John Wiley &amp; Sons,
          <year>2016</year>
          . doi:
          <volume>10</volume>
          .1002/9781118833162.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>O.</given-names>
            <surname>Chornenkyi</surname>
          </string-name>
          ,
          <article-title>Use of information and communication technologies for political science research</article-title>
          ,
          <source>The Journal of V.N. Karazin Kharkiv National University: Issues of Political Science</source>
          <volume>42</volume>
          (
          <year>2022</year>
          )
          <fpage>38</fpage>
          -
          <lpage>44</lpage>
          . doi:
          <volume>10</volume>
          .26565/
          <fpage>2220</fpage>
          -8089-2022-42-06.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B. A.</given-names>
            <surname>Ojokoh</surname>
          </string-name>
          et al.,
          <article-title>Big data, analytics and artificial intelligence for sustainability</article-title>
          ,
          <source>Scientific African</source>
          <volume>9</volume>
          (
          <year>2020</year>
          )
          <article-title>e00551</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.sciaf.
          <year>2020</year>
          .e00551.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Luscombe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Dick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Walby</surname>
          </string-name>
          ,
          <article-title>Algorithmic thinking in the public interest: navigating technical, legal, and ethical hurdles to web scraping in the social sciences</article-title>
          ,
          <source>Quality &amp; Quantity</source>
          <volume>56</volume>
          (
          <year>2022</year>
          )
          <fpage>1023</fpage>
          -
          <lpage>1044</lpage>
          . doi:
          <volume>10</volume>
          .1007/s11135-021-01164-0.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>V.</given-names>
            <surname>Lowndes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Marsh</surname>
          </string-name>
          , G. Stoker, Theory and Methods in Political Science, 4th ed. Bloomsbury Academic,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>K. L.</given-names>
            <surname>Anglin</surname>
          </string-name>
          ,
          <string-name>
            <surname>Gather-</surname>
          </string-name>
          Narrow-Extract:
          <article-title>A Framework for Studying Local Policy Variation Using Web-Scraping and Natural Language Processing</article-title>
          ,
          <source>Journal of Research on Educational Effectiveness</source>
          <volume>12</volume>
          (
          <issue>4</issue>
          ) (
          <year>2019</year>
          )
          <fpage>685</fpage>
          -
          <lpage>706</lpage>
          . doi:
          <volume>10</volume>
          .1080/19345747.
          <year>2019</year>
          .
          <volume>1654576</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Osis</surname>
          </string-name>
          , U. Donins,
          <article-title>Unified Modeling Language: A Standard for Designing a Software</article-title>
          ,
          <source>in: Topological UML Modeling</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>51</lpage>
          . doi:
          <volume>10</volume>
          .1016/B978-0
          <source>-12-805476-5</source>
          .
          <fpage>00001</fpage>
          -
          <lpage>0</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R.</given-names>
            <surname>Tabarés</surname>
          </string-name>
          ,
          <article-title>HTML5 and the evolution of HTML; tracing the origins of digital platforms</article-title>
          ,
          <source>Technology in Society 65</source>
          (
          <year>2021</year>
          )
          <article-title>101529</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.techsoc.
          <year>2021</year>
          .
          <volume>101529</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>B.</given-names>
            <surname>Gobin-Rahimbux</surname>
          </string-name>
          ,
          <article-title>Evaluation Metrics for Ontology Modules</article-title>
          ,
          <source>2022 IEEE International Conference on Data Science and Information System (ICDSIS)</source>
          , Hassan, India,
          <year>2022</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          . doi:
          <volume>10</volume>
          .1109/ICDSIS55133.
          <year>2022</year>
          .
          <volume>9915950</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Boicea</surname>
          </string-name>
          , C.
          <article-title>-</article-title>
          <string-name>
            <surname>O. Truică</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Rădulescu</surname>
          </string-name>
          , E.-C. Buşe,
          <article-title>Sampling strategies for extracting information from large data sets</article-title>
          ,
          <source>Data &amp; Knowledge Engineering</source>
          <volume>115</volume>
          (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.datak.
          <year>2018</year>
          .
          <volume>01</volume>
          .002.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>V. L.</given-names>
            <surname>Lysytskyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. Y.</given-names>
            <surname>Morhun</surname>
          </string-name>
          ,
          <article-title>Development of software for effective enterprise product policy creation</article-title>
          ,
          <source>Bulletin of National Technical University KhPI Series System Analysis Control and Information Technologies</source>
          <volume>21</volume>
          (
          <year>2018</year>
          )
          <fpage>59</fpage>
          -
          <lpage>64</lpage>
          . doi:
          <volume>10</volume>
          .20998/2079-
          <fpage>0023</fpage>
          .
          <year>2018</year>
          .
          <volume>21</volume>
          .11.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <article-title>urllib - URL handling modules</article-title>
          . URL: https://docs.python.org/3/library/urllib.html.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <fpage>re</fpage>
          -
          <article-title>Regular expression operations</article-title>
          . URL: https://docs.python.org/3/library/re.html.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <article-title>json - JSON encoder and decoder</article-title>
          . URL: https://docs.python.org/3/library/json.html.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>L.</given-names>
            <surname>Richardson</surname>
          </string-name>
          , beautifulsoup4, PyPI,
          <year>2019</year>
          . URL: https://pypi.org/project/beautifulsoup4/.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , W. Liu,
          <string-name>
            <given-names>H.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <article-title>Analyzing data flow diagrams by combination of formal methods and visualization techniques</article-title>
          ,
          <source>Journal of Visual Languages &amp; Computing</source>
          <volume>48</volume>
          (
          <year>2018</year>
          )
          <fpage>41</fpage>
          -
          <lpage>51</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.jvlc.
          <year>2018</year>
          .
          <volume>08</volume>
          .001.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>Microsoft</given-names>
            <surname>Power BI - Interactive Data Visualization BI</surname>
          </string-name>
          <article-title>Tools</article-title>
          . URL: https://powerbi.microsoft.com/en-us/.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>A. M. Luvembe</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Mutai</surname>
          </string-name>
          ,
          <article-title>Big Data Framework for Kenya's County Governments</article-title>
          ,
          <source>Journal of Computer and Communications</source>
          <volume>7</volume>
          (
          <issue>1</issue>
          ) (
          <year>2019</year>
          )
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          . doi:
          <volume>10</volume>
          .4236/jcc.
          <year>2019</year>
          .
          <volume>71001</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <article-title>The official guide to Life in Denmark</article-title>
          . URL: https://lifeindenmark.borger.dk/.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>[22] Citizens - Suomi.fi. URL: https://www.suomi.fi/citizen/.</mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>E-Government Development</surname>
          </string-name>
          <article-title>Index (EGDI)</article-title>
          . URL: https://publicadministration.un.org/egovkb/enus/About/Overview/-E
          <string-name>
            <surname>-Government-</surname>
          </string-name>
          Development-Index.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>D. L.</given-names>
            <surname>Hahs-Vaughn</surname>
          </string-name>
          ,
          <article-title>Foundational methods: descriptive statistics: bivariate and multivariate data (correlations, associations)</article-title>
          ,
          <source>in: International Encyclopedia of Education (Fourth Edition)</source>
          ,
          <source>Elsevier Science</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>734</fpage>
          -
          <lpage>750</lpage>
          . doi:
          <volume>10</volume>
          .1016/B978-0
          <source>-12-818630-5</source>
          .
          <fpage>10084</fpage>
          -
          <lpage>3</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>