<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Identification and Modeling of Historiographic Data in the Content of Web Forums</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Lviv Polytechnic National University</institution>
          ,
          <addr-line>Lviv 79013</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The research has developed a series of steps to detect and simulate historiographic information on web forums through Web-Scraping, Data mining and Big Data analytics. The system of posting ranking is described in order to determine the relevance of the content for the needs of the historian, developed an algorithm that determines the significance of the system, its impact on the quality of the historical research. The given code snippets in the SQL query language to get the most useful aggregate numeric and text data will help simplify the research methods and develop new ways to identify knowledge and information in the content of web-forums of historical topics. Monitoring of web-forum user's activity using time series analysis has been completed. The rush hour for the generation of historiographic data in selected thematic sections of the web forum was determined. It is substantiated that processing technologies use for a large number of text data using Natural Language Processing (NLP) and Deep Learning will also allow automating the detection of valuable characteristics and features of the obtained data.</p>
      </abstract>
      <kwd-group>
        <kwd>Social Networks</kwd>
        <kwd>Historiographic Information</kwd>
        <kwd>Python</kwd>
        <kwd>User Behavior</kwd>
        <kwd>Timeseries Data</kwd>
        <kwd>Web Scraping</kwd>
        <kwd>Data Mining</kwd>
        <kwd>Data Warehouse</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Informatization has led to an exponential increase in the amount of information, the
creation of local and global systems and networks, databases and knowledge, the
emergence of fundamentally new technologies that radically changed the
methodology of historical research. The intellectual and informational potential has become the
result of informatization, and the development of both depends on the intensity of the
information society process. These two concepts can be combined in one - the
information and cognitive potential that characterizes the process of informatization. An
important component of information and cognitive potential is the intellectual
potential, which manifests the ability of a person to solve problems using the accumulated
knowledge, skills and experience. The second component is the information potential,
which provides the necessary level of awareness of members of society, that is, the
ability to summarize, search, store and transmit information. Accumulation of
information and intellectual potential as a result of informatization is the accumulation of
historiographical information and the formation of a new resource of historical
knowledge in the Internet environment.</p>
      <p>Exponential growth of electronic resources, observed at the beginning of the XXI
century, opens new perspectives for the development of historical research. The
global computer network of the Internet is a complete source of professional
information for historians. An additional tool for cognition for historians is the study
of a large array of unstructured historical information generated in web forums. To do
this in Digital history, it is important to use various methods of data mining to
automatically detect web documents, retrieve information from web forums, and
identify its general patterns on the Internet.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Scholarly works include a variety of areas for applying knowledge and data
consolidation, using a wide range of resources for research on web communities. E.
Trunzer examines high-performance architecture for gathering and consolidating data
from multiple sources and resources into a single repository, their preparation for
processing, and Big Data Analytics [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The web forum content usage and social
network data analysis for conducting qualitative research, as well as the processes for
data collection, and new knowledge and information acquisition from their processing
is considered in B. McKenna, M. D. Myers and M. Newman.
      </p>
      <p>
        The indicated methods of carrying out qualitative research of data of information
systems, ways of preparation and data collection with the help of Web scraping
frameworks with the purpose of obtaining unique scientific results by researchers in
various fields of science [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        The transformation of large amounts of data into Smart Data in a wide range of digital
humanities (Digital History, Digital Sociology, etc.) is defined in the work of
M. Zeng, which describes ways of transforming "raw" data into useful for historians
and humanities information in order to identify additional knowledge [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
In N. Khymytsia, T. Ustiyanovich work the importance of the research is that it
presents Big Data and history usage [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Scientists P. Zhezhnych, N. Khymytsia,
S. Lisina, O. Morushko in their study carried out a comprehensive analysis and
systematization of mathematical and computer-oriented methods of processing
historical information and generalized the historical experience of using information
technologies in the Ukrainian historiography of science [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In a study by K. Artem,
N. Kunanets, R. Holoshchuk, V. Pasichnik and A. Rzheuskyi conducting scientific
research on the electronic science platform requires the establishment of effective
communication between virtual team members [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        At the same time, in the context of our study, valuable works are those that deal with
virtual communities flow. It is examined in K. Miller studies [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Web communication
formation and text mining refers to H. Rheingold's works [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Other authors consider
virtual communities as a means of communication and education. In particular,
N. Kristakys and J. Fowler on the analysis of social networks show that network
activity is productive [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Works by the scientists R. G. Howard are valuable for our
study as well [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and Yalan, Y., Xianjin, Z., Jinchao, Z., &amp; Xiaorong, H, [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
Historiographic aspect of information about the situation in the ATO area in virtual
communities is examined in the works of A. Peleshchyshyn and N. Khymytsia [
        <xref ref-type="bibr" rid="ref12 ref13">12,
13</xref>
        ]. Language and socio-demographic differences of Internet communications are
explained and covered in scientific publications of S. Fedushko [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Communication
interaction features based on Web forums are analyzed in details by O.
TymovchakMaksymets, O. Trach, V. Vus [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. In the works of T. Bilushchak, A. Peleshchyshyn,
M. Komova the historical information searching technology, taking into account
information potential, means of observation, and a retrospective analysis of events
development were considered. The algorithms of search and identification of Internet
sources of historical facts and preconditions that influence truth of historical event
witnesses or the author Internet source are presented [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. System for text data
analysis and mining from web-forum content, which helps to determine author
contibution to specific text work or web-post, was developed and described in the
work of I. Khomytska, V. Teslyuk, A. Holovatyy, and O. Morushko and others [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
Spam and discussion detection using NLP tools and text mining methods is developed
in the research of Y. Chen and H. Chen. Scraping of web-forum based data help to
collect huge amounts information and get it processed [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Linguistic method for
web-content comparison is used in the scientific work of P. Zhezhnych and
O. Markiv, using documentation data and automated tools for automated filling of
tourism documentation via web-forum content [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
      <p>The purpose of our study is to apply interpreted object-oriented programming
language Python for data collection, integration into the database; use of the data
warehouse and the language of SQL queries for processing the historiographic information
of web forums and determining the relevance of the data obtained for historians.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Main Part</title>
      <p>In modern conditions, changing the paradigm of historical science, the use of various
research resources of the Internet is a particularly urgent task. Within the framework
of Digital history, scientists will actively involve various information resources for
the study of recent history; sites that cover virtual and real scientific and historical
communication; sites that have sources and special research on history; sites of
scientific and social funds that support historical scientific and educational projects.
Among the wide variety of modern research resources of the Internet, it is worth
highlighting the source-research potential of direct-communication resources, since
they maximize the function of communication, and therefore generate the primary and
secondary sources of historical information. The specifics of directly-communicating
Internet resources (partnerships, forums, social networks, blogs, multimedia web
communities, and so on) lies in their in interactivity, embodied in the very technology
of the WWW.</p>
      <p>
        The most popular types of web communities are forums that are designed to
communicate with network users. Features of social network historical information
are these: information content is unstructured, discussions arise spontaneously around
various information drives (photos, posts of participants, discussions) [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
A historian who uses web forum content should keep in mind that information for
various reasons may be removed or temporarily closed to third-party users. Therefore,
the researcher faces the task of quick identifying, collecting and processing thematic
information. For qualitative analysis of unstructured and voluminous historiographic
information, it is important for historic experts to use the best methods of intelligent
analysis, based on Web Mining system and Data Mining technology.
      </p>
      <p>Data processing involves several stages that will enable the collected data to be
consolidated and integrated into warehouse, make it accessible for later processing,
provide historians with convenient and reliable work with the information they
receive. Thus, in Fig.1, the main components of the data workflow from the web
forums into historiographic information database are highlighted.</p>
      <p>The first step involves generating a request for historical information source (web
forum), parsing data with R/Python programming languages libraries and functions
according to website structure (see Fig.2). Each structural element has its own tag,
which is fairly easy to obtain by parsing and web scraping techniques, but one of the
major drawbacks of this method is the periodic change of class names and identifiers
contained within the HTML tag structure of the site. The frequent parsing problem of
web forums is data encoding (Javascript tags within HTML-tags), which requires
additional requests (Ajax request etc.) generation and data decoding techniques.</p>
      <p>After that, data characteristics with applying of natural language processing/
understanding (NLP, NLU) are determined automatally, which helps to obtain the
maximum amount of useful information, such as: keywords, presence of discussion in the
post, negative or positive post content and other features. So, the collected data must
be integrated into data warehouse or relational database (see Fig.3).</p>
      <p>The main table - "Posts", consisting of 10 characteristics and has relations to other
tables. Automated data processing and the ability to use regular expressions, NLP /
NLU and Deep learning will allow us to identify such characteristics as: a keyword, a
historical events that are described in the posts, the presence of discussion in the post,
chronological boundaries, topics, etc.</p>
      <p>Extracted and integrated into the database posts will be sorted by using the described
below data arranging method, which allows to highlight the most relevant and
interesting posts and those that contain useful for historians information.
The next step is to determine the effectiveness of the ranking system by comparing
the ID order by increasing of the unique post ID (Post_ID in the main table of the
"Posts" database), which is the initial state; and after the post ranking process and
their new sorting order definition in the main table. In order to execute this, an
algorithm has been developed that identifies the difference between two sets of
consolidated data.</p>
      <p>The algorithm works as follows: two datasets are accepted at the input, which must
contain the post identifier (Post_ID). The first set of data - the initial one, which was
not used for post ranking, the second set - the one that "passed" the post ranking
system. After that, the input table length is determined. If it is equal one to another,
depending on the position of the posts and their identifiers in both sets of data, the
similarity between argument 1 and argument 2 is calculated. The result is percentage of
similarity. The lower the percentage of similarity is, the greater significance the post
ranking system has. If the dataset length differs, the program does not perform
computations.
The final stage is ready-for-processing data extraction for historians, which will
greatly simplify their work, and allow a qualitative summarizing and hypothesis
development and building based on data analysis of posts from web forums containing
historiographical information. Data querying from an existing warehouse is possible by
using the SQL query language, working with relational databases. Complex queries
formation for obtaining valuable information will allow getting the most necessary
information in a short time spans and without high resource usage.
In addition, the use of ranking system will allow us to receive up-to-date information
quickly. Historiographic post ranking methods: summing up the existing data
characteristics, which will allow to calculate the information content of a separate post
stored in the database. For the posts ranking system, the following characteristics will
be used from the "Posts" table of the historiographical web-forum information
database: post date and time publication, number of keywords, presence of graphic images
/ illustrations in the post, appropriateness of current number of words with the optimal
number of words, reference to certain resources, and presence of discussion.
Therefore, most features will be sorted by the system and / or filtered by the end user (a
specialist, historian). The appropriateness to the optimal length of a post calculation is
possible using the following equation:
(1)
(2),
  = |  −   |,
where   – value that defines the optimal post's length. The lower this value, the
higher is the appropriateness;
  – number of words in the ni post;
  – the optimal number of words for a web-forum post.</p>
      <p>
        Characteristics that correspond to the logical values True or False will respectively
represent the value displayed on the equation below:
  = {   = 1   = 
  = 0   = 
where   – a characteristic that corresponds to the value True or False (the presence of
images in the post, calls to a certain resource, the presence of discussion in a post etc.)
User activity monitoring in a web-forum is an important feature that should be taken
into account during the stage of conducting exploratory data analysis. It will reveal
certain patterns of online activity, execute some kind of timeseries analysis and obtain
information about when most of data in the historical web forum content is
generated. Powerful libraries and tools that are part of Python's programming language allow
us to collect huge amounts of data from the Internet. For timeseries analysis and
activity monitoring, these tools were used, and the transformation &amp; visualization of the
collected data was carried out in order to obtain certain insights on forum users
activity. Timeseries analysis is a systematic approach by which mathematical and statistical
questions are answered posed by time correlations [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ].
      </p>
      <p>
        Most research information products and ways to improve them occupies precisely
analysis of user behavior, in particular their activity [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. On Fig. 4. reflects the
activity of users of the historical web forum in a separate thematic block, discussion
of historical issues which lasted for a month.
From this graph, we can see that activity was almost always the same and quite stable
(about 50-75 posts daily), the peak of activity is at the beginning of the second half of
the month. That is, the most active discussion in the forum occurs when the number of
posts crosses the middle, after which the activity lasts for a certain period of time and
gradually decreases.
      </p>
      <p>The key point is to determine the weekday of deploying a discussion on web forums,
especially relevant for historical topics, which allows you to determine when users
themselves are able to participate in resolving disputed issues of the past as well as
the present. So, as we can see in the graph, most users of historical, military-historical
web forums are active on weekdays, whereas on weekends they are less and less
timeconsuming. High activity is observed at the beginning of the week, but gradually it
decreases. Later it rises again (Thursday), and then it finally becomes minimal
(Saturday, Sunday).</p>
      <p>It's important to watch the time of the day when new content is being published on a
web forum. This allows the user, including the historian, to choose the best time to
publish his own post so that other members of the web forum can familiarize with and
respond to it. At the web-site under investigation, the peak of activity was at lunch
(13: 00-15: 00). If we take a gap from 5:00 to 20:00, we can observe a normal
distribution, while from 00:00 to 5:00 - the distribution of Poisson.
Determining the activity of the users of the forum has allowed us to identify the
necessary timeframes for active deployment of discussion on web forum pages, to
model user behavior and analyze it. It also helped to find certain correlations between
the time of day, day of the week and the number of generated posts in the content of
the historical web forum. Consequently, most users are actively posting new posts on
weekdays, especially during the day. Many of them join the discussion on the forum,
only when it appeared in a sufficient number of posts (more than half). This allows
you to find a good time to collect data, as well as participate in discussions in
historical web forums.</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>Modern web forums generate unique sources of historical information that contain
large volumes of important, valuable information from eyewitness events.
Investigating such a large array of unstructured historical information through web
forums is an additional tool of cognition for historians. Web forums provide wide
access to information that has not been filtered. Web Mining technology provides a
high-quality analysis of the content of web-forums and allows you to get new
information about historical events, to conduct operational monitoring of data. Using
interpreted object-oriented programming language Python helps to automate the
following processes: data collection, identification of the main characteristics of the
individual structural elements of the forum, data acqusition, data computing and
processing. The tools used in the research provided for the rapid processing of tabular
data and helped to identify the most interesting characteristics of historical
information that was generated in the content of web forums.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Trunzer</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kirchen</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Folmer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koltun</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vogel-Heuser</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A flexible architecture for data mining from heterogeneous data sources in automated production systems</article-title>
          .
          <source>In: 2017 IEEE International Conference on Industrial Technology, ICIT 2017</source>
          , pp.
          <fpage>1106</fpage>
          -
          <lpage>1111</lpage>
          . Toronto, Canada (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>McKenna</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Myers</surname>
            ,
            <given-names>M. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Newman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Social media in qualitative research: Challenges and recommendations</article-title>
          .
          <source>Information and Organization</source>
          <volume>27</volume>
          (
          <issue>2</issue>
          ),
          <fpage>87</fpage>
          -
          <lpage>99</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Zeng</surname>
            ,
            <given-names>M. L.</given-names>
          </string-name>
          :
          <article-title>Smart data for digital humanities</article-title>
          .
          <source>Journal of data and information science 2</source>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Khymytsia</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ustiyanovich</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Application of Big Data in Historical Science</article-title>
          .
          <source>In: Proceedings 7th International Academic Conference of Young Scientists “Humanities and Social Sciences</source>
          <year>2017</year>
          ”,
          <string-name>
            <surname>HSS</surname>
          </string-name>
          <year>2017</year>
          , pp.
          <fpage>368</fpage>
          -
          <lpage>370</lpage>
          .
          <string-name>
            <surname>Lviv</surname>
          </string-name>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Zhezhnych</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khymytsia</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lisina</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morushko</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Analysis of computer-based methods for processing historical information</article-title>
          .
          <source>In: Proceedings of the 12th International Scientific and Technical Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          <year>2017</year>
          , pp.
          <fpage>365</fpage>
          -
          <lpage>368</lpage>
          .
          <string-name>
            <surname>Lviv</surname>
          </string-name>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Artem</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kunanets</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holoshchuk</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pasichnik</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rzheuskyi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Information Support of the Virtual Research Community Activities Based on Cloud Computing</article-title>
          .
          <source>In: Proceedings of the 13th International Scientific and Technical Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          <year>2018</year>
          , pp.
          <fpage>199</fpage>
          -
          <lpage>202</lpage>
          . Lviv,
          <string-name>
            <surname>Ukraine</surname>
          </string-name>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Communication Theories: Perspectives, processes, and contexts</article-title>
          .
          <source>2nd edn. McGraw-Hill</source>
          , New York (
          <year>2005</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Rheingold</surname>
          </string-name>
          , H.:
          <article-title>The virtual community: Homesteading on the electronic frontier</article-title>
          .
          <source>AddisonWesley</source>
          Publishing Company, Reading, MA (
          <year>2000</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Christakis</surname>
            ,
            <given-names>N. A.</given-names>
          </string-name>
          :
          <article-title>Connected: The Surprising Power of Our Social Networks and How They Shape Our Lives - How Your Friends' Friends' Friends Affect Everything You Feel, Think, and Do</article-title>
          . In: Christakis,
          <string-name>
            <given-names>N. A.</given-names>
            ,
            <surname>Fowler</surname>
          </string-name>
          <string-name>
            <surname>J. H</surname>
          </string-name>
          . (eds).
          <source>Back Bay Books</source>
          (
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. How to:
          <article-title>Manage a Sustainable Online Community</article-title>
          , https://mashable.com/
          <year>2010</year>
          /07/30/sustainable-online-community/.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Yalan</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xianjin</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jinchao</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiaorong</surname>
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>Comparing digital libraries with virtual communities from the perspective of e-quality</article-title>
          .
          <source>Library Hi Tech</source>
          <volume>32</volume>
          (
          <issue>1</issue>
          ),
          <fpage>173</fpage>
          -
          <lpage>189</lpage>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Peleshchyshyn</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khymytsia</surname>
          </string-name>
          , N.:
          <article-title>Historiographic aspect of giving information about the situation in the ATO area in virtual communities</article-title>
          .
          <source>In: Proceedings of the 4th International Scientific Conference “Information</source>
          , Communication, Society”,
          <source>ICS</source>
          <year>2014</year>
          , pp.
          <fpage>238</fpage>
          -
          <lpage>240</lpage>
          .
          <string-name>
            <surname>Lviv</surname>
          </string-name>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Khymytsia</surname>
          </string-name>
          , N.:
          <article-title>Socio-focused online research sources Eurorevolution 2013-2014 in Ukraine</article-title>
          .
          <source>The state and the army 784</source>
          ,
          <fpage>214</fpage>
          -
          <lpage>223</lpage>
          (
          <year>2014</year>
          ).
          <article-title>(in Ukrainian)</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Fedushko</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Development of a software for computer-linguistic verification of sociodemographic profile of web-community member</article-title>
          .
          <source>Webology</source>
          <volume>11</volume>
          (
          <issue>2</issue>
          ), article
          <volume>126</volume>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Korobiichuk</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fedushko</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Juś</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Syerov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Methods of Determining Information Support of Web Community User Personal Data Verification System</article-title>
          . In: Szewczyk R.,
          <string-name>
            <surname>Zieliński</surname>
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaliczyńska</surname>
            <given-names>M</given-names>
          </string-name>
          . (eds)
          <article-title>Automation 2017</article-title>
          .
          <source>Advances in Intelligent Systems and Computing</source>
          , vol.
          <volume>550</volume>
          , pp
          <fpage>144</fpage>
          -
          <lpage>150</lpage>
          . Springer (
          <year>2017</year>
          ). DOI:
          <volume>10</volume>
          .1007/978-3-
          <fpage>319</fpage>
          -54042-9_
          <fpage>13</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Trach</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vus</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tymovchak-Maksymets</surname>
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Advanced search query for identifying Web-forum threads relevant to given subject area</article-title>
          .
          <source>In: 13th International Conference on Modern Problems of Radio Engineering</source>
          , Telecommunications and Computer Science, TCSET
          <year>2016</year>
          , pp.
          <fpage>849</fpage>
          -
          <lpage>852</lpage>
          .
          <string-name>
            <surname>Lviv</surname>
          </string-name>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Bilushchak</surname>
            <given-names>Т.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peleshchyshyn</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Komova</surname>
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Development of method of search and identification of historical information in the social environment of the Internet</article-title>
          .
          <source>In: XIth International Scientific and Technical Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          <year>2017</year>
          , pp.
          <fpage>196</fpage>
          -
          <lpage>199</lpage>
          .
          <string-name>
            <surname>Lviv</surname>
          </string-name>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Bilushchak</surname>
          </string-name>
          , Т.,
          <string-name>
            <surname>Myna</surname>
          </string-name>
          , Zh.,
          <string-name>
            <surname>Yarka</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peleshchyshyn</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Integration processes in the archival section of Lviv Polytechnic National University</article-title>
          . In: 12th
          <source>International Scientific and Technical Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          <year>2017</year>
          , pp.
          <fpage>200</fpage>
          -
          <lpage>203</lpage>
          .
          <string-name>
            <surname>Lviv</surname>
          </string-name>
          (
          <year>2017</year>
          ). DOI:
          <volume>10</volume>
          .1109/STC-CSIT.
          <year>2017</year>
          .
          <volume>8098768</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Khomytska</surname>
          </string-name>
          , І.,
          <string-name>
            <surname>Teslyuk</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holovatyy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morushko</surname>
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Methods, models and means of the system for differentiation of phonostatistical structures of engglish functional styles. Development of methods, models and means of authorship attribution of a text</article-title>
          .
          <source>East European Journal of Advanced Technologies</source>
          <volume>3</volume>
          (
          <issue>2</issue>
          ),
          <fpage>41</fpage>
          -
          <lpage>46</lpage>
          (
          <year>2018</year>
          ). DOI:
          <volume>10</volume>
          .15587/
          <fpage>1729</fpage>
          -
          <lpage>4061</lpage>
          .
          <year>2018</year>
          .
          <volume>132052</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Khymytsia</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lisina</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morushko</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhezhnych</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Peculiarities in generating historical information in virtual communities</article-title>
          .
          <source>In: Proceedings of the 12th Interna-tional Scientific and Technical Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          <year>2017</year>
          ,
          <article-title>1, art</article-title>
          . no.
          <issue>8098799</issue>
          , pp.
          <fpage>336</fpage>
          -
          <lpage>339</lpage>
          .
          <string-name>
            <surname>Lviv</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Y. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>H. H.</given-names>
          </string-name>
          :
          <article-title>Opinion spam detection in web forum: a real case study</article-title>
          .
          <source>In: Proceedings of the 24th International Conference on World Wide Web, WWW</source>
          <year>2015</year>
          , pp.
          <fpage>173</fpage>
          -
          <lpage>183</lpage>
          . Florence,
          <string-name>
            <surname>Italy</surname>
          </string-name>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Zhezhnych</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markiv</surname>
            ,
            <given-names>O.:</given-names>
          </string-name>
          <article-title>A linguistic method of web-site content comparison with tourism documentation objects</article-title>
          .
          <source>In: Proceedings of the 12th International Scientific and Technical Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          <year>2017</year>
          , pp.
          <fpage>340</fpage>
          -
          <lpage>343</lpage>
          .
          <string-name>
            <surname>Lviv</surname>
          </string-name>
          (
          <year>2017</year>
          ). DOI:
          <volume>10</volume>
          .1109/STC-CSIT.
          <year>2017</year>
          .
          <volume>8098800</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Shumway</surname>
            ,
            <given-names>R. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stoffer</surname>
            ,
            <given-names>D. S.:</given-names>
          </string-name>
          <article-title>Time series analysis and its applications: with R examples</article-title>
          . Springer (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Bernaschina</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brambilla</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mauri</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Umuhoza</surname>
          </string-name>
          , E.:
          <article-title>A Big Data Analysis Framework for Model-Based Web User Behavior Analytics</article-title>
          . In: Cabot J.,
          <string-name>
            <surname>De</surname>
            <given-names>Virgilio</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Torlone</surname>
          </string-name>
          ,
          <string-name>
            <surname>R</surname>
          </string-name>
          . (eds.) Web Engineering,
          <string-name>
            <surname>ICWE</surname>
          </string-name>
          <year>2017</year>
          , vol
          <volume>10360</volume>
          . Springer, Cham (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>