<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Method of Operative Content Management in Web Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Aleksandr Gozhyj</string-name>
          <email>alex.gozhyj@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lyubomyr Chyrun</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Agnieszka Kowalska-Styczen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olga Lozynska</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Petro Mohyla Black Sea National University</institution>
          ,
          <addr-line>Mykolayiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Silesian University of Technologyty</institution>
          ,
          <addr-line>Gliwice</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In the given article content lifecycle support method in Web systems is proposed. The uniform method realizes information resources processing in the Web systems and simplifies the information technology of Web content automation management. In the given paper the main problems Web content function and management services in Web systems are analyzed. In this article, the authors proposed to apply the technique of content analysis of textual information in e-commerce systems for the automation of e-business and decision making by the appropriate person.</p>
      </abstract>
      <kwd-group>
        <kwd>content</kwd>
        <kwd>content analysis</kwd>
        <kwd>e-commerce</kwd>
        <kwd>quantitative content analysis</kwd>
        <kwd>qualitative content analysis</kwd>
        <kwd>content management system</kwd>
        <kwd>content lifecycle</kwd>
        <kwd>information resources</kwd>
        <kwd>Web system</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The current level of complexity of tasks solved through e-commerce systems is
constantly growing: from the automated information collection and interpretation to the
management, design, modeling and forecasting of the various business processes, for
example, forecasting by the e-commerce system the changes in demand for a certain
type of products depending on the thematic discussions by forum users or the analysis
of arrays of comments on these products. However, the most complex tasks are not
solved automatically by e-commerce systems, but by persons who accompany the
process of the corresponding system [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1-3</xref>
        ]. There was a need to search for the
nontraditional approaches to the use of information technologies and mathematical
methods in the processes of preparation and decision-making in e-commerce systems
based on the information obtained from forums, comments of product users,
electronic correspondence, search engine and agents results [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4-6</xref>
        ]. To date, most of the
information obtained by the users from the e-commerce system is being studied by the
moderators. For example, large text arrays of comments on the products presented on
the site, or forum information, are mainly filtered by the moderators over a certain
period of time by a certain circle of proxies [
        <xref ref-type="bibr" rid="ref10 ref7 ref8 ref9">7-10</xref>
        ]. To increase the demand for
products, it is necessary to post only filtered user comments on the site, where not only
obscene language is available (such comments are blocked), but where there is no
advertising of competitors' products. In addition, a detailed analysis of the multitude
of comments on products enables the manufacturer to analyze the market situation
and adjust the demand for its products [
        <xref ref-type="bibr" rid="ref11 ref12 ref13 ref14 ref15 ref16">11-16</xref>
        ]. The study of information needs for the
similar system operation shows that in the decision-making process, along with the
use of factual materials, it is important to involve the textual information about the
reflection of the relationships between various facts, events, persons, etc. and its
automatic processing in a short period of time with minimal financial costs [
        <xref ref-type="bibr" rid="ref17 ref18 ref19 ref20 ref21">17-21</xref>
        ].
The modern market is characterized by an increase in demand for Web content and
growth proposals in public enterprises and public institutions. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] Using Web content
helps optimize the development programs of management, trade and long-term and
economic strategy for production. It is associated with an increase in the complexity
of management and implementation of systems using the predictive Web content of
character. Specialized information resources are most type’s users of Web content,
e.g., online publishing, online magazines, online newspapers, etc. For these users
content is subject for making other Web content [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        The article purpose is to design a typical architecture for content management
systems. The work feasibility lies in the methods development of Web resources
processing in such systems. It promotes the active development in Ukraine e-business
and effective implementation of the Web systems [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The scientific problem is the
lack of a standardized approach to architectural design and development of such
systems and the subsystems of Web resources processing. There is no classification of
the Web systems, which creates problems for the study of information resources
processing methods in these systems. This creates problems for the appropriate software
implementation. This also justifies the purpose, relevance, appropriateness and
research directions. The special feature of Web systems is as follows [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]: openness,
global character, unlimited in time, frankness, direct interaction with the user,
automatic analysis of queries and users data tracking; reducing the cost of e-business
functioning; additional information providing in interactively [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref15 ref5 ref6 ref7 ref8 ref9">5-15</xref>
        ].
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Recent research and publications analysis</title>
      <sec id="sec-2-1">
        <title>The Web systems implementation contributes to the e-business success [1], including:</title>
        <p>business globalization increases the demand for Web content and fast access to it;
uneven functioning business processes according to the regions. leads to increasing
needs on speed, regularity and frequency of obtaining of the necessary Web content;
saving time in the necessary Web content getting; personalization in services provide
in the Web systems; the Web systems integration. Advantages and feasibility of the</p>
      </sec>
      <sec id="sec-2-2">
        <title>Web systems introduction are as follows [1-2]: increased efficiency of Web content</title>
        <p>
          receiving; the cycle reducing of Web content production and sale; costs reducing
associated with the content exchange; the openness of the Web systems due to
customers; automatically inform users of Web content; alternative sales channels creation,
such as online newspaper. Problems of Web resources processing in the such systems
associated with increasing content amounts on the Internet; the rapid prevalence of
Internet access; active e-business development; the set expanding of Web services and
goods; the growth in demand for Web services and products; Internet technology and
facilities creating and applications expansion of Web resources processing techniques
[
          <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1-4</xref>
          ]. In this area are actively working the world's leading manufacturers of Web
resources processing, particularly developed such systems [
          <xref ref-type="bibr" rid="ref1 ref11 ref12 ref13 ref14 ref15 ref16 ref17 ref18 ref19 ref2 ref20 ref21 ref3">1-3, 11-21</xref>
          ]:
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Web content is a business processes object for the Web systems (Table 1). Example is an information product or content for information resource as online newspapers, online publishing, marketing research and consulting services portal, etc.</title>
      </sec>
      <sec id="sec-2-4">
        <title>The content life cycle is a process that content takes place during management through the submission stages. A content lifecycle model does not take into account all stages of information resources processing.</title>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Problems selection</title>
      <p>Content streams number is greater than the ways number of goods moving to
industrial enterprises. Much of the content flows consists easily formalized and automated
procedures. The main problem is the common approach lack to process modeling,
design and development of Web systems. There is a general and detailed
classification lack of Web systems. This leads to the problem of the general methods definition
for the architecture and algorithms development of these systems functioning. The
existing Web systems do not support the entire content life cycle, do not solve
information resources processing and content management problems (Table 2). This
justifies the research purpose, relevance, appropriateness and directions.</p>
      <sec id="sec-3-1">
        <title>Input information the functioning of Web systems is evidence of a system work ap</title>
        <p>pointment and conditions. They define the main purpose of modeling. They also make
it possible to formulate the requirements for the system formal model S and content
management models. Model of Web systems presented as</p>
        <sec id="sec-3-1-1">
          <title>S  X , C,V , H , Function,T ,Y , where X  x1, x2,, xnX  is the input data to the sys</title>
          <p>tem, C  c1, c2,, cnC  is influences the content flow on the system, V  v1, v2,, vnV  is
the environment influence,</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>H  h1, h2,, hnH  is internal system</title>
          <p>parameters,
Y  y1, y2,, ynY  is the system output characteristics, T  t1, t2,,tnT  is the content
management transaction time. The process S of Web systems functioning described
by the function as y j ti  t   Function  xi , cr , vl , hk , ti  , where xi is the visitor/user
query to the system. Characteristics component y j according to Google Analytics are
the visits number for the time period t , the average Time Spent Online (min:s),
bounce rate (%), achieved the goal, dynamics (%), total viewing pages, page views
number per visit, new visits (%), total unique visitors, traffic sources in % (search
engines, direct traffic or other sites).
5</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Method of text information analysis</title>
      <p>
        During the investigation of the mechanisms of textual information generation, it was
revealed that the choice of the text information description model depends on how the
probabilistic and linguistic test is constructed and how the extraction of the certain
units is made from the text information [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1-4</xref>
        ]. The probabilistic modeling of textual
information and its components is an introductory, preparatory stage to the description
of the functions of linguistic units in the text. The study of the language and speech
functioning through the probabilistic modeling of textual information relies upon the
models of probability theory, mathematical statistics, and combinatorial analysis. In
linguistic research, and especially in the implementation of the information retrieval
algorithms [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1-3</xref>
        ], there are always problems associated with predicting the occurrence
of a certain number of word forms or word combinations belonging to the certain
classes in the segment of a given length. Probabilistic modeling of texts and syllables,
word combinations, grammatical classes also makes it possible to determine the
sample size necessary to provide, with a given probability, the occurrence of at least one
time a corresponding linguistic unit [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        Content analysis is one of the known analysis methods for the textual
information. It is the standard method of research in the social sciences, the subject of which
is the analysis of the textual arrays content and communicative correspondence
products (for example, comments, forums, electronic correspondence, articles, etc.).
Content analysis is the quantitative and qualitative analysis of textual information and
textual arrays for the purpose of the subsequent meaningful interpretation of the
obtained quantitative and qualitative regularities. Content analysis is used to study the
sources that are invariant in structure or content, but which exist as a
nonsystematized, randomly organized textual material [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4 ref5">1-5</xref>
        ]. The method of content
analysis is to form an abstract text content model from a variety of textual material.
      </p>
      <sec id="sec-4-1">
        <title>There are two types of content analysis: quantitative and qualitative.</title>
      </sec>
      <sec id="sec-4-2">
        <title>Quantitative content analysis (content-related) is a research method of words,</title>
        <p>topics, and messages that focuses on a message content. So, before analyzing the
selected linguistic units, it is necessary to predict their content and determine each
possible observation result in accordance with the researcher expectations. As first step in
conducting a content analysis of this type, the researcher has to create a dictionary in
which each observation will be defined and assigned to appropriate category.</p>
        <p>
          Qualitative content analysis (structural) is a research method in which the
researcher is not so much interested in the message content as in the form and structure
of this message. For example, you need to determine how much time or print space is
allocated to a topic in a particular source, or how many words or columns have been
given to each topic of the corresponding category. Then the relative density of each
topic and category are calculated, and a comparative analysis of the corresponding
topics is carried out for the further prediction of events, processes, and the like. The
relative density P of any category is calculated by the formula: P=R/T, where R is the
number of the analysis units that sets this category, T is the total number of the
analysis units. In [
          <xref ref-type="bibr" rid="ref1 ref2 ref3">1-3</xref>
          ] authors singled out the following components of the
contentanalytical study.
1. Content analysis deals with a mass of texts, using the typical sociological
procedures for continuous reading or sampling observation while meeting the
representativeness requirements.
2. Content analysis involves structuring, segmentation, decomposition of texts or
defining the meaningful invariants that are repeated in all or in a number of texts
related to the mass collection under investigation.
3. To ensure such a uniformity of segmentation and the isolation of invariants,
content analysis involves a high degree of formalization, the use of harsh operational
rules and formal algorithms to implement the content analysis procedures.
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>4. Content analysis is an analytical and synthetic procedure since it is supposed to</title>
        <p>formalize the division of the whole texts or the selection of their certain items for
the subsequent collection.</p>
      </sec>
      <sec id="sec-4-4">
        <title>5. Content analysis involves the use of methods of probability theory and mathemati</title>
        <p>cal statistics.</p>
        <p>
          According to the Ukrainian researcher in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], the content analysis is a qualitative
and quantitative method of studying the documents, characterized by the conclusions
objectivity and the procedure rigidity. It is the quantitative text processing with the
subsequent results interpretation. The subject of the content analysis can be both
problems of the social reality, expressed or concealed in documents, and the internal
regularities of the study object itself. The key milestones of the formation of e-commerce
information analysis system based on the textual information content analysis are the
following:
1. Definition of the research topic, analysis purpose and object, its chronological and
geographical frameworks, selection principles;
        </p>
      </sec>
      <sec id="sec-4-5">
        <title>2. The classifier formation for the selection of key citations and the preparation of in</title>
        <p>structions for the encoder;</p>
      </sec>
      <sec id="sec-4-6">
        <title>3. The coding of publication fragments;</title>
      </sec>
      <sec id="sec-4-7">
        <title>4. Storing of publication fragments in the database;</title>
      </sec>
      <sec id="sec-4-8">
        <title>5. Processing of publication fragments in an automated mode.</title>
      </sec>
      <sec id="sec-4-9">
        <title>The implementation of these stages leads to the formation of thematically selected arrays of information, in which information on the coverage of all aspects of the problem under investigation is accumulated, taking into account the diversity of opinions and views.</title>
        <p>The construction of the information analysis systems largely depends on the
availability and compliance with the rules for the above steps:
6. The procedure of content-analytical selection should consist of precisely defined
actions, which, without any changes, should be subject to all processing objects;</p>
      </sec>
      <sec id="sec-4-10">
        <title>7. The sections of the classifier on which the publication fragments are coded should be clear and elementary to avoid ambiguity;</title>
      </sec>
      <sec id="sec-4-11">
        <title>8. The interpretation of the research results should cover all the publication fragments</title>
        <p>obtained; conclusions should not be based on any part of the results, but rather take
into account all of them without exception.</p>
        <p>For the automated processing of textual information is of great importance not
only what the occurrence frequency of a particular category of the linguistic unit, but
generally its presence in the test text or absence. Quantitative estimation makes it
possible to draw the objective conclusions about the material focus concerning the
number of uses of the analysis units (key quotes) in the studied texts, such as the
number of positive/negative responses to a certain type of product. Qualitative
analysis allows us to draw the objective conclusions, where there is a certain linguistic unit
in the text and in what context.
6</p>
        <p>The main stages of content analysis in e-commerce
The main stages of content analysis of textual information in e-commerce systems are
the following items.</p>
        <p>1. Identification of an array of sources under investigation or messages through a
set of specified criteria to which each message corresponds:
 a given source type (forum, email, online newspaper, chat, online magazine);
 one message type (article, e-mail, banner, comment);
 specified parties involved in the communication process (addresser, addressee,
recipient);
 confronted size of messages (minimum scope or length);
 message occurrence frequency;
 message distribution method;
 message distribution place;
 time of message emergence and so on.</p>
        <p>2. The formation of the sample multitude of messages. In some cases, the entire
set of sources identified at the first stage is studied, since the message to be analyzed
is often quantitatively limited and available. However, sometimes the content analysis
should be based on a limited sample, extracted from a larger array of information.</p>
        <p>3. Identification of the linguistic units under analysis, for example, words or
topics. The correct choice of the linguistic units to be analyzed is an important part of the
entire work. A word is a simple message element. A topic is the other unit, it is a
separate statement about any object. There are clear requirements for the choice of the
possible linguistic unit to be analyzed:
 large enough to interpret the meaning;
 small enough to not interpret many meanings;
 easily identified;
 the number of units is large enough for sampling.</p>
        <p>If accepted a topic for the unit analysis, the following rules are taken into account:
 the size of the topic does not go beyond a paragraph;
 a new topic arises if there is a substitution of goal, category or someone who
perceives or acts.</p>
        <p>4. Allocation of calculation units, which can coincide with the meaningful units or
be specific in nature. In the first case, the analysis procedure is reduced to the
frequency occurrence count of the allocated meaningful unit; in the other, the researcher,
on the basis of the analyzed material and the research objectives, sets forth the
calculation units that can be:
 the physical length of the texts;
 the text area filled with the meaningful units;
 number of lines (paragraphs, characters, text columns);
 file size and type;
 a number of pictures with a certain content, plot, etc.</p>
        <p>In some cases, researchers use other calculation elements. At this stage of the
content analysis, the fundamental value has a rigorous definition of its operators.</p>
        <p>5. The calculation procedure itself. In general, it is similar to the standard
methods of classification by the selected groups from the formulas of mathematical
statistics and the probability theory. There are also special counting procedures for the
content analysis, for example, the formula for calculating the Janis coefficient c ,
designed to calculate the ratio of positive and negative (towards the selected position)
evaluations, opinions, arguments. The Janis coefficient can be used, for example, to
calculate the ratio of positive and negative thoughts, highlighted in user comments
about products that are sold through the e-commerce system. If the number of positive
evaluations exceeds the negative ones, the Janis coefficient is calculated by the
formula: c  f 2  f  n , where f is the number of positive evaluations; n is the number of
r  t
negative evaluations; r is the volume of the text content that has a direct relationship
to the problem that is being investigated; t is the total volume of the analyzed text. If
the number of positive evaluations is less than the negative, the Janis coefficient is
found by a formula: c  f  n  n2 .</p>
        <p>r  t
6. Interpretation of the obtained results according to the goals and objectives of
the case study. Usually, at this stage, the characteristics of the text material are
identified and evaluated, which allows one to draw conclusions about what the author
wanted to emphasize or hide. Or, based on a statistical set of calculated Janis
coefficients for a certain period of time for a specific category of products, you can predict
changes in demand for the same products.
7</p>
        <p>Advantages of content analysis in e-commerce
The use of content analysis of textual information in e-commerce systems, according
to the authors, provides a number of advantages for simplifying the business dealing
and solves many problems facing business process participants. Here are a few basic
1200
1000
800
600
400
200
0
bookclub.ua
presstime.com.ua
700
830
720
bonprix.ua
timesonline.co.uk
970
advantages of content analysis of the text information in e-commerce systems:
 the automation of the textual information filtering, which is placed on the website
of the e-commerce system by a user;
 the possibility to automatically create a portrait of a permanent user based on
his/her comments;
 the possibility to automatically create a portrait of the target audience based on the
analysis of portraits of permanent users;
 the reduction of the number of moderators who serve the e-commerce system;
 the reduction of the time for filling the site with the text information of a
permanent user due to the automatic processing of this information and the absence of an
intermediate link as a moderator;
 the language barrier elimination due to the automatic generation of dictionaries of
the permanent user and the use of automatic translation.</p>
      </sec>
      <sec id="sec-4-12">
        <title>The content analysis of the user text information distribution makes it possible to qualitatively evaluate the content flow in e-commerce systems for further decisionmaking by the relevant person. The comparison results shown in Fig. 1 approve that there is a need to pay attention to the content analysis in e-commerce systems.</title>
        <p>The average number of system queries</p>
      </sec>
      <sec id="sec-4-13">
        <title>The Web content life cycle presented in the form next major processes communica</title>
        <p>tion as Source  content formation  content management  content
implementation  database.</p>
        <p>Model of Web systems S  X , Formation, C, Management, Realization,Y ,
where X  x1, x2 ,, xnX  is input data set, Formation is content formation operator,
C  c1, c2 ,, cnC  is content set, Management is content management operator,
Realization is content implementation operator, Y  y1, y2 ,, ynY  is output data
set. Below is the content management models classification.</p>
      </sec>
      <sec id="sec-4-14">
        <title>1. Pages generate per request is submitted in the form of the following main</title>
        <p>stages connection as Content  content editing  Database  content presentation
 informational resource. Pages generate model on demand as
ManagementQ  X , C, Q, R, Edit,Y , where X is input data set, C is content set, Y
is pages generated set, Q is query set, R is pages formulation and submission
function, Edit – content editing and updating function.</p>
      </sec>
      <sec id="sec-4-15">
        <title>2. Pages generate model while editing is presented as the next major stage of</title>
        <p>communication as Content  content editing  database  informational resource.</p>
        <p>When making changes to the site content creates a static pages set. Not is taken
into account interactivity site between visitors and content. Pages generation system
model while editing as ManagementE  C, Edit,Y , where C is content set, Y is
static pages set, Edit is content editing function. The pages formation described
func 
tion as y t   Edit c,Weight,t  .</p>
      </sec>
      <sec id="sec-4-16">
        <title>3. Pages generate mixed model combines the advantages of the first two types and</title>
        <p>is presented as a communications major stages as Content  content editing 
Database  content analysis  blocks collection  content presentation 
informational resource. This model is as ManagementM  X , C,Q, R, Edit, Caching ,Y ,
where X is input data set, C is content set, Y is pages generated set, Q is query set,
R is pages formulation and submission function, Edit is content editing and
updating function, Caching is cache formulation function. Cache is update automatically
(after a certain period or when amending certain site sections) or manually (team
administrator). Another approach is to maintain information blocks on the editing site
stage. Then the pages collected from these units when requesting user. The process is
implementing caching. The module generates submission page once. Then it is
downloading from the cache several times faster.</p>
        <p>User queries content analysis allows to qualitatively assess the content flow in the
system. This facilitates the subsequent decisions by the moderator as follows: the
problem situation description and study purpose search; precise definition of the study
object and subject; the object preliminary analysis; concepts substantial clarification
and empirical interpretation; the procedures description for the properties and
phenomena registration; the overall study plan determining; the definition of the sample
type, sources collection and so on. Qualitative content analysis is intended to provide
the necessary means moderator for results analysis (Table 3). With their help identify
the content properties and test them on the general content stream. Then apply the
content stream general properties on its specific thematic part.</p>
      </sec>
      <sec id="sec-4-17">
        <title>Quantitative content analysis consists of the stages presented in Table. 4.</title>
        <p>Stage name</p>
      </sec>
      <sec id="sec-4-18">
        <title>The analysis unit selecting</title>
      </sec>
      <sec id="sec-4-19">
        <title>Units frequency counting</title>
      </sec>
      <sec id="sec-4-20">
        <title>Categorization</title>
      </sec>
      <sec id="sec-4-21">
        <title>Data Mining</title>
      </sec>
      <sec id="sec-4-22">
        <title>Results interpretation Table 4. Quantitative content analysis stages</title>
      </sec>
      <sec id="sec-4-23">
        <title>Relationships are identifying between linguistic units.</title>
      </sec>
      <sec id="sec-4-24">
        <title>Categories finite and excess aggregate are determining to obtain quantitative data of their appearance. Categories irregular sequence is clustering (into groups and classes division). And on the basis of new generalized categories is received.</title>
      </sec>
      <sec id="sec-4-25">
        <title>New knowledge is identifying in the content flow through multiple</title>
        <p>quantitative evaluations. Next qualify them as categories.</p>
      </sec>
      <sec id="sec-4-26">
        <title>Content and semantically-filled results are getting. For this purpose use various statistical mathematical methods and semantic formalisms.</title>
      </sec>
      <sec id="sec-4-27">
        <title>Main task for content management process are the following items: the opera</title>
        <p>tional and retrospective databases forming; databases rotation; users work
personalization; a database create and access to it; personal needs and sources protection;
work statistics keeping; search ensuring in database; output forms generation;
interaction with databases of other subsystems. In Table 5 presents the content management
key stages in the Web systems.</p>
      </sec>
      <sec id="sec-4-28">
        <title>As information technology basis considered annotated database in search engines.</title>
      </sec>
      <sec id="sec-4-29">
        <title>It contains an index, inverse, dictionary tables, etc. In content management systems</title>
        <p>creates a database search primary content pattern (PCP). They used clustering
technology (automatic forming groups with similar content on the criteria PCP). In
content management systems formed database annotations for used in the search process.</p>
      </sec>
      <sec id="sec-4-30">
        <title>Clusters database each record is corresponding cluster definitions and containing its</title>
        <p>description. Database record is performing automatic abstracting methods (digest is
formation of text statistical analysis methods). These methods are used to create the</p>
      </sec>
      <sec id="sec-4-31">
        <title>PCP and descriptions of available users (Fig. 2).</title>
        <p>Content formation Results search and clustering</p>
        <p>Content categorization
New content
stream</p>
        <p>Author
Traditional
database</p>
        <p>Cluster:
content title,
description,
number
clusters
database
Centroid choosing,</p>
        <p>describe
annotations forming
content association
in group of the
likeness</p>
        <p>Content
database
Topics tracking
Themed string</p>
        <p>Fig. 2. Scheme of the Web systems functioning from annotated database</p>
        <p>Thematic
content
Indexing</p>
        <p>Moderator
Database creating
for retrieval pattern
Pattern
database
annotations
database
Clustering
New themes identify</p>
        <p>Load distribution</p>
        <p>Content
Moderator
result
clusters set</p>
        <p>K K K K
...</p>
        <p>K</p>
        <p>User</p>
        <p>Theme query
Quest formation
Search procedure
Result clustering
Result abstracting
Reports generating</p>
        <p>Report submission
SK SK SK ... SK
K</p>
        <p>K
...</p>
        <p>K
C ... C</p>
        <p>...</p>
        <p>Page submission</p>
        <p>Resource formation
The results
display,
including the
name and
description of
the super cluster
Clusters
Content
grouping
Thematic
clusters</p>
        <p>C</p>
        <p>Clusters
grouping of
requests
considering
The cluster
name and
description print</p>
        <p>Output</p>
        <p>K content
C</p>
        <p>C</p>
        <p>Personalization based rules is the content provision to specific users or users
groups of conventional business logic using. For example, using a rule where all those
interested in children's books fall into the group that focused advertising children's
clothing. Rules developed on the content basis that type users in a registration card. In
content management systems use algorithms categorization with personalization using
filters (intelligent agents). Also, algorithms use based on the content analysis of user
behavior. In particular, he analyzes the content to which the user accesses, the sites
visited and more. That is constantly conducted analyzed registered user and user’s
group history with priorities an overwhelming number for interest.</p>
        <p>The full-text search problem in large content arrays is ineffective. The annotated
content search solves the problem exactly: instead of the full content searching to
search on annotations (pattern). Digest remotely similar content and often not
perceived by person. But as the search content pattern with weighted keywords and
phrases it leads to adequate results with full-text search. Digest constructed from
content fragments with the largest weight values. Content analysis is used for digests
automatic generation, the concepts (categories) relationship automatic detection,
relationships automatic clustering to the most important identify, the relationships
automatic detection (e.g., positive and negative). One of the most important tasks in
content analysis is the categorization process. It sets the conceptual grid. In its terms is
the content flow analyzing and new categories generate. In Fig. 2-6 presented the
results of the developed Web systems “Good morning, accountant”
(dobryjranok.com).</p>
        <p>1 5 9 13 17 21 25 29
1
5</p>
        <p>Ukraine (not set) Russia</p>
        <p>Austria</p>
        <p>Italy</p>
        <p>Norway Poland
Fig. 4. Resource visit distribution from different countries for the 07-12/2012 period
200
150
100
50
0
ru
ru-ru
uk
en-us
de
en
en-gb
uk-ua
доброго ранку
бухгалтер
dobryjranok.com</p>
        <p>іт пільги з пдв
добрий ранок,
бухгалтер
Fig. 7. Resource visit distribution of the search engines under the keyword content-search for
the 09-12/2012 period</p>
      </sec>
      <sec id="sec-4-32">
        <title>Globalization and localization is one of the modern market requirements. For suc</title>
        <p>cessful marketing studies is use statistical basis. Globalization is more than a simple
Web-pages conversion from one language to another. It still is to promote local
brands (localization). Within globalization and localization is using content strategy,
information hierarchy and navigation structure. For appropriate systems implement
the following tasks: create content in languages that its list is predefined; global/local
content preservation in different databases; support services for text automatic
translation; content changes track and its approved.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In the article the justification is needed in the information resources processing
methods and means development in electronic commerce content. This is implemented by
the architecture improving of Web systems. Improving such systems is to automate
processes of the Web content formation, management and implementation. This paper
is analyzes the terminology and classification of Web systems. This is done to
determine the characteristic design and simulation patterns, trends, process of Web
systems. Also, the article outlines the shortcomings of existing content management
methods and means. For Web systems is developed and described a content
management model. This made it possible to develop a generic architecture for typical Web
systems and standardized methods of information resources processing in these
systems. The paper improved the overall architecture of Web systems. It is different from
the existing modules presence of information resources processing. This is enabled
the stages implementation of the content lifecycle. The content analysis of textual
information in e-commerce systems allows us to determine the incidence of any
feature of the studied set of texts. In such a case, it is important not so much absolute as
the relative importance of the feature, that is, the characteristic of the place (share)
among other features. For example, the percentage point of discussions by the forum
users on the economic issues in relation to the political issues, or the percentage of
positive comments on products in relation to the negative ones and with respect to all
comments on this product category in the online store. Measuring the ratio between
the features in the text gives the empirical material for understanding the functional
relationships between the elements of reality reflected in the texts, for example,
determining the mood of the audience of Forum on the economic or political situation in
the country and the world. If there are chronologically ordered texts, you can have a
number of portraits of the studied reality fixed over time (the change in demand for
the product category depending on the season, for example, fantastic fiction is read
more in the winter, and detective novels − in the summer) or portraits of the target
audience (the change in demand for the product category according to the gender, for
example, the demand for women's clothing is greater in September than in March). It
allows you to hypothesize about the predictive nature of system elements functioning.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Designing architecture of electronic content commerce system</article-title>
          .
          <source>In: Computer Science and Information Technologies, Proc. of the X-th Int. Conf. CSIT'</source>
          <year>2015</year>
          ,
          <fpage>115</fpage>
          -
          <lpage>119</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hasko</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuchkovskiy</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Process analysis in electronic content commerce system</article-title>
          .
          <source>In: Proceedings of the International Conference on Computer Sciences and Information Technologies</source>
          ,
          <string-name>
            <surname>CSIT</surname>
          </string-name>
          <year>2015</year>
          ,
          <volume>120</volume>
          -
          <fpage>123</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Vysotska</surname>
          </string-name>
          , V.:
          <article-title>Linguistic Analysis of Textual Commercial Content for Information Resources Processing</article-title>
          . In: Modern Problems of Radio Engineering, Telecommunications and Computer Science, TCSET'
          <year>2016</year>
          ,
          <fpage>709</fpage>
          -
          <lpage>713</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dosyn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sachenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Smart Data Integration by Goal Driven Ontology Learning</article-title>
          .
          <source>In: Advances in Big Data. Advances in Intelligent Systems and Computing</source>
          . - Springer International Publishing AG 2017. P.
          <volume>283</volume>
          -
          <fpage>292</fpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Mykich</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Algebraic Framework for Knowledge Processing in Systems with Situational Awareness</article-title>
          ,
          <source>Advances in Intelligent Systems and Computing</source>
          ,
          <volume>217</volume>
          -
          <fpage>228</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Mykich</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Algebraic model for knowledge representation in situational awareness systems</article-title>
          .
          <source>In: Computer Sciences and Information Technologies, Proc. of the Int. Conf. CSIT</source>
          ,
          <fpage>165</fpage>
          -
          <lpage>167</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Chernukha</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bilushchak</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Mathematical modeling of random concentration field and its second moments in a semispace with erlangian disrtibution of layered inclusions</article-title>
          .
          <source>In: Task Quarterly</source>
          ,
          <volume>20</volume>
          (
          <issue>3</issue>
          ),
          <fpage>295</fpage>
          -
          <lpage>334</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Kravets</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Game Model of Dragonfly Animat Self-Learning</article-title>
          .
          <source>In: Perspective Technologies and Methods in MEMS Design</source>
          ,
          <volume>195</volume>
          -
          <fpage>201</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kravets</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kyrkalo</surname>
          </string-name>
          , R.:
          <article-title>Fuzzy logic controller for embedded systems</article-title>
          .
          <source>In: Proc. of the 5th Int. Conf. on Perspective Technologies and Methods in MEMS Design</source>
          ,
          <article-title>(</article-title>
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kravets</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>The control agent with fuzzy logic, Perspective Technologies and Methods in MEMS Design</article-title>
          , MEMSTECH'
          <year>2010</year>
          ,
          <fpage>40</fpage>
          -
          <lpage>41</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chyrun</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chyrun</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <source>Information Technology of Processing Information Resources in Electronic Content Commerce Systems, Computer Science and Information Technologies</source>
          ,
          <fpage>212</fpage>
          -
          <lpage>222</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Shakhovska</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chyrun</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Features of E-Learning Realization Using Virtual Research Laboratory</article-title>
          . In: Computer Science and Information Technologies,
          <source>Proc. of the XI-th Int. Conf. CSIT'</source>
          <year>2016</year>
          ,
          <fpage>143</fpage>
          -
          <lpage>148</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Khomytska</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teslyuk</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>The Method of Statistical Analysis of the Scientific, Colloquial, Belles-Lettres and Newspaper Styles on the Phonological Level</article-title>
          .
          <source>In: Advances in Intelligent Systems and Computing</source>
          ,
          <volume>512</volume>
          ,
          <fpage>149</fpage>
          -
          <lpage>163</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chyrun</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Analysis features of information resources processing</article-title>
          .
          <source>In: Computer Science and Information Technologies, Proc. of the Int. Conf. CSIT</source>
          ,
          <fpage>124</fpage>
          -
          <lpage>128</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Vasyl</surname>
          </string-name>
          , Lytvyn, Victoria, Vysotska, Dmytro, Dosyn, Roman, Holoschuk, Zoriana, Rybchak:
          <article-title>Application of Sentence Parsing for Determining Keywords in Ukrainian Texts</article-title>
          . In: Computer Science and Information Technologies,
          <source>Proc. of the Int. Conf. CSIT</source>
          ,
          <fpage>326</fpage>
          -
          <lpage>331</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uhryn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hrendus</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Naum</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Analysis of statistical methods for stable combinations determination of keywords identification</article-title>
          .
          <source>In: EasternEuropean Journal of Enterprise Technologies</source>
          ,
          <volume>2</volume>
          /2(
          <issue>92</issue>
          ),
          <fpage>23</fpage>
          -
          <lpage>37</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Davydov</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lozynska</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Information System for Translation into Ukrainian Sign Language on Mobile Devices</article-title>
          .
          <source>In: Computer Science and Information Technologies, Proc. of the Int. Conf. CSIT</source>
          ,
          <fpage>48</fpage>
          -
          <lpage>51</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Davydov</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lozynska</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Linguistic Models of Assistive Computer Technologies for Cognition and Communication</article-title>
          .
          <source>In: Computer Science and Information Technologies, Proc. of the Int. Conf. CSIT</source>
          ,
          <fpage>171</fpage>
          -
          <lpage>175</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Khomytska</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teslyuk</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Specifics of Phonostatistical Structure of the Scientific Style in English Style System</article-title>
          .
          <source>In: Computer Science and Information Technologies, Proc. of the XI-th Int. Conf. CSIT'</source>
          <year>2016</year>
          ,
          <fpage>129</fpage>
          -
          <lpage>131</lpage>
          . (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pukach</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bobyk</surname>
          </string-name>
          , І.,
          <string-name>
            <surname>Pakholok</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A method for constructing recruitment rules based on the analysis of a specialist's competences</article-title>
          .
          <source>In: Eastern-European Journal of Enterprise Technologies</source>
          ,
          <volume>6</volume>
          /2(
          <issue>84</issue>
          ),
          <fpage>4</fpage>
          -
          <lpage>14</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Zhezhnych</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markiv</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Linguistic Comparison Quality Evaluation of Web-Site Content with Tourism Documentation Objects</article-title>
          .
          <source>In: Advances in Intelligent Systems and Computing</source>
          ,
          <volume>689</volume>
          ,
          <fpage>656</fpage>
          -
          <lpage>667</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>