<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yusifov Mahammad</string-name>
          <email>yusifov.19@mail.ru</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lala Bakirova</string-name>
          <email>lala_bekirova@mail.ru</email>
        </contrib>
      </contrib-group>
      <abstract>
        <p>The thesis includes the main basic concepts of machine learning and its applications. For simplest possible explanation of machine learning concepts, handwritten number recognition software has been represented and on this application, concepts like artificial neurons, weights, edges, connections and others has been explained. Layered structure of artificial network, artificial neuron activation value. What is edges shapes? How the recognition process happening, what is neural network and neurons? What is weight and activation value, Sigmoid and ReLU function? What is the parameter of connections between neurons? These questions and others has been covered in this article.</p>
      </abstract>
      <kwd-group>
        <kwd>1 Neural network</kwd>
        <kwd>artificial intelligence</kwd>
        <kwd>neurons</kwd>
        <kwd>auto recognition</kwd>
        <kwd>activation</kwd>
        <kwd>weight</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Machine learning is a field of science that
covers mathematical complex calculations and
equations in order to make computers imitate the
“learning” process of humans. This process
involves a computer algorithm that builds models
based on given training data to make proper
decisions without being manually programmed by
a developer [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Machine learning is considered as
an important functionality and branch of Artificial
Intelligence and the applications of this
interesting technology are very wide: Speech
recognition and other image recognition
techniques, computer vision, email spam, and
malware filtering, virtual personal assistant,
online fraud detection, self-driving cars, stock
market trading, and medicine. And this list can be
expandable. In this thesis, main concepts and
applications of machine learning will be covered.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Neural network</title>
      <p>
        The artificial neural network is a set of
connected units that each represents a neuron, like
in the human brain. Every neuron can be
considered as a function that takes input as a
number and passes as output to the next so-called
artificial neuron [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Figure 1 is an example of
artificial neural network that consists of input,
output and some additional hidden layers that can
be considered as black box for now.
      </p>
      <p>In every layer, neurons are connected to each
other via weights that can be modified with result
of learning or training process. Next, we can take
simple example application handwritten number
recognition and go through these concepts of
neurons.</p>
    </sec>
    <sec id="sec-3">
      <title>2.1. Handwritten recognition number</title>
      <p>Although it is not so challenging to humans to
understand the number on the left which is sloppy
written by me, for computers it is not an easy task.</p>
      <p>How this software converts or translates the
analog number given on the left to the digital
mathematical number that has been given on the
right by the machine? Let’s start with
conceptualizing the term “neuron”.</p>
      <p>
        We can consider a neuron as a unit that holds
a number between 0 and 1. In figure 3 we have 28
x 28 = 784 neurons and each one of these neurons
holds a value that represents the greyscale value
of the corresponding pixel. For example, the
neuron that value is 0 means it is black and the one
with value 1 means it is white. This number is
called “activation” [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>All of these 784 neurons represent the input
layer of our artificial network and the output layer
consists of 10 neurons that each represents a
number as a result. The activation of neurons in
the output layer also represents grayscale value,
the neuron with the highest activation number is
the “choice” of the machine. For example in our
case, the number is 9 and the neural network of
the decision process has been shown in Figure 4.</p>
      <p>When we try to recognize a number, our brain
looks for certain patterns like loops or shapes. For
example, in our example, the number 9 has circle
up top and a line on the right. The number 8 has 2
circles or loops one on top and the other at the
bottom. So our next inner layer can be responsible
for holding each one of these patterns shapes in a
neuron. If the activation of the specific neuron is
close to 1, the possibility of that corresponding
pattern is high. For example, suppose there is one
certain neuron that represents a circle pattern, and
the activation is 0.9. That means the resulting
number can be: 0, 6, 8, 9.
So we can suppose, our second last layer is
responsible for recognizing patterns as shown in
Figure 5.</p>
      <p>But, these patterns themselves also consist of
little shapes, which we can call edges. For
example circle can consists of a bunch of edges.
neurons: weight which is also number. And from
mathematical approach:</p>
      <p>
        1 1 +  2 2 + … +  , (1)
This equation is weighted sum [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>So we also need to have a layer that is
responsible for recognizing these little edges. The
layer that comes after the input layer can be
responsible for that operation. So with 2 layered
16 neurons structure, we can suppose the working
principle of our little AI is in this way: In the first
layer we get input as a handwritten number with
784 neurons, and then it sends a signal to the next
layer in order to detect edges and then comes third
layer to detect bigger shapes as patterns and at the
output we have our result as a number.</p>
      <p>This structure can be also applicable for other
recognition tasks like image and voice
recognition. For example, detecting shapes in the
pictures, or identifying syllables in human voice
can be our “patterns” or “edges” in this structure.</p>
      <p>To be able to detect that edges, we need to
assign new parameter to the connections between</p>
      <p>Weight with the color green means positive
vale and with the red color means negative so
weights with the pixel color red means darker are
and it distinguishes darker and lighter areas on
pixels so that it can detect edges.</p>
      <p>When we compute weighted sum in equation
1, we can come up with any number, but for our
network, we need activation values to be in the
range of 0 and 1 and here we need to use the
specific function that minimizes real number to
the range between 0 and 1. A common function is
the sigmoid function.</p>
      <p>
        Negative inputs end up close to 0 and positive
inputs end up close to 1 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. So the activation of a
neuron in our network is a measure of how
positive the corresponding weighted sum is. But
also we need bias for inactivity. We just need to
extract that value from the weighted sum before
processing the sigmoid function [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>( 1 1 +  2 2 + … +  − (2)
 ) ,
So, the weights indicate what pattern, edges this
specific neuron in the second layer is picking up
on and the bias indicates how high the weighted
sum needs to be before the neuron starts getting
active.</p>
      <p>But there is another better function called
ReLU – REctified Linear Unit.</p>
      <p>
        Using sigmoid function, was very difficult to
train at some point, but ReLU solved this problem
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>Every neuron in the first layer is connected to
the 784 neurons. And each one of these 784
connections has weight and bias. So we have 784
x 16 weights with 16 biases and the connection
between other layers also has weights and biases
according to them. Our network has 13002
weights and biases in total.</p>
      <p>
        Learning in here is, getting the machine to find
a true setting for all of these many numbers so that
it can solve the problem at a time [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        The training process of the network can be
done with the help of certain special databases that
store collections of images of numbers
handwritten. So the machine can train itself to
become a better “predictor” with both having
input and output from the database [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>MNIST database is well-known example of
these databases which stores 70000 images of
digits handwritten by people like high school
students and employees of the American Census
Bureau that are published to help AI researchers.</p>
      <p>
        The more training data we give to the machine,
the more it evolves itself by doing feedback:
taking input from the picture does its operation
and match the result with the real idealistic value
behind the number and correct itself [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
    </sec>
    <sec id="sec-4">
      <title>3. Applications</title>
      <p>
        Very popular example of how powerful
artificial intelligence can be is GPT-3 (Generative
Pre-trained Transformer 3) launched by OpenAI
company. This model uses deep learning to
produce texts that are very similar to human words
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. GPT-3 can even code by its own, you just
have to give input commands like create a button
that looks like rectangular and when it is pressed
increase certain value or do some other
operations. For example debuild.co is interesting
website that uses GPT-3 to write code. User gives
orders as input and it generates JavaScript code
according to orders and visually shows the result
at a time [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>GPT-3 functionalities are very wide and that
shows how amazing works can be done using
Machine Learning and Artificial Intelligence.</p>
      <p>In medical field, machine learning has so many
applications and they are currently being
developed. One of the proper examples is
Microsoft's InnerEye Project that identifies
differences between healthy cells and tumor cells
by using 3D radiological images as shown in
Figure 13.</p>
      <p>
        Pfizer uses machine learning to research about
how the immune system of human body can fight
cancer. Insitro is a startup uses data science and
machine learning to develop drugs that cures
people much faster and with high level of success
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>Besides medical field, there are many
interesting fields that use machine learning.
Navigation programs like google maps analyzes
all roads and transportation ways and brings most
relevant choice of travel from location to another.
Social media programs has many features that use
machine learning to process its functionality like
bringing “people you may know”, face detection,
image recognition and others. Self-driving cars
like Tesla works on deep learning and also
includes IoT because of intelligent sensors.
Language translation apps, online video
streaming apps and many others have a touch on
machine learning and expands quickly.</p>
      <p>Another additional machine learning
application can be in psychology field. Model can
be established in order to read and store different
parameters of humans in a database. Database
structure is shown in a following figure.</p>
      <p>This database can be input for model and it can
use machine learning to improve more correct
analyses of human and better treatment for each
specific patient. There are 3 tables in this simple
represented database:
1. patients: stores basic information for
other relations
2. health_parameters: stores vital
information about a human this data is
essential for identifying potential diseases
3. other_parameters: stores information
about daily activites, and possible decisions
that the human can make.</p>
      <p>The second and third tables are connected to
first table via foreign key on id of patient. With
only this small data a well-structured AI can
predict the cause of psychological disease and
develop more relevant therapy for patient. For
example: a married man with weight of 120kg
who is accountant may have heart disease and
with having no hobbies is additional risk
parameter that causes psychological disease on
him. That one example can be extended widely
with the growth of the database. More data
provided to model, more it will be decisive and
correct. If there are 2 persons that have relation
they can also have same problem. For instance, a
partner of current patient had this certain disease
4 months ago as she registered in this database. So
this AI can read all relations of humans even if
they are not bound together but have same
workplace that other patient had, this can be
important information for model to decide. This
complex relation may be difficult for us to see and
decide but it is not the same for a machine. It can
perform big analysis and diagnostic on a human
and identify multiple treatment or therapy
methods. And a doctor who is psychologist can
choose one or more of those method even modify
them. This kind of machine learning application is
not only about psychology field, it also can be
about other specific fields of medical applications.</p>
    </sec>
    <sec id="sec-5">
      <title>4. Conclusions</title>
      <p>In the new era of technology, Artificial
Intelligence will help humans to solve so many
problems in a short time without errors. It can
affect the human life as given example about
identifying the disease on human by giving
database as a training data for machine to learn. In
medicine, special surgeries, in oil manufacturing,
or in some dangerous places that humans should
not be operate, machines will replace and make
appropriate decisions like, or better than humans.
And that complex operations is starting with
handwritten number recognition task. It is
considered as “Hello-World” of Machine
Learning. But as we know every big roads starts
with first little steps.</p>
    </sec>
    <sec id="sec-6">
      <title>5. Acknowledgements</title>
      <p>Thanks to head of cathedra: Doctor of
technical sciences Mrs. Lala Bakirova.</p>
    </sec>
    <sec id="sec-7">
      <title>6. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Lau</surname>
          </string-name>
          , Suki. “
          <source>A Walkthrough of Convolutional Neural Network - Hyperparameter Tuning": Medium. 23 August</source>
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Bishop</surname>
            ,
            <given-names>C. M.</given-names>
          </string-name>
          “
          <article-title>Pattern Recognition and Machine Learning”</article-title>
          , Springer,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Ethem</given-names>
            <surname>Alpaydin</surname>
          </string-name>
          . “Introduction to Machine Learning (Fourth ed.)”
          <fpage>2020</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <article-title>[4] "AI has cracked a key mathematical puzzle for understanding our world"</article-title>
          <source>: MIT Technology Review 19 November 2020</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <article-title>[5] "Introduction to AI Part 1"</article-title>
          .
          <source>Edzion. 2020-12- 08. Retrieved</source>
          <year>2020</year>
          -
          <volume>12</volume>
          -09.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <source>[6] "AN EMPIRICAL SCIENCE RESEARCH ON BIOINFORMATICS IN MACHINE LEARNING - Journal". Retrieved 28 October</source>
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Berlinski</surname>
            ,
            <given-names>David "</given-names>
          </string-name>
          <article-title>The Advent of the Algorithm"</article-title>
          .
          <source>Harcourt Books</source>
          .
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Zola</surname>
            ,
            <given-names>Andrew.</given-names>
          </string-name>
          "
          <source>Interview Prep: 40 Artificial Intelligence Questions"</source>
          .
          <source>Springboard Blog</source>
          .
          <year>2019</year>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <article-title>[9] Sizing the prize: PwC's Global AI StudyExploiting the AI Revolution"</article-title>
          . 11
          <string-name>
            <surname>November</surname>
          </string-name>
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <article-title>"Automation and anxiety"</article-title>
          .
          <source>The Economist. 13 January</source>
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Lohr</surname>
          </string-name>
          ,
          <article-title>Steve "Robots Will Take Jobs, but Not as Fast as Some Fear, New Report Says"</article-title>
          .
          <source>The New York Times</source>
          . 2017
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Christopoulou</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tran</surname>
            <given-names>TT</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sahu</surname>
            <given-names>SK</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miwa</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <year>Ananiadou</year>
          .
          <article-title>"Adverse drug events and medication relation extraction in electronic health records with ensemble deep learning methods"</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          .
          <year>2020</year>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Brunn</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diefenbacher</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Courtet</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Genieys</surname>
            <given-names>W. "</given-names>
          </string-name>
          <article-title>The Future is Knocking: How Artificial Intelligence Will Fundamentally Change Psychiatry"</article-title>
          .
          <source>Academic Psychiatry. 2020</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Kocaballi</surname>
            <given-names>AB</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ijaz</surname>
            <given-names>K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laranjo</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Quiroz</surname>
            <given-names>JC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rezazadegan</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tong</surname>
            <given-names>HL</given-names>
          </string-name>
          , et al.
          <article-title>"Envisioning an artificial intelligence documentation assistant for future primary care consultations: A co-design study with general practitioners"</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          .
          <year>2020</year>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Kobie</surname>
            <given-names>N.</given-names>
          </string-name>
          <article-title>"DeepMind's new AI can spot breast cancer just as well as your doctor"</article-title>
          .
          <source>Wired UK. Wired</source>
          . 2020
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>