<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Likelihood Asymptotics for Changepoint Problem</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>CCS Concepts</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>K. O. Obisesan Department of Statistics University of Ibadan Nigeria</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>7</fpage>
      <lpage>9</lpage>
      <abstract>
        <p>Changepoint problems are often encountered when series undergo abrupt changes or discontinuities. Detecting changepoints can signal useful actions towards sustainable developments. However the presence of changepoints have often been known to lead to failure of some regular assumptions. In theory much has not been done on which assumptions fail and to what extent will it affect the score functions of the likelihood asymptotic. In this work we concentrate on simulating the likelihood function using R to establish the failure of regular assumption due to the presence of changepoint. The failure of regular assumption is established using various score functions coded in R thereby making it possible to understand the statistical theory and the consequences of the failure of assumptions as a result changepoints.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>likelihood
asymptotic,
regular
assumption,</p>
    </sec>
    <sec id="sec-2">
      <title>1. INTRODUCTION</title>
      <p>Changepoints are referred to as discontinuities that can lead to
non-linearity even in complex functions (Chen and Gupta, 2000).
The causes of changepoints include; changes in locations of
observations, equipment, measurement methods, environmental
effects, regulations, standards and so on. Generally we need to
investigate the potential presence of possible changes in the data
set indicating data quality problems that should be resolved prior
to any subsequent analysis. This will therefore signal signs for
timely protection and knowing this could be highly advantageous
in planning for the future. However Yang.et.al.(2006) noted that
changes do occur even in the best regulated systems. They
indicated that discrepancies in records, occasional disagreement
between documentation and data, abnormal data entry, changed
units of measurement and other problems require adequate
attention. Most times we need to detect the number of
changepoints, or jumps, and their locations whereas it is noted in
Mainly(2001) that it is much easier if the point of change is
known. This case is referred to as intervention analysis. In</p>
      <p>It is indicated in Obisesan.et.al(2013) that the data analysed were
the physico-chemical properties of water samples obtained from
two reservoirs in Oyo State Nigeria. The data were seen to contain
some abrupt changes in behaviour. In the work various charts and
diagrams were engaged in showing the positions and locations of
changepoints and the likelihood function was written to show the
single changepoint detection. However the theory on changepoint
linking failure of assumption was not shown therefore this present
work attempts to extend the likelihood theory to show the
implications of failure of regular assumptions as a result of the
presence of changepoint.</p>
    </sec>
    <sec id="sec-3">
      <title>2. STANDARD TECHNIQUES OF</title>
    </sec>
    <sec id="sec-4">
      <title>LIKELIHOOD ASYMPTOTICS</title>
      <p>To study the inference of changepoint problems such as to
understand its non-standard nature it is important to review some
properties of likelihood functions. The likelihood function for a
scalar parameter based on data as a collection
of independence observations is defined to be
( | )
(
)
∏ (
)
which is simply the joint density of the data, regarded as a
function of the parameter (Rice, 2007). For convenience, we study
the log-likelihood function ( | ) ( ) and write
( )
( | )
(
)
(
)
(</p>
      <p>)
∑
(
)
The maximum likelihood estimate of ̂ which is a value of
that minimizes the log-likelihood function. If the likelihood
function is a differentiable function of then ̂ will be the root
( ) ( )</p>
      <sec id="sec-4-1">
        <title>Moreover, for a local maximum we need</title>
        <p>at ̂</p>
      </sec>
      <sec id="sec-4-2">
        <title>The main assumptions here can be stated simply as</title>
        <p>Assumption 1 : The log-likelihood is a twice
differentiable function.</p>
        <p>Assumption
2 : The
second
derivative
( )</p>
        <p>at ̂
3. THE SCORE FUNCTION: SIMULATION
Under Assumption 1, the first derivative is usually called the
score function: ( ) [∑ ( )] and is regarded as a
function of for fixed X This function plays a central role in
maximum likelihood theory. We can also define the observed
information as
( )
( )
∑
( )
∑
(
)
which is a sum of
defined as
components. Also the Fisher information is
( )
{
( )
}
( )
and
Which can be written
( )
{</p>
        <p>∑
( )
∑
(
(
)}
)
}
{
( )</p>
      </sec>
      <sec id="sec-4-3">
        <title>Where</title>
        <p>( ) refers to single observation information.</p>
        <p>
          Now we show some characteristics of the score function when
data are assumed generated from ( ) so that (assumed true
value of is the parameter to be estimated. If we have an
independent and identically distributed sample of size n, the
loglikelihood is written as
( )
∑
( | )
( )
A careful illustration of the behavior of the score function is given
in Figure 1. This allows the sampling variation of score function
for different models (Normal, Poisson, Binomial and Cauchy) for
samples of size . Figure 1(a) shows 25 score function,
each based on independent and identically distributed sample of
size from N(
          <xref ref-type="bibr" rid="ref1 ref4">4, 1</xref>
          ). Each function is exactly linear and the
score varies around 10 at the true parameter . Figure
1(b) shows score function for 25 independent samples of size 10
from a Poisson distribution with mean 4 (Each function looks
approximately linear) and at the true parameter the score
function also varies around 0. Figure 1(c) shows score function of
25 independent samples of size n=10 from binomial (10, 0.4)
where In Figure 1(d), the score function for Cauchy
distributions (also based on 25 independent samples of size 10)
are rather irregular and fail to behave as the previous models
(although the score function also varies around 0 at but
there is the potential for multiple roots to the score equation). This
case indicates problems with a complicated likelihood.
        </p>
        <p>In all examples in Figure 1, the score varies around zero at the
true parameter value. We can show this is generally the case.
Recall from Section 3 that the score function is the first derivative
of the log-likelihood function where we set ( ) ( ), then
at the true value of
[ ]
The major assumption here is needed to justify interchanging the
order of differentiation and integration and can be stated in</p>
        <sec id="sec-4-3-1">
          <title>Assumption 3 as</title>
          <p>Assumption 3 : The range of integration does not depend on
since ( (
say</p>
        </sec>
      </sec>
      <sec id="sec-4-4">
        <title>This implies that Now</title>
        <p>Therefore using the stated assumptions we have ( ( ))
as required. We have also find the variance of the score function}
as
( (
))
(
(
))
(
( ( (
)]
)))
))
as seen above. We can rewrite (
) as
[</p>
        <p>(
( (</p>
        <p>))
Since the score function is a sum of n independent random
variables, the last equation above shows that
( (</p>
        <p>))
contradicts Assumption 4).</p>
      </sec>
      <sec id="sec-4-5">
        <title>Then for an arbitrary value</title>
        <p>The discussion so far has dealt with the behavior of the score
function at the true parameter value. We now consider its
behavior at other values of . In general (we need to investigate a
case that indicate the existence of changepoint) for = we find
that there may be need for another assumption:
Assumption 4: For the density
( ) on a set of non zero measure.
(</p>
        <p>) differs from
unless
for all x (which itself
therefore we have
# ............................... Poisson Score Functions:
t0&lt;- 4
x&lt;- rpois(n,t0)
theta&lt;- seq(t0/2,t0*2,len=40)
stheta&lt;- -n + sum(x)/theta
plot(theta,stheta,type='n',xlab=expression(theta),</p>
        <p>
          ylab='Score',ylim=c(
          <xref ref-type="bibr" rid="ref15">-5,15</xref>
          ),cex=.6)
for(i in 1:20){
x&lt;- rpois(n,t0)
stheta&lt;- -n + sum(x)/theta
lines(theta,stheta,lwd=.1)
)))
) ( | )
        </p>
        <p>( | )</p>
        <p>( | )
*∑
( | )</p>
        <p>+
)
and
( | )
( | )+ is finite for all . Therefore
Therefore as</p>
        <p>the
) tends to a deterministic function
( )
consider
)
)
( )
( )</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. SIMULATION CODE WITH R.</title>
      <p>In this section the R code used in simulating the likelihood
functions for the Normal and Poison distributions are stated as run
from the prompt. The Binomial and Cauchy distributions follow
similar way. After simulating from the distributions the likelihood
functions are plotted to show the distribution of the parameter. It
is clear from the code that the expected value of the score function
moves around 0.</p>
    </sec>
    <sec id="sec-6">
      <title>5. Consistency of Maximum Likelihood</title>
    </sec>
    <sec id="sec-7">
      <title>Estimators</title>
      <p>We now consider whether ̂ is a constant estimator of</p>
      <sec id="sec-7-1">
        <title>Taylor expansion for ( ) around , we have</title>
      </sec>
      <sec id="sec-7-2">
        <title>Using a</title>
      </sec>
      <sec id="sec-7-3">
        <title>For some</title>
      </sec>
      <sec id="sec-7-4">
        <title>In particular, when</title>
        <p>( )
(
(</p>
        <p>|
Which can be rewritten as
̂
̂
)
( )
( )
|
( )</p>
        <p>(
) and so we can write
( ) ( )
)</p>
        <p>|
( )
|
̂ then we have (nothing that ( ̂)
( )
( ) ( )
)
( )
Note that the numerator of Equation 6 approaches 0 as If
we assume that the denominator is guaranteed nonzero, then
Equation 6 implies that and therefore ̂ . This
requires the following assumption which can be seen as a
strengthened version of Assumption 2.</p>
        <sec id="sec-7-4-1">
          <title>Assumption 5:</title>
          <p>is non- zero in an interval containing
.
6. Limiting Distribution of ̂
As well as demonstrated the consistency of the maximum
likelihood estimator ̂, Equation 5 allows us to establish its
distribution when n is large. Recall again that
Moreover, ( ) is a sum of independent and identically
distributed contributions. Hence from the central limit theorem we
have asymptotically,
( )
|
]
|
( )
[</p>
          <p>|
( ( ) )and
|
|
( )
since it lies
̂
√</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>7. Limiting Chi-Square Distributions:</title>
    </sec>
    <sec id="sec-9">
      <title>Likelihood Ratio Statistic</title>
      <p>Now we discuss the basic test statistic used for testing hypothesis
using the principles of likelihood functions. Suppose that l(.) is the
log-likelihood established from the probability density f . then the
consistency of ̂ implies that we can write
( )
( ̂)
( ̂
) ( ̂)
( ̂
)
( )</p>
      <sec id="sec-9-1">
        <title>Where is between and .</title>
        <p>Then representing the likelihood ratio statistic with
( ( ̂)</p>
        <p>( )) gives
and since (̂)
( ̂
) ( ̂)
( ̂
)
(
)
( )( ̂
by definition we can write that</p>
        <p>( ) ( )
)
( )</p>
        <p>( )
( )( ̂</p>
        <p>)
( )
( )
( )
( )
( )
It is clear that the first part of Equation 8 is asymptotically the
square of a standard normal random variable and it is therefore a
distribution in addition, the last two ratios (( )) and (( ))
tend to 1 using similar arguments to those applied in the previous
subsection. In the same direction, we can obtain the
distribution for a case when is vector (without proof) in that as
above we write [ ( ̂) ( )] ( ̂ ) ( )( ̂
) It is therefore noted that ( ) has an approximate chi-square
distribution on p degree(s) of freedom for repeated sampling of
data from the model. We can write ( )</p>
      </sec>
    </sec>
    <sec id="sec-10">
      <title>7.1 The two-mean model</title>
      <p>In Obisesan et.al(2013), the development of changepoint detection
was based on Hinkley(1970) work. Hinkley(1970) considered
sequences of random variables and discussed the point at which
the probability distribution changes using a normal distribution
with changing mean. The asymptotic distribution of the maximum
likelihood estimate discussed in this paper is particularly relevant
to change-point. The author indicated the simplest model over a
whole range of data as ( ) as usual
where ( )is a mean function and refer to error terms. Hinkley
(1970) computed the asymptotic distribution in the normal case
when $\theta_0$ and $\theta_1$ are unknown. The asymptotic
dsitribution is found to be the same when the mean levels are
known. The two-mean model to be considered supposes that there
exist a mean ( ) and mean ( ) and
respectively. He also computed the asymptotic
distribution of the likelihood estimate of the change-point
̂ (where and are known and is unknown) is obtained
from a sample by simply maximizing the likelihood
function of the form
(
)
∏ (
) ∏
(
)
which can be written in form of log likelihood as
(
)
∑
(
) ∑
(
)
( )
Moreover, many cases arise when the mean levels are not known.
The log-likelihood of the observed sequence ( ) is
(
|</p>
      <p>)
{∑(</p>
      <p>)
∑ (
) }
( )
If we assume that
estimators
∑
and ̂
respectively
are</p>
      <p>̂
is known therefore the maximum likelihood
∑
̂
∑ (
) ∑
(
Particularly for convenience. Hinkley (1970) substituted
as known so that Equation 10 becomes
( | )
)
)
) }</p>
      <p>( )
{∑(
∑ (</p>
      <sec id="sec-10-1">
        <title>Assuming that is unknown and putting the maximum</title>
        <p>likelihood estimates of and back into the log-likelihood in
Equation 11 and re-arranging the emerging sums of squares
conditional on t Equation 11 was used to estimate changepoint of
water pollution in Eleyele and Asejire reservoirs in Nigeria. This
confirms the application of the likelihood theory of changepoint.
More on the applications are discussed in Obisesan(2011, 2015).</p>
      </sec>
    </sec>
    <sec id="sec-11">
      <title>8. RESULTS</title>
      <p>In this work it has been shown that changepoint arises as a
result of failure of some regular assumptions specifically in this
case Assumptions 1 and Assumptions 4 may fail. This work has
justified using simulation in the theory of likelihood function for
the score functions to show the change in parameter allowing
changepoint to occur. The work also justifies the application of
changepoint detection as used in Obisesan et.al(2013). The use of
R has therefore made it possible to show the failure of regular
assumption.</p>
    </sec>
    <sec id="sec-12">
      <title>9. CONCLUSION</title>
      <p>Single changepoint detection has been discussed in the framework
of the failure of regular assumptions that have not been commonly
noticed. Likelihood function was used to merge the two-mean
levels and various score functions were simulated using the
successful statistical computing language R. The theoretical
implications of failure of regular assumptions were discussed and
the failed assumption identified using R . This work has therefore
provided a basis for using computational statistics methods in
solving a mathematical problem.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>J. amd Gupta</given-names>
          </string-name>
          ,
          <string-name>
            <surname>A.L.</surname>
          </string-name>
          (
          <year>2000</year>
          ):
          <article-title>Parametric statistical change Point Analysis</article-title>
          . Birkhauser Boston.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Easterling</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          and Peterson,
          <string-name>
            <surname>T.C.</surname>
          </string-name>
          (
          <year>1995</year>
          ).
          <article-title>A New Method for Detecting Undocumented Discontinuities in Climatological Time Series</article-title>
          .
          <source>Int.J. Climatol</source>
          ,
          <volume>15</volume>
          :
          <fpage>369</fpage>
          -
          <lpage>377</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Hanesiak</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.L.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Adverse Weather Trends in the Canadian Arctic J</article-title>
          .Climate,
          <volume>18</volume>
          (
          <issue>2</issue>
          ):
          <fpage>3140</fpage>
          -
          <lpage>3156</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Hinkley</surname>
            ,
            <given-names>D.V.</given-names>
          </string-name>
          (
          <year>1970</year>
          ).
          <article-title>Inference about the Change-point in a Sequence of Random Variables</article-title>
          . Biometrika,
          <volume>57</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Jandhyala</surname>
            ,
            <given-names>V.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fotopoulus</surname>
            ,
            <given-names>S.B.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Evagelopoulos</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>1999</year>
          ).
          <article-title>Change-point Methods for Weibull Models with Applications to Detection of Trends in Extreme Temperature</article-title>
          . Environmetrics,
          <volume>10</volume>
          :
          <fpage>547</fpage>
          -
          <lpage>564</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Jandhyala</surname>
            ,
            <given-names>V.K.</given-names>
          </string-name>
          , and MacNeil,
          <string-name>
            <surname>I.B.</surname>
          </string-name>
          (
          <year>1986</year>
          )
          <article-title>The Changepoint Problem: A review of Applications. Elsevier: In statistical Aspects of Water Quality Monitoring (Eds.</article-title>
          <string-name>
            <given-names>A.H.</given-names>
            <surname>ElShaarawi and R.E</surname>
          </string-name>
          . Kwiatkowski), pages
          <fpage>381</fpage>
          -
          <lpage>387</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lund</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Seymour</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>An update of US temperature trends</article-title>
          .
          <source>J. Climate</source>
          ,
          <volume>18</volume>
          :
          <fpage>4906</fpage>
          -
          <lpage>4919</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Manly</surname>
            ,
            <given-names>B.F.L.</given-names>
          </string-name>
          (
          <year>2001</year>
          ).
          <article-title>Statistics for Environmental Science and Management</article-title>
          . Chapman Hall/CRC, Boca Raton.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Obisesan</surname>
            <given-names>K.O</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lawal</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bamiduro</surname>
            <given-names>T.A</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Adelakun</surname>
            <given-names>A.A</given-names>
          </string-name>
          (
          <year>2013</year>
          )
          <article-title>: Data Visualization and Changepoints Detection in Environmental Data: The Case of Water Pollution in Oyo State</article-title>
          ,
          <source>Nigeria: Journal of Science Research</source>
          : Vol.
          <volume>12</volume>
          :
          <fpage>181</fpage>
          -
          <lpage>190</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Obisesan K.O(2015) Modelling Multiple Changepoints Detection Ph.D Thesis</surname>
          </string-name>
          , Department of Statistics, University of Ibadan.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Obisesan</surname>
            <given-names>K.O</given-names>
          </string-name>
          (
          <year>2011</year>
          )
          <article-title>Changepoint Detection in Time-Series with Hydrological Applications M</article-title>
          .
          <source>Phil Thesis</source>
          , Department of Statistical Science, University College London.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Page</surname>
            ,
            <given-names>E.S.</given-names>
          </string-name>
          (
          <year>1954</year>
          ).
          <source>Continuous Inspection Schemes. .Biometrika</source>
          ,
          <volume>41</volume>
          :
          <fpage>100</fpage>
          -
          <lpage>114</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Page</surname>
            ,
            <given-names>E.S.</given-names>
          </string-name>
          (
          <year>1955</year>
          ).
          <article-title>A test for a Change in a Parameter Occuring at an Unknown Time Point</article-title>
          . Biometrika,
          <volume>42</volume>
          :
          <fpage>523</fpage>
          -
          <lpage>526</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Page</surname>
            ,
            <given-names>E.S.</given-names>
          </string-name>
          (
          <year>1957</year>
          ).
          <article-title>On Problems in which a Change in Parameter Occurs at Unknown Point</article-title>
          . Biometrika,
          <volume>44</volume>
          :
          <fpage>248</fpage>
          -
          <lpage>252</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Rice</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <source>Mathematical Statistics and Data Analysis. Duxbury.</source>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.L.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Climatology and Trends in Some Adverse and Fair Weather Conditions in Canada,</article-title>
          <year>1953</year>
          -
          <fpage>2004</fpage>
          . J.
          <source>Geophys. Res 111</source>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chandler</surname>
            ,
            <given-names>R.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Isham</surname>
            ,
            <given-names>V.S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Wheater</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Quality Control for Daily Observational Rainfall Series in UK</article-title>
          .
          <source>Water and Environment Journal</source>
          ,
          <volume>20</volume>
          :
          <fpage>185</fpage>
          -
          <lpage>193</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>