<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Efficacy of the POMDP-RTI Approach for Early Reading Intervention</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Umit Tokac</string-name>
          <email>ut08@my.fsu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Russell G. Almond</string-name>
          <email>ralmond@fsu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Educational Psychology and</institution>
          ,
          <addr-line>Learning Systems</addr-line>
          ,
          <institution>Florida State University</institution>
          ,
          <addr-line>Tallahassee, FL 32306</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <abstract>
        <p />
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>A POMDP is a tool for planning: selecting a
policy that will lead to an optimal outcome.
Response to intervention (RTI) is an approach to
instruction, where teachers craft individual plans
for students based on the results of progress
monitoring tests. Current practice assigns
students into tiers of instruction at each time
point based on cut scores on the most recent test.
This paper explores whether a tier assignment
policy determined by a POMDP model in a RTI
setting offer advantages over the current practice.
Simulated data sets were used to compare the
two approaches; the model had a single latent
reading construct and two observed reading
measures: Phoneme Segmentation Fluency (PSF)
for phonological awareness and Nonsense Word
Fluency (NWF) for phonological decoding. The
two simulation studies compared how the
students were placed into instructional groups
using the two approaches, POMDP-RTI and
RTI. This paper explored the efficacy of using a
POMDP to select and apply appropriate
instruction.</p>
      <p>
        INTRODUCTION
Statistics gathered by local school districts reflect that
roughly 30% of their first-grade students read below
grade level standards
        <xref ref-type="bibr" rid="ref18">(Matthews, 2015)</xref>
        . Moreover,
Landerl and Wimmer (2009) reported that 70% of
struggling readers in first grade continued to struggle in
eight grade when no intervention was provided.
        <xref ref-type="bibr" rid="ref17">Mastropieri, Scruggs, and Graetz (2003</xref>
        ) argued that
reading is the main problem for most students with
learning disabilities.
      </p>
      <p>
        <xref ref-type="bibr" rid="ref31">Torgesen (2004)</xref>
        asserts that reading consists of five
components: phonological awareness, phonological
decoding, fluency, vocabulary, and reading
comprehension. According to the Simple View Theory of
Reading Development
        <xref ref-type="bibr" rid="ref14">(Gough &amp; Tunmer, 1986)</xref>
        for
children at young ages, mastery of the first two
components, phonological decoding and phonological
awareness, generate the remaining three reading
components: fluency, vocabulary, and reading
comprehension. A lack of either phonological decoding
or phonological awareness affects the other components
and causes reading difficulties. Because the development
of reading skills is critical, instructors should identify
children with reading difficulties and provide additional
instructional support
        <xref ref-type="bibr" rid="ref12">(Catts, Hogan &amp; Fey, 2003)</xref>
        .
Response to intervention (RTI) is an educational
framework designed to identify students with difficulties
in reading and math, and intervene as early as possible by
providing more intensive instruction for students who
need it. The RTI approach divides instruction into Tiers;
each tier includes different intervention or instruction.
The RTI process starts with screening tests which monitor
general knowledge and skills of all students in the class.
The screening tests are administered on multiple
occasions during a school year. The screening test results
provide teachers with a rough estimate of each student’s
proficiency that guides the assignment of students into
appropriate tiers of instruction. RTI has produced good
results in both research and operational settings, and
hence is considered to be one of the evidence-based
practices for improving reading and preventing learning
disabilities
        <xref ref-type="bibr" rid="ref10 ref15">(Greenwood et al., 2011)</xref>
        .
      </p>
      <p>
        Ideally, the placement into Tiers of students in an RTI
program would be based on their unobservable true
proficiency. As this is unobservable, the placement
decision is instead made basis of the estimates of
proficiency from screening tests. Often in current
practice this is implemented through a cut score on the
most recent screening test. Naturally, a certain amount of
measurement error causes some students to be placed
incorrectly. Considering the entire (both students’
previous screen-tests results and changes in instruction)
history in account should improve the proficiency
estimates performance.
        <xref ref-type="bibr" rid="ref1">Almond (2007)</xref>
        suggested that this
could be done using a partially observed Markov decision
process (POMDP) — partially observed, because the true
student proficiency is latent; a decision process, because
the instructors decide what instruction or intervention to
use between measurement occasions.
      </p>
      <p>
        A POMDP is a probabilistic and sequential model. A
POMDP can be in one of a number of distinct states at
any point in time, and its state changes over time in
response to events
        <xref ref-type="bibr" rid="ref11">(Boutilier, Dean &amp; Hanks, 1999)</xref>
        . One
noteworthy difference between a RTI approach and a
POMDP model is that most RTI approaches use only the
latest test results to identify students’ proficiencies and
assign them to appropriate tier
        <xref ref-type="bibr" rid="ref21">(Nese et al., 2010)</xref>
        . We call
the approach the current-time only-RTI model. On the
other hand, a POMDP-RTI model is the combination of a
periodically applied screening test, and the RTI into a
POMDP model. Additionally, a POMDP considers the
students’ entire histories (both actions and test scores)
when determining appropriate interventions at in order to
identify their current abilities and forecast their future
abilities under competing policies. Therefore, a
POMDPRTI model should perform better than current-time
onlyRTI model.
      </p>
      <p>
        To test the last assertion, this paper compares the
POMDP-RTI model with the current-time only-RTI,
evaluating the predictive accuracy of each model, the
quality of the instructional plans produced and the reading
levels achieved at the end of the year. It does this through
simulation studies based on numbers obtained from fitting
the POMDP model to a group of kindergarten students in
an earlier RTI study
        <xref ref-type="bibr" rid="ref10">(Al Otaiba, Connor, Folsom,
Greulich, Meadows, &amp; Li, 2011)</xref>
        .
      </p>
      <p>
        METHOD
Two simulated datasets were used in order to address how
properly students are assigned to each tier based on their
latent reading score in the POMDP-RTI model compared
based on their observed score in the current-time
onlyRTI model. The initial value of the parameters were based
on a longitudinal Florida Center for Reading Research
(FCRR) study of reading proficiency
        <xref ref-type="bibr" rid="ref10">(Al Otaiba et al,
2011)</xref>
        and data sets were simulated based on the
        <xref ref-type="bibr" rid="ref1">Almond
(2007)</xref>
        model in order to produce realistic data for
answering the research question posed above. The
parameters of the simulation were chosen so that the
distribution of scores on the screening test were similar to
those of the Al Otaiba et al. study at both the initial and
final measurement period.
2.1 THE POMDP-RTI FRAMEWORK
        <xref ref-type="bibr" rid="ref1">Almond (2007)</xref>
        describes a general mapping of a POMDP
into an educational setting. It is assumed that the
student’s proficiency is measured at a number of
occasions. The latent proficiencies of the students is the
hidden layer of the POMDP model. The actual test scores
are the observable outcomes, and the instructional options
for the teacher between measurement occasions are the
action space. The utility is assumed to be an increasing
function of the latent proficiency variable at the last
measurement occasion; thus, it is finite time horizon
model.
proficiency variables at Measurement Occasion m, Rm,
and the observable multivariate outcome variables are
PSFm and NWFm on that occasion. Extending the ECD
terminology,
        <xref ref-type="bibr" rid="ref1">Almond (2007)</xref>
        calls the model for the Rm's,
the proficiency growth model. Following the normal logic
of POMDPs this is expressed with two parts: the first is
the initial proficiency model, which gives the population
distribution for proficiency at the first measurement
occasion. The second is an action, which gives a
probability distribution for change in proficiency over
time that depends on the instructional activity chosen
between measurement occasions.
      </p>
      <p>
        There are two notable differences between the POMDP
models used in this application and those commonly seen
in the literature. First, the models have a fixed and finite
time horizon, with the reward occurring only at the last
time step (although the actions at each step have a cost
which is subtracted from the reward). This removes the
need for the usual discounting of future rewards. The
second is that the Markov process in non-stationary (it is
hoped that the student’s abilities will improve over time).
This produces a potential identifiability issue, as growth is
difficult to distinguish between difficulty shifts in the
measurement instruments
        <xref ref-type="bibr" rid="ref7">(Almond, Tokac &amp; Al Otaiba,
2012)</xref>
        . Assuming that the screening tests have all be
equated, hence are on the same scale, takes care of the
identification issue. An alternative approach would be to
subtract the expected growth from the model, making the
latent proficiency variable represent deviations from the
expected growth model
        <xref ref-type="bibr" rid="ref6 ref8">(Almond, et al., 2014)</xref>
        .
2.1.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Proficiency Growth Model</title>
      <p>The model from which the data was simulated was a
unidimensional model of reading with a single latent,
continuous variable: Rnm, the reading ability of individual
n on measurement occasion m. In this case, N was 300
students and M represented the three equally spaced time
points, t1, t2, t3. (RTI screening tests are typically given 3
times per year.)
This study assumed that a teacher provided general
instruction to all the students until the first time point, t1,
and that the initial ability distribution was normal,
R0 ~ N(0,1). As this is a purely latent variable, the scale
and location is arbitrary. Fixing the initial population to
have a standard normal distribution establishes the scale.
After analyzing the results of assessments administered
at t1, the teacher delivered additional and more intensive
instruction to students who were assigned to Tier 2, but
delivered only general instruction to students in Tier 1.
The tier to which student n is assigned at time m is
represented by a(n,m). The growth rate for the students is
assumed to depend on the tier assignment. Thus, for
measurement occasion m &gt; 1,</p>
      <p>Rnm = Rn(m-1) + γa(n,m) ΔTm + ηnm,
(1)
where</p>
      <p>ηnm ~N(0, σa(n,m) ∆  ),
and where ΔTm represents the elapsed time period
between measurement occasions m and m-1 for Tier 1 and
Tier 2. In this study, each school year was equal to 1, and
ΔTm was fixed and equal to 1/M (e.g. M = 3, so ).
The parameter γa(n,m) is a tier-specific growth rate and it
was fixed and had two different initial values for each
tier. We set γam = 0.9 for Tier 1, and γam = 1.2 for Tier 2.
The residual standard deviation, σa(n,m) ∆  , depends on
both a tier-specific rate, σa(n,m), and the length of time,
ΔTm, between measurements (thus, growth is occurring
via a non-stationary Brownian motion process). The
standard deviation of the growth per unit time, σa(n,m), was
fixed to 1 for both tiers.
2.1.2</p>
    </sec>
    <sec id="sec-3">
      <title>Evidence Model</title>
      <p>
        The evidence model involved two independent
regressions, one for each observed variable i. These two
observable variables were chosen because they are critical
reading components for later reading performance in the
first two years of elementary school
        <xref ref-type="bibr" rid="ref26">(Rock, 2007)</xref>
        . Let
Ynmi be the observation for individual n at measurement
occasion m on observed variable i of the proficiency
variables, then:
      </p>
      <p>Rn0 ~ N(0,1)
Ynmi = ai + biRnm +    ,
    ~ N(0, ωi).</p>
      <p>(2)
The reliability of the instruments can be used to determine
b and ω. The reliability of an observed variable i at any
time point was represented as ri. In classical test theory,
the reliability is the squared correlation coefficient
between the true score and the observed score of the
student. This definition translates into an equation as
ri = 1- (Varn(ϵnmi)/ Varn(Ynmi))
where Varn(.) indicates that the variance comes from
individuals (where measurement occasion and instrument
are considered as constant). Then
bi =    /   *√ 2</p>
      <p>and
ωi =   *√1 −  2
In order to make ri = .45 at each time point, tm, for the
measurement of each skill on observed variable i, bi = .98
and ωi = .65 was used at tm. These numbers are
comparable to reading measures commonly used with 1st
grade students. At this point, the model is very close to
the model described in Almond, Tokac and Al Otaiba
(2012), except that the previous work assumed all
students were in the same Tier. Appropriate values for a
and b depend on the scale of the instruments chosen. The
values used in the simulation were chosen so that the
mean and standard deviation of the simulated data
matched the data set from Al Otaiba et al. (2011) at the
first and last time points.
2.1.3</p>
      <sec id="sec-3-1">
        <title>Decision Rules</title>
        <p>The key research question compares the performance of
the system under two different policies. The first is a
fixed decision rule implicit in the current-time RTI policy:
Students who are below a cut-score on either of the two
screening tests are placed into Tier 2 instruction. The
second policy is the optimal policy found by solving the
POMDP. Implementing this policy requires an explicit
specification of the utility function and the cost function
for the instructional options.</p>
        <p>Many RTI implementations used the reference score
(general class median score or some other percentile rank)
as a cut score for assigning each student to either the
Tier 1 or Tier 2 group. The simulated model used
different Tier 2 for each of the two screening tests (NWF
and PSF) giving four possible Tier assignments. For
instance, if a student’s score on the NWF test is lower
than the cut score for NWF but higher for PSF, the
student was assigned to Tier 2 for NWF and Tier 1 for
PSF. (This differs slightly from the common practice
which would put students who fail to meet the cut on
either measure into a single Tier 2.)
The POMDP forecasts expected learning under each
possible outcome and assigns students to tiers in a way
that balances the expected learning gains with the cost of
instruction. The utility function is the expected gain at the
last time point and the cost function is the sum of costs of
applied instruction at each state. The benefit is always
higher for Tier 2, as is the cost. However, the cost exceeds
the utility of the benefit for some regions of the
distribution because the utility is nonlinear, while for
other regions it does not.</p>
        <p>
          The contact hours with the instructor drive the cost of
each block. Cost is high for more intensive instruction in
Tier 2, and, without loss of generality, it is zero for Tier 1,
as all students receive Tier 1 instruction. The cost
function consists of three components: the frequency with
which the group meets, fa, the duration of the meeting
time, da, and size of the group, ga
          <xref ref-type="bibr" rid="ref6 ref8">(Almond &amp; Tokac,
2014)</xref>
          . Then
c(a) = k fa da/ga ,
(3)
represents the model cost of taking action or activity a in
state s, where k is a constant used to put the cost function
on the same scale as the utility function. In this study, the
cost value was fixed at c(Tier 2) = 0.1 and c(Tier 1) = 0.
The utility function is
u(RM) = logit-1(α(RM -β)).
(4)
In this equation α and β are fixed parameters; β is a
proficiency target, which is on the scale of the internal
latent variable RM. Specifically β = 0.5 for Tier 1 and β=
0.1 for Tier 2. Also, α is a slope parameter, and α= 0.8 for
both Tier 1 and Tier 2. High values of α favor bringing
students near proficiency standards above the proficiency
target β, while low values of α give more weight to
enriching students at the high end of the scale and
providing remediation at the low end of the scale
          <xref ref-type="bibr" rid="ref6 ref8">(Almond &amp; Tokac, 2014)</xref>
          . (Almond &amp; Tokac
alternatively recommend using a probit function in place
of a logit, so that α becomes effectively a standard
deviation; however, the as the shape of the logit and
probit curves are so similar, we expect the results using a
probit curve would be similar as well.)
In this case, the total reward is u(RM) – c(a(s,2)) –
c(a(s,3)). The difference between the utility function and
the cost function is the total reward for getting the student
to proficiency level Tier 1 using instruction a(s,2) and
a(s,3) between measurements 1 and 2, and 2 and 3. The
reward is the basis for the assignment of each student to
Tier 1 or Tier 2. The POMDP model forecasts the
expected reward, and balances that with cost during each
period.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>2.2 SIMULATION DESIGN</title>
        <p>The initial value of the simulated data student distribution
at time 0 was based on the FCRR data set (Al Otaiba,
2007). In the FCRR data, the correlation between NWF
and PSF was .65. The simulation generated latent
proficiency variables for each simulee, and simulated
scores on the reading scores on the NWF and PSF test
administered at t1, t2 and t3 in the model. At each time
point, the correlation coefficient between NWF and PSF
was around 0.65 and the same growth and measurement
error residuals were used for both the POMDP-RTI and
current-time only-RTI models.</p>
        <p>
          The proficiency growth model and evidence model
parameters were estimated from the simulated data
through Markov Chain Monte Carlo (MCMC) simulation
using JAGS
          <xref ref-type="bibr" rid="ref22">(Plummer, 2003)</xref>
          . Four independent Markov
chains with random starting positions were used with
500000 iterations. This is consistent with standard
practice
          <xref ref-type="bibr" rid="ref13 ref20">(Gelman, Carlin, Stern &amp; Rubin, 2004; Neal,
2010)</xref>
          .
          <xref ref-type="bibr" rid="ref30">Tokac (2016)</xref>
          describes tests done
convergence and parameter recovery with this model.
for
        </p>
        <p>RESULTS
Data were simulated for students under two different
policies, (1) current-time only-RTI policy where students
are assigned to Tier 1 or Tier 2 based on a cuts scores on
the PSF and NWF tests at the most recent time point, and
(2) a POMDP-RTI policy where each student is assigned
to the tier that maximizes the expected utility for that
student. This resulted in two different simulated series:
  ˇ was the true reading ability under the current-time
only cut score policy and  ^ was the true reading ability
under the POMDP-RTI policy. Note that the two
simulations used the same residuals in equation (1)
(growth residual ηnm) and equation (2) (measurement error
    ). Thus, they differed only by the value of the growth
rate parameter, γa(n,m) , used in equation (1).
Table 3 shows the pattern of Tier assignment under the
two models. At the second time point, the two policies
behave roughly the same assigning the lowest performing
50% of students to Tier 2. However, at the third time
point, substantially fewer students are assigned to Tier 2
under the POMDP-RTI policy. This might be a result of
better placement policies, or simply that the Tier 2
support is less needed in the latter part of the school year.
Table 4 breaks down the differences between the two
policies at time point 3. Recall that the students were
classified into Tiers independently based on the PSF and
NWF measures, resulting effectively in four different
classifications: 1-1 (both in Tier 1), 1-2, 2-1 (mixed), and
2-2 (both Tier 2). Table 4 shows the number of students
who were classified into one of the four groups who were
classified into a different group by the other policy.
Slightly over half (151) students were assigned different
instruction under the different policies.
Thus, there is a fair bit of difference in the placement, but
which placement is better? As this is a simulation
student, the true abilities are known it should be possible
to determine an ideal placement based on the known
simulated abilities. However, the abilities,   ˇ and  ^ ,
are different in the two branches of the assessment
(because a different policy was actually employed).
Therefore, the ideal placements will be different under
each policy.</p>
        <p>In determining the ideal placement, the two mixed
assignments, 1-2 and 2-1, were combined into a single
mixed tier. Cut scores on the latent ability variable were
calculated based on the utilities in equations (3) and (4)
and a single growth step after the last measurement: the
students with abilities higher than 0.1 should be placed
into Tier 1, those lower than -0.4 into Tier 2 and students
in between into the Mixed Tier. Both policies used the
same cut points for determining the ideal placement, but
because the abilities were different, the actual ideal
placement could be different for the two students under
the same policy at Time 3.</p>
        <p>Table 5 presents the number of students placed in each
tier under the actual and ideal placements under both
policies. It also presents a measure of agreement which is
the number of students assigned to that tier in the ideal
placement that were actually assigned to the Tier. The
POMDP-RTI does well under that metric, with all of the
students who should be placed into Tier 1 or 2 correctly
placed in that tier. This policy only had problems with
the mixed tier, with 35% of the students being incorrectly
placed in Tier 1 or Tier 2.</p>
        <p>
          The current-time only-RTI policy did not fare as well.
First, note that under the ideal placement for this policy
fewer students would be in the high-performing Tier 1
group. This is likely due to incorrect assignment at
Time 2. Next, note that agreement rates are lower. So the
POMDP-RTI model did better on two important metrics.
To summarize the agreement numbers, we used Goodman
and Kruskall’s lambda
          <xref ref-type="bibr" rid="ref5">(Almond, Mislevy, Steinberg,
Yan, and Williamson, 2015)</xref>
          . Usually, this adjusts the
raw agreement rate by subtracting out the agreement with
a classifier which simply classifies everybody at the
modal category (which would be the mixed tier for both
policies). However, Tier 1 has a special meaning in the
context of RTI; Tier 1 is the normal whole-class
instruction that is given regardless of the test score.
may have been influenced by the use of the same utility
model used in the POMDP to define ideal placement.
Therefore, by using Tier 1 as the baseline in lambda, the
result is a statistic that describes how much better the RTI
is performing than undifferentiated whole class
instruction. Let ki be the number of students correctly
classified into Tier i, and let kTier1 be the number of
students who should ideally be assigned to Tier 1. Then
λ = ∑   −    1
        </p>
        <p>−    1
Like a correlation coefficient, the value of lambda ranges
between -1 and 1, with 0 representing a classifier which
does no better than simply assigning everybody to the
model category. If it is 1, it means that the policy did a
perfect job of assigning students to the ideal tier. Using
the data in Table 5, λ = 0.74 for POMDP-RTI, λ = 0.51
for Current-time only RTI. So RTI does better than
undifferentiated instruction, but the POMDP-RTI policy
also does better than the current-time only-RTI.</p>
        <p>CONCLUSION
As expected, a policy produced by a POMDP (which is
designed to produce optimal policies) performed better
than current-time only cut-score policy current used in
many RTI implementations. In particular, the
POMDPRTI had a better agreement with the ideal placement (λ =
0.74) than the current-time only model did (λ = 0.51).
The likely reason for the better performance is that the
POMDP model is better able to use the entire student
record, both the history of assessments and instruction
and multiple tests taken at the same time to build a more
accurate estimate of student proficiency, although some
The cut-score approach currently in common use does
have one clear advantage over the POMDP model: it is
simpler to implement and explain. However, if the
POMDP recommendations were integrated into an
electronic gradebook, it might be better received by
teachers. However, while teachers may not feel the need
for the POMDP software to address the Tier 1/Tier 2
placement, there is another aspect of the RTI framework
which was not addressed in this study. During Tier 2,
students receive regular progress monitoring assessments,
and the teacher is supposed to be making fine-grained
adjustments if the student is not responding to the
intervention (hence the name response-to-intervention).
In particular, the teachers can adjust the intensity of the
intervention (equation 3) adding more time on task if
needed, or using less support if the teacher is appearing to
do well. This is a target of opportunity for the POMDP
model, as teachers have responded favorability to the idea
of computer support to help them with tracking and
intervention adjustment for Tier 2 students.1 The present
work shows that POMDPs are a promising approach to
this problem.</p>
        <p>Another limitation of the current work is that it assumes
all students grow at the same rate under each of the
instructional conditions (e.g., given the tier placement).
In practice, many studies looking at RTI have found that
students grow at different rates, with a low growth rate
often corresponding to low initial ability.2 While this
adds complexity to the model, we think that the POMDP
framework will help educators make optimal policy
decisions with this additional information.</p>
        <p>Acknowledgements
We would like to thank the Florida Center for Reading
Research for allowing us access to the data used in this
paper. The data were originally collected as part of a
larger National Institute of Child Health and Human
Development Early Child Care Research Network study.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Almond</surname>
            ,
            <given-names>R. G.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Cognitive modeling to represent growth (learning) using Markov decision processes</article-title>
          .
          <source>Technology, Instruction, Cognition and Learning (TICL)</source>
          ,
          <volume>5</volume>
          ,
          <fpage>313</fpage>
          -
          <lpage>324</lpage>
          . Retrieved
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          1
          <string-name>
            <given-names>Joe</given-names>
            <surname>Nese</surname>
          </string-name>
          , U. Oregon, private communication.
          <source>May 16</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          2
          <string-name>
            <surname>Young-Suk</surname>
            <given-names>Kim</given-names>
          </string-name>
          , Florida State University.
          <source>Private communication. March</source>
          <volume>31</volume>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Almond</surname>
            ,
            <given-names>R. G.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Estimating Parameters of Periodic Assessment Models (Repot No</article-title>
          . RM-
          <volume>11</volume>
          -06).
          <article-title>Educational Testing Service</article-title>
          . Retrieved from http://www.ets.org/research/policy_research_rep orts/rm-11-06.pdf
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Almond</surname>
            ,
            <given-names>R. G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mislevy</surname>
            ,
            <given-names>R. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steinberg</surname>
            ,
            <given-names>L. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Williamson</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <source>Bayesian Networks in Educational Assessment</source>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Almond</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldin</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Vertical and Stationary Scales for Progress Maps</article-title>
          . In J Stamper,
          <string-name>
            <given-names>Z</given-names>
            <surname>Pardoz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M Mavrikis</given-names>
            , &amp;
            <surname>B. M. McLaren</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 7th International Conference on Educational Data Mining, London, England. Society for Educational Data Mining</source>
          .
          <fpage>169</fpage>
          -
          <lpage>176</lpage>
          . Retrieved from http://educationaldatamining.org/EDM2014/uplo ads/procs2014/long%20papers/169_EDM-2014- Full.pdf
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Almond</surname>
            ,
            <given-names>G. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tokac</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          , &amp; Al Otaiba,
          <string-name>
            <surname>S.</surname>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Using POMDPs to Forecast Kindergarten Students' Reading Comprehension</article-title>
          . In Agosta,
          <string-name>
            <given-names>J. M.</given-names>
            ,
            <surname>Nicholson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            , &amp;
            <surname>Flores</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. J</surname>
          </string-name>
          . (Eds.),
          <source>The 9th Bayesian Modeling Application Workshop at UAI</source>
          <year>2012</year>
          . Catalina Island, CA. Retrieved from http://www.abnms.org/uai2012-appsworkshop/papers/AlmondEtal.pdf
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Almond</surname>
            ,
            <given-names>R. G.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Tokac</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          (
          <year>2014</year>
          , November).
          <article-title>Using Decision Theory to Allocate Educational Resources</article-title>
          . Paper presented at Annual Meeting, Florida Educational Research Association, Cocoa Beach, FL.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Almond</surname>
            ,
            <given-names>R. G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Hemat</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Parameter Recovery Studies with a Diagnostic Bayesian Network Model</article-title>
          . Behaviormetrika,
          <volume>35</volume>
          (
          <issue>2</issue>
          ),
          <fpage>159</fpage>
          -
          <lpage>185</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Al</given-names>
            <surname>Otaiba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Folsom</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. S.</given-names>
            ,
            <surname>Schatschneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Wanzek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Greulich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Meadows</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            , &amp;
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <surname>Z.</surname>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Predicting first grade reading performance from kindergarten response to instruction</article-title>
          .
          <source>Exceptional Children</source>
          ,
          <volume>77</volume>
          (
          <issue>4</issue>
          ),
          <fpage>453</fpage>
          -
          <lpage>470</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Boutilier</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Hanks</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>1999</year>
          ).
          <article-title>Decisiontheoretic planning: Structural assumptions and computational leverage</article-title>
          .
          <source>Journal of Artificial Intelligence Research</source>
          ,
          <volume>11</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>94</lpage>
          . Available from citeseer.ist.psu.edu/boutilier99decisiontheoretic. html
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Catts</surname>
            ,
            <given-names>H. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hogan</surname>
            ,
            <given-names>T. P. E.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Fey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>Subgrouping poor readers on the basis of individual differences in reading-related abilities</article-title>
          .
          <source>Journal of Learning Disabilities</source>
          ,
          <volume>36</volume>
          ,
          <fpage>151</fpage>
          -
          <lpage>164</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Gelman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carlin</surname>
            ,
            <given-names>J. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stern</surname>
            ,
            <given-names>H. S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Rubin</surname>
            ,
            <given-names>D. B.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Bayesian Data Analysis</article-title>
          .
          <source>Boca Raton</source>
          , FL: Chapman and Hall.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Gough</surname>
            ,
            <given-names>P. B.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Tunmer</surname>
            ,
            <given-names>W. E.</given-names>
          </string-name>
          (
          <year>1986</year>
          ). Decoding, reading, and reading disability.
          <source>Remedial and Special Education</source>
          ,
          <volume>7</volume>
          ,
          <fpage>6</fpage>
          -
          <lpage>10</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Greenwood</surname>
            ,
            <given-names>C. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bradfield</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaminski</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Linas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carta</surname>
            ,
            <given-names>J. J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Nylander</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>The Response to Intervention ( RTI ) Approach in Early Childhood</article-title>
          .
          <source>Focus on Exceptional Children</source>
          ,
          <volume>43</volume>
          (
          <issue>9</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Landerl</surname>
            <given-names>K</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Wimmer</surname>
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2008</year>
          )
          <article-title>Development of word reading fluency and spelling in a consistent orthography: An 8-year follow-up</article-title>
          .
          <source>Journal of Educational Psychology</source>
          .
          <volume>100</volume>
          (
          <issue>1</issue>
          ):
          <fpage>150</fpage>
          -
          <lpage>161</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Mastropieri</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scruggs</surname>
            ,
            <given-names>T. E.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Graetz</surname>
            ,
            <given-names>J. E.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>Reading comprehension instruction for secondary students: Challenges for struggling students and teachers</article-title>
          .
          <source>Learning Disability Quarterly</source>
          ,
          <volume>26</volume>
          (
          <issue>4</issue>
          ),
          <fpage>103</fpage>
          -
          <lpage>116</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Matthews</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Analysis of an Early Intervention Reading Program for First Grade Students</article-title>
          . Retrieved from http://scholarworks.waldenu.edu/cgi/viewcontent .cgi?article=1395&amp;context=dissertations
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Mislevy</surname>
            ,
            <given-names>R. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steinberg</surname>
            ,
            <given-names>L. S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Almond</surname>
            ,
            <given-names>R. G.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>On the structure of educational assessment (with discussion)</article-title>
          .
          <source>Measurement: Interdisciplinary Research and Perspective</source>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ),
          <fpage>3</fpage>
          -
          <lpage>62</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Neal</surname>
            ,
            <given-names>R. M.</given-names>
          </string-name>
          (
          <year>2010</year>
          )
          <article-title>``MCMC using Hamiltonian dynamics'', in the Handbook of Markov Chain Monte Carlo</article-title>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Brooks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. L.</given-names>
            <surname>Jones</surname>
          </string-name>
          , and X.-L. Meng (editors), Chapman &amp; Hall / CRC Press, pp.
          <fpage>113</fpage>
          -
          <lpage>162</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Nese</surname>
            ,
            <given-names>T. F. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lai</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anderson</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jamgochian</surname>
            ,
            <given-names>M. E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kamata</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saez</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Park</surname>
            ,
            <given-names>J. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alonzo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Tinda</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <source>Technical Adequacy of the easyCBM® Mathematics Measures: Grades 3-8</source>
          ,
          <fpage>2009</fpage>
          -2010
          <source>Version (Technical Report No: 1007)</source>
          . Eugene, OR: Behavioral Research and Teaching, University of Oregon.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Plummer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling</article-title>
          .
          <source>Proceeding of the 3rd International Workshop on Distributed Statistical Computing</source>
          , Viena, Austria.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>R</given-names>
            <surname>Development Core</surname>
          </string-name>
          <string-name>
            <surname>Team.</surname>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>R: A language and environment for statistical computing</article-title>
          . Vienna, Austria: R Foundation for Statistical Computing. Retrieved from http://www.R-project.org
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Rafferty</surname>
            ,
            <given-names>A. N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brunskill</surname>
            ,
            <given-names>E.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Griffiths</surname>
            ,
            <given-names>T. L.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Shafto</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Faster teaching by POMDP planning</article-title>
          .
          <source>Proceedings of the 15th International Conference on Artificial Intelligence in Education (AIED2011)</source>
          . Auckland, New Zealand.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Raftery</surname>
            ,
            <given-names>A. E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lewis</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          (
          <year>1995</year>
          ).
          <article-title>The number of iterations, convergence diagnostics and generic Metropolis algorithms</article-title>
          . In: Gilks,
          <string-name>
            <given-names>W. R.</given-names>
            ,
            <surname>Spiegelhalter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. J.</given-names>
            ,
            <surname>Richardson</surname>
          </string-name>
          , S., eds.
          <source>Practical Markov Chain Monte Carlo. London: Chapman and Hall.</source>
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Rock</surname>
            ,
            <given-names>D. A.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Growth in reading performance during the first four years in school</article-title>
          .
          <source>(Report No: RR-07-39)</source>
          . Princeton, NJ: Educational Testing Service.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Ross</surname>
            ,
            <given-names>M. S.</given-names>
          </string-name>
          (
          <year>1983</year>
          ).
          <article-title>Introduction to stochastic dynamic programming</article-title>
          . London:Academic Press.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <surname>Ross</surname>
            ,
            <given-names>M. S.</given-names>
          </string-name>
          (
          <year>2000</year>
          ). Introduction to Probability Models. London: Academic Press.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <surname>Tierney</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>1994</year>
          ).
          <article-title>Markov Chain for exploring posterior distributions (with discussion)</article-title>
          .
          <source>Ann. Statist</source>
          .
          <volume>22</volume>
          :
          <fpage>1701</fpage>
          -
          <lpage>1762</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <surname>Tokac</surname>
            ,
            <given-names>Umit.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Using partially observed Markov decision processes (POMDPs) to implement a response-to-intervention (RTI) framework for early reading</article-title>
          .
          <source>Doctoral Dissertation</source>
          . Florida State University.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <surname>Torgesen</surname>
            ,
            <given-names>J.K.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Avoiding the devastating downward spiral: The evidence that early intervention prevents reading failure</article-title>
          .
          <source>American Educator</source>
          ,
          <volume>28</volume>
          ,
          <fpage>6</fpage>
          -
          <lpage>19</lpage>
          .
          <article-title>Reprinted in the 56th Annual Commemorative Booklet of the International Dyslexia Association</article-title>
          , November,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>