<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Optimal Implementation of Power Saving Techniques in CGR Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tiziana Fanni</string-name>
          <email>tiziana.fanni@diee.unica.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universita degli Studi di Cagliari</institution>
          ,
          <addr-line>DIEE</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Coarse-Grained Recon gurable (CGR) architectures combine high performance with exibility, allowing the execution of a large set of applications over the same substrate. However, they are also required to be energy e cient. This work focuses on a methodology to identify which parts of a CGR architecture may bene t from the application of power saving techniques, guiding the designers towards an optimal implementation of clock gated and power gated designs.</p>
      </abstract>
      <kwd-group>
        <kwd>Recon gurable Systems Power Saving Clock Gating Power Modeling Datapath Merging</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Nowadays small portable devices are required to e ciently execute multiple
fancy functions. Recon gurable systems [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] are a suitable solution for these
colliding requirements. In particular, Coarse-Grained Recon gurable (CGR) systems
o er higher performance with a certain degree of exibility. In CGR systems all
the resources belonging to the possible con gurations are instantiated in the
substrate, and they are multiplexed in time. Thus CGR o er fast recon guration,
paying the cost of the power consumed by the resources present in the substrate
but not involved in active con guration.
      </p>
      <p>Power consumption in digital devices is computed as in Equation 1, where:
1) Plkg is the static power dissipation caused by leakage currents, consumed even
while no circuit activity is present; 2) Pint is the dynamic power consumption
mainly due to the cell switching activity; (3) Pnet is again a dynamic term,
related to the interconnection. With technologies below 90 nm, designers are
required to minimize both static (Plkg) and dynamic (Pint + Pnet) terms.</p>
      <p>P =Plkg + Pint + Pnet
(1)</p>
      <p>A popular technique to minimize dynamic power consumption is the clock
gating (CG). It consists on switching o the clock of resources not involved in
the computation, and it is applied automatically, at gate level, by commercial
synthesizers. On the other hand, power gating (PG) shuts-o the power of
unused logic, thus acting also on static consumption. PG requires the instantiation
of several resources (i.e. sleep transistors to switch on/o the power supply,
isolation cells to avoid the transmission of spurious signals and state retention logic
to maintain the internal state of the gated region) and the rules to manage them
are quite complicated, thus commercial tools do not provide it automatically.</p>
      <p>In a CGR architecture, considering the minimum set of disjointed Logic
Regions (LRs), composed of processing elements that are always active/inactive
together, it is possible to apply to all of them CG or PG techniques. However,
if on one hand CG requires only one AND gate for each region to switch o the
clock tree, PG has a much higher cost due to the additional logic, and the power
overhead may easily overcome the power saved by switching o the unused
resources. The research presented in this paper studied an automatic system-level
analysis and implementation ow to estimate in advance the cost of CG and PG
application in a CGR system, leading to the identi cation of those LRs that can
bene t from the application of these power saving techniques.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Methods and Algorithm</title>
      <p>The proposed model (see Equations 2 and 3) estimate Plkg and Pint of
Equation 1, when PG and CG are applied at LRs level. They are composed of
two terms: 1) Plkg=intON (LRi) - consumption within the considered LRs; 2)
Ext Overlkg=int(LRi) - consumption due to the logic inserted outside the LR.
=</p>
      <p>Plkg=int(LRi) = Plkg=intON (LRi) + Ext Overlkg=int(LRi) =</p>
      <p>X</p>
      <p>[Plkg=int(cmb) + Plkg=int(RC) #rtn+
actors2LRi
+Plkg=int(reg) (#reg</p>
      <p>#rtn)=#reg] T iON +
+[Plkg=int(ISOON ) T iON + Plkg=int(ISOOF F ) T iOF F ] #iso+
+[Plkg=int(ContrON ) T iON + Plkg=int(ContrOF F ) T iOF F ]+
+[Plkg=int(CGON ) T iON + Plkg=int(CGOF F ) T iOF F ]</p>
      <p>actors2LRi
=</p>
      <p>Plkg=int(LRi) = Plkg=intON (LRi) + Ext Overlkg=int(LRi) =</p>
      <p>X</p>
      <p>[Plkg=int(cmb) + Plkg=int(reg) T iON ]+
+[Plkg=int(CGON ) T iON + Plkg=int(CGOF F ) T iOF F ]
(2)
(3)
In particular, the Equations present the following contributions:
{ Combinatorial Logic [Plkg=int(cmb) T iON and Plkg=int(cmb)]: sum of the
contributions of the combinational cells, weighted for the activation time
T iON . CG switches o only the clock tree; therefore, combinational logic is
always active when CG is applied.
{ Sequential Logic [Plkg=int(reg) (#reg #rtn)=#reg T iON and Plkg=int(reg)
T iON ]: this term consider only those registers (#reg) inside the LR that are
not replaced by the retention cells. CG does not have e ect on the static
power, thus when it is applied only the internal contribution is multiplied by
T iON (Plkg(reg) and Pint(reg) T iON ).
{ Retention Cells [Plkg=int(RC) #rtn T iON )]: This term consider the
retention cells inserted to preserve the status of some registers (#rtn), when
PG is applied. Plkg=int(RC) is the consumption of a single retention cell.
{ Isolation Cells [[Plkg=int(ISOON ) T iON + Plkg=int(ISOOF F ) T iOF F ]
#iso]: In the PG case, isolation cells are inserted and their dissipation is
proportional to their overall number.
{ Clock Gating Cells Plkg=int(CGON ) T iON + Plkg=int(CGOF F ) T iOF F ]:
Clock gating cells are used in both PG and CG cases. In the PG case they
are required for the proper operation of the retention cells.
{ Power Controller [Plkg=int(ContrON ) T iON +Plkg=int(ContrOF F ) T iOF F ]:
inserted to properly drive the enable signals to the power saving logic.</p>
      <p>
        This estimation model has been embedded in the automatic design ow for
CGR systems o ered by the Multi-Data ow Composer (MDC) tool [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. MDC
handles the automatic composition and deployment of CGR systems, starting
from the high-level speci cation of the kernels to be executed, represented as
networks [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. MDC o ers also other functionalities: 1) a structural pro ler that
performs the design space exploration of the implementable multi-functional
systems to determine the optimal CGR substrate [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]; 2) a power manager that
partitions the multi-functional network, identifying the minimum set of disjointed
LRs and applies to all of them either CG [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] or PG [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] (generating an ad-hoc
common power format (CPF) le to specify the power intent early in the design
[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]); a rapid prototyper to embed the CGR system into a Xilinx compliant IP [
        <xref ref-type="bibr" rid="ref8 ref9">8,
9</xref>
        ]. The MDC power manager has been extended to analyze the design,
exploiting the estimation ow, to identify which regions may mostly bene t of PG and
CG application [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ].
      </p>
      <p>Datapath Merging
s
s
e
c
o
r
P
g
n
i
g
r
e</p>
      <p>M</p>
      <sec id="sec-2-1">
        <title>NETs CGR NET</title>
        <p>Baseline HDL Generation
sn in
iiceoggR ittcaeon
Lo Id
s
i
s
y
l
a
n
A
r
e
w
o
P</p>
        <p>HDL
eG Scripts
n
o
i
LDH traen
L n
D io
H t
/PGG raeen
C G</p>
      </sec>
      <sec id="sec-2-2">
        <title>Synthesizer</title>
      </sec>
      <sec id="sec-2-3">
        <title>Reports</title>
        <p>LR identi cation</p>
        <p>Power Saving Application</p>
        <p>Fig. 1. Proposed analysis and implementation ow.</p>
        <p>Fig. 1 shows the complete analysis and implementation ow. MDC derives the
HDL code of the baseline CGR system and provides the scripts to perform the
synthesis and all the simulations for the system back annotation. Commercial
tools are used to determine the baseline system power reports, which are fed
back to MDC. MDC identi es the LRs and estimates the number of isolation
cells (#iso) at data ow level by analyzing the connections between the di erent
LRs. These data, together with the information gathered by the reports of a
single synthesis run of the baseline CGR system netlist, are used to estimate
power consumption of the identi ed LRs (consumption of the additional power
saving logic - isolation cells and retention cells is estimated by characterizing
the adopted technology libraries).</p>
        <p>The power estimation ow analyzes the LRs according to Algorithm 1:
{ area evaluation: PG overhead may a ect the power consumption of the smaller
LRs. Thus, too small LRs are evaluated only for CG application. The
threshold value is set by the user.
{ PG evaluation: the power variations due to PG and CG application are
estimated. If PG results more convenient than CG, the LR is candidated for the
implementation of PG application. Otherwise, CG is evaluated.
{ CG evaluation: the power variation due to CG application is estimated. If it
is not able to save any power, LR is discarded and its logic will be included
in the always on domain.</p>
        <p>Algorithm 1: Power saving strategy selection for CGR systems.</p>
        <sec id="sec-2-3-1">
          <title>PG set is empty;</title>
          <p>CG set is empty;
foreach LRi in set LRs do</p>
          <p>evaluate area(LRi, areath)
end
function: evaluate area(LRi, areath):
calculate LRi area;
if areaLR &gt; areath then</p>
          <p>evaluate PG(LRi);
else
end</p>
          <p>evaluate CG(LRi);
PG evaluation: evaluate PG(LRi):
estimate PG total overhead;
else
end
if P G total overhead &lt; 0 then
estimate CG total overhead;
if P G total overhead &lt;
CG total overhead then
add LRi to PG set;
add LRi to CG set;
else</p>
          <p>evaluate CG(LRi);
end
evaluate CG(LRi);</p>
        </sec>
        <sec id="sec-2-3-2">
          <title>CG evaluation: evaluate CG(LRi):</title>
          <p>
            estimate CG total overhead;
if CG total overhead &lt; 0 then
add LRi to CG set;
To assess the proposed methodology, two di erent applications are considered,
an FFT application and a computing core accelerating a zoom application. The
Fast Fourier Transform (FFT) [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ] is an optimised algorithm for the Discrete
Fourier Transform (DFT) calculation; a DFT of size 2 (radix-2) takes the name
of butter y. The adopted use case involves a CGR radix-2 FFT of size 8, where 4
di erent con gurations are available, FFT that uses: 1) 12 butter ies; 2) 4
butter ies; 3) 2 butter ies; 4) 1 butter y. In this design MDC identi ed 8 LRs. The
Zoom coprocessor is composed of seven computational kernels: 1) absolute value
calculation; 2) Bilevel/grayscale block checking; 3) Linear combination
calculation; 4) Cubic lter convolution; 5) Median calculation; 6) Maximum/minimum
nding; 7) Edge block checking. These kernels have been modelled as networks
and implemented over a CGR hardware accelerator. In this CGR co-processor
MDC identi ed 13 LRs. To validate cross-application and cross-technology
effectiveness of the proposed estimation model, this section presents assessment
on three implementations: FFT targeting a 90 nm CMOS technology; Zoom
targeting a 90 nm CMOS technology; Zoom targeting a 45 nm CMOS technology.
          </p>
          <p>These designs have been synthesized using Cadence RTL Compiler, then
generated post-synthesis simulation reports have been fed back to MDC tool to
estimate the power consumption of gated LRs, by applying Equations 2 and 3.
In order to calculate the accuracy of the proposed estimation models, one power
gated and one clock gated design for each identi ed LR have been implemented.
Tab. 1 reports the real (extracted from the post-synthesis reports) and estimated
percentage power variation for each LR, respectively when CG and PG strategies
are applied. Comparing real power variations with estimated ones we see that
the worst estimation is related to LR4 of FFT, whose percentage power variation
is so low (-0.67%) that it is impossible to estimate it accurately. For this reason,
the algorithm that evaluates the LRs keeps in consideration also their area.</p>
          <p>Tab. 1 also depicts the percentage area of each LR with respect to design area.
As expected, biggest regions (LR1 in FFT and LR2 in Zoom) are the ones with
the highest power saving, regardless of the considered strategy (PG or CG). Also
the composition of the considered LR has an impact on the e ectiveness of the
applied power saving technique. LR1 of FFT is 99% combinational, thus CG can
save really little amount of power in this region. However, simply considering the
LR size may not be su cient to identify the best candidate for power saving;
indeed it is possible to notice that LR3 of FFT is 15.8% of total area, but
switching it o only saves about 6% or total power.</p>
          <p>The presented methodology speeds-up the evaluation of power saving
application estimating in advance the e ectiveness of PG or CG if applied to the LRs
of a CGR system. For a CGR system composed of N input networks, only one of
the baseline design without any power management, and N (one for each kernel)
simulations to retrieve the real switching activity of the system, are required.
Otherwise, it would be necessary to implement a design for each identi ed LR
for both CG and PG applications, plus the baseline one (to evaluate the cost
and bene t of the chosen power saving technique).</p>
          <p>As a follow up of this research it is necessary to improve the model,
considering in the power estimation also the contribution of the interconnection which
currently is not addressed. Furthermore, power switches overhead is not
considered yet; these sleep transistors are inserted only during place and route ow,
and a way to estimate in advance how many switches are going to be inserted
for each LR has not been explored yet. The number of switches has e ect on the
rush currents during power-up transitions, and on the power-down/up timing,
thus also these issues are going to be addressed in a future work.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Related Works</title>
      <p>
        To the best of our knowledge, literature does not treat the problem of modeling
PG and CG costs in CGR designs. Sha que at al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] focuses on low-power
techniques and power modeling for FPGAs. In [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], only CG is taken into
account: di erent power states are de ned and their consumption is characterized
by low-level power analysis results. The work in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] focuses on estimating the
leakage reduction for PG and reverse body bias.
      </p>
      <p>
        Other approaches perform an estimation that considers di erent components.
For instance, Stokke at al. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] propose a power modeling method for the Tegra
K1 CPU, that taking into account measured rail voltages and ne-grained
hardware activity predictors, expose components such as rail and core leakage
currents. The FALPEM framework [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] provides power estimations at pre-register
transfer level (RTL) stage, speci cally targeting the power consumed by clock
network and interconnection, but PG and CG costs are not de ned. Finally, Li et
al. [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] propose an architecture-level integrated power, area, and timing
modeling framework for multi-core systems, that evaluates system building blocks (i.e.,
CPU, buses, etc.) for di erent technology nodes, providing also PG support.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>R.</given-names>
            <surname>Hartenstein</surname>
          </string-name>
          , \
          <article-title>A Decade of Recon gurable Computing: a Visionary Retrospective,"</article-title>
          <source>in Proc. of Design, Automation and Test in Europe (DATE'01)</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>F.</given-names>
            <surname>Palumbo</surname>
          </string-name>
          et al., \
          <article-title>Power-Awarness in Coarse-Grained Recon gurable MultiFunctional Architectures: a Data ow Based Strategy,"</article-title>
          <source>Journal of Signal Processing Systems</source>
          , vol.
          <volume>87</volume>
          , pp.
          <volume>81</volume>
          {
          <issue>106</issue>
          ,
          <string-name>
            <surname>Apr</surname>
          </string-name>
          <year>2017</year>
          . doi:
          <volume>10</volume>
          .1007/s11265-016-1106-9.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>C.</given-names>
            <surname>Sau</surname>
          </string-name>
          et al., \
          <article-title>Automated design ow for multi-functional data ow-based platforms,"</article-title>
          <source>Journal of Signal Processing Systems</source>
          , vol.
          <volume>85</volume>
          , pp.
          <volume>143</volume>
          {
          <issue>165</issue>
          ,
          <string-name>
            <surname>Oct</surname>
          </string-name>
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>F.</given-names>
            <surname>Palumbo</surname>
          </string-name>
          et al., \
          <article-title>Power-awarness in coarse-grained recon gurable designs: A data ow based strategy,"</article-title>
          <source>in Workshop on Signal Processing Systems</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>F.</given-names>
            <surname>Palumbo</surname>
          </string-name>
          et al., \
          <article-title>Coarse-grained recon guration: data ow-based power management," IET Computers Digital Techniques</article-title>
          , vol.
          <volume>9</volume>
          , no.
          <issue>1</issue>
          , pp.
          <volume>36</volume>
          {
          <issue>48</issue>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>T.</given-names>
            <surname>Fanni</surname>
          </string-name>
          et al.,
          <source>\Automated Power Gating Methodology for Data ow-based Recon gurable Systems," in Conference on Computing Frontiers</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <fpage>SI2</fpage>
          .,
          <source>Si2 Common Power Format Speci cationTM - Version 2</source>
          .1,
          <string-name>
            <surname>Dec</surname>
          </string-name>
          .
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>C.</given-names>
            <surname>Sau</surname>
          </string-name>
          et al.,
          <article-title>\Automatic generation of data ow-based recon gurable co-processing units," in Conf. on Design and Architectures for Signal and</article-title>
          Image Processing,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>C.</given-names>
            <surname>Sau</surname>
          </string-name>
          et al., \
          <article-title>Recon gurable coprocessors synthesis in the MPEG-RVC domain,"</article-title>
          <source>in Conference on ReConFigurable Computing and FPGAs</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>T.</surname>
          </string-name>
          Fanni et al.,
          <source>\Power and Clock Gating Modelling in Coarse Grained Recon gurable Systems," in Conference on Computing Frontiers</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>F.</given-names>
            <surname>Palumbo</surname>
          </string-name>
          et al.,
          <article-title>\Modelling and automated implementation of optimal power saving strategies in coarse-grained recon gurable architectures,"</article-title>
          <source>Journal of Electrical and Computer Engineering</source>
          , p.
          <fpage>27</fpage>
          ,
          <year>2016</year>
          . doi:
          <volume>10</volume>
          .1155/
          <year>2016</year>
          /4237350.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>J.</given-names>
            <surname>Cooley</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Tukey</surname>
          </string-name>
          , \
          <article-title>An Algorithm for the Machine Computation of Complex Fourier Series</article-title>
          , vol.
          <volume>19</volume>
          ,
          <article-title>"</article-title>
          <source>Mathematics of Computation</source>
          ,
          <year>1965</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>M. Sha</surname>
          </string-name>
          que et al.,
          <article-title>\Adaptive Energy Management for Dynamically Recon gurable Processors,"</article-title>
          <source>IEEE Trans. on CAD of Integrated Circuits and Systems</source>
          , vol.
          <volume>33</volume>
          , no.
          <issue>1</issue>
          , pp.
          <volume>50</volume>
          {
          <issue>63</issue>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>J.</given-names>
            <surname>Yi</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Kim</surname>
          </string-name>
          , \
          <article-title>Power modeling for digital circuits with clock gating," IEICE Electronics Express</article-title>
          , vol.
          <volume>12</volume>
          , no.
          <issue>24</issue>
          , pp.
          <volume>20150817</volume>
          {
          <issue>20150817</issue>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15. H.
          <string-name>
            <surname>Xu</surname>
          </string-name>
          et al., \
          <article-title>Run-time Active Leakage Reduction by Power Gating and Reverse Body Biasing: An Energy View,"</article-title>
          <source>in Conference on Computer Design</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>K. R. Stokke</surname>
          </string-name>
          et al., \
          <article-title>High-Precision Power Modelling of the Tegra K1 Variable SMP Processor Architecture," in Symposium on Embedded Multicore/Many-core Systems-on-</article-title>
          <string-name>
            <surname>Chip</surname>
          </string-name>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17. A.
          <string-name>
            <surname>Chhabra</surname>
          </string-name>
          et al.,
          <article-title>\FALPEM: framework for architectural-level power estimation and optimization for large memory sub-systems,"</article-title>
          <source>IEEE Trans. on CAD of Integrated Circuits and Systems</source>
          , vol.
          <volume>34</volume>
          , no.
          <issue>7</issue>
          , pp.
          <volume>1138</volume>
          {
          <issue>1142</issue>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>S.</given-names>
            <surname>Li</surname>
          </string-name>
          et al., \
          <article-title>The McPAT Framework for Multicore and Manycore Architectures: Simultaneously Modeling Power, Area, and Timing,"</article-title>
          <source>TACO</source>
          , vol.
          <volume>10</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>5</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>