<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Workload Impact on BTI HCI Induced Aging of Digital Circuits: A System level Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ajith Sivadasan</string-name>
          <email>ajith.sivadasan@st.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Florian Cacho</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sidi Ahmed Benhassain</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vincent Huard</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>STMicroelectronics - 850 rue Jean Monnet</institution>
          ,
          <addr-line>38926 Crolles</addr-line>
          ,
          <country>France Contact:</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>TIMA</institution>
          ,
          <addr-line>46, avenue Félix Viallet, 38031 Grenoble</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>- Workload characterization of digital circuits using industry standard benchmarks gives an insight into the performance and energy characteristics of processor designs. Aging studies of digital circuits due to BTI, HCI is gaining importance since a higher impact on the performance of circuits can be observed as we scale down gate dimensions. For embedded system applications, the workload may very well dictate the lifetime of a system. This article aims to study the influence of different workloads on the degradation of critical path which determines the reliability of a system. A top-down circuit activity and probability analysis is carried out leading to an accurate estimation of aging due to HCI and BTI of critical path elements at the design stage. A dedicated simulation flow has been set up, from RTL simulation down to gate level cell timing analysis mapped onto 28nm FDSOI technology from STMicroelectronics. The objective is to correlate path delay timing with aging of critical path cells. Simulation results indicate that the higher complexity of an execution program may not necessarily lead to a higher rate of degradation of the critical path considering that aging is primarily driven by the workload dependent activity and the probability of critical path combinational logic elements.</p>
      </abstract>
      <kwd-group>
        <kwd>Workload</kwd>
        <kwd>Aging</kwd>
        <kwd>Critical Path</kwd>
        <kwd>Reliability</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>I. INTRODUCTION</title>
      <p>
        Computers were designed to perform tasks faster.
Speed of operation of microprocessors, power consumption,
processor micro-architecture, memory hierarchy, system
architecture [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and task scheduling of application specific
software that drive the microprocessors without a
comprehensive knowledge of the end user specific
requirements has been till the now the major concern for
microprocessor and microcontroller designers [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. It is to
simulate these wide range of possible applications of an end
user that benchmarks and their associated performance metrics
that benchmarks have been developed [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Benchmarks are
designed to represent emerging workloads. EEMBC
(Embedded Microprocessor Benchmark Consortium) deal
with benchmarks for embedded systems [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Benchmarks
scores gives a relative measure of performance of different
processors. Using these different benchmarks for system or
hardware performance analysis gives information to the
designers as to whether design changes need to be made
depending on the workload that it will employ for a certain
application [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Though workload characterization till now has
Lorena Anghel2
just been used for micro-architecture analysis, for our current
aging analysis studies, the same workloads are considered to
influence the aging of the microcontroller circuit under study
in different respective fashions. As observed from our
simulations and references [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] considerable variation can
be observed between the simulation times of different
applications. Number of cycles of certain application to be
employed by the end user will then definitely influence the life
time of the circuit. The automotive sector is one of the markets
of interest for St Microelectronics. Matrix multiplication, FIR
filters benchmarks [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] are used to characterize Embedded
System Applications for the Automotive sector.
      </p>
      <p>
        Silicon on Insulator (SOI) technology involves the
fabrication of a sandwich structure, where a 25nm Buried
oxide layer is sandwiched between a thin undoped silicon
layer and the substrate. This undoped layer is an important
device matching characteristic [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. The final SOI thickness of
7nm provides an excellent Short Channel effect (SCE) control
without any change in the leakage current for gate size up to
24 nm. FDSOI technology is aimed at high speed, low voltage
circuit applications providing a 32% and 84% speed increase
for 1V and 0.6V respectively with very little modification to
the prevalent fabrication flow at ST Microelectronics [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
Incidentally, manufacturing process gets simplified because of
elimination of well and field implantation steps. Memory
access times, can be significantly reduced due to high Iread
values for manageable leakage. HCI and BTI effects observed
are comparable to the libraries based on Bulk Technology.
      </p>
      <p>
        This paper makes a direct link between workload at
system level and the device level degradations due to HCI and
BTI. Considerable amount of research into how HCI and BTI
affect the aging of transistors has been done [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Both
HCI and BTI result in an increase in the threshold voltage [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
thus resulting in slower transistors. HCI degradation is
observed predominantly while switching the transistors at high
frequency while a transistor at a constant potential degrade
gradually due to BTI. Efforts have already been made to
consider HCI and BTI effects in the design process [16].
Research into how workload can accelerate the aging has been
explored by [2], [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. This paper investigates an industrial
design flow to identify critical paths and critical path elements
of a design that are most susceptible to HCI and BTI
corresponding to a certain workload.
      </p>
      <p>II. DESIGN AND FLOW METHODOLOGY</p>
      <sec id="sec-1-1">
        <title>A. openMSP430 Architecture</title>
        <p>The design under use is an open-source synthesizable 16
bit microcontroller core from TI MSP430 family based on
Von Neumann architecture and written in Verilog HDL. The
modules Frontend Unit, Execution unit, Memory backbone in
Fig 1 were observed to show the maximum activity based on
RTL simulations and so they are the modules that are of
interest in relation to this paper. The openMSP430 was further
synthesized onto 28nmFDSOI technology.</p>
        <p>B. Workload dependent Aging analysis Flow</p>
        <p>Activity is defined as the average number of transitions of
a net per clock period for the entire simulation cycle. Activity
for this paper refers to transition density as mentioned in [20].
Probability in this research paper refers to the probability of
observing a logic 1 at a particular net per clock cycle. Thus
probability for this paper stands for signal probability of being
either a 1 or 0. Signal probability of 1 means the signal is
always at 1 and vice versa.</p>
        <p>Activity and probability information for all nets in the
design corresponding to 10 different Benchmark programs for
a full simulation run is obtained.</p>
        <sec id="sec-1-1-1">
          <title>Frontend Unit</title>
        </sec>
        <sec id="sec-1-1-2">
          <title>Execution Unit</title>
          <p>Register File
ALU</p>
        </sec>
        <sec id="sec-1-1-3">
          <title>Serial Debug Interface</title>
          <p>HW Break
Unit
UART or
I2C</p>
        </sec>
        <sec id="sec-1-1-4">
          <title>DMA Controller, Bootloader, Memory-BIST</title>
          <p>Fig. 1. openMSP430 Design Structure
e
n
o
b
k
c
B Watchdog
a
y
r
o
m
e
M</p>
          <p>Program Memory Interface
Data Memory Interface</p>
          <p>SFR</p>
          <p>RAM</p>
          <p>ROM
Peripheral bus</p>
        </sec>
        <sec id="sec-1-1-5">
          <title>Basic Clock</title>
          <p>Module
16x16
Multiplier
Peripherals</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>III. HIERARCHICAL APPROACH</title>
      <sec id="sec-2-1">
        <title>A. Path Level Analysis</title>
        <p>The activity value varies as we move through the elements
along a certain critical path. The activity value of a net depends
on the cell elements connected to it and the workload. Activity
of the endpoint nets of all chosen critical paths is obtained.
These activity values are plotted against their respective path
delay values. Aging of critical paths which form 10% of
maximum delay has also been referred to as PCP (Potential
Critical Paths) [1]. Activity of endpoints here is considered as
the activity of the critical paths which provides a means to
compare path aging due to HCI and BTI.</p>
        <p>The critical paths with highest activity have been
highlighted in the graph Fig. 3 (bottom). The cell from which
the net originates is expected to see maximum HCI related
0.4 0.6 0.8
Normalized Delay Path</p>
        <p>0.5
Normalized delay path
1
1
Sub-CP with
potential hazard
Sub-CP
without hazard</p>
        <p>Worst CP with ‘1’ probability
Sub-CP with potential hazard</p>
        <p>Worst CP with ‘0’ probability
Worst CP with high activity
400
350
300
tah220500
p
#150
100
50
0</p>
        <p>1
slack path
2
degradation. The nets without activity will have a logic level of
1 or 0 associated with it and this is an indicator of possible
worst case degradation due to BTI as in Fig. 3 (top)</p>
        <p>The cell level analysis of the critical path provides
complete data on how the individual elements in the critical
path ages. In Fig. 4 (bottom), the cells most affected by HCI is
identified. In Fig.4 (top), for a different workloads, probability
of the nets along the Critical Path being either 1 or 0 provides
information on BTI degradation.</p>
        <p>A complex workload can be referred to as the workloads
that lead to high activity nodes. More the number of such high
activity nodes, higher is the complexity of the workload.
Higher complexity of the workload does not necessarily lead to
a higher activity of all the critical paths but it has an impact on
certain potential critical paths [2].</p>
      </sec>
      <sec id="sec-2-2">
        <title>C. Timing Arc degradation</title>
        <p>
          The impact of activity and probability are now reviewed at
cell level. Design in Reliability models [17] [18], are models of
HCI and NBTI degradation of cells. Using Design in
Reliability models, it is possible to evaluate the degradation of
cell for a given stimuli and mission profile. Assuming 2 years
of operating conditions at Vmax, the degradation of a
NAND3A is depicted in Fig 5. At a given input slope and load,
delay of rising and falling arcs are simulated. It is noticeable
that degradation is a function of both the activity and
probability driven respectively by HCI and BTI mechanisms.
However, there is coupling between both these mechanisms, as
explained in [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. These effects are not additive but they do
interact with each other. For this particular arc, an always ‘0’ at
input leads to drastic cell degradation because of PMOS
degradation. This gets exacerbated at high activity.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>CONCLUSION</title>
      <p>This paper discusses a flow that provides workload
dependent HCI and BTI related degradation information to the
designer at the very beginning of the design stage. Workload
dependent activity and probability information of the critical
path nets of a digital circuit is gathered. Higher the activity and
probability on a certain net, higher will be degradation due to
HCI and BTI respectively of cells from which the nets
originate. A designer can thus improve the reliability of a
hardware, by taking into account the HCI and BTI aging for a
certain application during the design stage.</p>
      <p>2
0
falling, high activity
falling, low activity
rising, low activity
rising, high activity
[17] Huard, V et al (2007), Design-in-Reliability Approach for NBTI and
hot- Carrier degradations in Advanced Nodes, IEEE Transactions on
Device and Materials Reliability
[18] Huard, V et al (2009), CMOS device design-in reliability approach in
advanced nodes, IEEE International Reliability Physics Symposium,
2009
[19] Najm, F.N. (1993), Transition density: a new measure of activity in
digital circuits, IEEE Transactions on Computer-Aided Design of
Integrated Circuits and Systems
[20] Kleeberger, V.B. (2014), Workload and instruction-aware timing
analysis - The missing link between technology and system-level
resilience, 51st ACM/EDAC/IEEE Design Automation Conference
(DAC)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Wang</surname>
          </string-name>
          et al., (
          <year>2007</year>
          ).
          <article-title>An efficient method to identify critical gates under circuit aging</article-title>
          . IEEE/ACM International Conference on Computer-Aided Design,
          <source>Digest of Technical Papers, ICCAD</source>
          ,
          <fpage>735</fpage>
          -
          <lpage>740</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Mintarno</surname>
          </string-name>
          et al., (
          <year>2013</year>
          ).
          <article-title>Workload dependent NBTI and PBTI analysis for a sub-45nm commercial microprocessor</article-title>
          .
          <source>IEEE International Reliability Physics Symposium Proceedings</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Cacho</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          et al., (
          <year>2009</year>
          ).
          <article-title>Hci/Bti Coupled Model: the Path for Accurate and</article-title>
          Predictive Reliability Simulations
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Ebrahimi</surname>
          </string-name>
          et al. (
          <year>2013</year>
          ).
          <article-title>Aging-aware logic synthesis</article-title>
          . IEEE/ACM International Conference on Computer-Aided Design,
          <source>Digest of Technical Papers, ICCAD</source>
          ,
          <fpage>61</fpage>
          -
          <lpage>68</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>O. Girard.</surname>
          </string-name>
          (
          <year>2010</year>
          ). [Online]
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>John L. K</surname>
          </string-name>
          . et al. (
          <year>1998</year>
          ).
          <source>Workload Characterization: Methodology and Case Studies</source>
          ,
          <year>1999</year>
          ,
          <fpage>3</fpage>
          -
          <lpage>14</lpage>
          ,
          <fpage>10</fpage>
          .1109/WWC.
          <year>1998</year>
          .809354
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Maxiaguine A.</surname>
          </string-name>
          et al (
          <year>2004</year>
          ).
          <article-title>Design, Automation and Test in Europe Conference and Exhibition(DATE),</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Li</surname>
            <given-names>Y.</given-names>
          </string-name>
          et al (
          <year>1997</year>
          ).
          <article-title>Computer-Aided Design of Integrated Circuits and Systems</article-title>
          , IEEE Transactions on, Vol:
          <volume>16</volume>
          , Issue:
          <fpage>12</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Hoste</surname>
          </string-name>
          et al. (
          <year>2007</year>
          ). Characterization,
          <string-name>
            <surname>Micro IEEE</surname>
          </string-name>
          Microarchitecture-Independent Workload
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Weicker</surname>
            <given-names>P.</given-names>
          </string-name>
          et al (
          <year>1990</year>
          ).
          <article-title>An Overview of Common Benchmarks</article-title>
          , Computer
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Poovey</surname>
            ,
            <given-names>J.A</given-names>
          </string-name>
          et al, (
          <year>2009</year>
          ).
          <article-title>A Benchmark Characterization of the EEMBC Benchmark Suite</article-title>
          , Micro IEEE
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Planes</surname>
            <given-names>N</given-names>
          </string-name>
          et al (
          <year>2012</year>
          ).
          <article-title>28nm FDSOI technology platform for high-speed low-voltage digital applications</article-title>
          ,
          <source>Symposium on VLSI Technology (VLSIT)</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Federspiel</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          et al(
          <year>2011</year>
          ),
          <article-title>Experimental characterization of the interactions between HCI, off-state and BTI degradation modes</article-title>
          ,
          <source>IEEE International Integrated Reliability Workshop Final Report (IRW)</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Huard</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          et al, (
          <year>2008</year>
          ),
          <article-title>NBTI degradation: From transistor to SRAM arrays</article-title>
          ,
          <source>IEEE International Reliability Physics Symposium</source>
          , (IRPS).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Huard</surname>
            <given-names>V.</given-names>
          </string-name>
          et al, (
          <year>2013</year>
          )
          <article-title>Advances in industrial practices for optimal performance/reliability/power trade-off in commercial high-performance microprocessors for wireless applications</article-title>
          ,
          <source>IEEE International Reliability Physics Symposium</source>
          , (IRPS)
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>