Technical Publications and Papers
You will find here a very exhaustive list of white papers on the topic of Soft Errors.
Abstracts are included. If you would like a full paper, please click on its title.
Recent articles 2010-2011
- DFT 2011
: TIMA and iRoC Technologies :
"A Practical Approach to Single Event Transients
Analysis For Highly Complex Designs"
Abstract: Single Event Transients are considerably
more difficult to model, simulate and analyze than the
closely-related Single Event Upsets. The work
environment may cause a myriad of distinctive
transient pulses in various cell types that are used
in widely different configurations. We present
practical methods to help characterizing the standard
cell library using dedicated tools and results from
radiation testing. Furthermore, we analyze the SET
propagation in logic networks using a standard
(reference) serial fault simulation approach and an
accelerated fault simulation technique, taking in
account both logic and temporal considerations. The
accelerated method provides similar results as the
reference approach while offering a considerable
increase in the simulation speed. However, the
simulation approach may not be feasible for large
(multi-million cells) designs that could benefit from
static analysis methods. We benchmark the results of a
static, probabilistic approach against the reference
and accelerated methods. Finally, we discuss the
integration of the SET analysis in a complete Soft
Error Rate analysis flow.
- RADECS 2011
: EADS and iRoC Technologies :
"Assessment and comparison of the low energy
proton sensitivity in 65nm to 28nm SRAM devices"
Abstract: The low energy proton sensitivity is
investigated on 65 to 28nm SRAM devices. The
experiments are based on a novel cost and
time-efficient methodology, which allows irradiations
in high energy facilities.
- IOLTS
2011 : Politechnico di Torino and
iRoC Technologies :
"A new IP core for fast error detection and
fault tolerance in COTS-based solid state
mass memories"
Abstract: Commercial-off-the-shelf (COTS)
components are crucial for the success of future
space missions as, although not being
specifically designed for space, they are the
only components able to meet the performance
requirement that new missions impose to
designers. COTS memories are particularly
appealing as large memory arrays can be
implemented, which can be made immune to
radiations by means of cost-effective
information redundancy schemes. In this paper,
we present an intellectual property (IP) core
implementing a (12, 8) Reed- Solomon (RS) code
for error mitigation that is suitable for COTS
Flash NAND and NOR memories. The main novelty of
the proposed scheme consists in its architecture
which is based on a shortened Reed-Solomon code
with a fast error detection feature. Two
implementations have been studied: a fully
combinational scheme that provides error
detection and correction in the access cycle and
a two-stage pipeline with early (in-cycle) error
detection a 1-cycle latency correction. The
pipelined version presents the specific
advantages of minimizing the time penalty
associated to a traditional RS implementation.
We have characterized the area and timing
performance of the proposed architectures in a
variety of FPGA implementations, obtaining a
maximum frequency of 47 MHz for the
combinational implementation and 53 MHz for the
pipelined one (in a Virtex 6 FPGA), with a quick
5 ns error detection. In addition, we have
characterized the fault resiliency of the
proposed schemes with respect to Single Event
Transients and Single Event Faults.
- IOLTS 2011
: iRoC Technologies :
"A
Comprehensive Soft Error Analysis Methodology for
SoCs/ASICs Memory Instances"
Abstract: Memory blocks are important features of any
design, in terms of functionality, silicon area and
reliability. Embedded SRAM instances are critical
contributors to the overall Soft Error Rate of the
system, requiring a careful consideration of the
reliability aspects and adequate sizing of the error
mitigation capabilities. While error detecting and
correcting codes are widely available and particularly
effective against most types of Single Event Effects,
Multiple Bit Upsets and progressive errors
accumulation may defeat the error correction
capabilities of standard SECDED codes. Accordingly,
the paper presents an overall approach to the
structural and functional SER analysis of the memory
instances in addition to error mitigation efficiency
estimation. Moreover, intrinsic, nominal, SER figures
are not a realistic indicator of the memory behavior
for a given application. We propose instead, an
opportunity window metric, associated to the notion of
data lifetime in the memory, as extracted from
functional simulations. Lastly, based on the
opportunity window figures, targeted and efficient
fault simulation campaigns can be prepared to estimate
high-level functional failures induced by Single
Events. The overall memory SER evaluation aims at
assisting the designers to improve the performances of
the design and to document the reliability figures of
the system.
- IOLTS 2011
: Kobe University and Kyushu University, Japan:
"Multiple-Bit-Upset and Single-Bit-Upset Resilient
8T SRAM Bitcell Layout with Divided Wordline
Structure"
Abstract: This paper presents a new 8T (8-transistor)
SRAM cell layout mitigating multiple-bit upset (MBU)
in a divided wordline structure. Because bitlines
along unselected columns are not activated, the
divided wordline structure eliminates a half-select
problem and achieves low-power operation, which is
often preferred for low-power / low-voltage
applications. However, the conventional 8T SRAM with
the divided wordline structure engenders MBUs because
all bits in the same word are physically adjoining.
Consequently, error correction coding (ECC) techniques
are difficult to apply. This paper presents a new 8T
cell layout pattern that separates internal latches in
SRAM cells using both an n-well and a p-substrate. We
investigated an SEU cross section of nMOS that is
3.5–4.5 times higher than that of pMOS. Using an iRoC
TFIT simulator, we confirmed that the proposed 8T cell
has better neutron-induced MBU tolerance. The MBU in
the proposed 8T SRAM is improved by 90.70% and the MBU
soft error rate (SER) is decreased to 3.46 FIT at 0.9
V when ECC is implemented. Additionally, we conducted
Synopsys 3-D TCAD simulation, which indicates that the
LET threshold (LETth) in single-event upset (SEU) is
also improved by 66.47% in the proposed 8T SRAM by a
common-mode effect.
- Selse 2011
: Samsung and iRoC Technologies :
"Thermal Neutron SER Testing and Analysis:
Findings from a 32nm HKMG SRAM Case Study"
Abstract: We developed a set of testing and analysis
techniques to quantify the impact of terrestrial
thermal neutrons on SRAM SER in a 32nm high-K
metal-gate process. We found that the SBU SER caused
by terrestrial thermal neutrons is less than 10% of
the terrestrial cosmic-ray high-energy neutron SER on
the same device and that the MBU/SEFI/SEL impact from
thermal neutrons is negligible. In this paper, we also
share some lessons learned during the various test
campaigns, such as dosimetry calibration, equivalent
flux conversion, and packaging-material shielding
effect, which are important factors to account for
when some ambiguity may still exist in current
industry standard on thermal neutron SER testing.
- NSREC 2010
: EADS and iRoC Technologies :
"Methodology for Testing Low Energy Proton Effect
on Electronic Devices"
Abstract: A methodology is proposed to
experimentally assess the sensitivity of a component
with regard to the direct ionization of protons and
allows irradiation tests in high energy proton
facilities. The efficiency of the methodology is
evaluated during the radiation testing of a 65 nm bulk
SRAM device.
- White Paper from
iRoC Technologies :
"Thermal
neutron vs low energy neutrons: their fundamental
differences concerning SER"
TFIT related papers
SoCFIT related papers
Other conferences papers
TFIT related papers
MEASUREMENT OF NEUTRON-INDUCED SINGLE EVENT TRANSIENT PULSE WIDTH PULSE NARROWER THAN 100ps
Abstract: A novel SET pulse measurement circuit is proposed which can detect pulses narrower than 100ps. Alternation of SET pulses during the propagation through the chain of target cells is minimized, which is attributed to small chain length (typically 20). This circuit configuration contributes to obtaining pulse distribution similar to that observed in actual circuit in use. Distribution of SET pulse width measured by our circuit through the white neutron beam testing agrees well with that estimated by computer simulation.
STUDY ON INFLUENCE OF DEVICE STRUCTURE DIMENSIONS AND PROFILES ON CHARGE COLLECTION CURRENT CAUSING SER PULSE LEADING TO SOFT ERROR IN LOGIC CIRCUITS
Abstract: Current responses due to the strike of ionized particle onto nMOS transistor of 90nm and 55nm generation have been analyzed through 3D device simulations. From the current response, duration of charge collection (tcc) is determined, which correlated strongly with the width of erroneous pulse (SET pulse). Causes of the difference between tcc values of 90nm and 55nm generation MOSFETs have been investigated and it is found that the difference in STI depth and width of p-well contact line between these two generations influences tcc mainly. This is because that the resistance below the p-well contact affects the ability to pull out the excess holes remaining in the channel region. It is also shown that there is room for reducing tcc and hence SET pulse width by well profile engineering.
SoCFIT related papers
A MULTI-PARTNER SOFT ERROR RATE ANALYSIS OF AN INFINIBAND HOST CHANNEL ADAPTER
Abstract: We present the results of a Soft Error Rate (SER) analysis flow performed on a high-performance, commercial InfiniBand Host Channel Adapter. The primary goals of this evaluation consist in characterizing the device reliability from a SER perspective exhaustively, taking into account all the factors affecting the SER sensitivity in a systematic approach. This method consequently helps the designers implementing the optimal error mitigation methodology. The work presented in this paper follow a practical, systematical method that takes into account the important stages of the design flow from the SER figures of the standard cell library to SER-related field problems. Moreover, this project represented the opportunity to reunite in a common framework the technology provider, the chip designers and the SER-solutions provider. In addition, field data show no contradiction to the predicted results and an evolution of the SER analysis for the next generation of the device has been scheduled.
Other conferences papers
Abstract: Soft Error phenomena induced by the Sea-level cosmic neutron have been investigated by using a simulation system that covers from an individual MOSFET device level to an LSI-chip level.This system consists of several kinds of simulations codes tools, such as mixed-mode 3D device simulator, SPICE circuit simulator and analysis tools of gate-levels net-lists. A comprehensive practical simulation flow is demonstrated in this paper on commercial 90 nm generation logic devices and standard cells.
A VARIATION OF SRAM ALPHA-INDUCED SOFT ERROR RATE WITH TECHOLOGY NODE
Abstract: this document presents a compilation of results from tests performed by iRoC Technologies on SER induced by alpha particles on SRAM memories for technology nodes from 180nm to 65 nm. The aim of this study is to establish the variation of sensitivity with technology node for SEU and MCU and to analyze the possible influence of different designs and technological parameters at a given technology node.
A SYSTEMATICAL METHOD OF QUANTIFYING SEU FIT
Abstract:
We present a
practical,
systematical
method for the
evaluation of
the Soft Error
Rate (SER) of
microelectronic
devices.
Existing
methodologies,
practices and
tools are
integrated in
a common
approach while
highlighting
the need for
specific data
or tools. The
showcased
method is
particularly
adapted for
evaluating the
SER of very
complex
microelectronic
devices by
engineers
confronted to
increasingly
demanding
reliability
requirements.
A LOW-COST SINGLE-EVENT LATCHUP MITIGATION SCHEME
Abstract: Single-event latchup is one of the most threatening single event effects as the induced current may destroy the affected device. Existing latchup mitigation schemes may induce a very high area cost or may require modifying the fabrication process. In this paper we present a new single-event latchup mitigation approach implemented at design level that protects devices from destruction and preserve circuit state at very low area cost.
Characterizing Laser-Induced Pulses in ICs: Methodology and Results
Abstract: Like other silicon integrated circuit (IC) domains, the smart card market is very competitive and main actors are constantly trying to design the cheapest and safest circuits to ensure their consumers' satisfaction. These specificities lead smart cards actors to design standard cell-based tamper resistant ICs and to characterize their circuits' sensitivity.
In this paper, we present some experimental results aimed at characterizing the sensitivity to laser-induced transient pulses (and their duration) of elementary cells from the 0,13 μm standard cell library STM HCMOS9GP. They were obtained for a specially designed and fabricated experimental circuit. The data obtained here allow us for better understanding of laser attacks and would be used to design integrated circuits with better protection against such attacks. The final goal of this work is to build and augment new simulation models dedicated to laser-induced faults aimed specifically at smart card industry.
IRPS 06 (accepted)
Abstract: Measurement of soft error rates (SER) of ten commercial SRAMs of 0.35 μm to 90 nm technologies have been completed at the ILL thermal neutron facility. Results establish the sensitivity of old and recent SRAMs showing the impact of 10B concentrations in BPSG and p-type regions. 10B results are also compared to high-energy neutron SER. [Keywords: soft error, single-event upset (SEU), thermal neutrons, cross-section, boron].
Cancer-Radiotherapy
Equipment
as a Cause of
Soft Errors in
Electronic
Equipment
(IEEE,
Wilkinson and
all, Sep 2005)
Abstract:The
undesirable
production of
secondary
neutrons by
cancer-radiotherapy
linear
accelerators
(linac) has
been
demonstrated
to cause soft
errors in
nearby
electronics
through the 10B(n,
a)7Li
reaction. 10B
is a component
in the BPSG
used as a
dielectric
material in
some
integrated-circuit
(IC)
fabrication
processes.
Modeling of Transients Caused by a Laser Attack on Smart Cards
Abstract:
Several
techniques for
extracting
data from
smart cards
have been
described in
the
literature,
including the
so called
differential
fault analysis
(DFA) that
relies on
perturbing
the chip
operations to
deduce the
data. In this
paper, we
present some
experimental
results of the
DFA that
relies on
using a laser
beam.
Robust System Design with
Built-In
Soft-Error
Resilience
(IEEE,
Intel-iRoC,
Feb 2005)
Abstract:
Transient
errors caused
by terrestrial
radiation pose
a major
barrier to
robust system
design. A
system’s
susceptibility
to such errors
increases in
advanced
technologies,
making the
incorporation
of effective
protection
mechanisms
into chip
designs
essential. A
new design
paradigm
reuses
design-for-testability
and debug
resources to
eliminate such
errors
As nanometer processes are adopted, reliability has once again become one of the hottest semiconductor topics. Among reliability failure mechanisms, soft errors are becoming the biggest cause of failures in-the-field. By striking device transistors, alpha particles and atmospheric neutrons can cause unpredictable bit flips called soft errors. While a phenomena that has been around forever, it has become a general market issue as activation energies drop and the amount of SRAM memory embedded in new chip generations explodes. The iRoC Technologies paper will detail ways of evaluating the SER threat, and will present new techniques and design tips for overcoming technology obstacles and achieving highly reliable ICs.
New Trends in the Soft Error Threat
Reliability is one of the major concerns for advanced semi-conductor manufacturers. Soft errors induced by alpha particles and atmospheric neutrons are among these concerns. By hitting the transistors, such particles may cause unpredictable bit flips in ICs (integrated circuit). This effect is becoming more and more preeminent as the industry moves along the technology roadmap to smaller, more sensitive technologies. To quantify the sensitivity of their chips to soft errors, it is necessary for chip manufacturers to run radiation testing on their latest IC generation
Design for Soft-Error Robustness To Rescue Deep Submicron Scaling
Progress in technological scaling allows the integration into a single chip of hundreds of millions of transistors, moving quickly to the multi-billion transistor capacities. The integration of complex systems into a single chip, that may include heterogeneous parts such as logic, SRAM, DRAM non-volatile memories, analog and even micromechanical and optical parts, is becoming a reality
VDSM IC Logic and Memory Signal Integrity and Soft Errors
Due to the VDSM evolution and an electronic systems market moving at a neck-breaking speed, the semiconductor industry is facing exciting new challenges all the time. As minimum layout dimensions continue to shrink and correspondingly the number of functions that can be put on a SoC continues to grow, signal integrity is becoming a major issue. Some of the growing effects are the so-called “transient errors” which are due to temporary conditions of use and the environment
Embedded Robustness IPs for transient error free ICs
Shrinking process geometries will make it imperative for designers to start paying attention to transient-error protection. Self-correcting intelligence embedded in ICs protects electronic systems against such unpredictable and insidious errors. Infrastructure IPs that focus on transient faults are a leading type of self-correcting intelligence
Single Event Effect in Avionics
The occurrence of single event upset (SEU) in aircraft electronics has evolved from a series of interesting anecdotal incidents to accepted fact. A study completed in 1992 demonstrated that SEUs are real, that the measured in-flight rates correlate with the atmospheric neutron flux, and that the rates can be calculated using laboratory SEU data. Once avionics SEU was shown to be an actual effect, it had to be dealt with in avionics designs
Single Event Upset at Ground Level
Ground level upsets have been observed in computer systems containing large amounts of random access memory (RAM). Atmospheric neutrons are most likely the major cause of the upsets based on measured data using the Weapons Neutron Research (WNR) neutron beam
Latent Damage in CMOS Devices From Single-Event Latchup
Permanent damage effects are studied for several types of CMOS circuits that are sensitive to single-event latchup. The emphasis is placed on latent damage effects, where devices remained functional despite significant structural damage to their interconnects. This type of damage was observed during latchup testing with both laser pulses and heavy ions. Microscopic examination of the damaged regions after latchup testing revealed small metallic spheres and cracked, voided interconnects
Neutron Induced Latchup in SRAMs at Ground Level
Neutron-induced single-event latchup has been studied in SRAMs manufactured by several different vendors. These SRAMs span different cell designs (six-transistor and four-transistor cells), technology generations (0.25 µm to 0.14 µm) and power supplies (5 V to 1.5 V). While some technologies appear to be latchup-free in neutron environments, others have neutron-induced latchup failure-in-time (FIT) rates as high as 300 FIT/Mbit at room temperature and maximum rated voltage. Latchup FIT rates increase dramatically with temperature. The observed latchup rates can lead to very high failure rates in systems with large amounts of memory, and can’t be circumvented using error correction
Microbeam mapping of single event latchups and single event upsets in CMOS SRAMS
The first simultaneous microbeam mapping of single event upset (SEU) and latchup (SEL) in the CMOS RAM HM65162 is presented. We found that the shapes of the sensitive areas depend on VDD, on the ions being used and on the site on the chip being hit by the ion. In particular, we found SEL sensitive sites close to the main power supply lines between the memory-bit-arrays by detecting the accompanying current surge. All these SELs were also accompanied by bit-flips elsewhere in the memory (which we call `indirect' SEUs in contrast to the well known SEUs induced in the hit memory cell only). When identical SEL sensitive sites were hit farther away from the supply lines only







