Aadam: A Fast, Accurate, and Versatile Aging-Aware Cell Library Delay Model using Feed-Forward Neural Network
Seyed Milad Ebrahimipour1, Behnam Ghavami2, Hamid Mousavi1, Mohsen Raji3 Zhenman Fang2, Lesley Shannon2 1Shahid Bahonar University of Kerman, {miladebrahimi, hamidmousavi0}@eng.uk.ac.ir 2Simon Fraser University, {behnam_ghavami, zhenman, lesley_shannon}@sfu.ca 3Shiraz University, [email protected] ABSTRACT 1 INTRODUCTION With the CMOS technology scaling, transistor aging has become With the CMOS technology scaling, the reliability of circuits has one major issue affecting circuit reliability and lifetime. There are become one of the major issues affecting digital circuit designs1 [ , 7]. two major classes of existing studies that model the aging effects In addition to the correct functionality of the circuit, a longtime in the circuit delay. One is at transistor-level, which is highly ac- lifespan is also crucial in many application fields such as aerospace, curate but very slow. The other is at gate-level, which is faster but defense, and medical industries [5, 13]. Transistor aging is a key less accurate. Moreover, most prior studies only consider a limited source of failure that threatens the lifetime reliability of digital subset or limited value ranges of aging factors. circuits. It leads to a degradation of the electrical characteristics of In this paper, we propose Aadam, a fast, accurate, and versatile transistors and subsequently, a considerable increase of the device aging-aware delay model for generic cell libraries. In Aadam, we delay [19]. For example, the Negative Bias Temperature Instability first use transistor-level SPICE simulations to accurately charac- (NBTI) aging phenomenon, a major parametric reliability issue, terize the delay degradation of each library cell under a versatile may increase the circuit delay by up to 30% [14]. This impact may set of aging factors, including both physical parameters (i.e., initial eventually lead to violations of circuit timing constraints, reduction threshold voltage and transistor width/length ratio) and operat- of mean time to failure, and faster wear-out of the system. ing conditions (i.e., working temperature, signal probability, input To steer clear of the aging effects and guarantee the correct signal slew range, output load capacitance range, and projected functionality of the circuit for the projected lifetime (푡), designers lifetime). For each library cell, we then train a feed-forward neu- have to include a safety timing margin called timing guard band in ral network (FFNN) to learn the relation between the input aging the design [6, 27]. This timing guard band may decrease the circuit factors and output cell delay degradation. Therefore, for a given frequency, as shown in Equation 1. input circuit and a given combination of aging factors, we can use 1 the trained FFNNs to quickly and accurately infer the delay degra- 푓 푟푒푞푢푒푛푐푦 = ; 퐷 (푡) = 퐷 (0) + 퐷 (푡) (1) 퐷 (푡) 퐺퐵 dation for each gate in the circuit. Finally, to effectively estimate the aging-aware lifetime delay of large-scale circuits, we also inte- where 푓 푟푒푞푢푒푛푐푦 denotes the targeted frequency of the design and grate Aadam into a state-of-the-art static timing analysis tool called 퐷 (푡) represents the lifetime delay. 퐷 (0) and 퐷퐺퐵 (푡) respectively OpenTimer. Experimental results demonstrate that Aadam achieves denote the initial delay of the circuit and the delay of the tim- fast estimation of the aging-induced delay with high accuracy close ing guard band. To reduce the performance overhead imposed by to transistor-level simulation. 퐷퐺퐵 (푡), it is essential to accurately estimate the aging-induced delay degradation of the circuit and apply a minimum 퐷퐺퐵 (푡). KEYWORDS However, it is nontrivial to quickly and accurately estimate the Aging, Reliability, Delay Model, Machine Learning, Cell Library aging-induced delay degradation as there are many factors that affect the delay degradation. They include both physical parame- ACM Reference Format: ters (i.e., initial threshold voltage and transistor width/length ratio) Seyed Milad Ebrahimipour1, Behnam Ghavami2, Hamid Mousavi1, and operating conditions (i.e., working temperature, signal proba- Mohsen Raji3 and Zhenman Fang2, Lesley Shannon2. 2020. Aadam: A Fast, Accurate, and Versatile Aging-Aware Cell Library Delay Model using bility, input signal slew range, output load capacitance range, and Feed-Forward Neural Network. In IEEE/ACM International Conference on projected lifetime), which will be explained in detail in Section 2.1.2. Computer-Aided Design (ICCAD ’20), November 2–5, 2020, Virtual Event, USA. As will be detailed in Section 2.2, the prior studies that model ACM, New York, NY, USA, 9 pages. https://doi.org/10.1145/3400302.3415605 the aging effects in the circuit delay can be divided into thefol- lowing categories. The first category of work34 [ ] uses full-circuit Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed transistor-level SPICE simulation to achieve high accuracy, but runs for profit or commercial advantage and that copies bear this notice and the full citation at very slow speed. The second category of work [2, 28–30, 37, 40] on the first page. Copyrights for components of this work owned by others than ACM uses lookup table (LUT) based gate-level models to achieve faster must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a speed at the expense of lower accuracy. The third category of work fee. Request permissions from [email protected]. [16, 26] only performs SPICE simulation for a set of critical paths, ICCAD ’20, November 2–5, 2020, Virtual Event, USA but is still slow for large-scale circuits. The last category of work © 2020 Association for Computing Machinery. ACM ISBN 978-1-4503-8026-3/20/11...$15.00 [20, 25, 38, 39] starts to use traditional machine learning techniques https://doi.org/10.1145/3400302.3415605 such as support vector machine (SVM) and non-linear regression ICCAD ’20, November 2–5, 2020, Virtual Event, USA Seyed Milad Ebrahimipour1, Behnam Ghavami2, Hamid Mousavi1, Mohsen Raji3 and Zhenman Fang2, Lesley Shannon2 model to predict the delay caused by the NBTI effect at gate-level or the channel and degrades the transistor’s mobility (휇). Second, critical path level. In fact, most prior studies only consider a limited because of the non-epitaxial structure of 푆푖푂2, the vertical electric subset of aging factors and/or limited value ranges of aging factors. field leads to defect generation in the interface of the transistor To address those issues in prior studies, in this paper, we propose and the formation of defects inside the 푆푖푂2. These defects cause Aadam, an accurate yet fast aging-aware delay model for generic accumulated charges around and within the gate dielectric, which cell libraries by considering a versatile set of aging factors that are ultimately increases the threshold voltage (푉푡ℎ) of the transistor. summarized in Section 2.1.2. The major idea behind Aadam is to For a projected lifetime (푡), the delay of a transistor (퐷(푡)) is leverage 1) the high accuracy of transistor-level SPICE simulation inversely proportional to its drain current (퐼퐷 (푡)), and 퐼퐷 (푡) is a to characterize the delay degradation of library cells only; 2) the function of the transistor’s mobility (휇(푡)) and threshold voltage learning power of feed-forward neural networks (FFNN) [18] to (푉푡ℎ (푡)), as shown in Equation 2 (where 푉푑푑 denotes the supply accurately and quickly predict the aging-induced delay at gate- voltage). As a result, with 휇(푡) decreasing and 푉푡ℎ (푡) increasing level under the versatile combinations of aging effects; and 3) the during the operation of the transistor, 퐼퐷 (푡) decreases and the fast aging-aware static timing analysis (STA) of large-scale circuits transistor delay (퐷(푡)) increases. using the state-of-the-art STA tool called OpenTimer [21]. 1 휇(푡) Aadam provides a fully automated framework for aging-aware 퐷(푡) ∝ ; 퐼 (푡) ≈ (푉 − 푉 (푡))2 (2) 퐼 (푡) 퐷 2 푑푑 푡ℎ STA of large-scale circuits. In Aadam, we first characterize the 퐷 delay degradation of each library cell under a versatile set of aging 2.1.2 Gate-Level. Each gate in the circuit is exposed to different factors as mentioned earlier, using the accurate transistor-level operating conditions that can lead to different aging-induced delay SPICE simulations. Based on these characterized data, we train an degradations over the projected lifetime (t). FFNN to learn the relation between the input aging factors and 1. The working temperature (T) has a great impact on the trap output delay degradation for each library cell. As a result, these generation of transistors in a gate. As 푇 increases, the rate of trained FFNNs can quickly and accurately predict the aging-induced interface and oxide trap generation increases [3, 4], i.e., 푉푡ℎ (푡) delay at gate-level. Finally, we integrate these trained FFNN models increases faster and 휇(푡) decreases faster. This includes both to a state-of-the-art STA tool called OpenTimer [21] to effectively Positive Bias Temperature Instability (PBTI) effect in NMOS estimate the aging-aware lifetime delay of large-scale circuits. We transistors and Negative Bias Temperature Instability (NBTI) plan to release our Aadam toolflow to the public in the near future. effect in PMOS transistors. In our experiments, we use the open-source Nangate 45nm 2. The signal probability (휆) also has a great impact on the trap generic cell library [31], and a few circuits from the ISCAS’85 [10], generation. The higher the signal probability of the inputs of a ISCAS’89 [9], ITC’99 [12], and OpenTimer [21] benchmark suites. gate, the greater the operation cycle (duty cycle) of its transistors. Compared to the accurate transistor-level SPICE simulation [35, 36], As a result, more interface traps and oxide traps will be generated Aadam achieves almost identical accuracy for predicting the lifetime inside the transistors of the gate, i.e., the aging rate of the gate delay of a gate while achieving four orders-of-magnitude speedup. will increase. Compared to the prior LUT-based model [2] and SVM-based model 3. The input signal slew range ([푆푚푖푛, 푆푚푎푥 ]) and output load ca- [38, 39] that predict the lifetime delay at gate-level, Aadam achieves pacitance range ([퐶푚푖푛, 퐶푚푎푥 ]) of a gate can lead to different better accuracy and can accurately predict for input aging factor interface trap generations and different values for 푉푡ℎ (푡) and combinations that are not captured in these models. Compared to 휇(푡) of each transistor in a gate [24]. the prior critical path based model [20] that performs aging-aware 4. Physical parameters, such as initial threshold voltage (푉푡ℎ (0)) STA at circuit-level, Aadam also achieves better accuracy in predict- and transistors’ width/length ratio (푊 /퐿 푟푎푡푖표), also affect the ing the circuit lifetime delay. In addition, Aadam can consistently aging-induced gate delay. provide high prediction accuracy when there are perturbations to the circuit, while the prediction accuracy of the critical path based 2.1.3 Circuit-Level. Stemming from the transistor-level, aging can model [20] significantly decreases after the circuit perturbation. lead to timing violations at the circuit-level. Since each gate of the Finally, for large-scale circuits with 138.9K to 255.3K gates, our circuit is operated under a different combination of the above fac- aging-aware delay estimation with Aadam only adds around 5 to tors, the aging-induced delay degradation of each gate is different. 12 seconds extra runtime overhead, which can be well tolerated. As a consequence, a critical path of the circuit might become non- critical and vice versa over the projected lifetime. Since the lifetime delay of each circuit path is changing due to the aging effects, eval- 2 RELATED WORK AND OUR NOVELTY uating aging impacts on the circuit lifetime cannot be determined 2.1 Aging Effects and Factors by only analyzing the critical paths of the circuit. Instead, all of the 2.1.1 Transistor-Level. The transistor aging phenomenon occurs paths that may violate the timing constraints of the circuit should due to the formation of interface traps (breaking of 푆푖 − 퐻 bonds at be jointly considered. the 푆푖 − 푆푖푂2 interface) and oxide traps (capturing of charges in the oxide vacancies within the dielectric). First, during the operation of 2.2 Aging-Aware Delay Modeling the transistor, the horizontal electric field over the gate dielectric Although there are a lot of studies that try to model the 푉푡ℎ degra- increases the kinetic energy of the carriers. This results in charge dation due to aging effects, there are only a few studies that aim to build-up near the transistor drain, leading to defect formation in the compute the aging-induced gate and/or circuit delay degradation, drain. These interface traps interact with the charge carriers inside which can be classified as the following categories. Aadam: A Fast, Accurate, and Versatile Aging-Aware Cell Library Delay Model using Feed-Forward Neural Network ICCAD ’20, November 2–5, 2020, Virtual Event, USA
2.2.1 Full-Circuit Transistor-Level Modeling. An aging-aware In [25], a non-linear regression model has been introduced to transistor-level timing analysis method is introduced in [34]. First, estimate the delay degradation of a circuit due to NBTI effects. the initial circuit without aging effect is simulated under different This method exploits the critical path set, which consists of the operating conditions of each transistor. Then, using the collected critical path and near-critical paths with delays higher than 80% of information, the 푉푡ℎ shift due to aging is computed for every tran- the critical path delay. Then, using a non-linear regression model, sistor in the entire circuit. Finally, the lifetime 푉푡ℎ is applied to various NBTI operating conditions are mapped to the delays of the each transistor and the circuit delay for the projected lifetime is critical path set. An extension of this model is presented in [20] to calculated. Despite the high accuracy, this method is very slow due estimate the lifetime delay of the circuit considering time variant to transistor-level simulation, which makes it infeasible to be used operating conditions. As stated before, due to the large number of for large-scale industrial circuits. paths to be considered, both of these methods are impracticable to analyze the aging effects of the large-scale industrial circuits. 2.2.2 LUT-based Gate-Level Modeling. To compensate for the run- time overhead of transistor-level simulations, some studies use gate- 2.2.5 Summary. In summary, prior studies for aging-aware delay level simulation models at the expense of lower accuracy. In [30, 37, modeling usually suffer from at least one of the following issues: 40], a Look-Up Table (LUT) based gate delay model is introduced low speed, low accuracy, or limited consideration of aging factors. to correlate the NBTI-induced 푉푡ℎ shift of PMOS transistors to the corresponding gate delay degradation. However, it considers 2.3 Our Novelty neither the PBTI effect in NMOS transistors nor the slope of rising In this paper, we propose a feed-forward neural network (FFNN) and falling signals of a gate. Kiamehr et. al. [28] propose to use an based method to model the aging-induced delay degradation, which aging-aware standard cell library. They extend the cell library and 1) considers a versatile set of aging factors as summarized in Sec- provide multiple copies of each library cell considering different tion 2.1.2, 2) achieves high accuracy that is close to transistor-level input signal probabilities. Then, according to the expected input modeling, and 3) achieves high speed that is close to the gate-level signal probability of each gate, technology-mapping is done and modeling. Moreover, once our FFNN models (one model per library a robust cell from the extended cell library is used to replace the cell) are trained for a cell library at a specific process technology, initial gate. However, they only consider the input signal probability. they can be used to estimate the lifetime delay of any circuit under In [2, 29], an accurate LUT based method is introduced to estimate different aging effects without the need of retraining. Finally, we both NBTI-induced and PBTI-induced delay degradations of each achieve fast aging-aware static timing analysis (STA) for large-scale gate. However, all these methods are limited to model the operating circuits by integrating our model with a state-of-the-art STA tool conditions under which the LUT is built. called OpenTimer [21].
2.2.3 Critical-Path Transistor-Level Modeling. With an eye toward 3 DESIGN OF AADAM analyzing the effects of aging on complex designs such as proces- sors, Karimi et.al. [26] consider the effects of aging on the critical The overall design of our Aadam framework is shown in Figure 1, path only (using the SPICE simulation). However, since the aging which consists of three major phases. First, we use transistor-level effects may change the critical paths into non-critical onesand SPICE simulation to accurately characterize the delay degradation vice versa, only considering the critical path of the circuit is not of each cell from a generic cell library, under a versatile set of aging sufficient. Another method16 in[ ] considers the effects of the top factors summarized in Section 2.1.2. Second, using the characterized 20% critical paths. However, this method is infeasible to analyze aging-aware cell delay dataset, we train a feed-forward neural 1 the aging effects on very large-scale integrated circuits used inthe network (FFNN) for each cell in a generic cell library to learn industry. For example, as shown in [14], the b19 benchmark circuit the relation between its input aging factors and output cell delay [12] has more than 108 paths, and considering the paths with only degradation. Using the trained FFNN models, we can infer the 5% relative slack time may lead to more than 107 paths, which are circuit delay degradation caused by a versatile set of aging effects intractable for analyzing. with high accuracy and fast runtime, without the need of model retraining. Finally, to effectively estimate the lifetime delay ofa 2.2.4 Machine Learning based Modeling. Recently, applying ma- large-scale circuit under specified aging effects, we implement an chine learning techniques in EDA (Electronic Design Automation) aging-aware Static Timing Analysis (STA) toolflow by extending a tools has gained increasing attention [22, 23, 32], and machine state-of-the-art STA tool called OpenTimer [21] with our trained learning based aging estimation for the circuit is not an exception FFNN models. [20, 25, 38, 39]. In [38, 39], a two-stage workload-dependent NBTI model has been introduced. First, a method is proposed to find 3.1 SPICE Characterization of Cell Library a set of representative flip flops that are important to be tracked The left part of Figure 1 presents our proposed flow to characterize with regard to the aging effects. And then, they calculate the aging- the aging-induced delay for each library cell using accurate SPICE induced delay degradation of these flip flops by sensing their actual simulation. First, we characterize the threshold voltage (푉푡ℎ) degra- signal probabilities at runtime. They use a Support Vector Machine dation of a transistor in the technology library using SPICE sim- (SVM) model to map the signal probabilities of these flip flops to ulation. Here we consider the transistor characteristics, including the delay degradation of the circuit. However, they assume other parameters such as supply voltage and temperature as constants 1A typical generic cell library has around 100 cells, which is much smaller than the and only consider the signal probability. number of gates in a typical industrial circuit. ICCAD ’20, November 2–5, 2020, Virtual Event, USA Seyed Milad Ebrahimipour1, Behnam Ghavami2, Hamid Mousavi1, Mohsen Raji3 and Zhenman Fang2, Lesley Shannon2
Transistor Gate-Level Netlist Technology Library Characteristics Test Data Training Data of the Circuit e.g. T, λ, Vth(0), W/L, t Activity Parasitic Estimator Information SPICE Extraction Simulation ` ProcessTraining Signal Degraded Parasitic Information of Probability and each Cell (SPEF File) Transistor Model Activity Factor of each Cell Cell Operating Analysis (STA)
Conditions Cell TimingStatic SPICE Library e.g. [Cmax,Cmin], Simulation [Smin,Smax] Test Accuracy of
Model ` Aging-aware Cell Delay Dataset ` NO Enough YES ` Accuracy Circuit Timing Reports
Cell Library Characterization FFNN Model Training Aging-Aware Static Timing Analysis Figure 1: Overview of our aging-aware Static Timing Analysis (STA) framework the temperature (푇 ), signal probability (휆), initial threshold voltage Input b1 b2 b3
(푉푡ℎ (0)), transistor sizes (푊 /퐿 푟푎푡푖표푠), and projected lifetime (푡) x1 of the transistor. For each combination of these aging factors, a Output degraded transistor model is created using the SPICE simulation xp Y results, including both NMOS and PMOS transistors. Second, for each cell from the cell library, we perform another round of SPICE x simulation to calculate the lifetime delay and the slope of the cell. d Here we consider the the degraded transistor models for the cell Figure 2: Architecture of our Feed-Forward Neural Network. and other cell characteristics, including the input signal slew range The input corresponds to the input operating conditions and ([푆푚푖푛, 푆푚푎푥 ]) and output load capacitance range ([퐶푚푖푛, 퐶푚푎푥 ]). physical parameters of a cell, and the output corresponds to Finally, we get an accurate aging-induced delay degradation for the estimated aging-induced delay degradation of the cell. each library cell under different combinations of the aging factors.
3.2 FFNN Model for Cell-Level Aging Delay where 푊1, 푊2, 푊3, and 푊4 are the weights matrices of each layer, 푏 , 푏 , and 푏 are the bias vectors of each hidden layer, which need Influencing the aging-induced delay degradation of a cell from 1 2 3 to be trained for the FFNN. 푓 (1) , 푓 (2) , and 푓 (3) are three nonlinear a given combination of its operating conditions and physical pa- functions; in this paper, we use the ReLU (Rectified Linear Units) rameters (as summarized in Section 2.1.2) can be considered as a nonlinear activation functions. regression problem. Therefore, we propose to solve this problem For each cell in the cell library, we will train one such FFNN using a feed-forward fully-connected neural network (FFNN) [18]. model to learn the relation between the input operating conditions Figure 2 presents the architecture of our proposed FFNN. The and physical parameters of the cell and the output delay degradation input layer corresponds to the input operating conditions and phys- of the cell. Then for each gate in the circuit, we can use our trained ical parameters of a gate. The input feature vector 푋 that feed to FFNN models to inference its aging-induced delay degradation. To the input layer can be defined as: train each FFNN model using the aging-aware cell delay dataset 푋 = [푥1, 푥2, . . . , 푥푑 ] from Section 3.1, we take the following steps: (3) = [푇, 휆, 푆푚푖푛, 푆푚푎푥,퐶푚푖푛,퐶푚푎푥,푊 /퐿,푉푡ℎ (0), 푡] 1. Normalization of input feature vector 푋. As described in Equa- The output layer just has one neuron that corresponds to the es- tion 3, each feature 푥푖 in our input feature vector 푋 is on a timated aging-induced delay degradation of a cell, i.e., 퐷(푡) = 푌. different scale. For example, the temperature 푇 ranges from ◦ ◦ We use three hidden layers in our FFNN and each hidden layers 20 퐶 to 120 퐶, while the signal probability 휆 ranges from 0 to 1. consists of 256 neurons. The relation between the output 푌 and Such different scaling of the features makes the FFNN training input feature vector 푋 can be expressed as: more difficult and it would make the trained model more depen- dent on the choice of units used in the input features. To address (3) (2) (1) 푌 = 푊4 · 푓 푊3· 푓 푊2·푓 (푊1·푋+푏1) + 푏2 + 푏3 (4) this issue, we normalize our input features from the training Aadam: A Fast, Accurate, and Versatile Aging-Aware Cell Library Delay Model using Feed-Forward Neural Network ICCAD ’20, November 2–5, 2020, Virtual Event, USA
data set using the normalization technique [33], i.e., the aging-induced delay degradation. Second, the trained FFNN ′ 푥푖 − 휇푋 models for each library cell are used inside the STA tool to infer the ∀ 푋 ∈ 푖푛푝푢푡푆푒푡 : 푥푖 = (5) 휎푋 aging-induced delay of the circuit at the projected lifetime (푡). Note that all other aging factors are already available in the standard where 푥 ′ denotes the normalized value of the 푖th feature in input 푖 STA tool and do not need a separate extraction. feature vector 푋. 휇 and 휎 respectively represent the mean 푋 푋 While our aging-aware STA toolflow is generic for any standard and standard deviation of the input feature vector 푋. STA tools, for illustration purpose, we have used a state-of-the-art 2. Initialization of weight matrices and bias vectors. We set the STA tool called OpenTimer [21] for the integration. The original weight matrices (푊 , 푊 , 푊 , and 푊 ) according to Xavier ini- 1 2 3 4 OpenTimer is unaware of aging effects but uses a block-based tialization [15], and initialize the bias vectors (푏 , 푏 , and 푏 ) of 1 2 3 approach for fast STA. As a result, our aging-aware STA flow based the hidden layers to zero. on OpenTimer can avoid the slow speed issue of prior critical-path 3. Fine tuning using back propagation. Finally, we fine tune the based approaches summarized in Section 2.2.3 and 2.2.4, and achieve weight matrices and bias vectors, so that the predicted delay fast static timing analysis for large-scale industrial circuits (results degradation 푌 from our FNN model approaches to the actual in Section 4.5). characterized delay degradation 푌ˆ from Section 3.1. To calculate the difference between 푌 and 푌ˆ, we use Mean Squared Error (MSE) for 푁 sample training datasets, i.e.,: 4 EXPERIMENTAL RESULTS 푁 1 Õ 4.1 Experimental Setup = − ˆ 2 푀푆퐸 푌푖 푌푖 (6) We use Google’s Tensorflow [17] machine learning framework 푁 = 푖 1 to build our proposed FFNN for aging-aware delay modeling. As To minimize the MSE, we use standard back propagation with described in Section 3.2, our FFNN has three hidden layers, each of the gradient descent algorithm [8]. which has 256 neurons. It is implemented in Python and runs on a desktop machine with an Intel Core i7 (2.5 GHz) processor and 3.3 Aging-Aware Static Timing Analysis 8GB of DRAM. To get the training and testing datasets, we use the Timing analysis is the process of determining the circuit delay and following setup. analyzing its timing issues. There are two major approaches for tim- At the transistor-level, we use the 45nm predictive technology ing analysis of a circuit at logic-level: timing simulation and Static model [41] for both NMOS and PMOS transistors. The HSPICE Timing Analysis (STA). Timing simulation is done by exercising the MOSFET Model Reliability Analysis (MOSRA) [35, 36] has been circuit using a set of input vectors and gates’ parameters [11]. The used under different working temperatures푇 ( ), signal probabili- comprehensiveness of this method depends on the number of input ties (휆), initial threshold voltage (푉푡ℎ (0)) values, transistor 푊 /퐿 vectors and gates’ parameters used for simulation. As a result, it ratios, and projected lifetime (푡). The ranges for the variables are: ◦ is almost infeasible to do a complete verification of all timing con- 푇 : [20:120] 퐶, 휆: [0:1], 푉푡ℎ (0): [-0.5:-0.1] V for PMOS transistors straints of modern industrial circuits which may have more than and [0.1:0.5] V for NMOS transistors, and 푡: [0:10] years. We also 100 million gates. In contrast, STA is carried out statistically which consider 6 different 푊 /퐿 ratios: 1X (minimum sized), 2X, 4X, 8X, makes it independent of the number of input vectors and gates’ 16X, 32X (maximum sized). parameters [11, 21]. By analyzing the entire circuit once, it provides At the gate-level, we use the open-source Nangate 45nm generic a simpler and faster way to analyze the timing of the circuit. cell library [31] and its SPICE netlists to get realistic netlists of A standard STA tool [11, 21] requires the following inputs from different combinations of cells (gates). Parasitic information isin- a circuit: 1) the gate-level circuit netlist, 2) the parasitic information cluded based on the layout at the 45nm technology node. The 푆푚푖푛 contained in the Standard Parasitic Exchange Format file (SPEF and 푆푚푎푥 are set to 5ps and 950ps, respectively. The 퐶푚푖푛 and 퐶푚푎푥 file), and 3) the timing table of the cells. Logic level STA utilizes the are set to 0.25fF and 25fF, respectively. The supply voltage (푉푑푑 ) is timing data obtained from annotations in standard delay format, set to 1.1V and the HSPICE tool is used to measure gate delays. and establishes look-up tables of gate delays to quickly retrieve a The dataset used for each library cell can be denoted as constant propagation delay. In these tools, the delay of the circuit 푑푎푡푎푃푎푖푟 = 푋푖, 푌ˆ푖 , where i denotes the sample index in the is a function of the input slew range and output load capacitance dataset, 푋푖 denotes the input feature vector defined in Equation 3, (for a specific temperature). and 푌ˆ푖 denotes the corresponding lifetime delay calculated by Incorporating aging-aware analysis into a standard STA tool HSPICE simulation (i.e., the ground true label for our FFNN). For makes the complexity of the timing description for the cells grow each library cell, we first randomly shuffle all its datasets andthen quickly, due to a versatile set of aging factors that affect the delay split them into two disjoint sets: the training set and the validation degradation as summarized in Section 2.1.2. Therefore, we modify set. In this paper, we use 1000 samples for the training dataset. For the standard STA toolflow to integrate our trained FFNN models the results presented in Section 4.2 and 4.3, we use 600 samples for for fast aging-induced delay inference. The revised aging-aware the validation set. STA flow is shown in the right part of Figure 1, with thefollow- ing two major changes. First, the gate-level netlist of the circuit is fed to a logic simulator to compute the signal probability (휆) pro- 4.2 Validation for Our Cell Delay Model files according to the running workload. This provides the signal We verify the accuracy of our proposed model by comparing the probability (휆) of each transistor inside each gate, which affects predicted delay of a gate with the true delay obtained from SPICE ICCAD ’20, November 2–5, 2020, Virtual Event, USA Seyed Milad Ebrahimipour1, Behnam Ghavami2, Hamid Mousavi1, Mohsen Raji3 and Zhenman Fang2, Lesley Shannon2