MOSFET Physics-Based Compact Model Mass-Produced: An Artificial Neural Network Approach

Huang, Shijie; Wang, Lingfei

doi:10.3390/mi14020386

Open AccessArticle

MOSFET Physics-Based Compact Model Mass-Produced: An Artificial Neural Network Approach

by

Shijie Huang

^1,2,3,* and

Lingfei Wang

^1,2,3,4,*

¹

Key Laboratory of Microelectronics Devices and Integrated Technology, Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, China

²

State Key Laboratory of Fabrication Technologies for Integrated Circuits, Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, China

³

University of Chinese Academy of Sciences, Beijing 101408, China

⁴

Peng Cheng Laboratory, Shenzhen 518066, China

^*

Authors to whom correspondence should be addressed.

Micromachines 2023, 14(2), 386; https://doi.org/10.3390/mi14020386

Submission received: 16 January 2023 / Revised: 30 January 2023 / Accepted: 1 February 2023 / Published: 4 February 2023

(This article belongs to the Special Issue Emerging CMOS Devices, Volume II)

Download

Browse Figures

Versions Notes

Abstract

:

The continued scaling-down of nanoscale semiconductor devices has made it very challenging to obtain analytic surface potential solutions from complex equations in physics, which is the fundamental purpose of the MOSFET compact model. In this work, we proposed a general framework to automatically derive analytical solutions for surface potential in MOSFET, by leveraging the universal approximation power of deep neural networks. Our framework incorporated a physical-relation-neural-network (PRNN) to learn side-by-side from a general-purpose numerical simulator in handling complex equations of mathematical physics, and then instilled the “knowledge’’ from the simulation data into the neural network, so as to generate an accurate closed-form mapping between device parameters and surface potential. Inherently, the surface potential was able to reflect the numerical solution of a two-dimensional (2D) Poisson equation, surpassing the limits of traditional 1D Poisson equation solutions, thus better illustrating the physical characteristics of scaling devices. We obtained promising results in inferring the analytic surface potential of MOSFET, and in applying the derived potential function to the building of 130 nm MOSFET compact models and circuit simulation. Such an efficient framework with accurate prediction of device performances demonstrates its potential in device optimization and circuit design.

Keywords:

surface potential; MOSFET; artificial neural network; DIBL; compact model

1. Introduction

Compact models work as a bridge between the fabrication process and circuit design. They are designed to accurately reproduce minute details of device electrical characteristics, which are essential in the design of digital, analog, mixed-signal, and RF-integrated circuits. This requires the model to be accomplished in a manner consistent with the device operation physics, and with a model structure that remains invariant of fabrication process particulars. Several types of compact model for a Mental-Oxide-Silicon Field-Effect transistor (MOSFET) have been developed, including the threshold-voltage(

V_{th}

)-based compact model, the inversion-charge(

q_{i}

)-based compact model, and the surface-potential(

ϕ_{s}

)-based compact model. Among these compact models, the

ϕ_{s}

-based compact model has achieved widespread success and is most frequently used in modern circuit simulators for its accurate description of major physical effects, which are responsible for the characteristics of scaled MOSFETs. The expression formulation of

ϕ_{s}

is the key component of

ϕ_{s}

-based compact models, and needs to be carefully designed in solving implicit transcendental equations. Aggressive down-scaling of the device and the resultant physical effects have made it very challenging to obtain analytic solutions from complex equations in mathematical physics [1,2]. On the other hand, though numerical solvers [3] can be high-quality alternatives, the non-differentiable solution is unfit for circuit simulation or design optimization; furthermore, a solver may take hours just to solve the equation for a single condition, being computationally intractable even if it can be applied on demand.

Can we free ourselves from challenging theoretic effort and maximally automate the process of building an analytic device surface potential expression that can be applied efficiently in system simulation, design, and optimization? The strength of an artificial neural network (ANN) is considered as one of the effective ways by which to achieve the goal. An ANN endeavors to recognize underlying relationships in a set of data through a process that mimics the way the human brain operates and can adapt to changing input, so the network generates the best possible result without needing to redesign the output criteria. An ANN consists of an input layer of neurons taking the input data, one or two hidden layers of neurons processing the input data, and a final layer of output neurons sending the predicted output, which is compared with the actual output. Based on the error, the parameters (weight and bias) in an ANN are changed and are then fed into the network again to improve the accuracy of prediction. Compared to other machine learning algorithms, an ANN has the ability to learn and adapt to unknown systems, and can fully approximate any complex nonlinear relationship, giving analytical expression of the input and output data, which is essential in the compact model development.

Several studies have been conducted to apply artificial neural networks in building models for MOSFET. In the literature [4,5], the electrical characteristics of MOSFET, including I-V curves and C-V curves, were directly adopted as the training data of the ANN. Two hidden layers were involved in the network and the learning result is achieved. However, with little consideration of the physics of the device’s working principle, the modeling variation is complex in the pure-deep-learning-based model, which is important in device design and optimization. In [6], a deep-learning-assisted MOSFET I-V compact modeling method was proposed. In the method, the ANN acts as a correction term to the traditional BSIM compact model, which increases the compatibility of the classical model with advanced node devices. This method presents a possible prospect for the development of future compact models. However, the direct correction to the I-V curve by the ANN also conceals the important physical information in advanced MOSFETs, which is the optimization and development of scaling devices.

In this work, we propose an innovative artificial neural network (ANN) [7] to obtain the surface potential for compact modeling, and thus, preserve physical information in the subsequent process of building the compact model. The key idea is to use a dedicated numerical simulator [8] (e.g., TCAD) as a “teacher”, and feed its output across highly diverse device data to the “student”, a so-called physical-relation-neural-network (PRNN). The PRNN is a universal approximator that combines general-purpose data fitting with domain-specific physical relations. Therefore, it can effectively mimic the behavior of the teacher, and instill the learned “knowledge” from the solver into the neural network, effectively closing the gap between discrete, numerical simulations and continuous, analytic modeling. We demonstrate the impressive results of our approach in obtaining accurate, analytic surface potential in 130 nm MOS devices. It only requires a simple substitution to generalize to the new parameters of the device, and human intervention involved in different devices/equations [9] is also expected to be minimal. Our framework will be particularly useful in device simulation acceleration, speeding up and coupling the development of micro-device physics-based compact models and device design optimization.

2. Method Framework

The global picture of the proposed framework is shown in Figure 1. First, TCAD [8] software is used to perform the simulation and generate the training data. The data are fed into a physical-relation-neural-network (PRNN), which is composed of both a physical-relation layer to account for basic, low-level physical prior knowledge, and general-purpose fully-connected layers to further capture data nonlinearity. After training and network optimization, the resultant analytic surface potential is further applied to building 130 nm MOSFET semi-classical compact models (e.g., I-V/C-V characteristics) and circuit simulation.

2.1. TCAD Simulation

The 130 nm node MOSFET is stimulated with the Sentaurus Device TCAD tool, as shown in Figure 2. The simulated training data have two parts: preselected device data/parameter

x_{i}^{'} s

(i for sample index), and corresponding surface potential [10] value

y_{i}^{'} s

computed by TCAD. Each

x_{i}^{'} s

is a d-dimensional vector specifying: the thickness of the gate insulation layer (

T_{ox}

), the gate to source voltage (

V_{gs}

), the drain to source voltage (

V_{ds}

), the length of the channel region (

L_{g}

), the temperature (

T

), the doping concentration in the channel region (

N_{d}

), and channel locations (x) (Table 1 for details). The

x_{i}^{'} s

should cover a diverse range of device/operation conditions to generate a rich training dataset. The simulated potential value

y_{i}

is the difference between substrate electrostatic potential and electrostatic potential (one nanometer below the channel surface) [11].

2.2. Physical-Relationship (PR) Layer

The PR-Layer groups the variables in each

x_{i}

and applies group-wise transform to account for the desired interaction between parameters; the details are shown in Equation (1), which reflect useful prior knowledge on some simple but fundamental relations of physics [12]. From a learning perspective, incorporating justified variable interactions can effectively reduce sample complexity; i.e., the amount of data needed for training an accurate model [13,14].

v_{i} {= [V}_{gs} {, V}_{ds} {, N}_{d} {, T, x, e}^{λ \frac{x - L}{t_{si}}} {, e}^{λ \frac{- x}{t_{si}}} {, V}_{ds} e^{λ \frac{- L}{t_{si}}} {, V}_{bi} (1 - e^{λ \frac{- L}{t_{si}}} {), V}_{gs}^{2} {, V}_{gs} - V_{ds}, \log (\frac{L_{Di} V_{gs}}{t_{si}})] .

(1)

Here,

v_{i} \in ℝ^{12 \times 1}

,

λ = \sqrt{\frac{ε_{ox} t_{si}}{T_{ox} ε_{si}}}

,

L_{Di} = \sqrt{\frac{ε_{si} v_{t}}{{qn}_{i}}}

is the intrinsic Debye length,

V_{b} {= v}_{t} \ln (\frac{N_{sd} N_{d}}{n_{i}^{2}})

is the build-in potential at the source/drain terminal. Min-Max normalization is then adopted to standardize the data

v_{i}

[15].

2.3. Fully-Connected (FC) Layer

In FC-Layers, all neurons in one layer will be fully connected to all neurons in the next layer. These are general-purpose network components, and serve as a nice complement to the PR-Layer in capturing complex nonlinear relations [16] between the device data and surface potential. As shown in Figure 1, we cascade two FC-layers right after the PR-layer, with 64 and 32 neurons, respectively, each activated by the sigmoid function. The two FC-layers are as follows [17].

h_{i}^{1} = σ (W_{1}^{T} \cdot v_{i} {+ b}_{1}),

(2)

h_{i}^{2} = σ (W_{2}^{T} \cdot h_{i}^{1} {+ b}_{2}) .

(3)

Here,

h_{i}^{1} \in ℝ^{64 \times 1}, h_{i}^{2} \in ℝ^{32 \times 1}

are the neurons in the hidden layer,

W_{1} \in ℝ^{12 \times 64}

,

W_{2} \in ℝ^{64 \times 32}

are the coefficient weight term matrices of the two FC-layers,

b_{1} \in ℝ^{64 \times 1}

and

b_{2} \in ℝ^{32 \times 1}

are the bias term, and

σ (\cdot)

is an entry-wise sigmoid function [18].

σ (u) = \frac{1}{1 + \exp (- u)} .

(4)

The sigmoid function is well bounded and allows for efficient computation of the gradient. Finally, the predicted surface potential associated with each

x_{i}^{'}

is computed by

\hat{y_{i}} {= w}^{T} \cdot h_{i}^{2} + b .

(5)

Here,

w \in ℝ^{32 \times 1}

and

b \in ℝ

are model coefficients and bias.

\hat{y_{i}} \in ℝ

is the predicted value of the ANN. Mean-Squared-Error (MSE)

\sum {(y_{i} - \hat{y_{i}})}^{2} / n

[19,20] is used as the loss function, which is iteratively minimized with stochastic gradient descent [21].

Upon the completion of the training process, the analytic expression of the surface potential can be written as:

\hat{y_{i}} (or ϕ_{s} {) = w}^{T} σ (W_{2}^{T} \cdot σ (W_{1}^{T} \cdot {PR (x}_{i} {) + b}_{1}) {+ b}_{2}) + b .

(6)

which is a concise model and can be efficiently evaluated.

2.4. Surface Potential Written in Verilog-A

To further verify the applicability of the framework in circuit simulation, the trained artificial neural network should be transformed into the form of Verilog-A [22], which is the commonly used hardware description language for MOSFET and other electronic components. Verilog-A is the analogy subset of Verilog-AMS [23], originally intended for modeling the behavior of analog and mixed-signal systems. Despite significant initial resistance, Verilog-A has emerged as the de facto stand language for defining and distributing compact models. In 2004, constructs explicitly for the purpose of compact modeling were added. Considering the incompatibility of matrix calculations in Verilog-A language, an automation script (Python [24], for example) is adopted to accelerate the transform process. Two steps are divided to realize this purpose. First, the value of the parameters in each layer should be entered into Verilog-A. In the framework, these parameters include physical parameters (for example, vacuum dielectric constant of silicon [25]

ε_{si}

, Planck constant k, and Unit charge constant q) used in PRNN, Min and Max value for normalization, bias term

b_{i}

, and weights term

w_{i}

. Table 2 (I) shows the pseudo code [26] for inputting

w_{i}

. The value of

w_{i}

is read from the saved txt file and written to Verilog-A by intermediate variable a. Then, in the second step, the forward propagation process of the artificial neural network is realized in Verilog-A. In this step, the implementation of the calculation process in the framework includes the data normalization, neurons (

h_{i}

) in the hidden layers and denormalization of data at output data

\hat{y_{i}}

. Table 2 (II) shows the pseudo code for transforming

h_{i}^{2}

. The calculation of weight and bias terms between each neural is conducted following Equations (3) and (4).

2.5. Establishing the Compact Model

After obtaining the analytical surface potential expression, the related compact model for MOSFET can be built. In this work, a semi-classical compact model is developed based on the combination of trained surface potential expression and the classical compact model. The main equations are shown as follows: [27,28,29,30,31]

I_{ds} {= u}_{eff} C_{ox} \frac{W}{L} (V_{gf} (ϕ s_{d} - ϕ s_{s}) - 0.5 (ϕ s_{d}^{2} - ϕ s_{s}^{2})) .

(7)

u_{eff} = \frac{u_{0}}{1 + {MUE * E}_{eff}^{THEMU} + CS {(\frac{q_{d}}{q_{d} + q_{i}})}^{2}}

(8)

V_{gf} = Log [1 + e^{(V_{gs} - V_{fb}) * Sl}] / Sl

(9)

ϕ s_{s} = ϕ s [V_{gs}, V_{ds}, T, \dots, x]- ϕ s[0, 0, T, \dots, x]

(10)

ϕ s_{d} = ϕ s_{s} + V_{dsx}

(11)

V_{dsx} = {gV}_{ds} {(1 + {\frac{{gV}_{ds}}{V_{dsat}}}^{m})}^{- \frac{1}{m}}

(12)

Here,

u_{eff}

[29] is carrier mobility,

u_{0}

is carrier mobility at a low electrical field,

q_{d}

is the normalized charge of the depletion region,

q_{i}

is the normalized charge of the inversion region, (MUE,THEMU) is the fitting parameters accounting for the mobility degradation [32] caused by the surface roughness and phonon scattering. Coulomb scattering is introduced using the parameter CS.

E_{eff}

denotes the effective vertical field at the potential midpoint.

V_{gf}

[33] accounts for the subthreshold region with parameter Sl acting as the correction parameter to the subthreshold swing.

ϕ s_{s}

is the surface potential obtained from the ANN at x = 0.01 um.

V_{dsat}

[31] is the saturation voltage with m acting as a fitting parameter.

The gate capacitance model is also necessary in circuit simulation and can be calculated as follows [29,34,35,36]:

Q_{g} = {W * C}_{ox} \int_{0}^{L} (V_{g} - V_{fb} - ϕ s [V_{gs}, V_{ds}, T, \dots, x]) dx

(13)

Q_{d} = {W * C}_{ox} \int_{0}^{L} q_{i} \frac{y}{L} dx

(14)

Q_{S} = {W * C}_{ox} \int_{0}^{L} q_{i} (1 - \frac{y}{L}) dx

(15)

C_{mn} = - \frac{\partial Q_{m}}{\partial V_{n}}, m \neq n; C_{mm} = \frac{\partial Q_{m}}{\partial V_{m}}

(16)

3. Method Validation and Discussion

We evaluated the proposed framework by computing the surface potential in 130 nm MOSFET. We generated 540,000 training samples and 100,000 testing samples by TCAD simulation. The d-dimensional device data

x_{i}^{'} s

were generated by randomly sampling each variable from their feasible domains. The evaluation results are reported in Figure 3. The left coordinate in Figure 3 shows the relationship between the MSE testing error and the training iteration process. It can be found that during the training process, the MSE loss decreases and stabilizes at

9.58 \times 10^{- 7}

, which is in millivolts, indicating a highly accurate result. The learning rate is an important hyper-parameter in the training process. A large learning rate promotes the rapid reduction of learning errors, while a small learning rate contributes to the convergence of the model. In this work, the learning rate is set to gradually decrease from 2 × 10⁻⁵ to 1 × 10⁻⁸ using the cosine annealing algorithm (a half cycle is adopted) during the training process, as the right coordinate in Figure 3 shows.

Figure 4a plots the 2D surface potential along the device channel. At a low gate voltage, the surface potential in the middle of the channel is determined by gate−channel work function differences. The surface potential of the drain and source terminal are raised by the PN junction[37], which is induced by different doping types of channel and source/drain terminals, and are hardly affected by

V_{gs}

. When

V_{gs}

increases from 0 (V) to 1.4 (V), the surface potential in the channel increases due to the electrical field induced by gate voltage

V_{gs}

, while the potential at the drain/source terminal stays almost fixed. Thus, the minimum surface potential moves from the middle of the channel to the source terminal, which is a challenging feature that a traditional 1D Poisson equation [38] solution fails to capture. We found an excellent match between our model predictions and the TCAD simulation. Figure 4b plots the surface potential at the source and drain terminal versus the gate voltage, respectively. Excellent agreement is achieved between the TCAD simulation result and the PRNN result. When the gate voltage increases, the surface potential at the drain/source terminal increases first and then gradually saturates, which is consistent with previous reports [8].

Figure 5 plots the patterns of the minimum surface potential (

ϕ s_{\min}

), as well as the location of the minimum potential (x_min) along the channel; both w.r.t. drain voltage. It can be seen that the two patterns show an opposite trend; with the increase in drain terminal voltage, x_min moves to the source terminal and the

ϕ s_{\min}

is increased, which is called the drain-induced barrier lowing (DIBL) effect.

ϕ s_{\min}

decides the threshold voltage of the device according to Equation (8) [39].

Δ V_{t} = - η Δ ϕ s_{\min} = - {σ V}_{ds} .

(17)

Here,

η

is the ideality factor and is assumed to be independent of bias condition.

σ

is a physical parameter that reflects the influence of

V_{ds}

to threshold voltage. The DIBL effect causes an excess injection of the charge carrier into the channel and gives rise to an increased subthreshold current [40]. The overlaps of the gate depletion zone with the source/drain depletion zone share its depletion charge [41], and the shared charge is balanced by a counter charge distributed between the gate electrode and the source and drain contacts, which brings a shift in threshold voltage. With the introduction of the PRNN, analysis of the device against the DIBL effect could be conducted in a facile way, and thus facilitates the optimization of device performance.

Figure 6 shows the comparison of the developed surface potential based semi-classical compact and TCAD simulation results. Good agreement is achieved between the n-type transfer characteristic curve against different drain voltages (a), the output characteristic curve (b), the transfer characteristic curve against different operation temperatures [42], and (d) small signals gate capacitance

C_{gg}

. It shows that our model can well describe the device performance.

To further verify the applicability of the framework to modern microelectronic circuit design, the proposed compact model was transformed into Verilog-A and circuits simulation was conducted by including the MOSFET devices as new active components of the circuit simulator, as Figure 7 shows. A ring oscillator circuit with seven-stage inverters was connected in series to generate oscillation [43]. Figure 7a shows the Vout-vs-Vin curves of the inverter. When the size of the p-type MOSFET increases, the driver capability of the pull up increases, and thus, the Vout-vs-Vin curves shift to the right. Figure 7b shows the transient simulation results. When the size of the p-type MOSFET transistor decreases, the frequency of the oscillator decreases as well, and saturation appears at the lowest point of the oscillation. These simulation results clearly demonstrate the applicability and usefulness of our framework in circuit simulation applications.

Compared to the classical compact model method [29,44], the proposed framework avoids the numerical iteration process [14] in solving the surface potential expression, and thus, saves a great deal of effort in model design. Considering that most physical effects caused by device scaling directly act on surface potential, the proposed framework can better achieve underlying physic scaling compared to those works that directly train electrical properties of a device that is incompatible with model variation. Furthermore, in the literature [6], due to training data being obtained through the subtraction of device electrical properties and classical compact model output, the ANN acts as a correction term to existing models and contains scarcely underlying physical information. In contrast, the training data of the surface potential in this framework is obtained from TCAD simulators, which are based on equations of mathematical physics and reflect the physical relationship between device parameters and surface potential, thus containing more systematic physical information of the device.

4. Conclusions

We exploited the universal approximation power of artificial neural networks in learning from large amounts of simulation data to generate accurate, generalizable MOSFET compact models in a highly automated manner. Impressive results were reported in building the analytic surface potential of a 130 nm MOSFET, which proved to be of benefit in device optimization. Furthermore, our work reveals the great potential of modern artificial intelligence techniques in boosting microelectronic research. The accurate, generalizable, and automated compact model development not only reduces the gap between theory and computing, but it is also expected to bring new vigor to vast landscapes in design, simulation, and the optimization of very large-scale circuit systems.

Author Contributions

Conceptualization, writing—review and editing, funding acquisition, L.W.; Methodology, formal, software, validation, analysis, writing, investigation, writing—original draft preparation, S.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by National key research and development program (2018YFA0208503), by the National Natural Science Foundation of China (Grant Nos. 62274178),and by the Opening Project of Key Laboratory of Microelectronic Devices and Integrated Technology, Institute of Microelectronics, Chinese Academy of Sciences, and by the National Natural Science Foundation of China (Grant Nos. 61890944, 61725404,61874134, 61888102, 92264204, 61720106013, 61904195,and 62004214), by the Strategic Priority Research Program of Chinese Academy of Sciences (Grant No. XDB30030000, XDB30030300).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Rios, R.; Mudanai, S.; Shih, W.-K.; Packan, P. An efficient surface potential solution algorithm for compact MOSFET models. In Proceedings of the IEDM Technical Digest. IEEE International Electron Devices Meeting, San Francisco, CA, USA, 13–15 December 2004; pp. 755–758. [Google Scholar]
Brews, J.R. A charge-sheet model of the MOSFET. Solid State Electron. 1978, 21, 345–355. [Google Scholar] [CrossRef]
Cham, K.M.; Oh, S.-Y.; Moll, J.L.; Lee, K.; Vande Voorde, P.; Chin, D. Numerical Simulation Systems. In Computer-Aided Design and VLSI Device Development; Springer: Berlin/Heidelberg, Germany, 1988; pp. 13–21. [Google Scholar]
Wang, J.; Kim, Y.H.; Ryu, J.; Jeong, C.; Choi, W.; Kim, D. Artificial Neural Network-Based Compact Modeling Methodology for Advanced Transistors. IEEE Trans. Electron Devices 2021, 68, 1318–1325. [Google Scholar] [CrossRef]
Habal, H.; Tsonev, D.; Schweikardt, M. Compact Models for Initial MOSFET Sizing Based on Higher-order Artificial Neural Networks. In Proceedings of the 2020 ACM/IEEE 2nd Workshop on Machine Learning for CAD (MLCAD), Virtual Event Iceland, 16–20 November 2020; pp. 111–116. [Google Scholar]
Kao, M.-Y.; Kam, H.; Hu, C. Deep-learning-assisted physics-driven MOSFET current-voltage modeling. IEEE Electron Device Lett. 2022, 43, 974–977. [Google Scholar] [CrossRef]
Abiodun, O.I.; Jantan, A.; Omolara, A.E.; Dada, K.V.; Mohamed, N.A.; Arshad, H. State-of-the-art in artificial neural network applications: A survey. Heliyon 2018, 4, e00938. [Google Scholar] [CrossRef] [PubMed]
Sharma, R.K.; Gupta, M.; Gupta, R.S. TCAD assessment of device design technologies for enhanced performance of nanoscale DG MOSFET. IEEE Trans. Electron Devices 2011, 58, 2936–2943. [Google Scholar] [CrossRef]
Gnudi, A.; Ventura, D.; Baccarani, G.; Odeh, F. Two-dimensional MOSFET simulation by means of a multidimensional spherical harmonics expansion of the Boltzmann transport equation. Solid-State Electron. 1993, 36, 575–581. [Google Scholar] [CrossRef]
Ortiz-Conde, A.; Sánchez, F.G.; Guzmán, M. Exact analytical solution of channel surface potential as an explicit function of gate voltage in undoped-body MOSFETs using the Lambert W function and a threshold voltage definition therefrom. Solid-State Electron. 2003, 47, 2067–2074. [Google Scholar] [CrossRef]
Neamen, D.A. Semiconductor Physics and Devices: Basic Principles; McGraw-Hill: New York, NY, USA, 2003. [Google Scholar]
Hamid, H.A.E.; Guitart, J.R.; Iniguez, B. Two-Dimensional Analytical Threshold Voltage and Subthreshold Swing Models of Undoped Symmetric Double-Gate MOSFETs. IEEE Trans. Electron Devices 2007, 54, 1402–1408. [Google Scholar] [CrossRef]
Minton, S.; Carbonell, J.G.; Knoblock, C.A.; Kuokka, D.R.; Etzioni, O.; Gil, Y. Explanation-based learning: A problem solving perspective. Artif. Intell. 1989, 40, 63–118. [Google Scholar]
Yu, B.; Lu, H.; Liu, M.; Taur, Y. Explicit Continuous Models for Double-Gate and Surrounding-Gate MOSFETs. IEEE Trans. Electron Devices 2007, 54, 2715–2722. [Google Scholar] [CrossRef]
Patro, S.; Sahu, K. Normalization: A preprocessing stage. arXiv 2015, arXiv:1503.06462. [Google Scholar] [CrossRef]
Zhou, Z.-H. Machine Learning; Springer Nature: Berlin/Heidelberg, Germany, 2021. [Google Scholar]
Zhang, Z. Artificial Neural Network; Springer: Cham, Switzerland, 2018; pp. 1–35. [Google Scholar]
Jain, A.K.; Mao, J.; Mohiuddin, K.M. Artificial neural networks: A tutorial. Computer 1996, 29, 31–44. [Google Scholar] [CrossRef]
Christoffersen, P.; Jacobs, K. The importance of the loss function in option valuation. J. Financ. Econ. 2004, 72, 291–318. [Google Scholar] [CrossRef]
Wallach, D.; Goffinet, B. Mean squared error of prediction as a criterion for evaluating and comparing system models. Ecol. Model. 1989, 44, 299–306. [Google Scholar] [CrossRef]
Amari, S.-I. Backpropagation and stochastic gradient descent method. Neurocomputing 1993, 5, 185–196. [Google Scholar] [CrossRef]
McAndrew, C.C.; Coram, G.J.; Gullapalli, K.K.; Jones, J.R.; Nagel, L.W.; Roy, A.S.; Roychowdhury, J.; Scholten, A.J.; Smit, G.D.; Wang, X.J. Best practices for compact modeling in Verilog-A. IEEE J. Electron Devices Soc. 2015, 3, 383–396. [Google Scholar]
Christen, E.; Bakalar, K. VHDL-AMS-a hardware description language for analog and mixed-signal applications. IEEE Trans. Circuits Syst. II Analog Digit. Signal Process. 1999, 46, 1263–1272. [Google Scholar] [CrossRef]
Van Rossum, G. Python Reference Manual; CWI: Nampa, ID, USA, 1995. [Google Scholar]
Han, S.M.; Aydil, E.S. Reasons for lower dielectric constant of fluorinated SiO₂ films. J. Appl. Phys. 1998, 83, 2172–2178. [Google Scholar] [CrossRef]
Bellamy, R.K.E. What does pseudo-code do? A psychological analysis of the use of pseudo-code by experienced programmers. Hum.–Comput. Interact. 1994, 9, 225–246. [Google Scholar] [CrossRef]
Bucher, M.; Lallement, C.; Enz, C.; Krummenacher, F. Accurate MOS modelling for analog circuit simulation using the EKV model. In Proceedings of the 1996 IEEE International Symposium on Circuits and Systems. Circuits and Systems Connecting the World, ISCAS 96, Atlanta, GA, USA, 15 May 1996; Volume 704, pp. 703–706. [Google Scholar]
Lee, C.S.; Pop, E.; Franklin, A.D.; Haensch, W.; Wong, H.S.P. A Compact Virtual-Source Model for Carbon Nanotube FETs in the Sub-10-nm Regime—Part I: Intrinsic Elements. IEEE Trans. Electron Devices 2015, 62, 3061–3069. [Google Scholar] [CrossRef]
Gildenblat, G.; Li, X.; Wu, W.; Wang, H.; Jha, A.; Langevelde, R.V.; Smit, G.D.J.; Scholten, A.J.; Klaassen, D.B.M. PSP: An Advanced Surface-Potential-Based MOSFET Model for Circuit Simulation. IEEE Trans. Electron Devices 2006, 53, 1979–1993. [Google Scholar] [CrossRef]
Pei, G.; Ni, W.; Kammula, A.V.; Minch, B.A.; Kan, E.-C. A physical compact model of DG MOSFET for mixed-signal circuit applications-part I: Model description. IEEE Trans. Electron Devices 2003, 50, 2135–2143. [Google Scholar]
Ytterdal, T.; Cheng, Y.; Fjeldly, T.A. MOSFET device physics and operation. In Device Modeling for Analog and RF CMOS Circuit Design; John Wiley and Sons: Hoboken, NJ, USA, 2003; pp. 1–45. [Google Scholar]
Esseni, D.; Abramo, A. Modeling of electron mobility degradation by remote Coulomb scattering in ultrathin oxide MOSFETs. IEEE Trans. Electron Devices 2003, 50, 1665–1674. [Google Scholar] [CrossRef]
Ghibaudo, G.; Aouad, M.; Cassé, M.; Martinie, S.; Poiroux, T.; Balestra, F. On the modelling of temperature dependence of subthreshold swing in MOSFETs down to cryogenic temperature. Solid-State Electron. 2020, 170, 107820. [Google Scholar] [CrossRef]
Zong, Z.; Li, L.; Jang, J.; Lu, N.; Liu, M.J. Analytical surface-potential compact model for amorphous-IGZO thin-film transistors. J. Appl. Phys. 2015, 117, 215705. [Google Scholar] [CrossRef]
Park, H.-J.; Ko, P.K.; Hu, C. A charge sheet capacitance model of short channel MOSFETs for SPICE. IEEE Trans. Comput.-Aided Design Integr. Circuits Syst. 1991, 10, 376–389. [Google Scholar] [CrossRef]
Elmasry, M.I. Capacitance calculations in MOSFET VLSI. IEEE Electron Device Lett. 1982, 3, 6–7. [Google Scholar] [CrossRef]
Elamaran, D.; Suzuki, Y.; Satoh, H.; Banerjee, A.; Hiromoto, N.; Inokawa, H.J.M. Performance comparison of SOI-based temperature sensors for room-temperature terahertz antenna-coupled bolometers: MOSFET, PN junction diode and resistor. Micromachines 2020, 11, 718. [Google Scholar] [CrossRef]
Guo, J.; Zhao, Y.; Yang, G.; Chuai, X.; Lu, W.; Liu, D.; Chen, Q.; Duan, X.; Huang, S.; Su, Y. A new surface potential based compact model for independent dual gate a-IGZO TFT: Experimental verification and circuit demonstration. In Proceedings of the 2020 IEEE International Electron Devices Meeting (IEDM), San Francisco, CA, USA, 12–18 December 2020; pp. 22.26.21–22.26.24. [Google Scholar]
Fjeldly, T.A.; Shur, M. Threshold voltage modeling and the subthreshold regime of operation of short-channel MOSFETs. IEEE Trans. Electron Devices 1993, 40, 137–145. [Google Scholar] [CrossRef]
Chamberlain, S.G.; Ramanan, S. Drain-induced barrier-lowering analysis in VSLI MOSFET devices using two-dimensional numerical simulations. IEEE Trans. Electron Devices 1986, 33, 1745–1753. [Google Scholar] [CrossRef]
Dargar, A.; Srivastava, V.M. Thickness modeling of short-channel cylindrical surrounding double-gate MOSFET at strong inversion using depletion depth analysis. Micro Nanosyst. 2021, 13, 319–325. [Google Scholar] [CrossRef]
Osman, A.A.; Osman, M.A. Investigation of high temperature effects on MOSFET transconductance (g/sub m/). In Proceedings of the 1998 the 4th International High Temperature Electronics Conference. HITEC (Cat. No. 98EX145), Albuquerque, NM, USA, 14–18 June 1998; pp. 301–304. [Google Scholar]
Srivastava, N.A.; Priya, A.; Mishra, R.A. Design and analysis of nano-scaled SOI MOSFET-based ring oscillator circuit for high density ICs. Appl. Phys. A 2019, 125, 533. [Google Scholar] [CrossRef]
Duarte, J.P.; Khandelwal, S.; Medury, A.; Hu, C.; Kushwaha, P.; Agarwal, H.; Dasgupta, A.; Chauhan, Y.S. BSIM-CMG: Standard FinFET compact model for advanced circuit design. In Proceedings of the ESSCIRC Conference 2015—41st European Solid-State Circuits Conference (ESSCIRC), Graz, Austria, 14–18 September 2015; pp. 196–201. [Google Scholar]

Figure 1. Schematic of the PRNN Framework. Stimulation data from TCAD is firstly transformed at the physical-relationship layer to contain more fundamental relations of physics. Then, the pretreated data is trained by fully-connected artificial neural networks.

Figure 2. Schematic of the 130 nm MOSFET device with a polysilicon gate stimulated in TCAD. The surface potential is defined as the difference between the substrate electrostatic potential and the electrostatic potential (one nanometer below the channel surface).

Figure 3. Mean Square Error (MSE) and learning rate with training iterations.

Figure 4. An excellent match between our PRNN model (lines) and the TCAD numerical simulation (symbols) for (a) the surface potential

ϕ_{s}

along the device channel against different gate voltages and (b) the surface potential against the gate voltage at the source/drain terminal.

Figure 4. An excellent match between our PRNN model (lines) and the TCAD numerical simulation (symbols) for (a) the surface potential

ϕ_{s}

along the device channel against different gate voltages and (b) the surface potential against the gate voltage at the source/drain terminal.

Figure 5. (a) The lowest potential (

ϕ s_{\min}

) w.r.t. drain voltage changes and (b) the lowest surface potential location (

x_{\min}

) andThe DIBL effect is observed for the shift of

x_{\min}

and

ϕ s_{\min}

with

V_{ds}

.

Figure 5. (a) The lowest potential (

ϕ s_{\min}

) w.r.t. drain voltage changes and (b) the lowest surface potential location (

x_{\min}

) andThe DIBL effect is observed for the shift of

x_{\min}

and

ϕ s_{\min}

with

V_{ds}

.

Figure 6. A good match between our MOSFET compact model (line) and the TCAD solution data (symbol) for (a) output characteristics, (b) transfer (both in conventional and logarithmic coordinates) characteristics, (c) transfer characteristics with different temperature, and (d) gate capacitance to gate voltage.

Figure 7. (a) The circuit diagram of a ring oscillator composed of seven-stage inverters connected in series; the inverter structure is shown at the bottom. The relation between V-out and V-in for the inverter: with the size of the p-type MOSFET pull up transistor increasing, the pull up driver capability increases as well, causing the curves to shift to the right. (b) Transient simulation results of the ring oscillator: with the size of the p-type MOSFET transistor decreasing, the frequency of the oscillator is reduced and saturation appears at the lowest point of oscillation.

Table 1. Features of stimulated device data

x_{i}

.

Table 1. Features of stimulated device data

x_{i}

.

Term	Min Value	Max Value	Std
$V_{gs}$ (V)	−1.4	1.4	0.81
$V_{ds}$ (V)	0	1.4	0.418
T (K)	275	300	14.14
x (nm)	0	70	48.59
L (nm)	50	70	7.07
$N_{d} ({cm}^{2}$ )	1 × 10¹⁷	1 × 10¹⁹	3.7 × 10¹⁸
$ϕ_{s}$ (eV)	−0.08438	2.405	0.49443

Table 2. The pseudo code for transforming the ANN to Verilog-A language.

(I) Parameter Inputfor

w_{i}

for

w_{i}

in framework
with file.open(

w_{i}

_value.txt) as f1:

for line in f1.readlines():

line_list = line.split()
for j in range(len(line_list)):

a = “parameter real

w_{i}

” + str(i) + ’ = ‘+str(line_list[j]) + ’;’

f.write(a)
i = i + 1
i = 1

(II) Forward Propagation Realization for

h_{2}

for i in range(1,len(hidden layer 2)):

a = ‘

h_{2}

’ + str(i) + ’ = ‘ + ’1/(1 + exp(-(‘

for j in range(1,len(hidden layer 1)):

a = a + ’

w_{2}

’ + str(i) + str(j) + ’*

h_{1}

’ + str(j) + ’ + ’

a = a + ’

b_{2}

’ + str(i) + ’;’
f.write(a)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Huang, S.; Wang, L. MOSFET Physics-Based Compact Model Mass-Produced: An Artificial Neural Network Approach. Micromachines 2023, 14, 386. https://doi.org/10.3390/mi14020386

AMA Style

Huang S, Wang L. MOSFET Physics-Based Compact Model Mass-Produced: An Artificial Neural Network Approach. Micromachines. 2023; 14(2):386. https://doi.org/10.3390/mi14020386

Chicago/Turabian Style

Huang, Shijie, and Lingfei Wang. 2023. "MOSFET Physics-Based Compact Model Mass-Produced: An Artificial Neural Network Approach" Micromachines 14, no. 2: 386. https://doi.org/10.3390/mi14020386

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

MOSFET Physics-Based Compact Model Mass-Produced: An Artificial Neural Network Approach

Abstract

1. Introduction

2. Method Framework

2.1. TCAD Simulation

2.2. Physical-Relationship (PR) Layer

2.3. Fully-Connected (FC) Layer

2.4. Surface Potential Written in Verilog-A

2.5. Establishing the Compact Model

3. Method Validation and Discussion

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI