<<

Investigation of Alternative Power Architectures for CPU Regulators

Julu Sun

Dissertation submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of

Doctor of Philosophy in Electrical Engineering

Fred C. Lee, Chairman Dusan Boroyevich Ming Xu William T. Baumann Carlos T.A. Suchicital Elaine P. Scott

November 22nd, 2008 Blacksburg, Virginia

Keywords: two-stage, , high power density, high efficiency

Investigation of Alternative Power Architectures for CPU Voltage Regulators

Abstract

Since future will have higher current in accordance with Moore’s law, there are still challenges for voltage regulators (VRs). Firstly, high efficiency is required not only for easy thermal management, but also for saving on electricity costs for data centers, or battery life extension for . At the same time, high power density is required due to the increased power of the microprocessors. This is especially true for data centers, since more microprocessors are required within a given space (per rack). High power density is also required for laptop computers to reduce the size and the weight.

To improve power density, a high frequency is required to shrink the size of the output and output of the multi-phase buck VR. It has been demonstrated that the output bulk capacitors can be eliminated by raising the VR control bandwidth to around 350kHz. Assuming the bandwidth is one-third of the switching frequency, a VR should run at 1MHz to ensure a small size. However, the efficiency of a 12V VR is very poor at 1MHz due to high switching losses. As a result, a 12V VR can only run at 300kHz to 600kHz, and the power density is very low.

To attain high efficiency and high power density at the same time, two-stage power architecture was proposed. The concept is “Divide and Conquer”. A single-stage VR is split into two stages to get better performance. The second stage has about 5V-6V input voltage; thus the duty cycle can be extended and the switching losses are greatly reduced compared with a single- stage VR. Moreover, a sub-20V MOSFET can be used to further improve the efficiency at high frequencies.

The first stage of the proposed two-stage architecture is converting 12V to 5-6V. High efficiency is required for the first stage since it is in series with the second stage. Previous first stage which is a has good efficiency but bulky size due to low frequency operation. Another problem with using a buck converter is that light-load efficiency of the first stage is poor. To solve these problems, switched- voltage dividers are proposed. Since

the first stage does not require , the sweet point for the can be determined and high efficiency can be achieved. At the same time, since there are no magnetic components for the switched-capacitor voltage divider, high power density can be achieved. By very careful design, a power density of more than 2000W/in3 with more than 97% efficiency can be achieved for the proposed voltage divider. The light-load efficiency of the voltage divider can be as high as 99% by reducing the switching frequency at light load.

As for the second stage, different low-voltage devices are evaluated, and the best device combinations are found for high-frequency operation. It has been demonstrated that 91% efficiency can be achieved with 600kHz frequency, and 89% efficiency can be achieved with a 1MHz frequency for the second stage. Moreover, adaptive on-time control method and a non- linear structure are proposed to improve CCM and DCM efficiency for the second stage respectively.

Previously the two-stage VR was only used as a CPU VR. The two-stage concept can also be applied to other systems. In this dissertation, the two-stage power architecture is applied to two different applications: laptop computers and high-end server microprocessors. The common characteristics of the two applications are their thermal design power (TDP) requirement. Thus the first stage can be designed with much lower power than the maximum system power. It has been demonstrated that the two-stage power architecture can achieve either higher efficiency or higher power density and a lower cost when compared with the single-stage VR.

To get higher efficiency, a parallel two-stage power architecture, named sigma architecture,

is proposed for VR applications. The proposed sigma VR takes advantage of the high-efficiency, fast-transient unregulated converter (DCX) and relies on this converter to deliver most of the output power, while using a low-power buck converter to achieve voltage regulation. Both the DCX converter and the buck converter can achieve around 90% efficiency when used in the sigma VR, which ensures 90% efficiency for the sigma VR. The small-signal model of the sigma VR is studied to achieve adaptive voltage positioning (AVP). The sigma power architecture can also be applied to low-power point of load (POL) applications to reduce the magnetic component size and improve the efficiency. Finally, the two-stage VR and the sigma VR are briefly compared.

iii

To my family:

My parents: Songtai Sun and Xiuhua Zhao My wife: Juan Liu My daughters: Stella Sun and Isabella Liu

iv

ACKNOWLEDGEMENTS

With sincere appreciation in my heart, I would like to thank my advisor, Dr. Fred C. Lee for his guidance, encouragement, and support. There are many things I learned from him, and one of the most important one is to “think hard before doing something” since I used to do things without careful thought. I believe everything I’ve learned from Dr. Lee will benefit my future career.

I am also grateful to the other five members of my advisory committee, Dr. Dushan Boroyevich, Dr. Ming Xu, Dr. William T. Baumann, Dr. Carlos T.A. Suchicital, and Dr. Elaine P. Scott, for their support, suggestions and encouragement.

I am especially indebted to my colleagues in the PMC group (including VRM and power architecture sub-groups). I would like to specially thank Dr. Ming Xu, who is the VRM group leader and also my best friend at CPES. He inspired me and helped me a lot with my research, and I learned a tremendous amount from him. I would also like to thank two other friends: Dr. Jinghai Zhou and Dr. Yuancheng Ren. They encouraged me during my first year of study at CPES. Thanks also to all the other team members: Dr. Kaiwei Yao, Dr. Jia Wei, Dr. Yang Qiu, Dr. Juanjuan Sun, Mr. Yu Meng, Dr. Shuo Wang, Dr. Kisun Lee, Dr. Bing Lu, Mr. Chuanyun Wang, Mr. Dianbo Fu, Ms. Yan Jiang, Mr. Doug Sterk, Mr. David Reusch, Dr. Xu Yang, Dr. Yan Xing, Dr. Yugang Yang, Dr. Ke Jin, Mr. Andrew Schmit, Mr. Authur Ball, Dr. Ching-Shan Leu, Mr. Yan Dong, Mr. Jian Li, Mr. Bin Huang, Mr. Ya Liu, Mr. Yi Sun, Mr. Pengju Kong, Mr. Yucheng Ying, Mr. Qiang Li, Mr. Pengjie Lai, Mr. Zijian Wang, Mr. Daocheng Huang, Mr. Qian Li and Mr. Zheng Luo. It was a pleasure to work with such a talented and creative group. I would also like to thank all of the other students I have met in CPES during my five-year study, especially to Dr. Lingyin Zhao, Dr. Wenduo Liu, Dr. Qian Liu, Ms. Yan Liang, Ms. Jing Xu, Ms. Michele Lim, Mr. Rixing Lai, Mr. Honggang Sheng, Mr. Di Zhang and Mr. Puqi Ning. It has been a great pleasure, and I’ve had fun with them.

I would also like to thank the wonderful members of the CPES staff who were always willing to help me out, Ms. Teresa Shaw, Ms. Linda Gallagher, Ms. Teresa Rose, Ms. Ann Craig, Ms. Marianne Hawthorne, Ms. Elizabeth Tranter, Ms. Michelle Czamanske, Ms. Linda Long, Mr. Steve Chen, Mr. Robert Martin, Mr. Jamie Evans, Mr. Dan Huff, and Mr. David Fuller. Specially, I would like to thank Ms. Keara Axelrod for her editing of this dissertation.

v

With much love, I would like to thank my parents, my wife and my twin daughters. My parents pray for me every day as I am staying in a different country. My wife encourages me every time I feel frustrated. My twin daughters have brought me a lot of fun during their first two years.

This work was supported primarily by Analog Devices, C&D Technologies, CRANE, Delta Electronics, HIPRO Electronics, Infineon, International , Intersil, FSP-Group, Linear Technology, LiteOn Tech, Primarion, NXP, Renesas, , Richtek, Texas Instruments. Also, this work made use of ERC shared facilities supported by the National Science Foundation under Award Number EEC-9731677.

vi Table of Contents

Table of Contents

Chapter 1. Introduction ...... 1

1.1. Evolution of Microprocessors ...... 1

1.2. High Efficiency and High Power Density Requirement for Voltage Regulators ...... 3

1.3. Limitations of Single-Stage Multi-phase Buck Voltage Regulators for High Efficiency and High Power Density ...... 16

1.4. Two-Stage Voltage Regulator Architectures – “Divide and Conquer” Concept ...... 20

1.5. Dissertation Outline ...... 22

Chapter 2. High Power Density Non-Isolated Bus Converters ...... 24

2.1. Background ...... 24

2.2. Review of Previous 12V Two-Stage VR Research ...... 26

2.2.1. Two-Stage VR for Desktop Computers ...... 26

2.2.2. Two-Stage VR for Laptop Computers ...... 28

2.3. Possible Improvement for Two-Stage VR ...... 30

2.4. Proposed Switched-capacitor Voltage Divider as the First Stage ...... 31

2.4.1. Basic Switched-Capacitor Circuits ...... 31

2.4.2. Proposed Switched-Capacitor Voltage Divider ...... 34

2.4.3. Challenges for the proposed voltage divider ...... 37

2.4.4. Minimizing Power Losses by Circuit Analysis and Design ...... 38

2.4.5. Design of C1, C2 and C3 ...... 42

2.4.6. Light-Load Efficiency and Gate-Driving Method ...... 45

2.4.7. Prototype ...... 47

2.4.8. Effect of Parasitic Inductors for High Frequency Operation ...... 50

2.5. Transient Analysis of the Proposed Voltage Divider ...... 58

vii Table of Contents

2.5.1. Output Impedance ...... 58

2.5.2. Stability of Two-Stage VR with Voltage Divider as the First Stage ...... 60

2.5.3. Interaction between VRs ...... 61

2.6. Startup and Protection of the Proposed Voltage Divider ...... 64

2.6.1. Proposed Soft-Start Method for Switched-Capacitor Voltage Divider ...... 64

2.6.2. Short Circuit and Over Current Protection ...... 67

2.7. Other Related Topologies ...... 69

2.7.1. Controllability: Achieving Output Voltage Regulation ...... 69

2.7.2. ZVS Voltage Divider ...... 72

Chapter 3. Two-Stage Architecture with Improved Second-Stage

Efficiency 74

3.1. Low Voltage Rating Devices for Second Stage ...... 74

3.1.1. Introduction of New Device FOM Proposed by CPES ...... 74

3.1.2. Efficiency Improvement by Great Wall LDMOS for Top ...... 79

3.1.3. Efficiency Improvement by Ciclon Lateral-Trench MOSFET ...... 80

3.1.4. Applying Ciclon 12V Lateral-MOSFET to Top Switch using an Improved Loss Model ...... 82

3.2. Proposed System Level Two-Stage VR for Laptop and Server Computers ...... 87

3.2.1. New Situations for Two-stage VR ...... 87

3.2.2. Proposed System Level Two-Stage Power Architecture for Laptop Computers ...... 87

3.2.3. Evaluation of Two-Stage VR for Laptop ...... 93

3.2.4. System Level Two-Stage Power Architecture for High-End Servers ...... 97

3.2.5. Evaluation of Two-Stage Architecture for High-End Servers ...... 99

3.3. Light Load Efficiency Improvement ...... 101

viii Table of Contents

3.3.1. Introduction ...... 101

3.3.2. Proposed Adaptive On-time Control to Improve Buck CCM Efficiency ...... 104

3.3.3. Improve DCM Efficiency with Non-linear Inductor ...... 114

Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture –

Sigma Voltage Regulators ...... 125

4.1. Introduction ...... 125

4.2. Proposed Sigma VR Structure ...... 126

4.3. Design Sigma VR with Scalability ...... 129

4.3.1. Buck Converter Design in Sigma VR ...... 129

4.3.2. DCX Design in Sigma VR ...... 129

4.3.3. Sigma Cell Concept for Scalability ...... 134

4.4. Modeling and Transient Analysis of Sigma VR ...... 138

4.5. Startup and Light Load Efficiency Improvement of Sigma VR...... 146

4.6. Comparison between Sigma VR and Two-Stage VR ...... 152

4.7. Three Other Implementations of Sigma VR ...... 153

4.8. Other Applications for Sigma Architecture ...... 154

4.8.1. Sigma Architecture for Low Current POL Converters ...... 154

4.8.2. Sigma Architecture for Server and Telecom Applications ...... 157

Chapter 5. Conclusions and Future Work ...... 159

5.1. Improvements of Series Two-Stage Power Architecture ...... 159

5.2. Analysis and Design on Proposed Sigma Power Architecture...... 160

5.3. Future Work ...... 161

References ...... 163

ix

List of Figures

Figure 1.1 The number of integrated on the die for microprocessors ...... 1

Figure 1.2 The speed of Intel microprocessors [2] ...... 2

Figure 1.3 Intel microprocessors’ power road map [5] ...... 3

Figure 1.4 A four-phase interleaved voltage regulator ...... 3

Figure 1.5 Load-line specifications from Intel VRD 11.0 ...... 4

Figure 1.6 The relationship between the load-line specifications and time domain waveforms .... 5

Figure 1.7 A typical power delivery path for today’s processors ...... 5

Figure 1.8 A lumped circuit model of the power delivery path of today’s ...... 6

Figure 1.9 Constant output impedance looking from the CPU die ...... 7

Figure 1.10 Impedance requirement from the CPU socket (VR sensing point) ...... 8

Figure 1.11 Bulk capacitance and cost vs. VR bandwidth ...... 9

Figure 1.12 Data center installed base and spending [11] ...... 10

Figure 1.13 Power flow example in one server system [13] ...... 10

Figure 1.14 Light-load efficiency target for laptop voltage regulators ...... 11

Figure 1.15 Rack drawing more power [11] ...... 12

Figure 1.16 One example of a server with VR highlighted ...... 13

Figure 1.17 Power delivery from 12V VR to high-end server microprocessors ...... 13

Figure 1.18 CPES proposed solution to minimize the interconnection effect ...... 14

Figure 1.19 ZVRM proposed by Intel [16] ...... 15

Figure 1.20 5V input CPU voltage regulators ...... 16

Figure 1.21 12V input VR ...... 16

Figure 1.22 Effciency of 12V and 5V VRs (fs = 1MHz, top switch: Si7112, bottom switch: two Si7112) ...... 17

x

Figure 1.23 State-of-the-art VR efficiency at 300kHz, 600kHz and 1MHz ...... 17

Figure 1.24 Loss breakdown of a six-phase buck converter running at 300kHz, 600kHz and 1MHz (120A) ...... 18

Figure 1.25 FOM vs. breakdown voltage ...... 19

Figure 1.26 Efficiency of 5V input VR with low voltage rating devices (fs = 1MHz, GWS15N15 + IRF6691) ...... 19

Figure 1.27 Two-stage VR Solution ...... 20

Figure 1.28 Two-stage VR with buck as the first stage [22] ...... 21

Figure 1.29 Another parallel type of two-stage voltage regulators – sigma VR ...... 22

Figure 2.1 Two-stage architecture for the isolated application [26] ...... 24

Figure 2.2 Factorized power architecture by Vicor [27] ...... 25

Figure 2.3 Efficiency of FPA ...... 25

Figure 2.4 Another two-stage architecture with isolation ...... 26

Figure 2.5 350kHz bandwidth can eliminate output capacitors [22] ...... 26

Figure 2.6 Two-stage VR with buck as the first stage [22] ...... 27

Figure 2.7 Overall efficiency comparison between single-stage and two-stage VRs [22] ...... 27

Figure 2.8 Cost comparison between single stage and two-stage VRs [22] ...... 28

Figure 2.9 The two-stage architecture in laptop application with wide input range [23] ...... 28

Figure 2.10 The CPU power consumption and the bus voltage following ABVP control strategy [23] ...... 29

Figure 2.11 The transient waveforms of output current, output voltage and bus voltage [22] ..... 29

Figure 2.12 The efficiency improvement by ABVP control strategy [23] ...... 29

Figure 2.13 First stage in two-Stage VR ...... 30

Figure 2.14 The experimental efficiency of the 1st stage...... 31

xi

Figure 2.15 Overall efficiency comparison between single stage and two-stage @ 1MHz [22] . 31

Figure 2.16 A typical converter for low power application ...... 32

Figure 2.17 Another switched-capacitor converter with current sink ...... 33

Figure 2.18 Equivalent circuit for two capacitors in parallel ...... 34

Figure 2.19 Proposed switched-capacitor voltage divider ...... 35

Figure 2.20 Switched-capacitor converters for high power application [31] ...... 36

Figure 2.21 Prototype of the four-level switched-capacitor DC/DC converter and tested efficiency [31] ...... 37

Figure 2.22 Switching performance of switched-capacitor voltage divider ...... 38

Figure 2.23 The equivalent circuit of the voltage divider ...... 39

Figure 2.24 Peak current on MOSFET vs. C2 (R = 6mΩ, fs = 375kHz, Io = 12A) ...... 43

Figure 2.25 C2 RMS current vs. kN and τN ...... 43

Figure 2.26 C2 RMS current vs. kN for different C2 (each curve corresponds to one C2 value) . 44

Figure 2.27 Output and input AC current vs. C1 and C3 ...... 45

Figure 2.28 Light-load efficiency improvement for proposed VD ...... 47

Figure 2.29 Proposed gate driving for VD ...... 47

Figure 2.30 Prototype and efficiency of 70W voltage divider for laptop ...... 48

Figure 2.31 Prototype and efficiency of 150W voltage divider for server ...... 49

Figure 2.32 The influence from the parasitic inductors ...... 51

Figure 2.33 Loss breakdown of voltage divider (Vin = 12V, Po = 70W, Lp = 3nH) ...... 51

Figure 2.34 The equivalent circuit of the voltage divider with parasitic inductance ...... 52

Figure 2.35 Inductor current with different C2 for half of the switching period ...... 54

Figure 2.36 Total power loss vs. C2 for a given switching frequency ...... 55

Figure 2.37 Total power loss vs. C2 for different switching frequencies ...... 55

xii

Figure 2.38 Optimal C2 and minimum power loss vs. frequency ( Lp = 3nH, R = 6mΩ, Vin =

12V, Io = 12A, driving loss included) ...... 56

Figure 2.39 Power loss vs. C2 for different parasitic ...... 57

Figure 2.40 650kHz, 70W voltage divider prototype and tested efficiency curve ...... 57

Figure 2.41 Equivalent circuit to derive output impedance ...... 59

Figure 2.42 Output impedance of the voltage divider (R=10mohm, Ro=1mohm, Co=88µF) ...... 60

Figure 2.43 An example of Zo1 and Zin2 ...... 60

Figure 2.44 Experiment setup and test results ...... 61

Figure 2.45 Two VRs share the same voltage divider ...... 61

Figure 2.46 No interaction between different VRs due to low output impedance of voltage divider ...... 63

Figure 2.47 Comparison of transfer function from io1 to vo2 for voltage divider first stage and buck first stage ...... 64

Figure 2.48 Startup waveforms without soft-start (Vin=3V) ...... 64

Figure 2.49 Device startup trajectory with Vgs = 5V (Device: HAT2165) ...... 65

Figure 2.50 Proposed soft-start method for voltage divider ...... 66

Figure 2.51 Simulation result of the proposed soft-start method ...... 66

Figure 2.52 Device junction temperature at startup ...... 67

Figure 2.53 Experimental result at startup with the proposed method (Vin=12V) ...... 67

Figure 2.54 Test result of Vin/2-Vo vs. Io ( Vin = 12V ) ...... 68

Figure 2.55 Simulation result at short circuit and over current ...... 68

Figure 2.56 Protection of voltage divider based on the voltage difference between Vin/2 and Vo 69

Figure 2.57 Three-level resonant buck (fs = fo) ...... 70

Figure 2.58 Efficiency comparison between 3-level resonant buck and voltage divider ( C2 =

50µF, Lo = 25nH, fs = 160kHz for 3-level resonant Buck ...... 71

xiii

Figure 2.59 Two control methods to regulate output voltage ...... 71

Figure 2.60 Low power density 48V bus converters ...... 72

Figure 2.61 Proposed high power density 48V-12V bus converter ...... 72

Figure 2.62 Proposed ZVS voltage divider ...... 73

Figure 2.63 Simulated waveforms for ZVS voltage divider ...... 73

Figure 3.1 Three basic structures for low-voltage application ...... 74

Figure 3.2 One example of lateral-trench MOSFET Structure ...... 75

Figure 3.3 The definition of Qgd, Qgs2 and Qg ...... 75

Figure 3.4 Figure of merit vs. breakdown voltage for different MOSFET structures ( All devices are for VRM applications) ...... 76

Figure 3.5 New figure of merit vs. breakdown voltage for different MOSFET structures (All devices are for VRM applications) ...... 77

Figure 3.6 Test efficiency with Si7106 and RJK0303 as top switch ...... 78

Figure 3.7 Efficiency improvement from Great Wall 15V devices ...... 79

Figure 3.8 Compare Ciclon devices with the best bottom switch ...... 80

Figure 3.9 Comparison of Ciclon devices with the best top switch ...... 81

Figure 3.10 A new device loss model which includes parasitic inductance and non-linear capacitors ...... 82

Figure 3.11 Comparison between loss model and physical model ...... 83

Figure 3.12 Piece-wise linear model for calculation of the turn-off loss in [51] ...... 84

Figure 3.13 A new piece-wise linear model for calculation of turn-off loss ...... 84

Figure 3.14 Breakdown of the improved model shows more accuracy ...... 85

Figure 3.15 Sweep die size of 12V device technology for top switch ...... 86

Figure 3.16 Final efficiency achievement ...... 86

Figure 3.17 Cost difference between 2004 and 2008 for desktop VR ...... 87

xiv

Figure 3.18 Intel’s Narrow VDC power architecture ...... 88

Figure 3.19 One possible two-stage solution for laptop VRs ...... 88

Figure 3.20 Proposed system level two-stage power architecture for laptop ...... 89

Figure 3.21 Two-stage system for Narrow VDC power architecture ...... 90

Figure 3.22 Inductance determines the transient performance for non-linear control ...... 91

Figure 3.23 Number of bulk capacitors vs. inductance for CPU VR ...... 92

Figure 3.24 Minimum switching frequency to meet the output ripple requirement ( Vin = 6V, Vo

= 1.2V, three phases, Io = 0.1A, constant on-time control) ...... 93

Figure 3.25 Evaluation board for two-stage solution ...... 93

Figure 3.26 Two different two-stage designs ...... 94

Figure 3.27 CPU VR Efficiency comparison between single-stage and two-stage Design #1 .... 95

Figure 3.28 CPU VR Efficiency comparison between single-stage and two-stage Design #2 .... 95

Figure 3.29 Transient response of CPU VR ...... 96

Figure 3.30 Footprint comparison for CPU VR ...... 96

Figure 3.31 DDR VR efficiency comparison ...... 97

Figure 3.32 ZVRM proposed by Intel...... 97

Figure 3.33 Proposed two-stage power architecture for future server microprocessors ...... 98

Figure 3.34 1MHz two-stage VR prototype and efficiency (No bulk capacitors) ...... 100

Figure 3.35 Control bandwidth and transient waveforms ...... 100

Figure 3.36 A typical power consumption breakdown in laptop ...... 101

Figure 3.37 Inductor current waveforms for constant on-time control with different loads .... 102

Figure 3.38 Improved light load efficiency with constant on-time control ...... 103

Figure 3.39 Synchronous buck converter ...... 104

Figure 3.40 Output voltage ripple vs. on-time ...... 104

xv

Figure 3.41 Maximum on-time vs. load current to meet inductor requirement ...... 106

Figure 3.42 Maximum on-time vs. load current after considering both ripple and inductor requirement ...... 106

Figure 3.43 Power losses vs. Ton at CCM for one-phase of a three-phase buck ...... 108

Figure 3.44 Tested power loss vs. on-time with different load currents ...... 110

Figure 3.45 Proposed adaptive on-time control to improve CCM efficiency- Case 1 ...... 111

Figure 3.46 Proposed adaptive on-time control to improve CCM efficiency – Case 2 ...... 112

Figure 3.47 Transient performance comparison between constant on-time control and adaptive on-time control ...... 113

Figure 3.48 Loss breakdown at DCM Vin = 12V,Vo = 0.9V, Io = 2A, Ton = 0.5µs, fs = 200kHz 114

Figure 3.49 Inductor current with increased on-time ( Dashed line is with larger on-time ) ..... 114

Figure 3.50 Output current ripple increases under DCM operation, which increase the voltage ripple...... 115

Figure 3.51 Conflict between efficiency and voltage ripple at DCM ...... 116

Figure 3.52 Switching frequency is reduced by increasing output inductance at during DCM . 116

Figure 3.53 Preferred output inductance to improve light-load efficiency ...... 117

Figure 3.54 Proposed saturable core to achieve desired inductance ...... 118

Figure 3.55 Increased on-time for DCM operation with the proposed non-linear inductor ...... 119

Figure 3.56 Calculated switching frequency with proposed control with non-linear inductor .. 120

Figure 3.57 Core loss estimation (Material: 3F3) ...... 120

Figure 3.58 Large inductance can be seen at CCM if Isat is large ...... 121

Figure 3.59 Trade-off design for LL ...... 122

Figure 3.60 Experimental results with the non-linear inductor (Vin = 12V, Vo = 1.05V) ...... 124

Figure 3.61 2nd stage of two-stage VR efficiency with non-linear inductor ...... 124

Figure 4.1 Parallel power architecture proposed by MIT ...... 126

xvi

Figure 4.2 Proposed sigma VR architecture ...... 127

Figure 4.3 Voltage regulation of sigma VR ...... 128

Figure 4.4 Buck converter in sigma VR ...... 129

Figure 4.5 DCX example 1: ZVS quasi-resonant converter proposed by CPES ...... 130

Figure 4.6 DCX example 2: LLC resonant converter working at fixed frequency ...... 131

Figure 4.7 is required to reduce output voltage ripple for ZVS-QR DCX ...... 132

Figure 4.8 Efficiency comparison between ZVS-QR DCX and LLC DCX (fs = 600kHz) ...... 133

Figure 4.9 Loss break down of ZVS-QR DCX and LLC DCX @ 80A ...... 133

Figure 4.10 Comparing 60A sigma cell with 3-phase buck ...... 134

Figure 4.11 Scalable VR with sigma Cell ...... 135

Figure 4.12 Current of DCX can be shared by controlling buck duty cycle ...... 135

Figure 4.13 One design example: 120A VR for VR11 ...... 136

Figure 4.14 Tested efficiency for sigma VR ...... 137

Figure 4.15 30A sigma VR cell ...... 137

Figure 4.16 30A sigma VR cell design ...... 138

Figure 4.17 Small-signal model for the power stage ...... 139

Figure 4.18 Current injection control to achieve AVP from Intel ...... 140

Figure 4.19 Control diagram and desired impedance by sensing buck current only ...... 141

Figure 4.20 Small single block diagram of the sigma VR ...... 142

Figure 4.21 Output impedance when the current loop closed (Zoi) ...... 142

Figure 4.22 Reduced T2 bandwidth of sigma VR - with the same output capacitance with buck ...... 143

Figure 4.23 Reduced output capacitance of sigma VR - with the same T2 bandwidth with buck ...... 143

Figure 4.24, Loop gain and closed-loop impedance ...... 144

xvii

Figure 4.25 Transient waveforms (Vo initial spike can be ignored) ...... 144

Figure 4.26 Bulk capacitors are required with large Lout and Rout ...... 145

Figure 4.27 Transient performance of sigma VR ...... 146

Figure 4.28 VR startup with no-load condition ...... 146

Figure 4.29 Huge current observed if there is input and output voltage difference for DCX .... 147

Figure 4.30 Proposed start-up method for sigma VR ...... 148

Figure 4.31 Experimental result for soft-start method ...... 148

Figure 4.32 Light load efficiency comparison ...... 149

Figure 4.33 Loss break at light load for sigma VR ...... 149

Figure 4.34 Shut-down DCX at light load to improve light load efficiency ...... 150

Figure 4.35 Activate DCX at heavy load for sigma VR ...... 151

Figure 4.36 Light-load efficiency comparison by shutting down DCX at light load ...... 151

Figure 4.37 Wide input voltage range for buck converter if the input voltage range is wide .... 152

Figure 4.38 Implementation #2 of sigma concept ...... 153

Figure 4.39 Implementation #3 of sigma concept ...... 154

Figure 4.40 Implementation #4 of sigma concept ...... 154

Figure 4.41 Inductor size is too large for POL converters ...... 155

Figure 4.42 Implementation #2 is selected for POL converters ...... 155

Figure 4.43 Detailed circuit diagram for POL application ...... 156

Figure 4.44 Inductor size comparison of sigma POL and buck POL converters ...... 156

Figure 4.45 Efficiency comparison ...... 157

Figure 4.46 Sigma architecture for server and telecom applications ...... 158

xviii

List of Tables

Table 2.1 Parameters in the 70W Voltage Divider prototype…………………………………...48

Table 2.2 Parameters in the 150W Voltage Divider prototype...... 49

Table 3.1 Comparison between Si7106 and RJK0303……………………………………...….. 77

Table 3.2 Comparison between GWS15N15 and RJK0303……………………………….…… 79

Table 3.3 Comparison between CSD13301 and IRF6691……………………………………… 80

Table 3.4 Comparison between CSD16402 and GWS15N15………………………………….. 81

Table 3.5 Comparison between two two-stage designs for laptop……….………………….….97

Table 4.1 Circuit parameters for sigma VR and 6-phase buck VR…………………………….136

Table 4.2 An example of 10A buck design…………………………………………………….138

xix Chapter 1. Introduction

Chapter 1. Introduction

1.1. Evolution of Microprocessors

The microprocessor, which is also known as the or CPU, is widely used in many applications such as computer systems and handheld devices. To achieve better performance, more transistors are being integrated into the microprocessor. Since 1971, when the first microprocessor, Intel’s 4-bit 4004 chipset, was released, more and more transistors have been integrated into the microprocessor, following Moore’s Law, which states that the number of components on an doubles roughly every two years. Figure 1.1 shows the historical data of the number of transistors integrated into Intel’s microprocessor [1]. The dual- core , which was released in 2006, has more than one billion transistors in it.

Figure 1.1 The number of transistors integrated on the die for Intel microprocessors

With more transistors integrated into the microprocessor, better performance is achieved. Figure 1.2 shows the speed of Intel microprocessors [2]. Millions of instructions per second (MIPS) is a measure of a computer’s processing speed. Intel targets to build a microprocessor with ten trillion instruction per second (TIPS) by 2015, which is about ten billion times faster than the first 4004 microprocessor.

1 Chapter 1. Introduction

Figure 1.2 The speed of Intel microprocessors [2]

These advances in microprocessor technology challenge the power supplies of these devices. With a single microprocessor, a device’s power consumption is roughly proportional to the clock frequency, the total lumped capacitance C of the transistors, and the square of the core voltage [3], [4]. In order to get a higher operating speed, the clock frequency must be higher and the power consumption must increase accordingly. Moreover, the scaling of microprocessors will increase capacitance C, and the power consumption increases. This power consumption eventually results in greater heat dissipation, which is a huge challenge for the thermal management of the microprocessors.

Starting in 2005, Intel changed the microprocessor structure from the single core to core multi-processor (CMP); this product uses multiple cores on the die, and includes dual-core and quad-core processors [2]. Assuming the maximum performance of the single core is 1.00x with a power consumption of 1.00x, in order to increase the performance to 1.13x, the clock frequency must be increased by 20% and the supply voltage (VCC) should be increased proportionally. The power consumption is increased as 1.73x (≈ 1.23). Meanwhile, if the clock frequency is reduced by 20%, the supply voltage can be reduced by 20% and the power consumption becomes 0.51x (≈ 0.83) with 0.87x performance. The dual-core processor uses two under-clocked cores, which increases the performance to 1.73x, with almost the same power consumption. Since the supply voltage is reduced, the processors demand more current. With more cores, the supply voltage is further reduced, and the current becomes higher. Figure 1.3 shows the power road map of Intel’s microprocessors, which were surveyed in [5]. In the near future, the voltage will be under 1V and the current demand will be larger than 150A.

2 Chapter 1. Introduction

CPU Core Voltage and Current 2.2 140

2.0 120

1.8 Vcc 100

(V) 1.6 Icc 80 cc V 1.4 60 Icc (A)

1.2 40

1.0 20

0.8 0 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010

Figure 1.3 Intel microprocessors’ power road map [5]

1.2. High Efficiency and High Power Density Requirement for Voltage Regulators

A. Multi-phase Buck Voltage Regulators

In order to supply power to the microprocessor with high current and low voltage demand, the dedicated voltage regulator (VR) is used. In low-end computer systems, since there is no isolation requirement, synchronous buck converters are widely used for VRs because of their simple structure, low cost and low conduction loss. An improvement on the synchronous buck converter is the adoption of the multiphase interleaved technique [6] (illustrated in Figure 1.4). By paralleling several synchronous buck converters and phase shifting their drive signals, the interleaving approach can reduce both the input and output current ripples, improve the transient response, and distribute power and heat.

L

Q1 Q2 Q2

V in Co RL L

Q3 Q4 Q4

L

Q5 Q6 Q6

L

Q7 Q8 Q8

Figure 1.4 A four-phase interleaved voltage regulator

3 Chapter 1. Introduction

B. Transient Requirement for Voltage Regulators

Today’s VR faces the stringent challenge of not only high current but also a strict transient response requirement. In the Intel VRD 11.0 specification, the maximum current slew rate is 1.9A/ns at the CPU socket [7]. Figure 1.5 shows the VR output load line from the Intel VRD 11.0 specification. The vertical axis is the VR output voltage deviation from the voltage identification (or VID, which is a code supplied by the processor that determines the reference

output voltage), and the horizontal axis is the CPU current (ICC). The maximum, typical and minimum load lines are defined by (1-1):

Vmax = VID − RLL ⋅ I CC

Vtyp = VID − RLL ⋅ I CC − TOB (1-1)

Vmin = VID − RLL ⋅ I CC − 2⋅TOB

where RLL is the load-line impedance and TOB is the tolerance band. The VR output voltage should be within this load line band for both static and transient operation.

0 20406080100120140 0 ‐0.02 Vmax LL ‐0.04 Vtyp LL ‐0.06 Vmin LL ‐0.08 (V)

V ‐0.1 Δ ‐0.12 ‐0.14 ‐0.16 ‐0.18 Icc (A)

Figure 1.5 Load-line specifications from Intel VRD 11.0

Figure 1.6 shows the relationship between the load-line specifications and the time domain waveforms [7]. When the CPU current (IO=ICC) is low, the VR output voltage should be high, and when the CPU current is high, the VR output voltage should be low. This is achieved by the VR controller, which is known as the adaptive voltage positioning control (AVP). The VR output voltage must also follow the load line during the transient, with one exception. During the load

4 Chapter 1. Introduction

step-down transient, the VR output voltage can be over Vmax load line for 25µs. This overshoot should not exceed the overshoot relief, which is VID+50mV in the VRD 11.0 specification.

< 25 us Overshoot releif (VID + 50mV)

VID Min Load SS Window V ma V x LL O V m in LL Max Load SS Window Voltage

IO

Current Icc max

Figure 1.6 The relationship between the load-line specifications and time domain waveforms

The voltage overshoots, which cannot meet this specification, will cause higher processor operating temperature, and this may result in damage or a reduced processor life span. The processor temperature rise from higher functional may lead to operation at low power states, which directly reduces processor performance. The voltage undershoots may cause system lock-up, “blue screening,” or data corruption.

The above concerns clarify why output filter capacitors are so important for a processor to operate properly. Figure 1.7 shows a typical power delivery path for today’s processors [9], [10]. In order to supply sufficient energy to the processor, sufficient energy storage components must be placed in different locations.

Processor Die Packaging Cap Bulk Cap For 478-pin socket

OLGA Oscon Socket

Cavity Cap

Figure 1.7 A typical power delivery path for today’s processors

5 Chapter 1. Introduction

The capacitors closest to the VR are so-called ‘bulk’ capacitors. In most of today’s designs, a special type of electrolytic capacitor ⎯ the Oscon capacitor ⎯ is widely used as the bulk capacitor. It has relatively low ESR and ESL values compared to the general-purpose electrolytic capacitor, while still having a large capacitance; for example: C=560µF, ESR=6-10mΩ, ESL=3nH. Since Oscon capacitors have relatively high profile, it cannot be used in applications in which low profile is required, e.g. laptop computers. Another high-density capacitor called specify polymer capacitor (SPCAP) is now widely used as bulk capacitors in laptop or server computers due to its low profile. SPCAP has even better performance than Oscon capacitors; for example: C=390µF, ESR=7mΩ, ESL=2nH. Following the bulk capacitors in the power path, multi-layer ceramic capacitors (MLCCs) are used as the decoupling capacitors. The MLCC has even lower ESR and ESL values than the Oscon capacitors, but the capacitance is small. A typical value would be: C=22µF, ESR=4mΩ, ESL=1nH. Because of their small package sizes, these MLCCs are usually located in the socket cavity. So sometimes it is referred as ‘cavity’ capacitors. The packaging capacitors are those included in the CPU package. They are also ceramic capacitors with extremely low ESL. There is also one on-die capacitor to help the transient performance at high current slew-rate.

Figure 1.8 shows a lumped circuit model of the power delivery path of today’s microprocessor [9], [10]. The selection of different types of capacitors is an attempt to achieve constant output impedance for best transient performance.

VR Socket CPU

Lvr Rvr Lskt Rskt Lvia Rvia L VID Lblk LMB pkg Idie Rdie Rblk RMB Rpkg C C C blk MB Cpkg die

Cavity cap Bulk cap Sensing point

Figure 1.8 A lumped circuit model of the power delivery path of today’s microprocessor

Figure 1.9 explains this using the constant output impedance concept [9]. Each type of capacitor has a different resonant frequency, so that they can cover a certain range of the

6 Chapter 1. Introduction

frequency to make the overall output impedance lower than the desired impedance, as shown in Figure 1.9(a). The overall output impedance is shown in Figure 1.9(b), and it can achieve constant or lower output impedance than the desired impedance with very wide frequency range.

Since packaging capacitors and on-die capacitors are beyond the circuit designer’s control, Intel defines only the socket load line impedance requirement [7], as shown in Figure 1.10. The impedance includes VR, bulk capacitors and cavity capacitors. Since the number of cavity capacitors is limited by the CPU socket, the maximum number is eighteen 22µF ceramic capacitors with 1206 footprint. As a result, it can only cover the frequency range from around 390kHz to 2MHz. Below 390kHz, bulk capacitors must work with VR to get the desired impedance.

|Z|

Desired

VR Bulk Cap Close‐loop (Oscon) Cavity Cap (MLCC)

Packaging cap Die Cap (MLCC) f fc

(a) Impedances for different decoupling capacitors

|Z|

Desired

f

(b) Overall output impedance from die side

Figure 1.9 Constant output impedance looking from the CPU die

7 Chapter 1. Introduction

|Z|

RLL

VR Bulk Cap Cavity Cap Close‐loop (MLCC)

f fc 390kHz 2MHz

Figure 1.10 Impedance requirement from the CPU socket (VR sensing point)

For example, assume VR control bandwidth is 50kHz, which means the impedance below 50kHz can be taken care by VR close loop. Therefore, there is a gap between the VR control bandwidth and the cavity capacitors from 50kHz to 390kHz. As a result, ten 560µF Oscon capacitors must be added to fill the gap. If VR control bandwidth can be pushed to 390kHz, there is no gap between VR impedance and cavity capacitance impedance, which indicates that there are no bulk capacitors required. With different VR control bandwidth, different number of the bulk capacitors should be used, as shown in Figure 1.11. The capacitor parameters are: Oscon capacitor: C=560µF, ESR=6mΩ, ESL=3.3nH, price: $0.15/piece; ceramic capacitor: C=100µF, ESR=1.4mΩ, ESL=1nH, price: $0.25/piece. A very interesting phenomenon is that within 50kHz and 390kHz control bandwidth, capacitance does not change for Oscon capacitors but gradually changed for ceramic capacitors. The reason is that larger ESL of Oscon capacitors limits the number of the capacitors. From Figure 1.11(b), it can be observed that with 150kHz VR bandwidth, it is better to switch from the Oscon capacitor to the ceramic capacitor to reduce the footprint while keeping the same cost.

8 Chapter 1. Introduction

100000

10000 Oscon Cap

1000 Ceramic Cap

100 Capacitance (µF) Capacitance

10 1.E+04 1.E+05 1.E+06 fc (Hz)

(a)Capacitance vs. VR bandwidth 10

Oscon Cap

1

Ceramic Cap Capacitor cost cost ($) Capacitor

0.1 1.E+04 1.E+05 1.E+06

fc (Hz)

(b)Capacitor cost vs. VR bandwidth

Figure 1.11 Bulk capacitance and cost vs. VR bandwidth

C. High Efficiency Requirement for Voltage Regulators

As described in Figure 1.3, the current of the CPU will continue to increase. Because of this, the thermal management of the power system will become more and more challenging. This problem is even severe in a data center, where many CPUs are used for high performance. Figure 1.12 shows the increase of data center installed bases for the worldwide server market [11]. It can be seen that the cost for powering and cooling the data center increases much faster than new server spending due to the high current requirement for CPUs.

9 Chapter 1. Introduction

Figure 1.12 Data center installed base and spending [11]

Figure 1.13 shows an example of the power flow of one server system [12], [13]. The load power is normalized to be 100W. The total input power can be as high as 285W - 405W, which is three or four times higher than the load power. If we exclude the cooling power, due to low efficiency of power supplies the power efficiency of the whole system is only 50%, which means power supplies consume an amount of power similar to the final load.

(a) Power delivery for server systems

(b) Efficiency breakdown for each stage

Figure 1.13 Power flow example in one server system [13]

10 Chapter 1. Introduction

As a result, it is required to improve the efficiency of each stage of the power delivery system in the server. Web giant Google™ proposed to improve their entire server power system’s efficiency to 90% [14]. They estimate that if this is deployed in 100 million PCs running for an average of eight hours per day, this new standard would save 40 billion kilowatt- hours over three years, or more than $5 billion at California’s energy rates. However, this efficiency target is not that realistic, so they decreased the target to 85%. Assuming the PSU can achieve 95% efficiency, the VR must provide at least 90% efficiency!

A similar efficiency requirement is proposed by IBM [15] and Intel [12]. Some efforts have already been made, so VR efficiency has increased from 75% to 85% - 86% [13]. However, there is still a 3% - 4% gap between the state-of-the-art VR efficiency and the target VR efficiency.

Efficiency as discussed above refers to full-load efficiency. Light-load efficiency of the VR is also very important, since the CPU goes into sleep states very frequently to save power. When the computer is in hibernate mode, or when there is no software running, the operating system (OS) sends a signal through the advanced configuration and power interface (ACPI), and the system goes into sleep mode. When the CPU goes into sleep mode, the clock frequency and the supply voltage are reduced. The light-load efficiency is particularly important for laptop voltage regulators, since the laptop VR goes to sleep about 80% of the time, and light-load efficiency is very important for battery life extension.

95% 90% 85% 80% Efficiency 75% 70% 0.1 1 10 100 Io(A)

Figure 1.14 Light-load efficiency target for laptop voltage regulators

11 Chapter 1. Introduction

Figure 1.14 shows an example of the light-load efficiency expectation from Intel. The expected efficiency is 90% for the active state (9A to 45A), 80% at 1A, and 75% at 300mA for the sleep states. Since the fixed power loss, which consists of gate driving losses, core losses, etc., becomes dominant at light load, it is very challenging to meet the efficiency requirement.

D. High Power Density Requirement for Voltage Regulators

At the same time, data center power density is increasing by approximately 15% annually. Figure 1.15 shows the rack power increasing year by year. The power could go as high as 25kW in 2009 [11]. The servers per rack increase dramatically. In 1996, there was an average of seven servers per rack. This number increased to ten servers per rack in 2002 and 14 in 2005. It is projected to increase to 20 servers per rack by 2010!

Figure 1.15 Rack drawing more power [11]

More servers per rack mean more CPUs, which indicates that the CPU voltage regulators increase correspondingly. Figure 1.16 shows an example of a server motherboard with two CPUs and two VRs. It can be calculated that the VR occupies more than 13% of the total motherboard footprint. To reduce the server size, it is necessary to reduce the VR size. If we perform further calculation, it can be seen that the passive components of the VR, which are the output inductors and output capacitors, are about 60% of the total VR size. So if we want to achieve a high- power-density VR, the passive component size must be reduced. In the future, since CPU current will continue to increase, more VR phases should be designed to power higher current. With

12 Chapter 1. Introduction more VR phases, the size of the VR will be further increased. As a result, reducing the passive component size is very important for server voltage regulators.

VR

Figure 1.16 One example of a server motherboard with VR highlighted

The above VR is mainly for processors. For Itanium processors which are widely used in high-end servers, the power delivery is very different, as shown in [16], [17], [18]. The VR delivers power from one side of the OLGA board through the power connector. There are three kinds of decoupling capacitors: bulk capacitors on the VR board, die-side capacitors (DSC) on the OLGA, and a pin-side capacitor (PSC) underneath the microprocessor.

Power connector Processor DSC PSC VR Inductor Bulk caps (die side cap) (pin side cap)

OLGA Socket motherboard

Figure 1.17 Power delivery from 12V VR to high-end server microprocessors

13 Chapter 1. Introduction

In the future, server microprocessors will have more and more power consumption and transient requirements. Based on Intel’s information, the expected next-generation Itanium CPU current is around 170A and the load-line resistance is 0.8mΩ at the CPU packaging point. The main limitation of the current power delivery approach for the future VR specification is the interconnection effect. The parasitic resistance of the interconnection is 0.6mΩ which causes a huge distribution loss of about 17.3W at 170A load current. The large ESL of the interconnection (400pH) will introduce a huge voltage spike with a fast load transient, which is hard to reduce with VR control. Many capacitors must be added to meet the transient requirement. Based on the design methodology introduced in [10], to meet the future VR transient requirements it can be predicted that the die-side capacitor number should be sixteen times as many as the current solution, and the pin-side capacitor number should be eleven times as many as the current solution. Due to the limited footprint of the OLGA, it is impractical to include so many capacitors.

To solve this problem, CPES proposed the new power delivery structure where the VR is put on the OLGA to minimize the interconnection effect [19]. The concept is shown in Figure 1.18, where the VR is put on the OLGA and delivers power from four sides of the microprocessor. The power delivery path is much shorter than in the previous architecture. Therefore, efficiency and transient response are improved. Simulations show that the parasitic resistance from the VR to the die-side capacitor is about 0.11mΩ, which reduces the distribution loss by 82%. The parasitic inductance is about 6pH, which results in a 75% die-side capacitor reduction.

VR

Processor VR VR DSC PSC OLGA (die side cap) (pin side cap) VR

VR VR

Socket motherboard

Figure 1.18 CPES proposed solution to minimize the interconnection effect

14 Chapter 1. Introduction

This concept is also used in industry. For example, Intel and other companies are now considering the Z-Axis VRM (ZVRM) structure, as shown in Figure 3.32 [16]. There is a small difference between the ZVRM and CPES’ proposed structure; the ZVRM is plugged into the OLGA instead of putting it on the OLGA. The reason for this is that ZVRM failure does not cause CPU failure. Nevertheless, the concept is still to put VR as close as the microprocessor to minimize the power delivery path, like CPES’ proposed structure.

(a)

ZVRM Processor ZVRM DSC Inductor Bulk cap Bulk cap Inductor DSC PSC (die side cap)

OLGA

Socket motherboard (b)

Figure 1.19 ZVRM proposed by Intel [16]

(a) Original Drawing (b) Concept redrawing

In this arrangement, the VR must be very small to put it on (or around) the OLGA. As we know, the bulky elements of the VR are the output inductors and output capacitors. Based on the information from Intel, VR switching frequency must be pushed to about 800kHz to fit the size to the space around CPU.

A similar requirement exists for the laptop VR, since the laptop demands smaller size and lighter weight.

15 Chapter 1. Introduction

1.3. Limitations of Single-Stage Multi-phase Buck Voltage Regulators for High Efficiency and High Power Density

A. Low Efficiency due to High Input Voltage

At the very beginning of the use of voltage regulators, since the output current was only about 15A, a 5V input from a silver box was widely used. The efficiency of this type of VR is about 90% [6].

5V12V VRMVRM Processor Silver AC 0.8~1.6V Box 2V/15A

Figure 1.20 5V input CPU voltage regulators

However, as the current of the CPU keeps getting higher and higher, the input current from 5V is also higher, which gives much more burden to the silver box. Furthermore, a high input current can cause high distribution loss since the distance between the silver box and the VRM is very long.

To meet the higher current VR requirements, the VR input changed from 5V to 12V. As a result, the input current is low, so both of the above problems are solved. However, the efficiency drops to about 80%.

12V VRMVRM Pentium Processor Silver AC 0.8~1.6V Box

Figure 1.21 12V input VR

16 Chapter 1. Introduction

Nowadays, the device performance is better so the efficiency gap between 12V VR and 5V VR is reduced. Nevertheless, there is still 4% efficiency difference in an example shown in Figure 1.22.

90% 88% 86% 84% 82% 80% 78%

Efficiency 76% Vin=5V 74% Vin=12V 72% 70% 2 6 10 14 18 Io (A)

Figure 1.22 Effciency of 12V and 5V VRs (fs = 1MHz, top switch: Si7112, bottom switch: two Si7112)

Figure 1.23 shows the tested efficiency curves for six-phase voltage regulators with 12V input and 1.2V output and the best available devices. Each phase contains one top switch, the RJK0305, one bottom switch, the RJK0301, and one output inductor. It clearly shows that the VR efficiency drops a good deal at 600kHz and 1MHz, which indicates the switching losses becomes dominant for high-frequency operation, since conduction losses are not related to the switching frequency.

95%

90%

85%

80% 300kHz 600kHz 75% 1MHz

70% 0 20 40 60 80 100 120 140 Io(A)

Figure 1.23 State-of-the-art VR efficiency at 300kHz, 600kHz and 1MHz

17 Chapter 1. Introduction

Figure 1.24 shows the loss breakdown of the above voltage regulator at full load [21]. It clearly shows two dominant losses: the top switch turn-off loss and the bottom switch conduction loss. Since the turn-off loss is roughly proportional to the input voltage and the commutation time, a high input voltage (12V) is a major reason for high turn-off loss. As for the bottom switch, since it conducts current for most of the time due to a small duty cycle (0.1 for 12V input 1.2V out), the conduction loss is also very high.

16 14 300kHz (W)

12 600kHz 10 1MHz Loss 8 6 4 Power 2 0

Figure 1.24 Loss breakdown of a six-phase buck converter running at 300kHz, 600kHz and 1MHz (120A)

B. Device Limitation of Multi-Phase Buck

For low-voltage applications, there are three basic types of device technology widely used in industry: the vertical diffusion MOS (VDMOS), the trench MOS [45] and the lateral diffusion MOS (LDMOS) [46]. Among these structures, the trench MOSFET has the lowest on-resistance,

while the lateral MOSFET has the fastest switching speed due to low Cgd. For sub-30V applications, the trench MOS and lateral MOS are widely used.

Recently, the lateral-trench MOSFET structure was developed to obtain the benefits of

both the lateral MOSFET and the trench MOSFET. Both Cgs and Cgd can be reduced comparing with trench MOSFET, and on resistance is lower than lateral MOSFET.

The traditional device figure of merit (FOM = Qgd * Rdson) [20] is shown in Figure 1.25. A lower FOM means the device can achieve lower power loss. For 12V voltage regulators, a 30V MOSFET should be used for both top and bottom due to the device’s hard-switching. Unfortunately, most of lateral MOSFET and lateral-trench MOSFDET are low voltage rating

18 Chapter 1. Introduction devices with sub-20V breakdown voltage. Figure 1.25 shows that a 30V device’s FOM is much larger than the FOM of sub-20V devices. Although companies put a lot of effort into improving device performance at 30V, e.g. the improvement of the Renesas D8 to D9 and the Infineon OptiMOS 2 to OptiMOS 3, the lowest FOM for 30V devices is still higher than the FOM of most sub-20V devices with lateral or lateral-trench structure. As a result, the power losses on both the top switch and bottom switch are larger. The MOSFET used in Figure 1.23 is the best available 30V MOSFET, a Renesas D9 generation, but the efficiency is not that exciting even at 300kHz switching frequency.

40 35 Infineon ‐ Trench 2 Vishay ‐ Trench 30 IRDirectFET 25 Renesas ‐ Trench D8 20 Infineon ‐ Trench 3 Renesas ‐ Trench D9 15 10 FOM=Qgd*Rds Greatwall‐ Lateral Ciclon‐ Lateral‐Trench 5 0 5 101520253035 Vds (V)

Figure 1.25 FOM vs. breakdown voltage

Figure 1.26 shows the efficiency of 5V VR with low voltage rating devices. It can be seen that the efficiency can be improve about 9% comparing with 12V VR with 30V devices.

90% 88% 86% 84% 82% 80% 78% 76% Vin=5V with low voltage devices Efficiency 74% Vin=5V with 30V device 72% Vin=12V dwith 30V devices 70% 2 6 10 14 18 Io (A)

Figure 1.26 Efficiency of 5V input VR with low voltage rating devices (fs = 1MHz, GWS15N15 + IRF6691)

19 Chapter 1. Introduction

C. Low Power Density Due to Low Frequency

As explained above, the frequency of the 12V VR is relatively low due to efficiency requirement. The frequency is about 200kHz~300kHz for laptop VR. For server VR the frequency range varies from 300kHz to 600kHz. Assuming the VR bandwidth is about one sixth of the frequency, the bandwidth of server VR is only 50kHz to 100kHz, which means at least 10 Oscon capacitors should be used. Low frequency also results in bulky inductor size. These are the reasons why we see so many capacitors and so large output inductors in Figure 1.16.

1.4. Two-Stage Voltage Regulator Architectures – “Divide and Conquer” Concept

In order to achieve high efficiency and high power density at the same time, this dissertation proposes using two-stage power architectures. The concept is “Divide and Conquer”. A single-stage VR is split to a two-stage VR and both of the two-stage can achieve high efficiency. There are two types of two-stage VR connection: series two-stage and parallel two- stage.

A. Series Two-Stage Solution: CPES Proposed Two-Stage Solution [22], [23]

To avoid the problems of the 5V input and to take advantage of the high efficiency of a 5V VR, a two-stage VR solution was proposed, as shown in Figure 1.27. By adding a high- efficiency first stage, both the silver box and the VR can be satisfied. Thus the first stage works like a buffer. The second stage is to conquer the high current requirement with low input voltage and low voltage rating devices and the first stage just needs to step down the 12V input to 5V.

5V 12V 1st Stage 2ndVRMVRMStage Pentium Processor Silver AC 0.8~1.6V Box

Figure 1.27 Two-stage VR Solution

20 Chapter 1. Introduction

Figure 1.28 shows one possible implementation of two-stage concept. The first stage is buck converter which converts 12V input to 5V. Since it does not require fast transient response, low switching frequency is desired for high efficiency. With 300kHz switching frequency, the first stage efficiency is more than 97%. The second stage is multi-phase buck converter with 5V input. It can achieve higher switching frequency with similar efficiency as 12V VR.

Voltage step-down converter Multi-Phase Buck 2nd Stage

Q11 Q1 Q2 io1 io Q12 Cbus Co RL

Q3 Q4 io2 300~500KHz

Q5 Q6 io3 …

Figure 1.28 Two-stage VR with buck as the first stage [22]

B. Parallel Two-Stage Solution: Proposed Sigma Voltage Regulators [25]

The previous two-stage structure is in series; thus each stage should be able to handle the full power. A more efficient way to deliver power is that two stages deliver power to the load in parallel, and use a high-efficiency unregulated converter to handle most of the power. A parallel- type two-stage approach is proposed, as shown in Figure 1.29. The input-side of the two converters is in series instead of in parallel. It is named as quasi-parallel architecture, or sigma architecture. The DC/DC (DCX) delivers most of the power, so high efficiency is expected. The low-power buck converter only needs to take care of the output voltage regulation and transient performance. Another benefit of this architecture is that low-voltage-rating devices can be used in both the DCX and buck converter.

21 Chapter 1. Introduction

isolation

** + n 1 Vin1 DCX - Vin Buck + Vin2 - Io

Figure 1.29 Another parallel type of two-stage voltage regulators – sigma VR

1.5. Dissertation Outline

From the above discussion, the CPES proposed series two-stage and quasi-parallel power architectures are good candidates for the voltage regulators. In this dissertation, I will focus on the investigation of these two power architectures.

This dissertation consists of five chapters organized as follows:

Chapter 1 gives an introduction of the research background. It highlighted the main challenges of VR design which are high efficiency and high power density requirement. The limitations of the current single-stage solution are illustrated in very detail.

Chapter 2 and Chapter 3 outline the improvement of the CPES proposed two-stage architecture.

Chapter 2 proposes a new first stage with very high power density and high efficiency. The first stage of the proposed two-stage architecture is converting 12V to 5-6V. High efficiency is required for the first stage since it is in series with the second stage. Previous first stage which is a buck converter has good efficiency but bulky size due to low frequency operation. Another problem with using a buck converter is that light-load efficiency of the first stage is poor. To solve these problems, switched-capacitor voltage dividers are proposed. Since the first stage does not require voltage regulation, the sweet point for the voltage divider can be determined and high efficiency can be achieved. At the same time, since there are no magnetic components for the

22 Chapter 1. Introduction

switched-capacitor voltage divider, high power density can be achieved. The detailed analysis of the switched-capacitor voltage divider will be provided in Chapter 2.

Chapter 3 focuses on how to improve the second stage’s efficiency with the help of low- voltage-rating devices, and how to improve the light-load efficiency. The different low-voltage devices will be evaluated, and the best device combinations are found for high-frequency operation. Moreover, adaptive on-time control method and a non-linear inductor structure are proposed to improve CCM and DCM efficiency for the second stage respectively. Previously the two-stage VR was only used as a CPU VR. The two-stage concept can also be applied to other systems. Two different applications will be studied for two-stage architecture: laptop computers and high-end server microprocessors. It will be demonstrated that the two-stage power architecture can achieve either higher efficiency or higher power density and a lower cost when compared with the single-stage VR.

In Chapter 4, a high efficiency parallel two-stage power architecture (Sigma VR) is

proposed, and the design is illustrated in detail. The proposed sigma VR takes advantage of the high-efficiency, fast-transient unregulated converter (DCX) and relies on this converter to deliver most of the output power, while using a low-power buck converter to achieve voltage regulation. Both the DCX converter and the buck converter can achieve around 90% efficiency when used in the sigma VR, which ensures 90% efficiency for the sigma VR. The small-signal model of the sigma VR will be studied to achieve adaptive voltage positioning (AVP). The sigma power architecture can also be applied to low-power point of load (POL) applications to reduce the magnetic component size and improve the efficiency. Finally, the two-stage VR and the sigma VR are briefly compared.

Chapter 5 provides a summary of the dissertation.

23 Chapter 2. High Power Density Non-Isolated Bus Converters

Chapter 2. High Power Density Non-Isolated Bus Converters

2.1. Background

Two-stage power architecture is originally used for the isolated DC/DC converter applications [26], as shown in Figure 2.1. The voltage regulation is achieved by the first stage which is a buck converter. The second stage is totally unregulated. The major benefit is to achieve self driven for the synchronous rectifier for the second stage. Also the secondary side gate driving loss can be partially recovered, which is very important for high frequency operation.

* *

Figure 2.1 Two-stage architecture for the isolated application [26]

Vicor applies the above two-stage architecture to 48V VRM application, which is called as factorized power architecture (FPA) [27]. A FPA is shown in Figure 2.2. We can see that the first stage is regulated, while the second stage is totally unregulated. The difference of FPA comparing with the architecture proposed in [26] is that the first stage is a buck-boost type converter which can achieve ZVS for all switches; the second stage is a LLC resonant converter running at resonant frequency. It is indicated that the unregulated resonant converter can achieve much higher efficiency than the regulated one since it can work with an optimal point. As a result, the second stage can achieve very high efficiency with high frequency. In addition, the second stage can use low-voltage-rating devices to improve the performance.

24 Chapter 2. High Power Density Non-Isolated Bus Converters

Pre‐regulated Module Voltage Transformation Module

Voltage regulation

Figure 2.2 Factorized power architecture by Vicor [27]

Figure 2.3 shows the efficiency of the FPA structure for 48V application. The overall efficiency can be as high as 88% with a switching frequency greater than 1MHz.

Vbus = 48V PRM VTM 36~75V input 1.5V/100A

η=97% × η=91% @ 1.5V/100A, fs =1MHz fs=1.3MHz

Figure 2.3 Efficiency of FPA

However, since the second stage is unregulated, the transient performance is not easy to achieve. Another problem with the FPA is that since the second stage is unregulated, the stages cannot be in parallel to increase the current capability. As a result, this structure has a problem with scalability.

To get better transient performance and achieve more scalability, another two-stage architecture was proposed for 48V applications [22], as shown in Figure 2.4. The first stage is isolated and unregulated, and the second stage just uses a simple buck converter, which can achieve very good transient performance. It is also well known that buck converter can achieve scalability very easily by using multi-phase interleaving structure. Nowadays, this structure is widely used in telecom industry and it is called as intermediate bus architecture (IBA).

25 Chapter 2. High Power Density Non-Isolated Bus Converters

The above two-stage structure is used for isolated applications. For 12V VR, since no isolation is required, is the two-stage architecture still a valid approach? The following section will review some research done for 12V two-stage VR.

First Stage Second Stage Isolation Voltage Primary-Side * * Secondary-Side Output Power Circuit Power Circuit Regulator Voltage

Control Path Fixed Duty-Cycle

Figure 2.4 Another two-stage architecture with isolation

2.2. Review of Previous 12V Two-Stage VR Research

2.2.1. Two-Stage VR for Desktop Computers

For desktop VRs, cost is the greatest concern. In reference [22], it is proven that with around 350kHz control bandwidth, the output bulk capacitors can be eliminated, as shown in Figure 2.5.

60

40

c 20 f =320kHz

0 Io 100A 20

40 3 4 5 6 7 1 .10 1 .10 1 .10 1 .10 1 .10

0 9

100 80º Vo 100mV

0 200

300

6 400 3 4 5 6 7 1 .10 1 .10 1 .10 1 .10 1 .10

(a)Measure loop gain (b)Transient response without bulk capacitors

Figure 2.5 350kHz bandwidth can eliminate output capacitors [22]

26 Chapter 2. High Power Density Non-Isolated Bus Converters

To achieve 350kHz control bandwidth, a 2MHz VR is required, assuming the bandwidth is one-sixth of the switching frequency. The efficiency of a 2MHz single-stage VR is only about 75%, so it is not applicable. Figure 2.6 shows the two-stage architecture. The first stage efficiency is about 97%, and the second stage efficiency is as high as 89% with 2MHz switching frequency due to the low input voltage (5V). Figure 2.7 shows that a 2MHz two-stage VR can achieve even higher efficiency than a 1MHz single-stage VR.

Voltage step-down converter Multi-Phase Buck 2nd Stage

Q11 Q1 Q2 io1 io Q12 Cbus Co RL

Q3 Q4 io2 300~500KHz

Q5 Q6 io3 …

Figure 2.6 Two-stage VR with buck as the first stage [22]

System Efficiency Comparison 90 88

) 86 84 82

Efficiency (% Efficiency 80 78 76 0 20 40 60 80 100 120 Io (A)

Single stage @ 1MHz Two stage @ 2MHz and Vbus=5V

Figure 2.7 Overall efficiency comparison between single-stage and two-stage VRs [22]

Figure 2.8 shows the cost comparison between the single-stage VR and the two-stage VR. The two-stage VR can save about 5% of the cost when compared with the single-stage VR. We can thus conclude that the two-stage approach is a more cost-effective solution and a promising candidate for desktop voltage regulators.

27 Chapter 2. High Power Density Non-Isolated Bus Converters

$3.50 Single stage $3.00 Two stage VR $2.50 $2.00 $1.50 $1.00 $0.50 $0.00 ) r t T u e ET ive Lin F Lo /1 0m ag Dr F st op bot FE Cer Cin u st Controller T 0 ir F (56 coupling cap n e D co s- O

Figure 2.8 Cost comparison between single stage and two-stage VRs [22]

2.2.2. Two-Stage VR for Laptop Computers

The requirement for laptop VRs is different from the requirement for desktop VRs. Since the laptop VR is used in portable devices, the size and the weight are very important, which implies that VR should run at a high switching frequency. At the same time, since the laptop VR is powered by batteries on the road, the efficiency is very important to extend the battery life. However, the laptop VR has a very wide input voltage range (8.7V-19V) to cover both the adapter voltage and battery voltage, which makes it even more challenging than the desktop VR with the input voltage range of 12V±10%. From both power density and efficiency points of view, a single-stage VR is not a good solution for laptop VRs.

To achieve both high efficiency and high power density with a wide input voltage range, the two-stage VR is proposed for laptop applications [23], with the configuration shown in Figure 2.9. Compared with the two-stage approach in desktop applications, the only difference is the wide input voltage range. Since the laptop computer has a much wider input range, it is more desirable to regulate the bus voltage, which makes the second stage easier and more efficient.

~5V L Q11 Q1 Q2 Vin Q12 C bus Co RL 8.7~19V L Q3 Q4

Regulated, L Q5 Q6 High fs low f 1st stage 2nd stage s

Figure 2.9 The two-stage architecture in laptop computer application with wide input range [23]

28 Chapter 2. High Power Density Non-Isolated Bus Converters

One more benefit from the two-stage VR is that the bus voltage (input voltage for the second stage) can be optimized to get higher light-load efficiency. Reference [22], [23] proposed an ABVP concept to improve the two-stage efficiency under light load conditions. The basic idea is to reduce the bus voltage under light load conditions to reduce the switching-related losses of the second stage. The analysis shows a trend that when the load is lighter, the optimal bus voltage is lower. As discussed previously, the optimal bus voltage is around 5-6V under heavy load; under light load, the optimal bus voltage gradually decreases. Based on this, the control strategy is illustrated in Figure 2.10.

CPU C0 C0 Power

C1 C2C1 C1 C2C1 C1 C2C1 C2C3 C3 C2C3 C3 C2C3 C3

Time Vbus

Time

Figure 2.10 The CPU power consumption and the bus voltage following ABVP control strategy [23]

1MHz Two-Stage VR Efficiency

90% Two-stage With ABVP

20A 80% Two-stage w/o ABVP Io

Single stage 20mV 70% Efficiency Vo 60%

Vbus (With Gate Drive Loss) 1V 50% 0 10 20 30 40 50 Output Current (A) Vbus=3V Vbus=3V Vbus=6V

Figure 2.11 The transient waveforms of output Figure 2.12 The efficiency improvement by ABVP current, output voltage and bus voltage [22] control strategy [23]

29 Chapter 2. High Power Density Non-Isolated Bus Converters

A prototype has been built to verify the feasibility of this concept [22]. The input voltage is 12V. The output voltage transitions from 1.28 to 1.3V as the load jumps from 0 to 20A. The bus voltage varies from 4.2 to 5.3V as the load changes from 20A to 0A. The inductance of the first stage is 10uH. The inductance of the second stage is 300-nH for each phase. The first-stage switching frequency is 200kHz and that of the second stage is 1MHz. The bus capacitance is 1mF and the output capacitance is 1mF. Figure 2.11 shows the experimental waveforms. It can be seen that the bus voltage jumps from 4V to 5V as the output current transitions from 0A to 20A. Both ABVP and AVP functions are realized. The experimental efficiency in Figure 2.12 indicates that the ABVP concept is able to improve the efficiency throughout the whole load range. The improvement is more significant at lighter loads.

2.3. Possible Improvement for Two-Stage VR

In the previous two-stage research, a buck converter was proposed for the first stage, as shown in Figure 2.6 and Figure 2.9. It is configured to run at a low switching frequency, e.g. a couple hundred kHz, to attain high efficiency. However, this low switching frequency is the first stage to a large size, and counteracts the space savings of the high-frequency second stage. As we know, high power density is very important for server and laptop applications. Figure 2.13 shows two different VR designs in the two-stage approach. One is the version for a desktop or laptop, the other is the VRM version for a server. It can be clearly seen that the first stage has a size similar to the second stage. Detailed analysis reveals that the two-stage approach with a buck converter as the first stage cannot cause too much size reduction.

Both sides are used for 1st stage

Industry Design for 1U VRM 500KHz 1st Stage

(a) 120W two-stage VR for desktop (b) 130W two-stage VRM for server

Figure 2.13 First stage in two-Stage VR

30 Chapter 2. High Power Density Non-Isolated Bus Converters

Another problem of using a buck converter in the first stage is its poor light-load efficiency [22], as shown in Figure 2.14. Even with pulse-skipping control [28] to improve light-load efficiency, the light-load efficiency is still not that high. Since the first stage is in series with the second stage, the low light-load efficiency of the first stage causes low light-load efficiency in the overall system directly. Figure 2.15 shows that due to the low light-load efficiency of the first stage, the two-stage overall efficiency at light load is even lower than the single stage approach. This is not acceptable since light-load efficiency becomes more and more efficient. As a result, this chapter explores more candidates for the first stage.

98 System Efficiency Comparison 90 97 89

) 88 96 87 95 86 85

94 (% Efficiency 84 Efficiency 83 93 82 0 20406080100120 92 Io (A) 0 5 1015202530 Single stage @ 1MHz Two stage @ 1MHz and Vbus=5V Output Current (A)

Figure 2.14 The experimental efficiency of the 1st stage. Figure 2.15 Overall efficiency comparison between Solid line: constant frequency single stage and two-stage @ 1MHz [22] Dash line: Pulse-skipping @ DCM

As for the second stage, since the input voltage is about 5V-6V, low-voltage-rating devices (20V or lower) can be used. References [22]and [23] do not pay too much attention to low- voltage-rating devices since such devices were not popular at that time. In Chapter 3, a number of low-voltage-rating devices are evaluated and the efficiency benefits are proven by experimental tests. Moreover, some methods are proposed to improve the light-load efficiency of the second stage, which are also discussed in Chapter 3.

2.4. Proposed Switched-capacitor Voltage Divider as the First Stage

2.4.1. Basic Switched-Capacitor Circuits

As stated above, using a buck converter in the first stage is too bulky due to the large output inductors. To get a high power density, it is better to have no inductors in the circuit. Thus we looked into a switched-capacitor technology where capacitors are used as the energy transfer.

31 Chapter 2. High Power Density Non-Isolated Bus Converters

Figure 2.16 shows a typical switched-capacitor converter [29]. When S1 and S2 are on, the

input voltage source charges C2 while Co provides the energy to the load. Assuming that the R

and C2 time constant is much smaller than half of the switching period, C2 is charged to Vin.

When S3 and S4 are on, C2 charges both Co and the load. Finally, C2 is discharged to Vo. Since

the output voltage is regulated, there should be some voltage difference between Vin and Vo. At

the moment when S3 and S4 turn on, the voltage difference between C1 and Co is (Vin-Vo). Therefore a large R is required to limit the current and protect the devices. The output voltage is controlled by controlling the power loss in the devices. Larger losses lead to lower output voltage. Since R must be high enough to serve as the buffer between two voltage sources, the current cannot be very high. The efficiency of this converter can be simply calculated by Vo/Vin, and Vo is always lower than Vin. A larger voltage difference between Vin and Vo leads to lower efficiency. This is why it can only be used in low-power applications.

S1 S4

‐Vo Vin C2

S2 C o RL S3

R R

_ _

+ _

+ + C + C 2 RL 2 RL Vin _ V Co in Co

S1,S2 on, S3, S4 off S3,S4 on, S1, S2 off

Figure 2.16 A typical switched capacitor converter for low power application

Another type of switched-capacitor converter is shown in Figure 2.17 [30]. When Q1 and

Q3 are on, Q1 is controlled as a current sink to limit the current. So if we look at the equivalent

circuit, it is like a current-fed circuit during this period. When Q2 and Q4 are on, since there is a

voltage difference between C2 and Co, a large R must be presented to limit the current. The output voltage can be simply expressed as:

32 Chapter 2. High Power Density Non-Isolated Bus Converters

Vo = 2DI on RL (2-1) where D is the duty cycle of Q1 and Q3. The efficiency of this circuit is 2Vo/Vin [30]. Again, if Vo is much less than Vin/2, the efficiency is very low.

Q1

Q2 C2 V in + Q3

V Co o Q4 RL ‐

Ion R C2 + ‐ + C V RL 2 RL in Co Vin ‐ Co

Q ,Q on, Q , Q off Q1,Q3 on, Q2, Q4 off 2 4 1 3

Figure 2.17 Another switched-capacitor converter with current sink

From the above two examples, a basic concept of switched-capacitor technology can be generalized. Figure 2.18 shows the case of two capacitors in parallel with a switch. As we know, two capacitors cannot be in parallel directly if there is a voltage difference between them. Therefore, there should be some mechanism to limit the current. As shown in Figure 2.18, there are at least three cases when two capacitors can be in parallel without a high surge current:

(1) Resistance R is large to limit the current when the voltage difference is large;

(2) S1 serves as a current sink to limit the current when the voltage difference is large;

(3) Resistance R can be small if the voltage difference between C1 and C2 is small.

It can be seen that the first two cases have higher power loss but they can achieve voltage regulation. The power loss of the third case is small but it cannot achieve voltage regulation.

33 Chapter 2. High Power Density Non-Isolated Bus Converters

Large R Ion Small R

+ + V + + + + 1 V2 V1 V V1 V ‐ ‐ ‐ ‐ 2 ‐ ‐ 2

V1‐V2 is large V1‐V2 is large V1‐V2 is small

(a) (b) (c)

Figure 2.18 Equivalent circuit for two capacitors in parallel

2.4.2. Proposed Switched-Capacitor Voltage Divider

For the two-stage power architecture, since the second stage can achieve voltage regulation, the first stage can be unregulated as long as it can lower the input voltage from 12V to the 5V-6V range. This converter can be treated as a non-isolated bus converter. As a result, it is possible for us to use a switched-capacitor circuit with the concept shown in Figure 2.18(c) to get lower power loss. Figure 2.19(a) shows the concept of the proposed voltage divider. The key is that Q1-Q4 have very low on-resistance and the output voltage is unregulated. By careful

design, the voltage on capacitors C2 and Co are almost half of the input voltage, so there is a very small voltage difference between their capacitances. A 50% duty cycle for all switches is

applied. An improved version is shown in Figure 2.19(b), where Co is split into C1 and C3. The output ripples are the same as long as Co is equal to the sum of C1 and C3. However, C1 also

serves as the input decoupling capacitor. Since C1 provides the AC current path, the real

decoupling capacitor for the Vin can be reduced. Also the voltage rating of C1 is only half of the input voltage, which can decrease the converter size.

34 Chapter 2. High Power Density Non-Isolated Bus Converters

Q1

Q2 C2 Vgs 50%Duty Cycle V in + Q3 Q2 & Q4

V Co o Q4 RL Q & Q ‐ 1 3 t t t t t t 0 1 2 3 4 5

Small R Small R C2 + ‐ + RL C2 R Vin C L o Vin ‐ Co

Q1,Q3 on, Q2, Q4 off Q2,Q4 on, Q1, Q3 off

(a) Concept of the proposed voltage divider

Q1

C1 Q2 C2

+ Q3 Vin V C3 o Q4 RL ‐

(b) Improved version of voltage divider

Figure 2.19 Proposed switched-capacitor voltage divider

Recently, another switched-capacitor circuit with a similar concept has been proposed [31] for high-output power automotive applications. As shown in Figure 2.20, since there are both a 42V battery and 14V battery in the system, a bi-directional DC/DC converter is required to power both the 42V loads and 14V loads. It can be seen that there are no voltage regulation function required for the system.

35 Chapter 2. High Power Density Non-Isolated Bus Converters

(a) Four-level DC/DC converters with switched-capacitor technology

(b) Equivalent circuit for each stage

Figure 2.20 Switched-capacitor converters for high power application [31]

Figure 2.20(b) shows the equivalent circuit for each stage, where R1 is the total resistance

in the loop. To deliver more current, R1 and R2 must be very small. To limit the current, the

voltage difference between C1 and Vo must be very small. This can be controlled by selecting

large C1 and C2. C1 is almost the same as Vo, while C2 is almost 2Vo. Finally, the ratio between

Vin and Vo is about 3:1. There is small voltage difference which depends on the load current.

The problem with this circuit design is that the switching frequency is too low (5kHz) since the author does not analyze the switching performance of this circuit very carefully. As a result,

36 Chapter 2. High Power Density Non-Isolated Bus Converters

there are too many capacitors needed to limit the current in the circuit which counteracts the size saving by eliminating the magnetic components, as shown in the prototype in Figure 2.21.

Figure 2.21 Prototype of the four-level switched-capacitor DC/DC converter and tested efficiency [31]

(fs = 5kHz, C1: 10*3300µF/7m electrolytic capacitors and C2: 10*2200µF/12m electrolytic)

2.4.3. Challenges for the proposed voltage divider

(A) Minimize the power losses and achieve highest efficiency

Previously, some methods are used to analyze and design the switched-capacitor circuits. For example, the state-space averaging method is used in [30] and the time-domain analysis method is applied in [32]. However, there are some limitations for the previous methods.

• Since most of them are designed for the regulated converters, the efficiency is well determined by the ratio between the output voltage and the input voltage. As a result, they did not pay attention to the power loss analysis for the switched-capacitor circuit.

• Most of the methods neglected the switching losses in the circuit.

• All of the methods neglected the influence from the parasitic inductors.

As we know, high efficiency must be achieved for the first stage of two-stage VR, so the power losses must be analyzed for the proposed voltage divider. Moreover, switching losses can be very high without very careful design due to the parasitic inductors. All of the above issues are solved with the detailed analysis in the following sections.

(B) System design consideration with voltage divider as the first stage

Since the voltage divider is used as the first stage and it can be followed by several different second stages, two important concerns emerge. The first is the stability of the whole

37 Chapter 2. High Power Density Non-Isolated Bus Converters

system. The second concern is how the bus voltage, which is the output voltage of the voltage divider, responds to the VR load transient. If the bus voltage changes a lot with one VR load transient, it may cause undesirable voltage noise with other VRs.

(C) Start-up and protection of the voltage divider

As mentioned above, at steady state, since Vc2,Vc1 and Vc3 are very close, the current on devices is relatively small. However, at startup and fault operation, the voltage difference between capacitors is very large, which causes a huge surge current on the devices. So some actions must be taken for start-up and protection.

2.4.4. Minimizing Power Losses by Circuit Analysis and Design

To simplify the design, the parasitic inductors in the circuit are firstly ignored. The power loss model with parasitic inductors will be introduced later.

(A) Switching loss analysis

Compared to the buck DC/DC converter, the proposed circuit has the following advantages in terms of the switching performance:

• All switches can be turned off in a zero voltage condition (ZVS)

Figure 2.22 (a) shows the turn-off of Q1 and Q3. Since there is no current charging the

junction capacitor of Q3, the Vds stays at zero after it turns off. The Q1 voltage also stays at

zero. This means Q1 and Q3 turn off with zero voltage. There is no turn-off loss for the

switches. The same conclusion applies for Q2 and Q4.

Q1 Q1

Q2 I Q C2 C2 C2 2

V Q3 Q in I O V 3 C4 in C4 Q 4 Q4

(a) Turn-off of Q1 and Q3 (b) Turn-on of Q2 and Q4

Figure 2.22 Switching performance of switched-capacitor voltage divider

38 Chapter 2. High Power Density Non-Isolated Bus Converters

• All switches can be turned on in a zero current condition (ZCS)

Figure 2.22(b) shows the turn-on for Q2 and Q4. Since the voltage difference between C2

and C4 is very small, the current on Q2 and Q4 is almost zero before their on-resistance drops to

a very small value. This means Q2 and Q4 turn on with zero current. However, there is capacitive turn-on loss for the switches. The total turn-on losses of the four switches are:

2 Pturn _ on = CeqVin f s (2-2)

where Ceq is the energy equivalent capacitance of each MOSFET with half of the input voltage.

• There are no body conduction losses.

Because there is no inductor and its current is freewheeling, the body diode of the device will not conduct during the dead time, which is normally reserved in the to avoid the short-through among the switches in the totem-pole structure.

(B) Conduction Loss Analysis

To simplify the deviation for the conduction, the parasitic inductors and the ESR of the

capacitors are ignored. The resulting equivalent circuit is shown in Figure 2.23, where RDSon is the on-resistance of each MOSFET. Since the switching loss of the voltage divider is minimized, now we would like to know how to calculate the conduction loss of the circuit.

+ C + C + C2 v 1 1 c2 v v _ c1 + c1 + _ R=2R _ iC2 DSon

Vin + Vin R=2RDSon C2 vc2 Io _ Io _ C3 _ C3 iC2

0~T /2 Ts/2~Ts s

Figure 2.23 The equivalent circuit of the voltage divider

39 Chapter 2. High Power Density Non-Isolated Bus Converters

To derive the RMS current on C2, the voltage on C2 must be solved first. Since the circuit is symmetric during the 0-Ts/2 period and the Ts/2-Ts period, the solution during 0-Ts/2 is provided as the example.

The voltage on C2 during 0-Ts/2 can be easily solved as:

v (t) = k e−t /τ + B ⋅t + k (2-3) c2 1 2

where k1 and k2 are variables which are to be determined, and B and τ are:

I (C +C )C B = o τ = R⋅ 1 3 2 (2-4) C + C + C C +C +C 1 2 3 1 2 3

Due to the charge balance, the average absolute current on C2 is Io. Then:

T / 2 Ts / 2 T I ΔV = v (T / 2) − v (0) = s i (t)dt = s o c2 c2 s c2 ∫ C 2 (2-5) C2 0 2C2

During 0-Ts/2:

vc2 = Vin − R ⋅ic2 − vo (2-6)

During Ts/2-Ts:

vc 2 = vo − R ⋅ ic 2 (2-7)

Then:

1 Ts 1 TTss/ 2 V = v dt = [ (V − R⋅i − v )dt + (v − R⋅i )dt] c2(avg) ∫ c2 ∫∫in c2 o o c2 Ts 0 Ts 02T / s (2-8) V TTss/ 2 V TTss/ 2 = in + I − v dt + v dt = in − v dt + v dt 2 c2(avg) ∫∫o o 2 ∫∫o o 02Ts / 02Ts / Due to the symmetry of the circuit:

1 Ts / 22 1 Ts V = v dt = v dt oavg ∫ o ∫ o (2-9) Ts / 2 Ts / 2 0 Ts / 2 Then:

40 Chapter 2. High Power Density Non-Isolated Bus Converters

Vin Vc2(avg ) = (2-10) 2

From (2-5) and (2-10), two boundary conditions for vc2 can be derived (For the curve as (2-3), the average voltage is in the middle of the peak-to-peak voltage):

Vin T2Io Vin T2 I o vc2 (0) = − vc2 (Ts / 2) = + (2-11) 2 4C 2 4C 2 2

From (2-3) and (2-11), k1 and k2 can be derived as:

V T I k = A, k = in − 2 o − A (2-12) 1 2 2 4C 2 where

I T I T o s − o s 2C2 2(C1 + C2 + C3 ) (2-13) A = −T /(2τ ) e s −1

Finally, the current on C2 during 0-Ts/2 period can be expressed as:

A −t /τ ic2 (t) = C2 (− e + B) (2-14) τ The RMS current can be calculated as

A2 4AB(e−Ts /(2τ ) −1) − (e−Ts /τ −1) (2-15) I = C B 2 + τ 2RMS 2 T s

After vc2 is solved, vc1 and vo can be solved correspondingly. This completes the analytical solution of the proposed voltage divider.

The peak and RMS current on each MOSFET is:

A I peak = C 2 (− + B) τ (2-16)

I QRMS = I 2RMS / 2

41 Chapter 2. High Power Density Non-Isolated Bus Converters

To normalize (2-15), we define:

I 2N = I2RMS / Io τ N =τ /Ts k N = C2 /(C1 + C3 ) (2-17)

Substituting (2-17) into (2-15), the normalized RMS current on C2 can be expressed as:

2 −1 / τ N k N (2 + k N ) [0.5 /(1+ k N )] e −1 I 2N = + (2-18) (1+ k ) 2 τ (e −0.5 / τ N −1) 2 N N

Because the average currents of C1 and C3 are zero, the output DC current has to be equal

to the sum of the currents of Q2 and Q3. It can be easily seen that the current of C2 is rectified by

Q2 and Q3 as the DC output current. Therefore,

I 2 = I o (2-19) avg

Due to the charge balance of C2, the positive area of the current of C2 should be equal to its negative area. According to (2-19), the average amplitude of either positive or negative current

pulse should be Io. It is known that Q1 conducts the positive current pulse of C2 in 50% duty cycle, so the average input current should be:

I = I / 2 in o (2-20)

.Based on the conservation of energy, the output voltage can be calculated with:

V I 2 R + P V = in − 2rms sw (2-21) o 2 I o

where Psw is the total switching-related loss, which could be limited to a tiny level by the appropriate capacitor design.

2.4.5. Design of C1, C2 and C3

Capacitor C2 needs to be large enough to limit the peak current when the capacitors are in parallel, as shown in Figure 2.24.

42 Chapter 2. High Power Density Non-Isolated Bus Converters

100

80

60 (A) )

peak 40 I

20

0 5 4 4 5.10 1.10 1.5.10

C2 (µF)

Figure 2.24 Peak current on MOSFET vs. C2 (R = 6mΩ, fs = 375kHz, Io = 12A)

Since the switching-related losses of the proposed voltage divide can be small enough to be ignored, the MOSFET conduction losses are the only concern for efficiency in this area. Using

(2-18), Figure 2.25 shows the relationship of I2N with kN and τN. It is clearly shown that larger kN or larger τN leads to lower RMS current. If kN and τN are determined, C2 and C1+C3 can be determined as follows:

τ (k +1)T τ N (k N +1)Ts N N s C = C1 + C3 = (2-22) 2 k R R N

τN increases kN increases

(a) Normalized C2 RMS current I2N vs. kN (b) Normalized C2 RMS current I2N vs. τN

Figure 2.25 C2 RMS current vs. kN and τN

43 Chapter 2. High Power Density Non-Isolated Bus Converters

⎛ 0.2 ⎞ I2N ⎜kN , ⎟1.15 C2 increases ⎝ kN+1 ⎠ ⎛ 0.3 ⎞ I2N ⎜kN , ⎟ 1.1 ⎝ kN+1 ⎠ ⎛ 0.4 ⎞ I2N ⎜kN , ⎟ ⎝ kN+1 ⎠1.05 ⎛ 0.5 ⎞ I2N ⎜kN , ⎟ ⎝ kN+1 ⎠ 1 ⎛ 0.6 ⎞ I2N ⎜kN , ⎟ ⎝ kN+1 ⎠0.95

0.9 1 1.5 2 2.5 3 3.5 4 4.5 5 kN

Figure 2.26 C2 RMS current vs. kN for different C2 (each curve corresponds to one C2 value)

It is preferable that both kN and τN be large, whereas a large kN and τN require large

capacitors for C1 through C3, which means a large footprint and high cost as well. The trade-off

between the RMS current of C2 and total capacitance can be done in two ways: one is a large kN and low τN, the other is low kN and large τN. Both of them can set the RMS current in the vicinity of the minimum value.

Given (kN+1)* τN, C2 can be determined by Equation (2-22). Figure 2.26 clearly shows

how the RMS current C2 is impacted by kN for a given C2. A large C2 can greatly diminish the

effect of kN on the RMS current. Consequently, it is proposed that C2 be designed with a large

enough capacitance for high efficiency being the foremost consideration. After that, C1+C3 can be determined taking other factors into consideration.

Normally C1 and C3 can be selected with consideration of output voltage ripple and input

AC current. Figure 2.27(a) shows an example of the influence of the C1+C3 on the output voltage

ripple. A proper C1+C3 should be selected to meet the ripple requirement. Figure 2.27(b) shows

an example of the influence of C1 on the input AC RMS current. An optimal C1 will give the lowest input AC current.

For better understanding, a design example is shown as below.

Circuit configuration is: Vin=12V, Iomax=12A, R = 6mΩ (Device: Si7104), fs = 375kHz.

44 Chapter 2. High Power Density Non-Isolated Bus Converters

First, it is determined that kN=1.2 and τN=0.15 to achieve low RMS current and output

ripple. Then, from Equation (2-22) we see that C2 is equal to 130µF, and C1+C3 is 90µF. To achieve the lowest input AC current, C1=45µF; then C3 = 45µF.

Finally the RMS current on each capacitor should be checked to decide how many capacitors must be paralleled to satisfy the current rating requirement of each capacitor. Based on calculations, the maximum RMS current for the three capacitors are:

I1RMS = 3A, I2RMS = 13A, I3RMS = 3A

Generally, low-ESR capacitors such as the multi-layer ceramic capacitor (MLCC) are preferred for the proposed voltage divider, which can reduce the power losses of the capacitors. Each 22µF TDK ceramic capacitor has a 2A RMS current capability. Two paralleled 22µF capacitors are used for C1 or C3. For C2, six paralleled 22µF capacitors are needed. o / V in / I inAC oripple I V

C /(C +C ) kN 1 1 3 (a) (b)

(a) Normalized Vo ripple vs. kN ((kN+1)τN=0.35, Io=12A)

(b)Normalized Input AC RMS current vs. C1/(C1+C3) (kN=1.2, τN = 0.14, Io = 12A, Iin = 6A)

Figure 2.27 Output ripple and input AC current vs. C1 and C3

2.4.6. Light-Load Efficiency and Gate-Driving Method

(A) Light-Load Efficiency

After the capacitor design is finished, τΝ and kN are determined. Then τN =τ*fs is proportional to the switching frequency. Figure 3.24 shows that the lower the τN is, the higher the

45 Chapter 2. High Power Density Non-Isolated Bus Converters

C2 RMS current will be. This means that for a given load current, a lower frequency means a higher conduction loss. Thus at heavy load, a higher frequency is needed. However, a lower frequency also leads to lower gate driver loss, which is normally the dominant loss at light load. Thus, lowering the switching frequency at light load can improve the light-load efficiency of the proposed voltage divider. Figure 2.28(a) shows a batch of efficiency curves at different frequencies. If the peak efficiency points of those curves are connected, an optimal efficiency curve can be achieved by varying the switching frequency in respect to the load current, as shown in Figure 2.28(b).

(B) Gate Driving Method

As mentioned above, there is not a stringent demand on gate driving for the devices. As a result, the most cost-effective way of driving the devices is to use a driving transformer. In low- voltage applications, the commercial boost trap gate driver IC can be utilized for concise circuitry. Figure 2.29(a) shows the driving schematic. Two gate driver ICs are used to drive the four . A boost trap scheme is used to get the PWM input signal for the upper two

MOSFETs. The voltages on C1 and C3 can also be utilized as the voltage sources of the gate

driver. Figure 2.29(b) shows the test gate signals for the proposed gate driving when Vin=12V

and Io=10A. It shows that the proposed gate driving works very well.

99.0% f decreases 98.8% s 300k

98.6% 250k 200k 98.4% 150k

98.2% 100k Efficiency 98.0% 50k

97.8% 0 5 10 15 Io (A)

(a) Efficiency vs. Io for different frequencies

46 Chapter 2. High Power Density Non-Isolated Bus Converters

100% 99% 98% 97% Variable frequency 96% 95% 94% Efficiency Efficiency 93% 92% 91% fixed frequency 90% 0.1 1 10 100 Po (W)

(b)Boosting light-load efficiency by lowering switching frequency at light load

Figure 2.28 Light-load efficiency improvement for proposed VD

Q1 Vg_Q1

Q2

Vin

Vgs_Q4

Q3

Q4

(a) Driving scheme (b)Tested gate signals

Figure 2.29 Proposed gate driving for VD

2.4.7. Prototype

One prototype has a 12V input with 6V/12A output for laptop applications. The prototype has achieved 1kW/inch3 power density. For reference, Table 2.1 gives the component

47 Chapter 2. High Power Density Non-Isolated Bus Converters parameters. Since the voltage rating for the devices is half of the input voltage, a 12V MOSFET which has better performance than 30V MOSFET is applied this voltage divider.

Table 2.1 Parameters in the 70W Voltaage Divider prototype

Switching frequency 375kHz

Capacitance of C1, C2 and C3 C1, C3: 2 22µF MLCC C2: 6 22µF MLCC

Switches, Q1~Q4 12V device: Vishay Si7104 (Rdson = 3.1mohm)

Driver 2× LM2722, National Semiconductor

(a) Prototyppe of 70W voltage divider

100% 99% 98% 97% variable frequency fixed frequency 96% Over power 95% 94% Efficiency 93% 92% 91% 90% 0.1 1 10 100 Po (W)

(b) Tested efficiency (including over power efficiency)

Figuure 2.30 Prototype and efficiency of 70W voltage divider for laptop

The overall efficiency is given in Figure 2.30(b). It can be seen that it can achieve high efficiency with 70W output power. One more benefit from the switched-capacitor voltage divider is that it has very good over-power capability. However, buck has poor over-power capability due to inductor core saturation with high current.

48 Chapter 2. High Power Density Non-Isolated Bus Converters

Another prototype has a 12V input/6V 25A output for server VR application. The circuit parameters are shown in Table 2.2. Figure 2.31 shows the prototype and tested efficiency. The power density is also about 1kW/in3.

Table 2.2 Parameters in the 150W Voltage Divider prototype

Switching frequency 350kHz

Capacitance of C1, C2 and C3 C1, C3: 2 44µF MLCC C2: 10 22µF MLCC

Switches, Q1~Q4 12V device: Ciclon 13301 (Rdson=1.3mΩ)

Driver 2× LM2722, National Semiconductor

(a) Prototype 99.5% 99.0% 98.5% 98.0% 97.5%

Effciency 97.0% fixed frequency 96.5% variable frequency 96.0% 95.5%

10Po (W) 100

(b) Tested efficiency

Figure 2.31 Prototype and efficiency of 150W voltage divider for server

49 Chapter 2. High Power Density Non-Isolated Bus Converters

2.4.8. Effect of Parasitic Inductors for High Frequency Operation

If we look at the previous 375kHz 70W voltage divider design for [33], [36], we

see there is too much capacitance (C1 = C3 = 44µF, C2 = 132µF), which adds more cost and a larger footprint. In a 150W voltage divider design, the capacitance is even larger. Ideally, there is no switching-related loss for the voltage divider except for capacitive turn-on loss. To keep the same RMS current on devices (parasitic inductance is ignored), we should have:

C ∝1/ f 2 s (2-23) where fs is the switching frequency. This clearly shows that the capacitance can be reduced by raising the frequency of voltage divider.

The previous analysis does not include parasitic inductors since it works at a low switching

frequency and C2 is pretty large. It is a simplified first-order model. In the real circuit, there is parasitic inductors, such as the inductor ESL, trace inductors, device parasitic inductors, etc., as

shown in Figure 2.32(a). Figure 2.32(b) shows the moment when Q2 and Q4 turn off, where Lp is

the total parasitic inductance in the loop. Then the total energy of the parasitic inductors when Q2 and Q4 are turned off is:

E = 0.5L I 2 (2-24) L p off

The following analysis focuses on the device packaging inductors [37]. Part or all of the energy can be dissipated in the loop, which introduces turn-off loss. When the switching frequency becomes high, the turn-off loss caused by parasitic inductors becomes high correspondingly, and can no longer be ignored.

50 Chapter 2. High Power Density Non-Isolated Bus Converters

Ld Q1 Q1 Ls C1 C Coss 2 Q2 Q2

Vin + + Q3 Q Vin 3 Ioff L V p O VO Q4 C 3 Q4 - -

(a) Parasitic inductance in the voltage divider (b) Equivalent circuit when Q2 and Q4 are turned off

Figure 2.32 The influence from the parasitic inductors

After considering the parasitic Lp, the loss breakdown of the voltage divider at different frequencies is obtained by SABER simulation, as shown in Figure 2.33. It clearly shows that at low frequencies, switching loss is negligible, while at high frequencies, switching loss can be very high. Another interesting phenomenon is that at high frequencies a higher C2 does not mean lower loss, which cannot be predicted by the previous first-order model.

After considering the parasitic Lp, the proposed voltage divider is similar to the resonant switched-capacitor converter [38] or three-level buck converter [39]. However, since the parasitic inductors cannot be simply treated as the lumped inductance for the entire transition time, it is more complex and the efficiency cannot be easily predicted.

2.5 3.5 2 C2=66uF 3 C2=10uF C2=132uF 2.5 C2=44uF 1.5 2 1 1.5 Loss (W) Loss (W) 1 0.5 0.5 0 0 Tot al Tot al Gate driving Total loss Tot al Tot al Gate driving Total loss: conduction switching loss conduction switching loss loss loss loss loss

(a) Switching frequency = 330kHz (b) Switching frequency = 1MHz

Figure 2.33 Loss breakdown of voltage divider (Vin = 12V, Po = 70W, Lp = 3nH)

51 Chapter 2. High Power Density Non-Isolated Bus Converters

With some assumptions, the switching losses and conduction losses can also be estimated for the voltage divider with parasitic inductance. The assumptions are:

1) The loop inductance when Q1 and Q3 are on is the same as the loop inductance when Q2

and Q4 are on. When this is true is the device parasitic inductance (Ld+Ls) is dominant. 2) The inductance is small enough that the energy in the inductor is dissipated in the loop when the devices are turned off. As a result, the current is zero when the switches are on. 3) The output capacitance is large enough that it can be treated as a voltage source.

Based on the above assumptions, the equivalent circuit can be drawn as Figure 2.34. Since

the circuit is symmetrical, the solution for 0-Ts/2 is given in the following analysis.

+ v C2 _ c2

iL Vo L P LP iL R Vo Vin + C v Io 2 _ c2 Io

0~Ts/2 Ts/2~Ts

Figure 2.34 The equivalent circuit of the voltage divider with parasitic inductance

Since the equivalent is a typical second-order system, the solution can be easily solved with the following boundary conditions:

Vin Ts Io Vin Ts Io vc2 (0) = − vc2 (Ts / 2) = + iL (0) = 0 (2-25) 2 4C2 2 4C2

The final solution for the C2 voltage within half of the switching cycle is:

2 (a) When 4LP C 2 − (RC 2 ) > 0

52 Chapter 2. High Power Density Non-Isolated Bus Converters

(t) = A e −αt sin(ωt) + A e −αt cos(ωt) + V − V vC 2 1 2 in o

T I / 2C A α s o 2 2 A2 = αT αT A1 = α ω − s ω − s ω sin( T )e 2 + cos( T )e 2 −1 ω 2 s 2 s (2-26) Ts I o Vin Vo = A2 + + 4C2 2

2 4LPC2 − (RC 2 ) where α = R /(2L P ) , and ω = 2L C P 2 .

2 (b) When 4LPC2 − (RC 2 ) < 0

(t) = A e f1t + A e f 2t + V − V v C 2 1 2 in o

Ts I o / 2C2 f A A = 2 2 2 f2Ts / 2 f1Ts / 2 A1 = − e −1− (e −1) f 2 / f1 f1

f 2 Ts I o Vin Vo = A2 (1− ) + + (2-27) f1 4C2 2

where

− RC + (RC ) 2 − 4L C − RC − (RC ) 2 − 4L C f 1 = 2 2 P 2 , f 2 = 2 2 P 2 2LPC2 2LPC 2

Based on (2-26) or (2-27), the inductor current can be derived by:

d i (t) = C V (t) (2-28) L 2 dt C 2

The shape of the inductor current during half of the switching period is shown in Figure

2.35. It clearly shows that with a smaller C2, both turn-off current and RMS current are high.

With a larger C2, the RMS current can be reduced. However, the turn-off current is high. It

seems there should be an optimal C2 for each switching frequency.

53 Chapter 2. High Power Density Non-Isolated Bus Converters

25

20

t)15

t)10 C2=40uF

5 t) C2=20uF

Inductor current (A) current Inductor 0

5 C2=15uF 7 7 7 02.10 4 .10 6 .10 Timet (s)

Figure 2.35 Inductor current with different C2 for half of the switching period

( fs = 650kHz, Lp = 3nH, R = 6mΩ, Vin = 12V, Io = 12A)

With (2-28) and the circuit condition, the conduction losses and switching losses can be estimated as follows:

2 Pcond = R ⋅ I RMS

2 2 Psw = CeqVin f s + Lp Ioff fs (2-29)

2 Ts / 2 I = i (t)2 dt I = i (T / 2) RMS ∫ L , off L s Ts 0

Figure 2.36 shows the relationship of the switching loss, conduction loss and total power losses and C2. The minimum switching loss occurs when the resonant frequency of Lp and C2 is

equal to the switching frequency. It shows that there is an optimal C2 that can provide the

minimum total power loss, and the optimal C2 is not necessarily equal to the C2 value where the switching loss is the minimum.

54 Chapter 2. High Power Density Non-Isolated Bus Converters

10

Total loss )

1 Conduction loss

Switching loss Power Loss (W)

0.1 6 5 4 3 1 .10 1 .10 1 .10 1 .10 C2C2(F)

Figure 2.36 Total power loss vs. C2 for a given switching frequency

( fs = 650kHz, Lp = 3nH, R = 6mΩ, Vin = 12V, Io = 12A)

Figure 2.37 shows the relationship between the total power losses and C2 for different switching frequencies. It shows that there is a different optimal C2 for different frequencies that can provide the minimum power loss.

10

)

)

fsincreases Power Loss (W)

1 6 5 4 3 1 .10 1 .10 1 .10 1 .10 C2C2 (F)

Figure 2.37 Total power loss vs. C2 for different switching frequencies

( Lp = 3nH, R = 6mΩ, Vin = 12V, Io = 12A, driving loss not included)

55 Chapter 2. High Power Density Non-Isolated Bus Converters

Figure 2.38 shows that, as expected, C2 can be reduced by raising the frequency. However, higher frequency also means higher power loss, so there is a trade-off in design between C2 reduction and voltage divider efficiency. In our design, a 700kHz frequency is chosen as the starting point for the trade-off. The final frequency is based on the experimental result.

140 3.3 120 3.1 2.9 (W) 100 2.7 80 2.5

C2 value 2.3 Loss 60 C2(µF) 40 Power loss 2.1 1.9 20 1.7 0 1.5 Power 200 700 1200 1700 2200 fs (kHz)

Figure 2.38 Optimal C2 and minimum power loss vs. frequency

( Lp = 3nH, R = 6mΩ, Vin = 12V, Io = 12A, driving loss included)

The above analysis is based on the given device parasitics, Ld = Ls=0.5nH and Lp = 3nH. Figure 2.39 shows the impact from the different parasitic inductances with 700kHz frequency.

When the inductance decreases, a higher C2 is required to have lower power loss. In terms of C2 reduction, higher parasitic inductance is preferred. However, higher inductance also means higher power loss, even at the optimal point. Another disadvantage of higher inductance is that when Lp is higher, power loss is very sensitive to the C2 value. As we know, Lp cannot be predicted very accurately, and capacitance has a large tolerance. Therefore it is better to decrease

parasitic inductance to minimize the negative effect from the sensitivity of C2.

To minimize parasitic inductance, at least two changes can be made. One is to choose device packages with lower parasitic inductance, e.g. DirectFET, PolarPak, Powerpak1212, etc. The other is to improve the PCB layout. Both possibilities need to be considered in real hardware design.

56 Chapter 2. High Power Density Non-Isolated Bus Converters

2.5 Ld=0.5nH 2.3 Ld=0.3nH 2.1 Ld=0.1nH 1.9

1.7

Power loss loss Power (W) 1.5

1.3 0 20406080100 C2 (µF)

Figure 2.39 Power loss vs. C2 for different parasitic

Figure 2.40 shows the 70W 650kHz voltage divider for a laptop two-stage VR. Compared with the 375kHz voltage divider, it doubles the power density (2200W/in3) and has a 47% footprint reduction and a 70% capacitance reduction. The efficiency is around 97%, which is only 0.5% lower than the 375kHz design.

375kHz VD 47% footprint reduction 650kHz VD Footprint: 1.2cm*2.2cm Doubled power density Footprint: 1.0cm*1.4cm Power density: 1000W/in3 Power density: 2200W/in3

70% capacitance reduction C1=C3=44µF, C2=132µF C1=C3=20µF, C2=35µF

100% 375kHz 99% 650kHz 98%

97%

96% Efficiency Efficiency 95%

94% 0 20406080 Po (W)

Figure 2.40 650kHz, 70W voltage divider prototype and tested efficiency curve

57 Chapter 2. High Power Density Non-Isolated Bus Converters

2.5. Transient Analysis of the Proposed Voltage Divider

Since the voltage divider is used as the first stage and it can be followed by several different second stages, two important concerns emerge. The first is the stability of the whole system. The second concern is how the bus voltage, which is the output voltage of the voltage divider, responds to the VR load transient. If the bus voltage changes a lot with one VR load transient, it may cause undesirable voltage noise with other VRs. A transient analysis is required to answer the above two questions.

2.5.1. Output Impedance

The most important performance parameter of the voltage divider is its output impedance,

Zo. The circuit necessary to derive the Zo is illustrated in Figure 2.41, where the input voltage is

shortened and Rc1, Rc2, and Rc3 represent the ESR values of the capacitors. Since C1 and C3 are in parallel, they can be lumped with Co = C1+C3, Ro = Rc1||Rc3, assuming C1 and C3 are the same

type of capacitors. The equivalent circuit is shown in Figure 2.41(b), where R = 2RDSon + Rc2 and

RDSon is the on-resistance of each MOSFET. Normally, R >> Ro and C2 >> Co, so R and Co form a low-pass filter in the circuit. If the low-pass filter corner frequency is much lower than the switching frequency, we can apply the state-space average model [41], [42]:

During 0~Ts/2:

⎛dv / dt ⎞ ⎛−1/[(R + R )C ] −1/[(R + R )C ]⎞⎛v ⎞ ⎛− R /[(R + R )C ]⎞ ⎜ c2 ⎟ = ⎜ o 2 o 2 ⎟⎜ c2 ⎟ + ⎜ o o 2 ⎟i ⎜ ⎟ ⎜ ⎟⎜ ⎟ ⎜ ⎟ o ⎝ dvco / dt ⎠ ⎝−1/[(R + Ro )Co ] −1/[(R + Ro )Co ]⎠⎝vco ⎠ ⎝ R /[(R + Ro )Co ] ⎠ During Ts/2~Ts:

⎛dv / dt ⎞ ⎛−1/[(R + R )C ] 1/[(R + R )C ] ⎞⎛v ⎞ ⎛ R /[(R + R )C ]⎞ ⎜ c2 ⎟ = ⎜ o 2 o 2 ⎟⎜ c2 ⎟ + ⎜ o o 2 ⎟i (2-30) ⎜ ⎟ ⎜ ⎟⎜ ⎟ ⎜ ⎟ o ⎝ dvco / dt ⎠ ⎝ 1/[(R + Ro )Co ] −1/[(R + Ro )Co ]⎠⎝vco ⎠ ⎝ R /[(R + Ro )Co ] ⎠

Averaging the above two states with the same weight yields:

⎛dv / dt ⎞ ⎛−1/[(R + R )C ] 0 ⎞⎛v ⎞ ⎛ 0 ⎞ ⎜ c2 ⎟ = ⎜ o 2 ⎟⎜ c2 ⎟ + ⎜ ⎟i ⎜ ⎟ ⎜ ⎟⎜ ⎟ ⎜ ⎟ o ⎝ dvco / dt ⎠ ⎝ 0 −1/[(R + Ro )Co ]⎠⎝vco ⎠ ⎝ R /[(R + Ro )Co ]⎠

Based on the average state equation, the output impedance of the voltage divider can be easily derived:

58 Chapter 2. High Power Density Non-Isolated Bus Converters

R(1+ sCo Ro ) Zo (s) = (2-31) 1+ sC (R + R ) o o

Q1 R R + - Rc1 Zo v Q C2 C2 C C 2 - + 2 + 2 C1 vC2 + + - + Q 3 VO iO VO i O Rc2 Ro Ro V R O + + c3 vCo C v C - o Co o Q - - - 4 C 3 - Q and Q on Q and Q on 1 3 2 4

Figure 2.41 Equivalent circuit to derive output impedance

This model matches very well with the SIMPLIS simulation, which can represent the experimental result, as demonstrated in Figure 2.42(a). At low frequency, the impedance is

determined by R≈2RDSon. So the output impedance of the voltage divider is very low due to the low on resistance of the low-voltage MOSFET.

Another interesting phenomenon is that the output impedance is independent of C2 as long as the state space averaging is applied. This is verified by SIMPLIS simulation, as shown in Figure 2.42(b).

In a real circuit, since there is parasitic inductance from the capacitor ESL, the device’s parasitic inductance, PCB trace inductance, etc., at high frequencies the output impedance of the voltage divider is dominated by the ESL in the circuit, which is shown in Figure 2.42(c).

20 20 Simplis C2=132uF 30 Model 30 C2=220uF 40 40 Gain Gain 1 50 50 Gain 2 Gain 60 60 Gain (dB) Gain Gain (dB) Gain 70 70 80 80 3 4 5 6 7 3 4 5 6 7 1 . 10 1 . 10 1 . 10 1 . 10 1 . 10 1 . 10 1 . 10 1 . 10 1 . 10 1 . 10 freq frequencyfreq (Hz) Frequency (Hz)

59 Chapter 2. High Power Density Non-Isolated Bus Converters

(a) Comparison between model and SIMPLIS (b) Simulated Zo for different values of C2

simulation ( C2 = 132µF)

20 w/o ESL 30 With ESL 40 Gain 1 50 Gain 4

Gain (dB) Gain 60 70 80 3 4 5 6 7 1 . 10 1 . 10 1 . 10 1 . 10 1 . 10 Frequencyfreq (Hz)

(c) Simulated Zo after considering the ESL in the circuit

(C2=132µF, ESL of C2 is 1nH, ESL of Co is 500pH)

Figure 2.42 Output impedance of the voltage divider (R=10mohm, Ro=1mohm, Co=88µF)

2.5.2. Stability of Two-Stage VR with Voltage Divider as the First Stage

For a cascaded system, if the output impedance of the first stage (Zo1) and the input

impedance of the second stage (Zin2) meet (2-32) it is a stable system [40].

| Z / Z |< 1 o1 in2 (2-32)

Figure 2.43 shows an example of Zo1 and Zin2 (CPU VR input impedance). It is clear that

Zo1 is much lower than Zin2 for all frequencies, so the system is stable.

30 20 10 0 Zin2 -10 -20

Gain (dB) -30 Zo1 -40 -50

1k 2k 4k 10k 20k 100k 400k 1M 2M 4M 10M

Frequency (Hz)

Figure 2.43 An example of Zo1 and Zin2

Figure 2.44 verifies the above analysis with experimental results. Although the CPU has a fast load transient, the bus voltage only has low variation, and the output voltage of the CPU VR

60 Chapter 2. High Power Density Non-Isolated Bus Converters can meet the adaptive voltage position (AVP) requirement by Intel. The whole system is stable and works very well.

20A/div Io 50mV/div

Zo1 Zin2 Vo Vin Vbus V o 20A/div Voltage divider CPU VR 50mV/div

io

Vbus 0.5V/div0.5V/div t: 100us/DIV

(a)Experiment setup (b)Transient waveforms

Figure 2.44 Experiment setup and test results

2.5.3. Interaction between VRs

To save on cost, several VRs may share the same voltage divider as their first stage. Therefore it is important to study if there is interaction between these VRs.

Figure 2.45 shows an example with two VRs sharing the same first stage. It is important to know if VR #1 has fast load transient, and the nature of the voltage noise generated on VR #2.

That means the io1 to vo2 transfer function need to be studied.

v bus vo1 Voltage divider VR #1 iin1 i bus io1 vin

iin2 vo2 VR #2

io2

Figure 2.45 Two VRs share the same voltage divider

61 Chapter 2. High Power Density Non-Isolated Bus Converters

Assume io1 has small signal perturbation, while io2 is constant. Then:

^ ^ vbus vbus (2-33) ZoVD = ^ = ^ ^ ibus iin1 + iin2

^ ^ ^ iin1 = i o1 G + vbus / Z (2-34) iii1 in1

^ ^ iin2 = vbus / Z (2-35) in2

where ZoVD is the output impedance of the voltage divider, Giii1 is the transfer function from io1 to iin1, and Zin1 and Zin2 are the input impedances of the two VRs.

Substituting (2-34) and (2-35) into (2-33), we have:

^ vbus G Z = iii1 oVD (2-36) ^ 1− Z / Z − Z / Z i o1 oVD in1 oVD in2

Since ZoVD<

^ v o2 Z o21 = ^ ≈ Gvg 2Giii1Z oVD (2-37) i o1

where Gvg2 is the audio-suitability of VR #2.

Figure 2.46(a) is an example of Zo21. Again, due to the low output impedance of the voltage

divider, the gain of Zo21 is also very low (below -100dB). It means if io1 has 50A load step, the 5 maximum generated voltage noise on vo2 is 50/10 =0.5mV, so there is almost no interaction between the two VRs. Figure 2.46(b) shows the experimental results with CPU VR and DDR VR sharing the same first stage. It shows that when CPU VR has a load transient, there is no voltage noise observed on the output voltage of DDR VR.

62 Chapter 2. High Power Density Non-Isolated Bus Converters

Voltage divider Vbus CPU VR Io_CPU Po=70W

DDR VR Vo_DDR

-50

-70

-90

-110 20A/div -130 Io_CPU -150 Gain (dB) -170 0.5V/div -190 Vbus -210 1.E+03 1.E+04 1.E+05 1.E+06 Frequency (Hz) Vo_DDR 0.1V/div

(a)The simulated transfer function from io1 to vo2 (b)Tested waveforms

Figure 2.46 No interaction between different VRs due to low output impedance of voltage divider

In the above analysis, only two VRs are considered. Actually the results can also be applied to the system with multiple VRs. Assume VR #n has a load transient, the transfer function to VR #n current to VR #m output voltage is:

^ vom Zomn = ^ ≈ GvgmGiiin ZoVD (2-38) i on

As a result, there should have no interaction among VRs with the voltage divider as the first stage of two-stage power architecture.

As a comparison, Figure 2.47 shows the transfer functions from io1 to vo2 for voltage divider as the first stage and buck as the first stage. They can achieve the similar peak gain. However, buck first stage needs three 270µF output SP capacitors with 50kHz close-loop control bandwidth. Voltage divide only has two 22µF ceramic capacitors with open loop.

63 Chapter 2. High Power Density Non-Isolated Bus Converters

-50

-70 VD as first stage Buck as first stage -90

-110

-130

-150 Gain (dB) Gain -170

-190

-210 1.E+03 1.E+04 1.E+05 1.E+06 Frequency (Hz)

Figure 2.47 Comparison of transfer function from io1 to vo2 for voltage divider first stage and buck first stage

2.6. Startup and Protection of the Proposed Voltage Divider

2.6.1. Proposed Soft-Start Method for Switched-Capacitor Voltage Divider

As mentioned above, at steady state, since Vc2,Vc1 and Vc3 are very close, the current on devices is relatively small. However, at startup, the voltage difference between capacitors is very large, which causes a huge surge current on the devices. A huge input current is observed in

Figure 2.48 even with very low input voltage (Vin = 3V). This means the devices can be destroyed at startup. As a result, a soft-start should be applied for the voltage divider.

2V/div Vin

5A/div Iin

Vo 5V/div

Figure 2.48 Startup waveforms without soft-start (Vin=3V)

64 Chapter 2. High Power Density Non-Isolated Bus Converters

The reason for the high current is the high voltage difference between the capacitors and

the small RDSon of the devices. The reason for the small RDSon is that a high gate-source voltage is applied to the device. The MOSFET trajectory at startup (shown in Figure 2.49) verifies that the

high Vgs is one of the major reasons for a high startup current.

700 700

600 600 Vgs=5V

500500

400400 Id(A) Id(A) 300300

200200

100100

00 024681012024681012 Vds(V) Vds(V)

Figure 2.49 Device startup trajectory with Vgs = 5V (Device: HAT2165)

Since the MOSFET is a voltage-controlled when it operates in the saturation

region, the device current can be limited by reducing the Vgs at startup. Therefore, the gate- source voltage of MOSFETs can be increased gradually from its threshold voltage to limit the device current. Figure 2.50 illustrates the proposed soft-start scheme. C1 and C3 can be pre- charged to half of the input voltage with RC network. After that, the gate driving voltage is

ramped up to charge C2 voltage to half of the input voltage smoothly. Figure 2.51 shows the simulation result with the proposed soft-start method, where the device current can be limited to a relatively low value during the voltage divider startup. Figure 2.52 verifies the device junction temperature by treating the device’s power as a single pulse, which is worse than the real case. The maximum junction-to-case temperature rise is only 10oC. Thus there should be no safety- related problems for the device.

65 Chapter 2. High Power Density Non-Isolated Bus Converters

Vbus Vin Voltage divider Vo CPU VR Po=70W

Vdriver Io Soft Start VR controller PowerGood

Vin + Q1 C1 R Vbus Q C2 2 Vbus Vin + Vdriver Q3 V C3 R C2 Q4 - - PowerGood

Io t

Figure 2.50 Proposed soft-start method for voltage divider

The experimental result shown in Figure 2.53 verifies the proposed soft-start method. It works at no load, and the input current only has a small spike when the driver voltage reaches the threshold voltage of the device.

Vdriver

30A ID_Q1

VDs_Q1

ID_Q1*VDS_Q1

Vo

VC2

Figure 2.51 Simulation result of the proposed soft-start method

66 Chapter 2. High Power Density Non-Isolated Bus Converters

Simulate device power as a single pulse:

PD PDM=80W

100µs

0.03

100u

o Tjc = 0.03*Rjc*PDM = 0.03*4.17*80 = 10 C

Figure 2.52 Device junction temperature at startup

Vdriver (5V/div)

Vo (2V/div)

Iin (5A/div)

Figure 2.53 Experimental result at startup with the proposed method (Vin=12V)

2.6.2. Short Circuit and Over Current Protection

The general method used to protect the short circuit and over current is to sense the output current and to take protective action correspondingly. However, this requires a current-sensing circuit, which adds more cost and power loss. For the voltage divider, the load current can be estimated as [33]:

67 Chapter 2. High Power Density Non-Isolated Bus Converters

Vin / 2 −Vo Io ≈ (2-39) R o

As long as the conduction loss is the dominant loss for the voltage divider, (2-39) is valid. The test result shown in Figure 2.54 verifies the above equation. This means the voltage difference between Vin/2 and Vo can represent the output current of the voltage divider. Thus no current-sensing circuit is required.

0.18 0.16 0.14 0.12 0.1 0.08 0.06 Vin/2-Vo (V) Vin/2-Vo 0.04 0.02 0 0 5 10 15 Io (A)

Figure 2.54 Test result of Vin/2-Vo vs. Io ( Vin = 12V )

The above equation shows that the voltage difference will increase significantly at short circuit, and will increase linearly at over-current, as shown in Figure 2.55.

Short circuit Over current

Io Io

Vin/2-Vo Vin/2-Vo

Figure 2.55 Simulation result at short circuit and over current

68 Chapter 2. High Power Density Non-Isolated Bus Converters

The detailed protection scheme is presented in Figure 2.56, where Vth1 is much larger than Vth2. If the voltage difference is larger than Vth1, the four MOSFETs are turned off immediately. If the voltage difference is larger than Vth2, which is proportional to the maximum load current, it will delay some time to avoid the noise problem. After that, if it is still larger than Vth2, the four MOSFETs are turned off to shut down the voltage divider.

ΔV Vth1 ΔV Vth2

Vth1>>Vth2 Shutdown Shutdown

Td

(a) Short circuit protection (b) Over current protection

ΔV = 0.5V −V in o

Figure 2.56 Protection of voltage divider based on the voltage difference between Vin/2 and Vo

2.7. Other Related Topologies

2.7.1. Controllability: Achieving Output Voltage Regulation

It has been demonstrated that the proposed voltage divider can achieve high efficiency and high power density. Since it is used as the first stage of the two-stage solution, no regulation is required. However, the ABVP concept cannot be applied with the switched-capacitor voltage divider. Also, since it is unregulated, the voltage divider cannot work in parallel to extend the current capability.

To achieve voltage regulation, one inductor must be inserted. Since we do not want the inductor size to be large, a three-level resonant buck [39] is proposed for the voltage divider, as shown in Figure 2.57. If the duty cycle for each switch is 50% and the switching frequency is

equal to the resonant frequency of C2 and Lo, all switches can achieve zero-current switching

(ZCS). As a result, the switching loss can be minimized and the size of Lo can be reduced.

69 Chapter 2. High Power Density Non-Isolated Bus Converters

Q1

C2 Q2 Lo A

Q V 3 + in Co V O Q 4 -

Q1 Q1

C2 Q2 Lo C2 Q2 Lo + + v vc c - - V iL + V iL + in Q3 in Q3 Co V Co V O O Q Q 4 - 4 -

ILN 1 ILN fo = 2π LoC2

Vin/2 VcN V 0 0 Vin/2 cN

fo = fs fo = fs

Figure 2.57 Three-level resonant buck (fs = fo)

Figure 2.58 shows the efficiency comparison between the three-level buck converter and the voltage divider with a similar cost. It can be seen that the three-level resonant buck converter can achieve higher efficiency due to a lower switching frequency. Their footprints are the same, but the voltage divider still has higher power density due to lower profiles.

Figure 2.59 shows two different control methods to regulate the output voltage. In terms of voltage gain, the duty cycle control is more preferred since the gain is controlled with a wide frequency range for frequency control.

70 Chapter 2. High Power Density Non-Isolated Bus Converters

100.0%

99.0%

98.0%

97.0% Voltage divider 96.0%

Efficiency 3-level resonant buck 95.0%

94.0% 0 20406080 Po (W)

Figure 2.58 Efficiency comparison between 3-level resonant buck and voltage divider

( C2 = 50µF, Lo = 25nH, fs = 160kHz for 3-level resonant Buck

C2 = 132µF, fs = 375kHz for voltage divider. Devices: 4*Si7104)

Vgs fs=1/Ts 50%Duty Cycle Q1

Q & Q 2 4 Q 2 D

Q1 & Q3 Q3

Ts Q4

0.6 fs fo 0.6 0.5 0.5

0.4 0.4 0.3 R=0.5ohm 0.3 R=0.5ohm R=5ohm 0.2 R=5ohm 0.2 M=Vo/Vin 0.1 M=Vo/Vin 0.1 0 0 200 300 400 500 600 700 0 0.1 0.2 0.3 0.4 0.5 0.6 D fs (kHz)

(a) Frequency control (b) Duty cycle control

Figure 2.59 Two control methods to regulate output voltage

71 Chapter 2. High Power Density Non-Isolated Bus Converters

2.7.2. ZVS Voltage Divider

Now let us look at some other possible applications for the voltage dividers. Since the proposed voltage divider can achieve extremely high power density, it is desirable for applications where high power density is preferred. Bus converters are one possible application. Figure 2.60 shows transformer-based step-down conversion bus converters to improve efficiency [44]. The transformer is not necessarily required since the input 48V voltage is a safety voltage.

Inductor Transformer

Figure 2.60 Low power density 48V bus converters

Figure 2.61 shows the proposed high power density bus converter with two voltage dividers in series. The power density is about 1050W/in3, which is 10 times higher than the conventional bus converters. The efficiency is about 96%, which is similar as the conventional bus converters.

Q1 Q5 98.0% C1 C4 Q Q C2 2 C 6 5 97.0% 48V + + 12V V Q3 Q7 in C3 VO1O VO Q Q C6 4 8 96.0%

- - Efficiency

95.0% 0 100 200 300 400 Po (W)

Figure 2.61 Proposed high power density 48V-12V bus converter

For a high input voltage, since there is capacitive turn-on loss for the voltage divider, ZVS operation is preferred for higher efficiency. Again, to achieve ZVS turn-on, inductors must be inserted to provide energy to charge and discharge the junction capacitors of the MOSFETs.

72 Chapter 2. High Power Density Non-Isolated Bus Converters

Figure 2.62 shows a proposed ZVS voltage divider. It adds a small inductor L resonating with the output capacitor to create energy on L to achieve ZVS operation.

Q1

Q2

+ Q3 Vin L

VO Q4 -

Figure 2.62 Proposed ZVS voltage divider

Figure 2.63 is the simulated waveforms to demonstrate that all of the switches can achieve ZVS operation. As a result, the efficiency can be further improved.

Vds_Q4

Vgs_Q4

Vds_Q1 ZVS is Vgs_Q1 achieved

IL1

Vo

Figure 2.63 Simulated waveforms for ZVS voltage divider

( L = 8nH, C1+C3=16µF, C2 = 66µF, Vin = 48V, Io = 20A, fs = 330kHz)

73 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.1. Low Voltage Rating Devices for Second Stage

3.1.1. Introduction of New Device FOM Proposed by CPES

For low-voltage applications, there are three basic types of device technology widely used in industry: the vertical diffusion MOS (VDMOS), the trench MOS [45] and the lateral diffusion MOS (LDMOS) [46]. These structures are shown in Figure 3.1. Among these structures, the trench MOSFET has the lowest on-resistance, while the lateral MOSFET has the fastest switching speed. For sub-30V applications, the trench MOS and lateral MOS are widely used.

VDMOS Trench MOS LDMOS

Figure 3.1 Three basic structures for low-voltage application

Recently, the lateral-trench MOSFET structure was developed to obtain the benefits of both the lateral MOSFET and the trench MOSFET. Figure 3.2 shows one structure of a lateral- trench MOSFET [47].

Traditionally Qgd*Rdson, which is known as the figure of merit (FOM), has been used to evaluate device performance [48]. The definition of Qgd is shown in Figure 3.3. It is expected that this FOM is derived as a part of the device’s minimum loss. Hence minimizing this value means minimizing the device’s total loss. Figure 3.4 shows a summary of the FOM vs. breakdown voltage for different types of commercial MOSFET. All devices are for VRM

74 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency applications, and the devices for the same type of MOSFET are obtained from the same company to make a fair comparison. It can be seen that most of the MOSFETs represented are trench MOSFETs.

Figure 3.2 One example of lateral-trench MOSFET Structure

Vgs

VDR

Vplt

Vth

Q Q gs2 gd Q Qg gate

Figure 3.3 The definition of Qgd, Qgs2 and Qg

There are some trends clearly shown in Figure 3.4:

• For each structure, the FOM decreases when the breakdown voltage decreases; and • With a lower breakdown voltage (below 20V), the LDMOS and the lateral-trench MOSFET show significant benefits in terms of the FOM.

75 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

40 35 Infineon ‐ Trench 2 Vishay ‐ Trench 30 IRDirectFET 25 Renesas ‐ Trench D8 20 Infineon ‐ Trench 3 Renesas ‐ Trench D9 15 10 FOM=Qgd*Rds Greatwall‐ Lateral Ciclon‐ Lateral‐Trench 5 0 5 101520253035 Vds (V)

Figure 3.4 Figure of merit vs. breakdown voltage for different MOSFET structures ( All devices are for VRM applications)

The above FOM is actually derived from high-voltage applications, and it is not suitable for low-voltage high-current applications since it neglects the Qgs2 effect, which is related to the current rising time (for turn-on) or the current falling time (for turn-off). It also does not include the impact of the gate-driving voltage. A better FOM that is more suitable for low-voltage high- current applications was proposed recently by CPES. The new device figure-of-merit is proposed in [49]:

FOM=+⋅⋅() Q K Q R gdgsgsdson22 (3-1) V 2(VV− V ) KV()1=+ DR plt DR plt gs2 DR VV−⋅⋅ VIR plt th in o g where Qgd, Qgs2, VDR, and Vth are shown in Figure 3.3, Vin is the input voltage, Io is the load current, and Rg is the gate resistance of the MOSFET, including the driver resistance.

Figure 3.5 shows the new FOM vs. breakdown voltage. It can be observed that there are some differences from Figure 3.4. For example, the 25V trench MOSFET from Vishay has a higher new FOM than the 30V devices due to its larger Qgs2. It indicates that 25V may not be an optimal design for VRM application.

76 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

120 Vishay ‐ Trench 100 Infineon ‐ Trench 2 80 60 IRDirectFET Infineon ‐ Trench 3 40 Renesas ‐ Trench D8 Renesas ‐ Trench D9 20 Greatwall‐ Lateral Ciclon‐ Lateral‐Trench 0

FOM=(Qgd+Kgs2*Qgs2)*Rds 5 101520253035 Vds (V)

Figure 3.5 New figure of merit vs. breakdown voltage for different MOSFET structures (All devices are for VRM applications)

Let’s verify the new FOM by comparing the 20V Vishay Trench MOSFET (Si7106) with the 30V Renesas D9 MOSFET (RJK0303), as shown in Table 3.1. The on-resistance of the two devices is very close. Since the Si7106 has a much lower Qgd, its old FOM is lower than the

FOM of the RJK0303. However, due to a larger Qgs2, the Si7106 has a larger new FOM than the RJK0303. As a result, the power loss of the Si7106 should be higher than the RJK0303 power loss for VR applications.

Table 3.1 Comparison between Si7106 and RJK0303

0 Rds (@75 C) Qgs2 (@20A) Qgd (@ 10V) Old FOM New FOM Vgs = 5V

Vishay 20V 6.6nC 2.8nC 5.62mΩ 15.7nC*mΩ 66.7nC*mΩ (Si7106)

Renesas 30V 2nC 5.2nC 4.6 mΩ 23.9nC*mΩ 36.9nC*mΩ (RJK0303)

77 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Figure 3.7 shows the test result with a single-phase buck configuration. The bottom switch and other circuit parameters are the same, but there are two different top switches. The figure clearly shows that RJK0303 can achieve better full-load efficiency than the Si7106, which matches the prediction from the new FOM, but would not be predicted if using the old FOM.

93%

92%

91% RJK0303 90% Si7106 Efficiency Efficiency 89%

88% 5 10152025 Io(A)

Figure 3.6 Test efficiency with Si7106 and RJK0303 as top switch

(Vin = 6V, Io = 20A, L = 200nH, fs = 600kHz, bottom switch: IRF6691 for both cases)

Another important factor in designing device packaging is the device packaging parasitics. To operate, the MOSFET must be connected to the external circuit using a certain packaging method. Different packaging methods use different ways to connect the MOSFET die to the pins; for example, they might use bonding, a solid copper strap, and so on. These packaging

methods introduce packaging parasitics to the common-source inductor (Ls) and the drain

inductor (Ld). Both the turn-on and turn-off losses will increase by increasing the common-

source parasitic inductance (Ls). The total switching loss is tremendously impacted by the

common-source parasitic inductance (Ls). Increasing the drain-side parasitic inductor (Ld) would decrease the turn-on loss, but would also increase the turn-off loss. In addition, it would not significantly influence the total switching loss in some ranges, because the gain from the turn-on loss and the loss from the turn-off loss would cancel each other out.

78 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.1.2. Efficiency Improvement by Great Wall LDMOS for Top Switch

Since our input voltage is about 6V, it is reasonable to use a 15V device from Great Wall as the top switch [46]. We compare it with the best 30V trench MOSFET, which is from Renesas. Table 3.2 shows that the Great Wall 15V MOSFET has a lower new FOM, which indicates that, its efficiency could be better.

Table 3.2 Comparison between GWS15N15 and RJK0303

0 Qgs2 @ Qgd @ Rds (@75 C) Old FOM New FOM Id = 20A Vds = 10V Vgs = 5V

Great Wall 15V 2.7nC 5.2nC 3.26mΩ 17nC*mΩ 29nC*mΩ (GWS15N15)

Renesas 30V 2nC 5.2nC 4.6 mΩ 23.9nC*mΩ 36.9nC*mΩ (RJK0303)

Figure 3.7 shows that, as expected, the Great Wall 15V LDMOS can achieve about 1% higher efficiency at full load than the RJK0303, which is the best 30V device. The efficiency of a 12V input buck converter with 30V devices is only 86%, which is 4% lower than a 6V buck converter.

93% 92% 91% 90% 89% 88% 87% GWS15N15 Efficiency 86% RJK0303 85% 84% 0 4 8 12162024

Io(A)

Figure 3.7 Efficiency improvement from Great Wall 15V devices

(Vin = 6V, Io = 20A, L = 200nH, fs = 600kHz, bottom switch: IRF6691 for both cases)

79 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.1.3. Efficiency Improvement by Ciclon Lateral-Trench MOSFET

Let’s take a look at bottom switch. For the bottom switch, the new FOM does not apply since there are no switching losses. Although there is a new bottom-switch FOM [49], since the die size cannot meet the minimum power loss requirement, that FOM is not that helpful in choosing a better bottom switch. Thus a very straightforward way to select a bottom switch is to test the performance of the bottom switch by testing different bottom switches with the same top switch. Table 3.3 shows the comparison between Ciclon’s 12V lateral-trench MOSFET with the best bottom switches available before this switch was made available (IRF6691). It can be observed that both the on-resistance and the total gate charge of the CSD13301 are lower than that for the IRF6691. As a result, higher efficiency can be expected.

Table 3.3 Comparison between CSD13301 and IRF6691

0 Rds @75 C and 5V Qg @ Vgs = 5V Rds*Qg

Ciclon 12V CSD13301 1.44mΩ 30nC 43.2nC*mΩ

IR 20V IRF6691 2.16mΩ 47nC 80nC*mΩ

Figure 3.8 shows that, as expected, the CSD13301 can achieve about 1.5% higher efficiency at full load than the IRF6691 can.

93%

92%

91%

90%

Efficiency CSD13301 IRF6691 89%

88% 0 4 8 12 16 20 24 Io(A)

Figure 3.8 Compare Ciclon devices with the best bottom switch

(Vin = 6V, Io = 20A, L = 200nH, fs = 600kHz, top switch: RJK0303 for both cases)

80 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Since Ciclon does not design a good 12V MOSFET for the top switch, a 25V lateral-trench MOSFET (CSD16402) is selected as a top switch and compared with what was previously the best top switch, the Great Wall 15V MOSFET (GWS15N15). Table 3.4 shows that the Ciclon 25V lateral-trench MOSFET has a lower new FOM than the Great Wall 15V lateral MOSFET.

Table 3.4 Comparison between CSD16402 and GWS15N15

Qgs2 Qgd Rds (@750C) Old FOM New FOM Id=20A Vds=10V Vgs=5V

Ciclon 25V 1.6nC 2.3nC 4.6 mΩ 10.6nC*mΩ 21nC*mΩ CSD16402

Great Wall 15V 2.7nC 5.2nC 3.26mΩ 17nC*mΩ 29nC*mΩ (GWS15N15)

Figure 3.9 shows that the CSD16402 and the GWS15N15 can achieve similar efficiency. The reason for this is the Great Wall 15V MOSFET’s lower packaging inductance, which compensates the device performance. Since the packaging of CSD16402 is PowerPak, which can be easily soldered, while Great Wall’s device has BGA packaging, the CSD16402 is selected for the second stage.

94%

93%

92%

91%

Efficiency 90% CSD16402 GWS15N15 89%

88% 0 4 8 12 16 20 24 Io(A)

Figure 3.9 Comparison of Ciclon devices with the best top switch

(Vin = 6V, Io = 20A, L = 200nH, fs = 600kHz, top switch: CSD13301 for both cases)

81 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.1.4. Applying Ciclon 12V Lateral-MOSFET to Top Switch using an Improved Loss Model

3.1.4.1. An Accurate Loss Model

As shown in Figure 3.5, the Ciclon 12V lateral-trench MOSFET has the lowest FOM. However, the device is only available for the bottom switches due to its large die size. In the following section, a very accurate loss model is developed to see how the efficiency can be improved if we change the die size of the 12V MOSFET to make it suitable for the top switch.

It has been proven that for high-frequency operation, the conventional model, which does not include any parasitic inductance, is no longer suitable. To get an accurate value for the power loss, a new analytical loss model is developed in [51] which includes the parasitic inductance and non-linear capacitance of the devices, as shown in Figure 3.10.

Ld1 LS1 Vphase

Ld2 Driver LS2

(a) Parasitic inductance is considered

Cds1 Cgd2

Cgs1

Cgd1 Cds2 Driver HAT2165 Cgd2

(b) Non-linear capacitors of the device are considered

Figure 3.10 A new device loss model which includes parasitic inductance and non-linear capacitors

82 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Figure 3.11 shows the comparison between the loss model and the physical model, where the power loss is absolutely correct. It can be observed that except for the top switch turn-off loss, all of the loss model’s other losses are very correct.

1.5 physical model Loss model (W)

1 Loss

0.5 Power

0

Figure 3.11 Comparison between loss model and physical model

(1 HAT2168+1 HAT2165, Vin = 12V,Vo = 1.3V, Io = 12.5A, fs = 1MHz)

A piece-wise linear model is used in [51] for the calculation of the turn-off loss, as shown

in Figure 3.12. The reason for the inaccuracy for the turn-off loss is that the drain current (iD) is

treated as a constant current source during the t1 - t2 period. However, since the Vds of the top

switch increases during this period, the Vds of the bottom switch should decrease

correspondingly, which means there should be some current on Coss_bot. Since the load current doesn’t change during this period, the drain current of the top switch should be changed during this period.

Figure 3.13 shows a new piece-wise model for the calculation of the turn-off loss. The

main change is for the t1 - t2 period. Due to the large Coss of the bottom switch, there is some

amount of the load current shifted from the top switch to the Coss of the bottom switch during this

period. As a result, the Coss of the bottom switch serves as a snubber capacitor for the top switch, which helps to reduce the turn-off loss of the top switch. During the turn-off period, the Cds of the top switch is extremely small, so it is neglected during the calculation.

83 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

D

G I0 30 v DS Io S 20

10 ) Coss_bot iD

0 t1~t2

10 9 9 9 9 02.10 4 .10 6 .10 8 .10 t t t1 t2 t3

t2~t3

Figure 3.12 Piece-wise linear model for calculation of the turn-off loss in [51]

D + G iD _ vDS_top 30 Io 25 S v 20 DS_top

id1_of()t + 15 vDS_bot v ()t Coss_bot ds1_of iD _ 10 vds1_bot()t

5 t1~t2 vDS_bot

0

5 9 9 9 9 02.10 4 .10 6 .10 8 .10 t t t1 t2 t3

t2~t3

Figure 3.13 A new piece-wise linear model for calculation of turn-off loss

84 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

With the improved method, it can be seen from Figure 3.14 that the turn-off loss of the top switch is now more accurate.

1.6 1.4 physical model 1.2 Improved loss model 1 (W)

loss model 0.8 Loss 0.6 0.4 Power 0.2 0

Figure 3.14 Breakdown of the improved model shows more accuracy

( 1 HAT2168 + 1 HAT2165, Vin = 12V, Vo = 1.3V, Io = 12.5A, fs = 1MHz)

3.1.4.2. Sweeping the Die Size for Better Efficiency

Since the CSD13301 12V device is designed as a bottom switch, its efficiency is not good if we use it as the top switch. Thus, if the die size can be reduced, we can use 12V devices for the top switch as well.

With the help of the above improved loss model, it is easy to sweep the die size for efficiency. The die size of CSD13301 is normalized to be 1, so the on-resistance is inversely

proportional to the die size (excluding the packaging resistance) while Cgs, Cgd , Cds and the trans-conductance (gm) are proportional to the die size. In the loss model, these values are changed correspondingly when the die size is swept.

Figure 3.15 shows the results from the model by sweeping the normalized die size from 0.2 to 1. Since the Ciclon 25V MOSFET (CSD16402) can achieve 91% efficiency, the Ciclon 12V MOSFET with 30% die size as CSD13301, can achieve 0.7% higher efficiency.

85 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

91.8% 91.6% 91.4% 0.7% 91.2% 91.0%

Effciiency 90.8% 90.6% 90.4% 00.20.40.60.811.2 Normalized Die size

Figure 3.15 Sweep die size of 12V device technology for top switch

(Vin = 6V, Io = 20A, L = 200nH, fs = 600kHz, bottom switch: CSD13301)

3.1.4.3. Final Efficiency Achievement with Low Voltage Rating Devices

Figure 3.16 shows the final efficiency achievement for the second stage with the low- voltage-rating devices. The solid line is the test efficiency result with a Ciclon 25V top switch and a 12V bottom switch. It can be observed that the efficiency at full load is about 91% for 600kHz switching frequency and 89% for the 1MHz switching frequency. The dashed line is the projected efficiency result with an optimized 12V top switch and a 12V bottom switch. The efficiency can improved to about 92% for 600kHz and 90% for the bottom switch.

95% 94% 93% 92% 91% 90% 89% 88% CSD16402+CSD13301 600kHz Efficiency 87% CSD16402+CSD13301 1MHz 86% (0.3XCSD13301)+CSD13301 600kHz 85% (0.2 X CSD13301+CSD13301) 1MHz 84% 0 4 8 12162024 Io(A)

Figure 3.16 Final efficiency achievement

(Vin = 6V, L = 200nH for fs = 600kHz, L = 100nH for fs = 1MHz)

86 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.2. Proposed System Level Two-Stage VR for Laptop and Server Computers

3.2.1. New Situations for Two-stage VR

All of the previous research on two-stage VRs is based on cost savings for desktop PCs, and on keeping a similar cost and reducing the footprint (for laptops). This is because the output bulk capacitors (Oscon capacitors for desktops and SP capacitors for laptops) used to be very expensive. The cost savings of the two-stage VR can compensate the additional first-stage cost.

However, capacitor costs have been greatly reduced due to large demand for capacitors. As shown in Figure 3.17, in 2004, the Oscon capacitor cost was about 25% of the total VR cost. However, the Oscon capacitor cost in 2008 is only about 15% of the total VR cost in 2004. As a result, two-stage VRs are even more expensive than single-stage VRs.

Year 2004 Year 2008 Total VR Cost: “1” Total VR Cost: “0.9” 1st Stage Cost: “0.2” 1st Stage Cost: “0.2” Oscon Cap Cost: “0.25” Oscon Cap Cost: “0.15” Cost Reduction: 5% Cost Increase: 5%

Figure 3.17 Cost difference between 2004 and 2008 for desktop VR ( Cost is normalized to the total VR cost in 2004)

For laptop VRs, there is a similar situation. As a result, the previous argument of using a two-stage VR to save costs is no longer true. A new architecture must be proposed to keep the cost similar while improving efficiency or reducing the footprint.

3.2.2. Proposed System Level Two-Stage Power Architecture for Laptop Computers

One example of the two-stage VR concept is Intel’s Narrow VDC [68], as shown in Figure 3.18. With the help of a system charger VR, the bus voltage is the same as the battery voltage, so the voltage range is narrowed, which provides significant benefits for the VRs downstream.

87 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

AC input: 90~265V AC/DC adapter 19V LDO I/O power Keep alive System charger VR VRs VR and LDO 8.7V~12.6V

Battery pack VR VR VR VR VR

5V 3.3V Boost Main power CPU Graphics DDR

LCD

Figure 3.18 Intel’s Narrow VDC power architecture

The bus voltage of Narrow VDC structure is still too high (8.7V~12.6V). To reduce bus voltage, one possible two-stage solution for laptop VRs is shown in Figure 3.19. Since the CPU, graphics, DDR and the main power, which require 5V and 3.3V power, are four major loads, a two-stage solution is applied for the VRs of these loads. However, this architecture is not practical. First, there are too many first stages, which add more cost and footprint. Second, the total first-stage power is too high at beyond 150W, assuming 85% VR efficiency. It is not an effective solution.

AC input: 90~265V LDO I/O power Keep alive AC/DC adapter VRs VR and LDO 8.7-19V 19V Power path switch 1st 1st 1st 1st 1st 8.7-16.8V Battery Battery VR VR VR VR VR charger pack

Boost Main power CPU Graphics DDR 25W 16.5W 54W 16W 18W LCD

Figure 3.19 One possible two-stage solution for laptop VRs

88 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Actually, the laptop has a total load power regulation function to meet the system’s thermal design power (TDP) requirement. For example, when the laptop detects that the total load power is beyond a certain value, it will lower the CPU clock frequency to reduce CPU power. As a result, although the peak power may exceed the system’s designed power for a very short period, the thermal designed power can be greatly reduced. For example, in the above laptop power architecture, the maximum load power can be as high as 110W, but the thermal design power is only about 60W, which is around half of the maximum power. Therefore, if all of the VRs share the same first stage, the first-stage thermal power can be as low as 70W (assuming 85% VR efficiency). Figure 3.20 shows the proposed system’s two-stage power architecture. The low- power first stage greatly reduces the cost and footprint when compared with the previous architecture, so it is a promising power solution for laptops.

AC input: 90~265V LDO I/O power Keep alive AC/DC adapter VRs VR and LDO 8.7-19V 19V Power path switch First stage with 70W power 3 cells: 8.7-12.6V 4 cells: 11.6V-16.8V Battery Battery VR VR VR charger pack VR VR 5V 3.3V Boost Main power CPU Graphics DDR

25W 16.5W 54W 16W 18W LCD

Figure 3.20 Proposed system level two-stage power architecture for laptop

This system’s two-stage VR can also be applied to the Narrow VDC power architecture, as shown in Figure 3.21.

89 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

AC input: 90~265V AC/DC adapter 19V LDO I/O power Keep alive System charger VR VRs VR and LDO 8.7V~12.6V

1st Stage Battery pack VR VR VR VR VR

5V 3.3V Boost Main power CPU Graphics DDR

LCD

Figure 3.21 Two-stage system for Narrow VDC power architecture

From the proposed architecture we can see that the first stage is very important to the system performance. First, the first stage should have high power density so that it occupies very little of the real estate. Secondly, the first stage should have an extremely high efficiency so that the overall efficiency of the two-stage VR is high. From Chapter 2 we can see that the proposed switched-capacitor voltage divider is a perfect solution for the two-stage VR.

Now let us discuss how to design the output inductance and output capacitance for the second stage of the two-stage VR. For simplicity, the CPU VR is used as an example, but the concept can be applied to other VRs.

In laptop VRs, non-linear control methods, such as constant on-time control and hysteretic control, are widely used. For non-linear control methods, the duty cycle of the buck converter is saturated immediately after the load transient occurs. The delay time is ignored. As a result, the inductor current slew rate at the transient is determined by the inductance value, as shown in (3-2):

Vin −Vo SRL = for step up Leq (3-2) V SR = o for step down L L eq

90 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

where Vin and Vo are the input voltage and output voltage, respectively, and Leq = L/N is the equivalent transient inductance. L is the inductance of each phase, and N is the number of phases of the VR. For a CPU VR, since the input voltage is much larger than the output voltage, the voltage spike for the load step-down is dominant. The equivalent load step-down circuit is shown in Figure 3.22(a), and current waveforms for different output inductances are shown in

Figure 3.22(b), where SRo is the load current slew rate. Then voltage deviation during the transient can be calculated as:

t 1 dic (t) ΔV (t) = R ⋅i (t) + ⋅ i (t) ⋅ dt + ⋅ (3-3) o c c C ∫ c Lc o 0 dt where Rc is the total ESR of the output capacitors, Co is the total output capacitance and Lc is the total ESL of output capacitors.

iL vo

Leq ic Lc Rc io

Co

(a)Equivalent load step-down circuit

Slope: SR0 Slope: SR0 ΔI ΔI io o io o

Slope: SRL1 Slope: SRL2 iL iL

Slope: SRo-SRL1 Slope: SRL1 Slope: SRo-SRL2 Slope: SRL2 ic ic

L1 > L2

(b) Current waveforms at load step-down for different values of L

Figure 3.22 Inductance determines the transient performance for non-linear control

Figure 3.22(b) shows that the smaller the output inductance is, the lower the capacitor current will be. This means fewer output capacitors are required to have the same voltage spike,

91 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

so the output capacitance is determined by the output inductance instead of the switching frequency. Figure 3.23 shows a calculated inductance-capacitance curve for a three-phase laptop CPU VR based on (3-2), where L is the inductance per phase, and the bulk capacitor is a low- profile 330µF/7mohm SP capacitor from Panasonic. It shows that lower inductance results in fewer capacitors needed to meet the same transient requirement. For example, in current industry practice, L is 600nH, so eight bulk capacitors are needed to meet the transient requirement. If the inductance is reduced by half (L=300nH/phase), the bulk capacitors can be reduced to five . If the inductance is reduced to 150nH, the bulk capacitors can be reduced to two. If the inductance is reduced to 100nH, the bulk capacitors can be eliminated!

Once the number inductors and capacitors are determined, the switching frequency should be determined correspondingly. For constant on-time control, the minimum switching frequency needed to meet the output voltage ripple requirement should be selected. Figure 3.24 shows that the switching frequency should be increased when the output inductors are reduced. If the output inductance is further reduced, although the output capacitance can be reduced as well, the required minimum frequency should be higher, which results in a lower VR efficiency. Therefore there is a trade-off between the reduction of the output inductance (capacitance) and the VR efficiency.

14 12 10 8 6 4

Num of Cap of Num 2 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 L (µH)

Figure 3.23 Number of bulk capacitors vs. inductance for CPU VR

(Vin = 6V, Vo = 1.2V, three phases, ∆Io =45A, ∆Vo = 65mV)

92 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Of course, as explained above, the real frequency should be chosen to be larger than the switching frequency to maximize VR efficiency.

1000 900 800 700 600 500 400 300 fs (kHz) fs 200 100 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 L(µH)

Figure 3.24 Minimum switching frequency to meet the output ripple requirement

( Vin = 6V, Vo = 1.2V, three phases, Io = 0.1A, constant on-time control)

3.2.3. Evaluation of Two-Stage VR for Laptop

To evaluate the proposed solution, a two-stage evaluation board was built, as shown in Figure 3.25. A CPU VR and a DDR VR are chosen as design examples. The MAX1545, which uses constant on-time control, is used as the VR controller. For simplicity, the efficiency is evaluated for a 3-cell battery, the input voltage of which is about 12V.

Voltage divider With SO-8 device

DDR VR (18W) CPU VR (50W) 2-phase or single-phase 2-phase or selectable 3-phase Buck

DDR load emulator CPU load emulator

Figure 3.25 Evaluation board for two-stage solution

93 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

To evaluate the proposed power architecture from different angles, two designs are compared, as shown in Figure 3.26. Design #1 focuses on the reduction of the footprint of the output filters and reducing the cost, so the VR frequency is almost doubled and the output inductance and capacitance are further reduced. Design #2 focuses on efficiency improvement and the output filter size reduction. The two designs have the same switching frequency but the second design use the best devices which have higher cost than the first design.

Benchmark: Single-stage

8.7V~12.6V for battery fs=300kHz, L=600nH CPU VR

DDR VR fs=300kHz, L=1000nH …

(a) Benchmark: single-stage solution (Single-stage: 3 phases, each phase: 1 IRF7821 + 2 IRF7822)

8.7V~12.6V for battery fs=550kHz, L=150nH Voltage divider CPU VR Po=70W

DDR VR fs=550kHz, L=300nH …

(b)Two-stage Design #1: reduce the cost and output filter size (similar devices as benchmark)

(1st stage: 70W voltage divider, 4*Si7104,2nd stage: 3 phases, each phase: 2 IRF7821 + 1 BSC024N025S)

8.7V~12.6V for battery fs=550kHz, L=150nH Voltage divider CPU VR Po=70W

DDR VR fs=550kHz, L=300nH …

(c) Two-stage Design #2: improve efficiency and reduce output filter size (best devices are used)

(1st stage: 70W voltage divider, 4*Si7104,2nd stage: 3 phases, each phase: 1 CSD16402 + 1 CSD13301)

Figure 3.26 Two different two-stage designs

94 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

The CPU VR efficiency of Design #1 is shown in Figure 3.27. Although the frequency is doubled, it can still achieve slightly higher efficiency than a single-stage solution. The CPU VR efficiency of Design #2 is shown in Figure 3.28. It clearly shows that the efficiency of the CPU sleep states can be improved by 3% and the efficiency of the CPU active states can be improved by 3-5%.

89% 88% 87% Single stage Two-stage overall 86% 85% 84% Efficiency 83% 82% 81% 80% 0.1 1.0 Io (A) 10.0 100.0 DCM, Vo=0.9V CCM, Vo=1.2V

Figure 3.27 CPU VR Efficiency comparison between single-stage and two-stage Design #1

92% 90% Single stage 88% Two-stage overall 86% Efficiency 84% 82% 80% 0.1 1.0 10.0 100.0 Io (A) DCM, Vo=0.9V CCM, Vo=1.2V

Figure 3.28 CPU VR Efficiency comparison between single-stage and two-stage Design #2

The transient response of the CPU VR is shown in Figure 3.29. It demonstrates that both of the designs can meet the transient and output voltage ripple requirements and both of them can reduce the number of capacitors from eight to two.

95 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Since the voltage divider is the additional stage for the two-stage solution, it should be counted together with the output filters. For two-stage design #1 and #2, the footprint reduction from the CPU VR is much larger than the additional footprint from the voltage divider. In our design example, the footprint reduction can be as high as 33%.

Io (20A/div) Io (20A/div) Io (10A/div)

Vo (50mV/div) Vo (50mV/div) Vo (50mV/div) 16mV

SP cap: 330uF each

(a) Single-stage CPU VR transient waveforms. (c) Two-stage CPU VR transient waveforms.

Figure 3.29 Transient response of CPU VR

Single-stage: CPU VR output inductors and caps

same scale Two-stage: CPU VR output inductors and caps

Figure 3.30 Footprint comparison for CPU VR

Figure 3.31 shows the efficiency comparison of single-stage and two-stage for DDR VR. It shows that with the doubled frequency, two-stage DDR VR can even achieve higher efficiency than single-stage DDR VR.

96 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

y 90.00% two-stage overall 89.00% single stage

88.00%

87.00% Efficiency 86.00%

85.00% 051015 Io(A)

Figure 3.31 DDR VR efficiency comparison

Singe stage: single phases, 1 IRF7821 + 1 IRF7822, L=1000nH, fs=300kHz

Two-stage: single phase, 1 IRF7821 + 1 Si4864, L=300nH, fs=550kHz

Table 3.5 gives a comparison between Design #1 and Design #2. It verifies that while Design #1 is footprint and cost-oriented and Design #2 is efficiency-oriented.

Table 3.5 Comparison between two two-stage designs for laptop

CPU VR Light load CPU VR Full- Footprint for Cost Efficiency load Efficiency output filters

Design #1 1% higher Similar ~33% reduction 5% reduction

Design #2 3% higher 3~5% higher ~33% reduction Similar cost

3.2.4. System Level Two-Stage Power Architecture for High-End Servers

As introduced in Chapter 1, Intel and other companies are now considering the Z-Axis VRM (ZVRM) structure, as shown in Figure 3.32 [16]. The concept is to put VR as close as the microprocessor to minimize the power delivery path.

Figure 3.32 ZVRM proposed by Intel

97 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

In this arrangement, the VR must be very small to put it on (or around) the OLGA. As we know, the bulky elements of the VR are the output inductors and output capacitors. The output inductors can be reduced by raising the VR switching frequency. The output capacitors of the VR, normally referred to as bulk capacitors due to their large capacitance and volume, can be reduced by raising the VR control bandwidth. As described in Chapter 1, when the VR control bandwidth is pushed to be around 350kHz~390kHz, the bulk capacitors can be eliminated. Recent research shows that with a 1MHz switching frequency and coupled-inductor structure, 350kHz bandwidth can be achieved [69].

Server microprocessors consist of three major loads: core, cache and uncore. Previously, only one VR was designed to feed these loads. However, since the load current is higher and thus the transient speed becomes faster, a single VR cannot satisfy all the loads. Hence, three separate VRs are required to power the three loads. Although the sum of the VRs’ power is around 190W, there is a thermal design power (TDP) requirement for the three loads, which means the total power cannot exceed a certain value, e.g. 130W. By taking advantage of the TDP requirement of the server microprocessor, we can apply the two-stage architecture, as shown in Figure 3.33, to reduce the power rating of the first stage for further cost and size reduction. In this structure, the first stage only needs to be designed at 150W power for the three VRs, assuming 87% VR efficiency.

Vin=12V Voltage step-down converter Vbus ≅ 6V High frequency Core VR 1st stage 100W High efficiency and High power density High frequency Cache VR 50W

High frequency Uncore VR 38W

TDP:130W

Figure 3.33 Proposed two-stage power architecture for future server microprocessors

98 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.2.5. Evaluation of Two-Stage Architecture for High-End Servers

In our two-stage design, the high-frequency, high-power-density voltage divider is used as the first stage. It can achieve over 97.5% efficiency for the entire load range.

With a high-frequency first stage design and a high-efficiency second stage design using low-voltage-rating devices, the overall efficiency comparison of a core VR is shown in Figure 3.34. Compared with a 600kHz single-stage VR, which is the state-of-art design for server applications, a 1MHz two-stage VR can achieve similar efficiency.

Figure 3.35 shows the control bandwidth [69] and the tested transient waveforms of the two-stage VR without any bulk capacitors. For a typical 600kHz single-stage VR, the output bulk capacitors are about five 100µF MLCC capacitors. Moreover, due to a higher switching frequency operation, the output inductor size can also be greatly reduced. Overall, 1MHz two- stage solution can achieve 45% output filter footprint reduction comparing with 600khz single- stage solution. This is very important for the ZVRM.

As for the cost, although the two-stage architecture adds additional cost for the first stage, it can reduce the bulk capacitors for the VRs. As a result, it has a similar cost to the single-stage VR solution.

1st stage

2nd stage

(a) Prototype – No bulk capacitors

99 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

92%

87%

82%

77% Single Stage 600kHz Two-stage overall 1MHz Efficiency 72%

67% 0 50Io (A) 100 150

(b)Efficiency

Figure 3.34 1MHz two-stage VR prototype and efficiency (No bulk capacitors)

(1st stage: voltage divider: 4*CSD13301; 2nd stage: 6-phases, (1 CSD16402+1 CSD13301)/ phase)

60 fs=1MHz 40

20 0 100A/div 20 Io

Magnitude (dB) Magnitude 40 3 4 5 6 100 1 . 10 1 . 10 1 . 10 1 . 10

90

) 0 o

90 50mV/div Vo φ =60o 180 m Phase ( 270 fc=350kHz 360 3 4 5 6 100 1 . 10 1 . 10 1 . 10 1 . 10 Frequency (Hz)

(a) Control bandwidth (b) Load transient waveforms

Figure 3.35 Control bandwidth and transient waveforms

100 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.3. Light Load Efficiency Improvement

3.3.1. Introduction

Laptop computers are gaining a greater share of the market share and are expected to outsell desktop computers in the near future. Laptop computers utilize mobile microprocessors, whose performance has greatly improved in recent years. For mobile devices, battery life is very important. Figure 3.36 shows a typical power consumption breakdown in a laptop [52], in which the power supply loss is around 14% of the total power. Therefore, the reduction of the power supply loss is very important for battery life extension. For a typical design, a 30mW power loss saving means a one-minute battery life extension. Since the CPU’s voltage regulator (VR) needs to provide 57% of the power to the CPU, a method for reducing CPU VR power loss becomes a very hot subject now.

Figure 3.36 A typical power consumption breakdown in laptop

In laptops, there are two categories of CPU VR efficiency requirements: one category of requirement is full-load efficiency at the AC adapter voltage (19V) (the worst case in terms of efficiency), as is required to meet thermal design; another category of efficiency requirement is the efficiency for the entire load range at the battery voltage (16V for 4-cell battery or 12V for 3- cell battery), which is important for battery life. To reduce the CPU’s average power, the CPU goes into sleep mode frequently, and the VR spends 80% of the time at the light-load condition [53], which is defined as all of the load conditions except for the full load.

The state-of-the-art VR design is based on the multi-phase buck converter. It is well known that at a very light load of less than 20% of the full load power, the power loss of buck is dominated by switching-related losses, e.g. gate driving loss and control switch turn-off loss

101 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

[54]. The most effective way of improving the efficiency for this load range is to apply variable- frequency control, e.g. variable-frequency current-mode control [54], burst mode control [55] and constant on-time control [56]. The idea is to decrease the switching frequency, or equivalent switching frequency in the case of burst mode control, to reduce the switching-related losses. In terms of the implementation, constant on-time control is the simplest method since it does not need to sense the current. Hence constant on-time control is widely used in laptop VR controllers.

Figure 3.37 shows how the inductor current changes with different loads for constant on-

time control. When the load current is larger than critical current (Icrit), the buck converter works

at CCM and the switching frequency is constant. When the load current is below Icrit, the buck converter works at DCM and keeps the same on-time for the control switch. The voltage gain at DCM can be easily derived using (3-4):

2 M = 8 ⋅ L (3-4) 1 + 1 + 2 RTon f s where M=Vo/Vin, L is the output inductance, R is the load resistance, Ton is the on-time of

control switch, and fs is the switching frequency of the buck converter.

iL

Io

Ts 0 t iL

Icrit Ts 0 t iL

Icrit/2 0 Ton 2Ts t

Figure 3.37 Inductor current waveforms for constant on-time control with different loads

From (3-4), we know that if Ton is constant, the switching frequency should be inversely proportional to the load resistance to regulate the output voltage. This means that if the load current decreases, the switching frequency decreases correspondingly. Since gate driving loss

102 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

and turn-off loss are the most dominant losses at light load, the light-load efficiency can be improved by applying constant on-time control. Figure 3.37 shows that when the load current is

half of Icrit, the switching period is doubled, which indicates that the switching frequency decreases by half.

Figure 3.38(a) shows the efficiency comparison between CCM operation and DCM operation with constant on-time control from Maxim [56]. Figure 3.38(b) shows how the switching frequency decreases with the loads during DCM operation.

350

300 (kHz) 250

200

150

100

Switching Frequency 50

0 0 1020304050 Io(A)

DCM CCM

(a) Efficiency vs. load current (b) Switching frequency vs. load current (Solid line: CCM; Dash line: DCM at light load)

Figure 3.38 Improved light load efficiency with constant on-time control

From Figure 3.38(a), it can be seen that although the efficiency at very light load is improved, there is no improvement in efficiency when the load is between 20% and 100%. Especially, the efficiency drops very quickly when the load decreases from the peak efficiency point (around 40% load), which indicates there is room for efficiency improvement during CCM. Figure 3.38(b) shows that the switching frequency remains unchanged during CCM operation, so we have opportunity to improve the efficiency by slightly varying the switching frequency. To improve CCM efficiency, adaptive on-time control is proposed, which is described in 3.3.2. Based on the constant on-time control, the non-linear inductor and its corresponding control method is proposed to further boost DCM efficiency, as described in 3.3.3. It should be mentioned that the proposed methods can be applied for both single-stage VR and two-stage VR.

103 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.3.2. Proposed Adaptive On-time Control to Improve Buck CCM Efficiency

This section investigates how to improve CCM efficiency with constant on time control. Figure 3.39 shows the scheme of the synchronous buck converter. Constant on-time control means that the on-time for control switch Q1 is fixed and the off-time is variable to achieve voltage regulation. In truth, the on-time does not need to be constant at CCM since the transient performance is mainly determined by the output inductance instead of the on-time [57]. The following section will describe how to select an on-time for higher efficiency with an example.

Q1 A Vo L Vin DCR Processor Gate Co Q2 Load Driver

Figure 3.39 Synchronous buck converter

First let us review how to select an on-time for the constant on-time control. There are two constraints to select an on-time [56]:

(a) Select an on-time to meet the output voltage ripple requirement. When Ton increases, the inductor current ripple increases, which results in a larger voltage ripple. Figure 3.40 shows the relationship between the output voltage peak-to-peak ripple vs. the on-time. For a laptop VR application, the maximum voltage ripple is 20mVpp [53], so the maximum allowed on-time is about 1µs. To leave some design margin, we select Ton(max) = 0.7µs.

30

25 (mV)

20 Ripple

15

10

5 Output Voltage

0 0 0.2 0.4 0.6 0.8 1 1.2 1.4 Ton (µs)

Figure 3.40 Output voltage ripple vs. on-time

Vin = 6V,Vo = 1.2V,L = 150nH/phase, 3 phases, Co = 2*330µF SP Cap + 18*22µF MLCC

104 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

(b) Select a maximum on-time to avoid inductor core saturation and to limit the root mean

square (RMS) current. The equation for inductor peak current Ioff is shown in (3-5), while the inductor RMS current is shown in (3-6):

(V −V )T I = I + in o on (3-5) off L 2L

(V −V )2 2 2 in o 2 (3-6) I rms = I L + 2 Ton 12L

where IL is the average inductor current. These equations clearly show that a larger on-time leads to a higher RMS current and peak current. The inductor size for a low-voltage, high-current application is mainly determined by the saturation current and the winding loss, which is 2 proportional to Irms . Since the volt-second on the inductor is very small, the core loss is negligible for the given inductor. Based on the calculation tool provided in [58], the core loss is only about 20mW, while the winding loss is as high as 150mW for a 20A load current, 600kHz switching frequency and 150nH inductance.

Based on (3-5) and (3-6) , we can derive the maximum on-time to meet both the saturation current and RMS current requirements, as shown in (3-7):

2L(I Lsat − Io / N) 2L 2 2 Ton max = min( , 3[I Lrms − (Io / N) ]) (3-7) Vin −Vo Vin −Vo where ILsat is the saturation current of output inductor, ILrms is the maximum RMS current of

inductors, Io is the total load current, and N is the phase number of the multi-phase buck

converter. Both ILsat and ILrms can be found in the inductor manufacturer datasheet. For example,

FP2-V150 from Cooper Bussmann has ILsat = 25A and ILrms = 37A [59]. Figure 3.41 draws the maximum on-time with different load currents for a three-phase Buck converter. It can be seen the maximum allowed on-time Combining both voltage ripple and inductor requirements, the maximum on-times for different load currents are shown in Figure 3.42. This figure provides the upper limit for the on-time.

105 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

1.4 0.8

1.2 0.7 1.0 (µs) (µs)

0.6 0.8 time time ‐ ‐ 0.5 0.6 0.4 0.4 0.3 Maximum on 0.2 Maximum on

0.0 0.2 0 1020304050 0 1020304050 Io (A) Io (A)

Figure 3.41 Maximum on-time vs. load current to Figure 3.42 Maximum on-time vs. load current after meet inductor requirement considering both ripple and inductor requirement

(Vin=6V,Vo=1.2V, N=3, Inductor: FP2-V150, L=150nH, ILsat = 25A, ILrms = 37A

Co=2*330µF SP cap + 18*22µF Ceramic )

Now let’s look at how to select an optimal on-time to achieve higher efficiency with different loads. Since the three phases of the buck converter are the same, here one phase of the buck converter’s power losses is considered.

For conduction losses, we have:

2 P = (D ⋅ R + (1− D) ⋅ R + DCR) ⋅ I cond ds(on) _ top ds(on) _ bot rms (3-8) (V −V )2 I 2 = I 2 + in o T 2 rms o1 12L2 on where Rds(on)_top and Rds(on)_bot is on-resistance for the top and bottom switch, respectively, and

DCR is the DC resistance of the output inductor. Irms is the inductor RMS current, D is the duty

cycle, Io1 = Io/N, and N is the phase number.

For given Vin, Vo, L and Io:

2 Pcond = Kc1 + Kc2Ton (3-9) where Kc1, Kc2 are constant values.

106 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

The total switching loss (Psw) consists of two parts: Q1 turn-off loss (PQ1_off), and the other

part of the switching loss (Psw1) [50].

Psw = PQ1 _ off + Psw1 (3-10)

Since

Vo 1 f s = (3-11) Vin Ton

Psw1 can be calculated as:

K s1 Psw1 = (Qg _ topVg + Qg _ botVg + 0.5Qoss _ topVin + 0.5Qoss _ botVin + I o1VF Tdead + QrrVin ) f s = (3-12) T on

where Qg_top and Qg_bot are the total gate charges of the top and bottom switch, respectively, and

Qoss_top and Qoss_bot are the switch output charges of the top and bottom switch, respectively. Vg is the gate-driving voltage, Tdead is the dead time between top switch gate signal and bottom switch

get signal to avoid shoot-through, and Qrr is the reverse recovery charge of the bottom switch.

1 Qgd 1 Qgs2 PQ1 _ off = ( Ioff × ×Vin × f ) + ( Ioff × ×Vin × f ) (3-13) 2 i g 2 i g where Ioff is the turn-off current, Qgd is the voltage-rising-related charge, Qgs2 is the current-

decreasing-related charge, and Ig is gate current.

(V −V )T I = I + in o on Q = C I / g off o1 2L gs2 iss off m (3-14)

where Ciss is the input capacitance of the top switch, and gm is the trans-conductance of the top switch.

Replacing (3-11) and (3-14) into (3-13), we can finally get:

K s2 PQ1_ off ≈ + K s3 + K s4Ton (3-15) T on Finally, based on (3-12) and (3-15) we can get:

Ks1 + Ks2 Psw = PQ1_off + Psw1 ≈ + Ks3 + Ks4Ton (3-16) T on where Ks1, Ks2, Ks3 and Ks4 are constant values for given Vin, Vo, L and Io. 107 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Figure 3.43(a-b) verified (3-9) and (3-16) using a very accurate analytical model developed in 3.1.4.1. The figure indicates that when Ton increases, which means a decrease in fs, conduction losses increase too. However, switching losses do not necessarily decrease. As a result, there is an optimal Ton to achieve the lowest power losses, as shown in Figure 3.43(c). The optimal on- time is about 0.5µs, which is lower than the maximum on time (0.7µs) when Io=30A (from Figure 3.42), so we can select Ton = 0.5µs to achieve higher efficiency.

1.0 1.8

0.9 1.7 (W) (W)

0.8 1.6 Loss Loss

0.7 1.5 Power Power 0.6 1.4 0.5

0.4 1.3 0 0.2 0.4 0.6 0.8 1 1.2 0 0.2 0.4 0.6 0.8 1 1.2 Ton (µs) Ton (µs)

(a) Conduction loss vs. on-time (b) Switching loss vs. on-time

1.2

1.1 (W)

Loss 1.0 Power 0.9

0.8 0 0.2 0.4 0.6 0.8 1 1.2 Ton (µs)

(c)Total power loss vs. on-time

Figure 3.43 Power losses vs. Ton at CCM for one-phase of a three-phase buck

(Vin = 6V, Vo = 1.2V, L = 150nH, Io = 30A for three-phase)

Figure 3.44 shows the test results by varying Ton at CCM for different load currents. The experimental setup is as follows:

108 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Vin = 6V, Vo = 1.2V, three phases.

Each phase configuration: Q1: 2*IRF7821, Q2:1*BSC024N025S, Inductor type: FP2-V150 (L=150nH).

Total output capacitance: 2*330µF SP cap from Panasonic+ 18*22µF ceramic cap from TDK.

It can be clearly seen that for each load current there is an optimal on-time that offers the highest efficiency. Based on the allowed maximum on-time for each current, it can be seen that the optimal on-time for efficiency is lower than maximum on-time for all three of the load conditions. Thus we can choose the optimal on-time to achieve higher efficiency. Another observation is that different loads require different optimal on-times, which indicates that the optimal on-time must be adaptively adjusted with load current.

0.8

0.7

0.6

0.5 Power Loss (W) 0.4 0.2 0.4 Ton (µs) 0.6 0.8

(a) Io = 9A

1.5

1.4

1.3

Power Loss (W) 1.2 0.2 0.4 0.6 0.8 Ton (µs)

(b) Io = 30A

109 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.2

2.7

2.2 Power Loss (W) 1.7 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Ton (µs)

(c) Io = 45A

Figure 3.44 Tested power loss vs. on-time with different load currents

Figure 3.45(a) illustrates the proposed adaptive on-time control concept. Ton is adjusted according to the load current to achieve the highest efficiency. The optimal Ton has an almost linear relationship with the load current, so it is easily implemented by hardware. Up to 2% higher light-load efficiency can be achieved by the proposed method, as demonstrated in Figure 3.45(b).

0.45 0.40 Constant on‐time 0.35 0.30

Ton (µs) Ton 0.25 Adaptive on‐time 0.20 0.15 0.10 5 10 15 20 25 30 35 40 45 50 Io (A)

(a) Optimal on time vs. load current

110 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

92.0%

91.0%

90.0%

89.0% Constant on time 88.0% Adaptive on time Efficiency 87.0%

86.0% 5 10 15 20 25 30 35 40 45 50 Io (A)

(b) Efficiency with adaptive on-time control

Figure 3.45 Proposed adaptive on-time control to improve CCM efficiency- Case 1

3 phases, Vin = 6V, Vo = 1.2V

Figure 3.46 shows another example of adaptive on-time control with 12V input voltage. The efficiency can be improved by up to 2% with the proposed adaptive on-time control. The experimental setup is as follows:

Vin = 12V, Vo = 1.2V, three phases.

Each phase configuration: Q1: 1*IRF7821, Q2:2*IRF7822, Inductor type: Sumida CDEP134H-0R6 (L=600nH)

Total output capacitance: 8*270µF SP cap from Panasonic+ 18*22µF ceramic cap from TDK.

0.55 0.50 0.45 Adaptive on‐time 0.40 0.35 0.30 Ton (µs) Constant on‐time 0.25 0.20 0.15 0.10 5 10 15 20 25 30 35 40 45 50 Io (A)

(a) Optimal on-time vs. load current

111 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

90.0%

89.0%

88.0%

87.0% Constant on-time Efficiency 86.0% Adaptive on time

85.0% 5 10 15 20 25 30 35 40 45 50 Io (A)

(b) Efficiency with adaptive on-time control

Figure 3.46 Proposed adaptive on-time control to improve CCM efficiency – Case 2

3 phases, Vin = 12V, Vo = 1.2V

It is also worthwhile to mention that Figure 3.45(a) shows that the on-time increases with the load current while Figure 3.46(a) shows that the on-time decreases with the load current. This is due to different circuit conditions.

As mentioned above, the transient performance of VR is mainly determined by the output inductance instead of the on-time. As a result, changing on-time should not affect the transient performance. Figure 3.47(a) shows the simulation results of the transient response with the SIMPLIS software. It can be seen that adaptive on-time control can achieve the same transient performance as the constant on-time control and both of them can meet the load-line transient

requirement from Intel with 2mΩ load-line resistance (RLL) [53]. Figure 3.47(b) shows that at the beginning of the load step-down, the duty cycle of VR is saturated to zero to achieve the smallest output voltage variation. After that, the on-time of the adaptive on-time control changes from 0.33 µs to 0.5 µs, which helps to improve the light load efficiency.

112 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

30 10

I(I1-pos) / A -10 1.2 1.16

Vo / V Vo 1.12 1.08 1.2 1.16

Vo / V Vo 1.12 1.08 1.4 1.45 1.5 1.55 1.6 1.65 1.7 1.75 1.8

time/mSecs 50uSecs/div

(a)Output voltage comparison during the load transient

( Upper: Io Middle: Vo with constant on-time control Lower: Vo with adaptive on-time control)

40

20

I(I1-pos) / A / I(I1-pos) 0 6

D / V 2

-2 6

D / V 2

-2 2.21 2.215 2.22 2.225 2.23 2.235

time/mSecs 5uSecs/div

(b)PWM signal comparison during the load step-down

Upper: Io ( from 45A to 10A)

Middle: PWM with constant on-time control ( on-time is constant: 0.33 µs)

Lower: PWM with adaptive on-time control (on-time for 10A: 0.5 µs; on-time for 45A: 0.33 µs)

Figure 3.47 Transient performance comparison between constant on-time control and adaptive on-time control

(Vin = 12V, L = 600nH/phase, 3-phase, RLL = 2mohm

Total output capacitance: 8*270µF SP cap from Panasonic+ 18*22µF ceramic cap from TDK)

113 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

3.3.3. Improve DCM Efficiency with Non-linear Inductor

(A) Proposed Non-Linear Inductor Concept to Improve DCM Efficiency

As mentioned before, when the load current is below the critical current (Icrit), the buck converter runs under DCM operation and the switching frequency begins to decrease. Since it is a synchronous buck converter, the current will become negative if we do not turn off the gate signal of bottom switch Q2 when the inductor current decreases to zero. Thus the inductor current must be sensed. When the inductor current decreases to zero, the bottom switch is turned off and the buck converter will run under DCM operation.

Figure 3.48 shows a loss breakdown at DCM operation with an accurate loss model from 3.1.4.1. It clearly shows that the dominant loss is still the switching losses. Therefore determining a way to effectively lower the switching frequency is the key to further improve light-load efficiency.

Gate driving

turn-on

turn-off

HS cond

LS cond

Driving

core loss Turn-off

Figure 3.48 Loss breakdown at DCM Vin = 12V,Vo = 0.9V, Io = 2A, Ton = 0.5µs, fs = 200kHz

A very straightforward idea is to increase the on-time at light load to further decrease switching frequency, as shown in Figure 3.49. It shows that for a given load current, when the on time increases (as shown in dashed curve), the switching frequency can be decreased. However, the current ripple increases, which increases the output voltage ripple.

iL

Icrit/2 0 t

Figure 3.49 Inductor current with increased on-time ( Dashed line is with larger on-time )

114 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

To better understand the voltage ripple at DCM, Figure 3.50(a) shows the three-phase buck current on the output side. During CCM operation, the output current ripple decreases due to interleaving operation and current cancellation, as shown in Figure 3.50(b). This means the AC current on the output capacitor is small, which leads to a small voltage ripple. However, with the DCM operation shown in Figure 3.50(c), the current cancellation effect is lost, so the AC current on the output capacitor increases in relation to the CCM operation, which leads to a higher voltage ripple.

3 Phases VR

iL_t

i c ESR

Co

ESL

(a) Three-phase Buck on the output side

iL DCM iL CCM

0 0 t iL_t t iL_t 0 0 t t ic 0 i c 0 t t

(b) CCM operation (c) DCM operation

Figure 3.50 Output current ripple increases under DCM operation, which increase the voltage ripple.

Figure 3.51 illustrates the conflict between efficiency and voltage ripple with 12V input. In terms of efficiency, the on-time needs to be large during DCM. However, to meet the requirements for the output voltage ripple, the maximum on-time needs to be decreased. As a result, we must choose the maximum on-time instead of the optimal on-time for efficiency.

115 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Ton Maximum on time for voltage ripple

Optimal on time for efficiency

Io DCM CCM

Figure 3.51 Conflict between efficiency and voltage ripple at DCM

To solve the conflict, it is desirable to keep a similar inductor current ripple or to reduce the output filter corner frequency, which means the inductance at DCM should be larger.

The on-time at DCM can be obtained from:

L⋅ I T = pk (3-17) on (1/ M −1)V o where Ipk is the peak inductor current at DCM. From (3-17) we can see that when the peak inductor current at DCM is fixed, Ton is proportional to L. From (3-4) and (3-17) we can derive:

1 f (3-18) s ∝ R ⋅ L

This means the switching frequency can be further decreased by enlarging the output inductance at light load. In Figure 3.52, the inductance is doubled. To keep the same inductor peak current, the on-time is also doubled; therefore the switching frequency can be decreased by half. At the same time, since the inductor current ripple does not change, a similar voltage ripple is expected.

iL Io/2 L Ipk 0 2Ts Ton1 t iL

Io/2 Ipk 2L 0 2Ton1 4Ts t

Figure 3.52 Switching frequency is reduced by increasing output inductance at during DCM

116 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Laptop CPUs use different operating states to save power [53]. During the CPU active state, because there is a fast transient, the VR should also have a fast response to meet the transient requirement. The VR should run at CCM operation for the active state. As a result, a small inductance is required to meet the transient requirement at active state [57]. However, since there is no stringent transient requirement at CPU sleep state (DCM operation), a large inductance is preferred for light-load efficiency. Figure 3.53 shows the requirement of the non- linear output inductance at different CPU states.

Lo

LL

Ls

Io Sleep state Active state

Figure 3.53 Preferred output inductance to improve light-load efficiency

(B) One Simple Non-Linear Inductor Implementation

There are several ways to create a non-linear inductor. One way is to use controlled magnetics, e. g. a magnetic [60]. However, this is a little complicated since it requires the control signal to switch between large inductance and small inductance. The second way is to use a saturable inductor [61], [62]. However, it is not easy to accurately control the inductance since we depend on the permeability of the material to get the inductance, which varies greatly with different magnetics. The third way is to use low permeability magnetic material such as MPP, iron power core or LTCC material [63]. The permeability of these materials decreases when the current increases, which decrease the inductance. However, the inductance changes with load current very gradually, no as required in Figure 3.53.

The simple proposed structure to achieve the non-linear inductor is to use a saturable core, as shown in Figure 3.54. When the inductor current is larger than the given saturation current (

Isat ), a small part of the core is saturated. Hence the equivalent air gap is lg which creates small inductance Ls. When the inductor current is lower than Isat, the small part of the core is not

117 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

saturated, and the equivalent air gap is lg1, which creates the large inductance LL. The large

inductance (LL) and the small inductance (Ls) can be designed using (3-19):

2 2 = n = n Ls − LL − − (3-19) l g + le l g l g l g1 + le l g + l g1 ()− u0 A1 A2 u0 ur A1 u0 ur A2 u0 ur A1 u0 A2

The parameters are described in Figure 3.54: A1 is the main magnetic area, A2 is the magnetic area of the small part, le is the effective magnetic length, n is the number of turns, and µr is the relative permeability of the core.

There are other structures that can realize a non-linear inductor. As long as the structure can achieve the required inductance shown in Figure 3.53, it can be used to improve light load efficiency. Figure 3.54 just provides a simple example which can be easily implemented.

A1 lg lg1 le

AC flux A2

(a) iL > Isat

A1 lg lg1 le

AC flux A2

(b) iL < Isat

Figure 3.54 Proposed saturable core to achieve desired inductance

(C) Proposed Control Method 118 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

The control method is very simple. Since the CPU can send out a signal (PSI#) to indicate its current state (active state or sleep state) [53], the on-time can be adjusted immediately according to this signal. At CPU active state, constant on-time or adaptive on-time control can be applied, as shown in Figure 3.55 (for constant on-time control). At CPU sleep state, on-time can be increased to get the same current ripple as CCM operation. The benefit of this control method is noise immunity, because it can be achieved by using available controllers (e.g. MAX1545 from Maxim [56]) without current sensing.

-V /L iL o s iL

(Vin-Vo)/Ls

Isat Isat T t Ton1 t on1

(a) Control at CPU active state (CCM)

Ton2> Ton1

-Vo/Ls (Vin-Vo)/Ls -Vo/LL Isat

(Vin-Vo)/LL

Ton2 t

(b) Control at CPU sleep state (DCM)

Figure 3.55 Increased on-time for DCM operation with the proposed non-linear inductor

The calculated switching frequency of one design example is shown in Figure 3.56. It clearly demonstrates that the proposed control method can further reduce the frequency to improve light-load efficiency.

119 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

o2

6 1 .10

Constant on-time control with regular 5 ⎛ Io2⎞1 .10 inductor f ⎜ ⎟ s1 ⎝ 3 ⎠ Proposed control ⎛ Io2⎞ f fs (Hz) method with non- s2 ⎜ ⎟ . 4 ⎝ 3 ⎠1 10 linear inductor

3 1 .10 0.1 1 10 100 I Io (A)o2

Figure 3.56 Calculated switching frequency with proposed control with non-linear inductor

(Ls = 600nH, LL = 3uH, 3 phases, Isat = 3A, Iomax = 45A, Ton1 = 330ns, Ton2 = 3Ton1)

Another concern is the core loss, especially foe the small saturable part. Figure 3.57(a) shows the core loss estimation based on the core loss density curve in magnetic material datasheet. It can be seen the maximum core loss is only about 17mW, which is negligible. Figure 3.57(b) shows the core loss of the small saturable part is only about 1mW. So there should have no hot spot for it.

0.1 0.1 non-linear inductor Main part

17mW 10mW0.01 0.01

Pc_non()Io2 Pc_main()Io2 . 3 Core loss Core 1mW1 10 Small (W)Pcb()Io2 Pc_small()Io2 3 loss 1 .10 Regular core saturable part 4 1 .10

4 5 1 .10 1 .10 0.1 1 10 100 0.1 1 10 100

I Io2 o2I IL L

(b) Main part vs. saturable part for non-linear (a) Non linear inductor vs. regulator inductor inductor

Figure 3.57 Core loss estimation (Material: 3F3)

(D) Design Guideline with the Proposed Control Method

The design parameters for this control method are: Ton2, Isat and LL. Ls and Ton1 are already determined by the transient requirement and switching frequency at active state, so they are not

120 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

design parameters. Since at CCM the preferred inductance is the small inductance for all of the

instant inductor current, Isat can be determined by (3-20):

I as _ min I ripple I sat = − (3-20) N 2 where Ias_min is the minimum CPU current at active state, N is the phase number of the VR, and

Iripple is the peak-to-peak inductor current at active state, which can be determined by (3-21):

(Vin −Vo )Ton1 I ripple = (3-21) L s

If Isat as calculated by (3-20) is too small, there will be no benefit from the large inductor at

DCM. In this case, we cannot determine Isat based on (3-20) . Therefore the large inductance will be seen in CCM, as shown in Figure 3.58. However, since the period for large inductance is very short, it will not hurt the transient performance that much. For example, if the inductor valley

current is 1.6A and Isat is 2A, the time interval for the inductor current rising from 1.6A to 2A is

about 100ns (assume LL = 3uH). Equivalently, there is 100ns additional delay in the system when load step-up happens. This 100ns delay only adds 2mV more voltage spike with 45A full load current and 8*270µF output capacitance.

iL

Isat

Ton1 t

Figure 3.58 Large inductance can be seen at CCM if Isat is large

Ton2 and LL can be designed by the desired inductor peak current during DCM operation. As mentioned, to meet the output voltage ripple requirement, the inductor peak current during DCM operation should be equal to the inductor peak-to-peak current during CCM operation. Based on the inductor current waveform shown in Figure 3.55(b), (3-22) can be derived:

LL Isat Ls (Iripple − Isat ) Ton2 = + (3-22) V −V V −V in o in o

121 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Once LL is determined, Ton2 can be determined correspondingly. If a larger LL is chosen, the switching frequency can be further decreased, as shown in Figure 3.59(a). Although the output-capacitor ESR caused voltage ripple is the same with the same inductor current ripple, the capacitor-discharge caused voltage ripple increases with a larger LL due to lower switching frequency. As result, larger LL causes larger voltage ripple, which is shown in Figure 3.59(b). Hence the trade-off between the switching frequency and the output ripple should be considered when choosing LL. In our design example, LL=5Ls is chosen to be a good trade-off since the output voltage ripple is similar to the constant on-time control with regular inductor case.

10 16 9 14 8 12 7 6 10 5 8 4 6 3 4 fs (kHz) fs 2

1 (mV) Voripple 2 0 0 05101520 0 5 10 15 20 LL:Ls LL:Ls

(a) Switching frequency vs. LL (b) Output voltage ripple vs. LL

Figure 3.59 Trade-off design for LL

Ls=600nH, 3 phases, Vin=12V, Vo = 0.9V, Isat = 2A, Ipk =Iripple = 6A, Co = 8*270µF SP cap+18*22µF MLCC

(E) Experimental Results

Based on the above design guidelines, a design example is provided, and its efficiency and ripple test results based on an industry laptop VR demo board [56] are shown in Figure 3.60. The experimental setup is as follows:

Vin=12V, Vo=1.05V, single-phase operation of three-phase Buck VR

Each phase configuration: Q1: 1*IRF7821, Q2:2*IRF7822, Inductor type:

Sumida CDEP134H-0R6 (L=600nH) for regular inductor.

Customized EI-18 core with 3F3 material (Ls=600nH, LL=3.1uH) as the non-linear inductor.

Total output capacitance: 8*270µF SP cap from Panasonic+ 18*22µF ceramic cap from TDK.

Ton1 = 330ns, Ton2 = 3Ton1 = 990ns

122 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

Figure 3.60(a) shows that the non-linear inductor VR can achieve 1%~4% higher efficiency at light load and the similar efficiency at heavy load as the regular inductor VR. There is an efficiency jump at 2A since Buck enters DCM operation and switching frequency starts to decrease, as shown in Figure 3.60(b).

It should be mentioned that the efficiency shape at light load is different from Figure 3.38(a). This is because our test focuses on how the power stage efficiency itself is affected by the proposed method. So the quiescent power consumption of the controller (MAX1545) is not included. From Figure 3.60(c) we can see that this design also meets the 20mVpp output ripple requirement with enough design margins.

The similar method can be applied to 2nd stage of two-stage VR, as shown in Figure 3.61. It can be seen that the light load efficiency can be improvement by about 4% with the non-linear inductor design.

92% 90% 88% 86% 84% ` 82% L=600nH regular inductor Efficiency 80% LL=3.1uH non-linear inductor 78% 0.1 1 10 100 Io(A)

(a) Efficiency comparison

1000

100

L=600nH regular inductor Fs (kHz) LL=3.1uH non-linear inductor

10 0.1 1 10 100 Io(A)

(b) Switching frequency comparison

123 Chapter 3. Two-Stage Architecture with Improved Second-Stage Efficiency

14 12 10 8 6 L=600nH regular inductor 4 L=3.1uH non-linear inductor 2 Vorippe (mV) Vorippe 0 0.1 1 10 Io(A)

(c) Output voltage ripple comparison

Figure 3.60 Experimental results with the non-linear inductor (Vin = 12V, Vo = 1.05V)

90% 88% 86% 84%

82% `

Efficiency 80% L=150nH regulator inductor 78% LL=600nH non-linear inductor 0.1 1 10 Io(A)

Figure 3.61 2nd stage of two-stage VR efficiency with non-linear inductor

Ls=150nH, LL = 600nH, Vin=6V, Vo = 0.9V, Ton1 = 250ns, Ton2 = 2Ton1 = 500ns

124 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

4.1. Introduction

It was demonstrated in Chapter 3 that a two-stage VR can achieve high efficiency and high power density. However, since two stages are in series, each stage should be able to handle the full power. The overall efficiency of the two-stage VR is the multiple of the first stage and the second stage, which decreases the second-stage efficiency. For example, with a 600kHz VR design, the second-stage efficiency is more than 91%. Although the first stage can achieve 98% efficiency, the overall efficiency still drops 2% due to the series connection.

Another two-stage approach is known as factorized power architecture (FPA) [27], which can be used for 48V to 1.2V VR applications. The idea behind FPA is to apply the unregulated converter (DCX) to serve as the second stage to achieve higher efficiency. It is well-known that the unregulated converter can be designed at an optimal point to achieve highest efficiency. For example, if an LLC resonant converter works at the resonant point, it can achieve zero voltage switching (ZVS) and almost zero current switching (ZCS) operation [72]. Thus the efficiency is much higher than a hard-switching or ZVS PWM converter. However, since an unregulated converter cannot regulate the output voltage, a pre-regulated first stage is required to achieve voltage regulation.

The two converters of the previous two-stage structures are in series; thus each stage should be able to handle the full power. A more efficient way to deliver power was proposed in [24], as shown in Figure 4.1. The basic concept here is to deliver the power to the load in parallel, and use a high-efficiency unregulated converter to handle most of the power. The unregulated cell is a Class-E inverter plus a rectifier. It can achieve high output impedance and thus acts as the current source. The regulated cell is a switched-mode converter or a . Low output impedance is required for the regulated cell to control dynamics of the system.

125 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Figure 4.1 Parallel power architecture proposed by MIT

Unfortunately, this architecture cannot be applied to VR applications due to the very fast load transient of the CPU. During the transient, most of the current is offered by the regulated cell, which is only designed at very low power at steady state.

4.2. Proposed Sigma VR Structure

To achieve high efficiency and low cost, a sigma VR structure is proposed [25], as shown in Figure 4.2 It is composed of a regulated converter (buck) and a high-efficiency unregulated converter (DC/DC transformer or DCX). The inputs of the two converters are in series, while the outputs are in parallel. That is why it is also named the quasi-parallel VR. There are three aspects in the proposed structure:

(1) The input current is equal: Iin1 = Iin2

(2) The input voltage can be designed by n: Vin1 = nVo, Vin2 = Vin - nVo , where n is the DCX converter turns ratio.

(3) The current (power) ratio from the unregulated converter to the regulated converter is

determined by: Io1/Io2 = Vin1/Vin2.

As a result, the power ratio can be controlled by n. For example, if n=6, and Vo = 1.2V, then Vin1 = 8V (considering there will be some voltage drop in the DCX converter), Vin2 = 4V,

Vin1/Vin2 = 2:1. This means a high-efficiency unregulated converter handles 67% of the output power, while the regulated converter only needs to provide 33% of the output power.

126 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

isolation I in1 Io1 ** + n 1 Vin1- DCX

Vin Buck I Vo in2 Io2 + Vin2 - d Io

Av

Figure 4.2 Proposed sigma VR architecture

The voltage gain of the proposed sigma VR is:

V D M = o = (4-1) V 1+ Dn in where D is the duty cycle of the buck converter. The higher the duty cycle is, the higher the voltage gain is.

Figure 4.3 shows how the sigma structure achieves voltage output regulation during the transient. There are two feedback mechanisms:

(1) Feedback from the power stage. For example, if there is a load step-up, the output voltage will drop correspondingly. Since an unregulated DCX is like a voltage sensor,

Vin1 drops immediately. As a result, Vin2 increases, and hence increases Vo. (2) Feedback from the closed-loop control. This works the same way as the conventional control method. The closed-loop control increases the duty cycle of the regulated converter to increase the output voltage and to achieve tight voltage regulation.

One important concept is to regulate the output voltage by changing the input voltage. This is different from the conventional structures.

127 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

n:1 ** n 1

+ Vin1↓ -

Vin Vo=Vin*d/(1+nd) ↓ ↑ ↑ + V Vo Vo V ↑ o in2 - i ↑( load change) d↑ o

Av

Figure 4.3 Voltage regulation of sigma VR

The benefits of the proposed structure include:

(A) High efficiency

The overall efficiency of the quasi-parallel VR is:

Vin1 Vin2 η = *η1 + *η2 (4-2) V V in in

where η1 and η2 are the efficiency of the DCX and buck converters, respectively. Equation (4-2)

clearly shows that the sigma VR efficiency is mainly determined by the DCX efficiency if Vin1

>> Vin2. It is well-known that the DCX can achieve higher efficiency than a buck converter since it can run at optimal operation points. It will be demonstrated later that the unregulated converter efficiency can be as high as 90% at 600kHz switching frequency. The efficiency of the regulated converter is also higher (~91% at 600kHz) due to its low input voltage (3V-4V). Thus, the overall efficiency is about 90%, which is about 4% to 5% higher than the state-of-the-art multi- phase buck VR solution, which has 85%-86% efficiency with the same switching frequency.

(B) Output capacitor reduction

With the help of the extra low output impedance of the unregulated converter [76], [78], the transient performance of the proposed VR is better than that of a buck converter. The reason for this is that high-voltage input capacitors can equivalently serve as the output capacitors with n2 times higher capacitance. As a result, there is no need to put as many output capacitors there to achieve transient requirement. This feature will be illustrated in detail later.

128 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

4.3. Design Sigma VR with Scalability

4.3.1. Buck Converter Design in Sigma VR

For the buck converter, it is better to design the duty cycle to be 50% to have both good step-up and step-down performance [57]. Since the output voltage is about 1.2V, the input voltage should be around 2.4V. After considering the input voltage variation (11V~13V), the nominal input voltage is chosen to be 4V, as shown in Figure 4.4. Based on the study in Chapter 3, high efficiency is expected for the buck converter due to low switching loss and low-voltage- rating devices.

However, this buck converter is different from the previous 5V-6V buck converter used for the second stage of the two-stage VR. The sigma VR can reach the input voltage (12V) at startup and light-load operation (these will be explained later). Fortunately, the output current is very low for these conditions. So there would be less resonance on the devices than with a 12V buck converter running at full load. A 20V MOSFET can be selected instead of a 30V MOSFET.

As for the current, since the current 12V input VR has 20A for each phase, we just follow the current practice.

isolation

** + n 1 Vin1- DCX 8V Vin 11V~13V Buck 20A 1.2V + Vin2- 3V~5V Io

Figure 4.4 Buck converter in sigma VR

4.3.2. DCX Design in Sigma VR

Since the buck current has been determined, the DCX current can be calculated based on the power ratio between the DCX converter and the buck converter. For example, if a single- phase buck is chosen with a 20A output current, the DCX should deliver about 40A. A high-

129 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

efficiency DCX should be selected. The DCX converter can be any PWM converter or resonant converter, as long as it can achieve high efficiency. For high-switching-frequency applications, resonant converters are preferred to reduce the switching loss. Figure 4.5 shows an example of an unregulated converter [75]. The leakage inductor (Lk) of the transformer resonates with the

output capacitor (Co) to achieve almost ZCS operation. Meanwhile, Q1 and S1 can achieve ZVS by resonance between the magnetizing inductor of the transformer and the output junction capacitor of the MOSFET. Therefore, the switching loss can be almost eliminated. The voltage ratio can be designed by the transformer turns ratio. For example, if the input voltage is 8V and output voltage is 1.2V, the transformer turns ratio should be 6 to leave some margin for the

voltage drop on Q1, S1 and the transformer windings.

It can be seen that this converter has the same number of devices and magnetic components as the buck converter. As a result, it is reasonable to have a 20A output for this converter. Ideally, if we need 40A, we can use two DCX converters in parallel. However, since the DCX converter is unregulated, the current sharing cannot be guaranteed. Therefore, two DCX converters are not recommended to be in parallel directly. To get 40A, we propose to use a quasi-parallel structure again for two DCX converters, as shown in Figure 4.5. The inputs of the two DCX converters are in series, while the outputs are in parallel. Based on the same theory as the sigma VR, we can assume that since the input voltage is the same, the current of the two DCX converters can be automatically shared.

LK One Transformer DCX1 1.2V * ip * 4V LM Co 20A is1 4V DCX2 S1 20A Q1

Figure 4.5 DCX example 1: ZVS quasi-resonant converter proposed by CPES

130 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Although the input voltage for each DCX is only 4V, Q1 should use a 30V MOSFET due to

the resonance between the output junction capacitors of Q1 and the magnetizing inductor of the

transformer [75]. As for S1, the voltage peak is about 10V. Thus a high-performance 12V MOSFET can be used to get better efficiency. The transformer turns ratio is 3:1 for each DCX converter. It should be mentioned that an additional auto-transformer is required to average the output voltage of the two single-phase DCX and to reduce the output voltage ripple [25].

Another possibility is to find a DCX converter that can deliver 40V without further paralleling. The LLC resonant converter is a good candidate [72], as shown in Figure 4.6. The LLC resonant converter has an input voltage of 8V, n=3, and all switches are 12V MOSFETs. Since it works at the resonant frequency, ZVS and almost ZCS can be achieved to get high efficiency.

DCX 1.2V 8V 40A

Figure 4.6 DCX example 2: LLC resonant converter working at fixed frequency

There is a special configuration for ZVS-QR DCX in order to reduce the output voltage ripple, as shown in Figure 4.7. An additional auto-transformer is required to average the output voltage of the two 40A DCX and to reduce the output voltage ripple. Figure 4.7 also provides the

test results. The voltage ripple of each DCX is high since Lk needs to resonate with output capacitors. However, due to the interleaving and averaging effect of auto-transformer, the final output ripple is very small. Another observation is that the volt-second on the auto-transformer is

very low (determined by Vo1-Vo2 and switching frequency). Therefore, the size of the auto- transformer is very small. For LLC DCX, there is no such configuration since the output voltage ripple is relatively low.

131 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Vo1 DCX1-2 40A

Vo1 Vo2 * 1 Vo

Co Vo 1* 5mV DCX3-4 40A V o2

Figure 4.7 Autotransformer is required to reduce output voltage ripple for ZVS-QR DCX

Figure 4.8 shows the efficiency comparison between ZVS-QR DCX and LLC DCX. It shows that LLC DCX can get slightly higher efficiency but it needs 4 more MOSFETs.

2-phase LLC DCX (80A) 4-phase ZVS-QR DCX (80A) 12*CSD13301, total devices: 12 4*RJK0301 + 4*CSD13301 Total devices: 8 fs=530kHz, duty cycle: 50% fs=600kHz, duty cycle: 75% 4 EIR14, 3F3, total magnetic components: 4 4EIR11 + 1EIR9.5, 3F3, Total magnetic components: 5

132 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

95%

90%

85% 4-P ZVS-QR DCX

Efficiency 80% 2-P 40A LLC DCX

75%

70% 0 20406080100 Io(A)

Figure 4.8 Efficiency comparison between ZVS-QR DCX and LLC DCX (fs = 600kHz)

Figure 4.9 shows the loss breakdown comparison between the two DCXs. Although LLC DCX has more synchronous (SR), the SR conduction loss is still higher than ZVS-QR DCX. The reason is ZVS-QR DCX has 75% duty cycle for SR conduction while LLC DCX only has 50%, and larger duty cycle results in lower RMS current. In the following design example, ZVS-QR DCX is chosen to reduce the VR cost.

12

10 4‐P ZVS‐QR DCX

8 2‐P 40A LLC DCX (W)

6

4

Power loss 2

0

Figure 4.9 Loss break down of ZVS-QR DCX and LLC DCX @ 80A

133 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

4.3.3. Sigma Cell Concept for Scalability

The above sigma VR can deliver 60A current. Comparing with a three-phase buck VR, which also has 60A current, the sigma VR has the same number of devices and magnetic components, which means a similar cost, as illustrated in Figure 4.10.

3-phase Buck -60A Sigma Cell -60A

12V 12V Buck DCX 4V 20A 20A

1.2V 1.2V Buck 4V DCX 20A 20A 60A 60A Buck Buck 4V 20A 20A

Figure 4.10 Comparing 60A sigma cell with 3-phase buck

The above structure can be treated as one sigma cell. Based on this cell, a scalable sigma VR can be built to deliver more current. Figure 4.11 shows the basic concept, which has the same basis as the multiphase buck VR. The only concern is current sharing among cells. Since there is a fixed power ratio between the buck converter and the DCX converter for each sigma cell, as long as all the buck converters in every cell can share the current, all the DCX converters can share the current too. Moreover, the commercial VR controllers used to control the multi- phase buck VR can be used to control all buck converters in the sigma VR.

For example, DCX1 and DCX2 which are ZVS quasi-resonant converters are the DCX

converters for different sigma cells. Assuming the on-resistance of Q1 and S1 differs by 20% for two DCX converters in different cells, the DCX converter currents are different for the two DCX converters, as shown in Figure 4.12(a). By slightly adjusting the buck duty cycle of the two DCX converters, the current can be shared, as shown in Figure 4.12(b).

134 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

12V Cell #1 DCX 4V 20A

DCX 4V 20A

Buck 4V 20A . 1.2V . . Cell #N DCX 60A*N 4V 20A

DCX 4V 20A

Buck 4V 20A

Figure 4.11 Scalable VR with sigma Cell

d1=46%, d2=46% d1=45.3%, d2=48%

4.5 4 4.5 3.5 4 V 3.5 / 3 T V U 2.5 / 3 -O T 6 2 U 2.5 U -O 6 1.5 U 2 1 1.5 0.5 1 0.5 0 0

4 4

V 3 / 3 V T / U T O 2 - U 2 4 -O 1 4 U 1 1 U 1

0 0

-1 4.9385 4.939 4.9395 4.94 4.9405 4.941 -1 4.9385 4.939 4.9395 4.94 4.9405 4.941 tim e /m S e c s 500nS ecs/ div tim e / mSecs I = I 500nS ecs/div IDCX1 > IDCX2 DCX1 DCX2

100 100

80 80

60 60 A A

40 40

20 20

0 0 4.951 4.952 4.953 4.954 4.955 4.956 4.957 4.958 4.959 4.96 9.941 9.942 9.943 9.944 9.945 9.946 9.947 9.948 9.949 9.95

tim e / mSecs 1uS ecs/div tim e /m S e c s 1uS ecs/ div

(a) Without current sharing (b) With current sharing

Figure 4.12 Current of DCX can be shared by controlling buck duty cycle

With the above configuration, the total load current is the multiple of 60A (e.g. 60A, 120A, 180A, etc). So if we need 120A, which is required for the VR11, two sigma cells should be

135 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators used, as shown in Figure 4.13. The prototype has a two-phase buck converter and a four-phase DCX converter, and the prototype has the same number of components as a six-phase buck converter, which also delivers 120A.

12V Cell #1 DCX1 4V 20A

DCX2 4V 20A

Buck1 4V 20A 4-phase

Cell #2 1.2V 2-phase DCX DCX3 4V 20A Buck

DCX4 4V 20A 120A Buck2 4V 20A

Figure 4.13 One design example: 120A VR for VR11

Figure 4.14 shows the test efficiency of the sigma VR and its comparison with a six-phase buck VR. The circuit setup is shown in Table 4.1.

Table 4.1 Circuit parameters for sigma VR and 6-phase buck VR

Switching Configurations Devices Magnetic components frequency

(1*RJK0305 + 6-phase buck VR 6 phases 600kHz 200nH/phase 1*IRF6635) /phase

Buck: 150nH/phase Buck: (1*Si7106 + 2-phase buck 1*BSC019N02)/phase DCX: EIR11/phase, 3F3 Sigma VR and 4-phase 600kHz DCX: (1*RJK0301 + Auto-transformer: DCX 1*CSD13301)/phase EIR9.5, 3F3

The experimental results show that 90% efficiency can be achieved by the sigma VR, which is about 3~4% higher efficiency than a single-stage VR. Thus it can meet the efficiency requirement from IBM and Intel. 136 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

95%

90%

85%

Efficiency 80% Sigma VR 6-phase Buck VR

75%

70% 0 20406080100120140 Io (A)

Figure 4.14 Tested efficiency for sigma VR

The previous sigma cell current was 60A. Thus the sigma VR can only get: 60A, 120A, 180A, 240A, etc. Scaling down the cell current provides more flexibility. Figure 4.15 shows how to scale down a 60A sigma cell to a 30A sigma cell.

Sigma Cell -60A Sigma Cell -30A 12V DCX 4V Scale Down 12V 20A DCX 8V 20A 1.2V 1.2V 4V DCX 20A 60A Buck Buck 4V 4V 20A 10A 30A

Figure 4.15 30A sigma VR cell

This figure shows a sigma VR cell composed of one 20A DCX converter and one 10A buck converter. The DCX design is slightly different with a 60A sigma cell since both the transform turns ratio and the primary-side switch need to be changed, as shown in Figure 4.16. As for a 10A buck, we can just borrow the design for another POL converter that has similar current. For example, the DDR VR normally has about 10A current, so we can use that design. Basically, compared with the 20A buck design, the 10A design just needs to have the die size of the devices reduced by half and have the inductor

137 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

size reduced by half as well. Table 4.2 shows an example of a 10A buck design, which can also achieve about 91% efficiency at 10A, which means that the 30A sigma cell can achieve about 90% efficiency.

n:1 * * Sigma Cell -30A Co is1 12V DCX 8V S1 20A Q1 1.2V Vin=8V n=6

Buck Q1: 60V MOSFET 4V 10A 30A S1: 12V/20V MOSFET

Select devices with half of the die size; inductor with half of the size comparing with 20A Buck

Figure 4.16 30A sigma VR cell design

Table 4.2 An example of 10A buck design

20A Buck 10A Buck

SiA430DJ (Rds=14.5mohm, Top Switch (20V) Si7106 (Rds=6mohm, Qgd=2.8nC) Qgd=1.4nC)

Bottom Switch BSC019N02 (Rds=1.6mohm, IRF6636 (Rds=3.5mohm, (20V) Qg=60nC) Qg=21nC)

FP4-200, L=200nH. FP2-D100, L=400nH Inductor Size: 10.8mm*6.2mm*5mm Size: 7.2mm*6.7mm*3mm

4.4. Modeling and Transient Analysis of Sigma VR

To close the loop and achieve AVP, first the small-signal model of the sigma VR must be studied. Figure 4.17 shows the small-signal model of the power stage of the proposed converter,

where Rout and Lout represent the output impedance of the DCX converter [76].

138 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Lout Rout ^ i1

** n C1 1

+ ^ ^ dVin/D ^ vin - L DCR i2

^ ** Rc I d ^ L 1D ^ io vo C2 C

Figure 4.17 Small-signal model for the power stage

Basically the power stage itself has closed-loop feedback from the DC/DC transformer, which attenuates the gain of the open-loop transfer functions, in contrast with the buck converter.

Assume Rout and Lout is small enough, then the major transfer functions are shown in (4-3):

^ v V 1+ s /ω G = o ≈ in2 zc vd ^ 1+ Dn s s2 d 1+ + ω ⋅Q ω 2 o o

^ i − nV s(1+ s /ω ) G = 1 ≈ in2 z1 id1 ^ (1+ Dn)2 (n2C − D⋅C) s s 2 d in 1+ + 2 ωo ⋅Q ωo

^ (4-3) i V s(1+ s /ω ) G = 2 ≈ in2 z2 id 2 ^ (1+ Dn)2 (C + n2C ) s s2 d in 1+ + 2 ωo ⋅Q ωo

1 1 2 Q = ⋅ ω = (1+ Dn) / L(C + n C ) ω (1+ Dn)2 ⋅ R ⋅C + DCR(C + n2C ) o in o c in

1 nC − D⋅C C + n2C ω = ω = in ω = in C = C + C zc R C z1 R CnC z2 R Cn2C in 1 2 c c in c in From (4-3) it can be observed that the double pole is moving with the duty cycle. However, since other input voltage and output voltage range is narrow, the duty cycle does not change too

139 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

much at steady state. All of the other transfer functions can be derived based on the model and they have the similar characteristics.

A CPU VR must achieve AVP to meet the Intel load line specifications. Generally the current injection control method is applied to achieve AVP [77], as shown in Figure 4.18. However, it is not easy to sense the DCX current since resonant type of DCX is used to improve the efficiency.

n:1 iin1 i v DCX o1 o Zo1c Vin1 + + Vin Io + RLL + i i in2 Buck o2 Vin2 Zo2c Co ic Zc io

AV PWM

Vref

Figure 4.18 Current injection control to achieve AVP from Intel

As explained before, the current ratio between DCX and buck is equal to the input voltage ratio of them. As a result, we have:

Vin i = i (4-4) o V o2 in2 Based on (4-4), a simpler way to achieve AVP is to only sense the buck current, as shown in Figure 4.19(a). Figure 4.19(b) and (c) shows the desired output impedance to achieve AVP,

where Zo1c and Zo2c are the closed-loop impedance of the DCX and buck converters, respectively; Zc is output capacitor impedance; and Zoc = Zo1c||Zo2c||Zc is the total closed-loop output impedance of the sigma VR. The impedance of the DCX converter should be much lower than the impedance of the buck converter within a wide frequency range to ensure that it also

handles most of the AC current during the load transient. Zoc should meet the impedance requirement from Intel, which specifies that the impedance should be equal to or lower than RLL, up to 2MHz [7].

140 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

iin1 v HF DC/DC io1 o transformer Zo1 Vin2 Vin Vin1 + Vin + Vin Rdroop = RLL Vin2 iin2 High Freq. io2

V Buck Zc i in2 Co o Zo2

AV

PWM

Vref

(a) Control diagram

Desired impedance v |Z| io1 io2 ic o |Zc| |Zo2c| |Zo1c|

|Z ||Z | Z Z o1c o2c o1c o2c Zc io

RLL |Zoc|

2MHz f

(b) Equivalent circuit (c) Desired impedance

Figure 4.19 Control diagram and desired impedance by sensing buck current only

Figure 4.20 shows the small signal block diagram [77]. The definition of Tv, Ti and T2 is as follows:

Tv T = A F G Ti = AV Rdroop FmGid 2 T2 = (4-5) v v m vd 1+T i

141 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

^ io Zo ^ + vo

+ Gvd

Tv

^ Gii2 + i2 + + Gid2 Rdroop ^ Ti - d - FM Av

Small Signal Block

Figure 4.20 Small single block diagram of the sigma VR

Since the transfer functions of sigma VR is very similar to buck VR, we can just follow the design guideline of buck VR to achieve constant output impedance [77]. The basic idea is to

push the current loop bandwidth higher than T2 bandwidth, so the inductor current of buck converter can be treated as a current source, as shown in Figure 4.21.

L out Rout ^ i1

** n 1 C1 L out Rc ^ ^ v ^ i2 R o out io ** Rc ^ 1 D ^ i 2 C vo o C *n C 2 C in

(a) Inductor can be treated as a current source (b) Output impedance Zoi

Figure 4.21 Output impedance when the current loop closed (Zoi)

142 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Figure 4.21 clearly shows that with the help of low output impedance of DCX (Rout and 2 Lout is low), the input capacitors is equivalently parallel with output capacitance with n times capacitance. This explains why sigma VR has better transient performance than buck VR.

Figure 4.22 shows the benefit from sigma VR. Assume both buck VR and sigma VR has the same output impedance, since sigma VR can get help from the input capacitors, Zoi of sigma

VR is lower than buck VR. Since the close-loop output impedance Zoc = Zoi/(1+T2), the required

T2 bandwidth of sigma VR is only 100kHz while the required T2 bandwidth of buck VR should

be 210kHz, by assuming the phase margin of T2 is 60 degree.

20

Zoi of multi-phase

30 Buck VR

40

gain() Zc, 2i⋅π⋅fnn 2 1/(sCinn )+Rout gain() Zx, 2i⋅π⋅fnn 50 Impedance gain() Zo1_HF , 2i⋅π⋅fnn Zoi of sigma VR RLL=1mohm60

70

80 3 4 5 6 7 1 .10 1 .10 T2 bandwidth:1 .10 T2 bandwidth:1 .10 1 .10 100kHz fnn 210kHz

Figure 4.22 Reduced T2 bandwidth of sigma VR - with the same output capacitance with buck

( Co = 5 100µF + 18 22µF MLCC, Cin = 88uF for sigma VR, Rout = 1.3mΩ, Lout = 150pH)

20 Zoi of multi-phase

30 Buck VR with 5*100uF Bulk cap

40

gainZ()c, 2i⋅π⋅fnn 50 gainZ()x2 , 2i⋅π⋅fnn Zoi of sigma VR Impedance with No Bulk Cap RLL=1mohm60

70

80 3 4 5 6 7 1 .10 1 .10 1 .10 1 .10 1 .10

fnn T2 bandwidth: 210kHz

Figure 4.23 Reduced output capacitance of sigma VR - with the same T2 bandwidth with buck

( Co = 5 100µF + 18 22µF MLCC for buck VR;

no bulk capacitor for sigma VR, Cin = 88uF, Rout = 1.3mΩ, Lout = 150pH)

143 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Figure 4.23 shows the benefit from another angel. If the bandwidths of sigma VR and buck VR are the same, sigma VR can eliminate all bulk capacitors with 210kHz control bandwidth, while buck VR needs five 100 µF with the same bandwidth.

Figure 4.24 shows the SIMPLIS frequency-domain simulation results. Figure 4.25 shows the time-domain simulation results. The circuit parameters are:

DCX: n=6, Rout = 1.3mΩ, Lout = 150pH, (DCX impedance) fs = 600kHz; Buck: fs =

600kHz, L=60nH. Vin = 12V, Vo = 1.2V, No Bulk Capacitors, decoupling capacitor: Co =

18*22µF, C1 = C2 = 2*22µF.

This suggests that the bulk capacitors can be eliminated, while a 600kHz buck converter needs at least five 100µF ceramic capacitors to meet the transient requirement. It also shows that the DCX converter takes care of most of the DC and AC current in the load transient condition.

40

20 T2 (Gain) 0 -20 100 io -40 70 100 T2 (phase) 40 0 -100 fc=230kHz, PM=75 degree 1.26 V 1k 2k 4k 10k 20k 40k 100k 400k 1M 2M 4M 10M 1.2 o

-20 1.14 I ( DCX -25 80 o1 -30 Zc current) -35 50 -40 Zo2c(Buck) 20 -45 4.4 4.5 4.6 4.7 4.8 4.9 -50 Z ( DCX) -55 o1c Io2 (Buck -60 time/mSecscurrent) 100uSecs/div -65 Zoc 1k 2k 4k 10k 20k 40k 100k 200k 400k 1M 2M 4M 10M

Figure 4.25 Transient waveforms (Vo initial spike Figure 4.24, Loop gain and closed-loop impedance can be ignored)

The above Lout and Rout value is from Vicor’s DCX [76]. Since they have advanced packaging technology, both Lout and Rout are very small: Rout = 1.3mΩ, Lout = 150pH. In our sigma VR prototype, all of the devices and magnetic components are discrete, which result in larger Lout and Rout: Rout = 1.8mΩ, Lout = 3nH. Since Rout and Lout serves as ESL and ESR of the

144 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators equivalent input capacitors, the input capacitors cannot help too much for the transient performance anymore. Figure 4.26 shows four 390µF SP capacitors are required to meet the impedance requirement with 210kHz control bandwidth.

20

30 Zoi of sigma VR with 4*SP Cap 40

gain() Zx2 , 2i⋅π⋅fnn 50 Impedance

RLL=1mohm 60

70

80 3 4 5 6 7 1 .10 1 .10 1 .10 1 .10 1 .10

fnn fc=210kHz

Figure 4.26 Bulk capacitors are required with large Lout and Rout

( Co = 4 330µF SP Cap + 18 22µF MLCC, Cin = 88uF, Rout = 1.8mΩ, Lout = 3nH)

Figure 4.27 shows the transient test results for sigma VR. It can achieve AVP requirement from Intel.

Vo:100mV/div

Io: 30A/div

(a) Load transient waveforms

145 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Vo:100mV/div Vo:100mV/div

Buck PWM Buck PWM

Io: 30A/div Io: 30A/div

(b) Load step up waveforms (c) Load step down waveforms

Figure 4.27 Transient performance of sigma VR

( Co = 4 330µF SP Cap + 18 22µF MLCC, Cin = 88uF, Rout = 1.8mΩ, Lout = 3nH)

4.5. Startup and Light Load Efficiency Improvement of Sigma VR

From the information in the VR11 design guideline [7], the VR starts with a no-load condition, as shown in Figure 4.28. After VR sends out VR_READY signal, CPU clock begins to work, which means there is CPU current after that.

Figure 4.28 VR startup with no-load condition

146 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

For the startup of the sigma VR, the most challenging issue is how to let the DCX converter soft-start. It is known that the DCX converter must have very small output impedance, so the sigma VR cannot allow too much difference between the input and output voltages. Figure

4.29 shows that if there is a difference between Vin and the equivalent Vo on the primary side

(n*Vo where n is the transformer turns ratio), huge current can be observed.

10 Vin 8

6

4 LK Vo 2

* * 0 LM Co More than 300A i s1 200 150 S1 Q1 100 is1 50 0 -50 -100 -150 0 5 10 15 20 25 30 35 40 45

time/uSecs 5uSecs/div

Figure 4.29 Huge current observed if there is input and output voltage difference for DCX

The proposed soft-start method is shown in Figure 4.30. The basic idea is to soft-start the output voltage reference. As a result, Vin1 will follow Vo with the fixed voltage ratio n. There is no difference between the input voltage and equivalent output voltage for the DCX converter. Therefore, no surge current can be observed. It also can be seen that the buck converter will see 12V input voltage for the soft-start case. However, since it is a no-load condition, there is no high-voltage spike on the devices.

Figure 4.31 shows the experimental results for the soft start. It is not easy to get the current information. However, based on the output voltage and input voltage from the buck converter information, we can see there should be no surge current in the sigma VR during start-up.

147 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Vref VID 0

+ * * S1 Vgs(Q1) Vin1- 0

Vgs(S1)

V 0 in Q1 Vgs(Q2) V o 0

+ Vin Vin2 Q - 2 S2 Vin2 0 G - cv + Vin1 n*Vo 0 Vref Vo 0

Figure 4.30 Proposed start-up method for sigma VR

Vin2 (Buck input)

Vo

Figure 4.31 Experimental result for soft-start method

Another issue is the light-load efficiency of the sigma VR. In Figure 4.32, it can be seen that the sigma VR has higher efficiency than the buck VR for all of the load conditions. A 60A sigma cell is the minimum configuration. However, for the buck converter, the phase-shedding strategy can be used to improve light-load efficiency. If we compare the sigma VR with the phase-shedding buck VR, the light-load efficiency of the sigma VR is lower, as shown in Figure 4.32. Hence the light load efficiency must be improved.

148 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

95.0%

90.0%

85.0%

Sigma VR Efficiency 80.0% 3-phase Buck VR 2-phase buck VR 75.0% 1-phase buck VR Buck with phase-shedding

70.0% 0 10203040506070 Io (A)

Figure 4.32 Light load efficiency comparison

If we look at a loss breakdown at light load for the sigma VR, it can be seen that the DCX converter power loss is dominant, as shown in Figure 4.33. Therefore the key to improving light- load efficiency is to improve the DCX converter efficiency.

1.6 1.4 1.2 1.0 0.8 0.6 0.4 0.2 DCX loss Buck loss

Power Loss Power Loss (W) 0.0 0 2 4 6 8 10 12 14 16 18

Io (A)

Figure 4.33 Loss break at light load for sigma VR

The strategy for improving the light-load efficiency of the sigma VR is to change the position of Vin1 and Vin2. At light load, if we can make the DCX converter input voltage zero,

149 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

which means the buck converter takes all of the input voltage, the DCX converter will have no power and the buck will take all of the power. As a result, the light-load efficiency can be improved. Moreover, all of the methods used to improve the buck converter’s light-load efficiency can also be used for the sigma VR. The sigma VR can achieve the same light-load performance as the multi-phase buck VR.

Figure 4.34 shows the control strategy of shutting down the DCX at light load. Actually, the CPU will send out a Power State Indicator (PSI#) signal to the VR to indicate that the CPU will go into sleep mode and the VR should work at light-load condition. When the VR receives the PSI# signal, it will always turn on Q1 and always turn off S1. Therefore, Vin2 will keep increasing until it reaches the input voltage.

n:1 CPU Active state CPU Sleep State

* * PSI# Signal + S1 Vin1- 0

Vgs(Q1) 0 V in Q1 Vgs(S1) Vo 0

V + in2- Q2 S2 i s1 0 - G V s cv Vo + 0

Vref Vin Vin2 0

Figure 4.34 Shut-down DCX at light load to improve light load efficiency

However, when the sigma VR has a heavy load, we need to activate the DCX converter to deliver more power, since the buck converter is designed to deliver a small part of the output power. Figure 4.35 shows the control strategy of activating the DCX at heavy load. When the

VR receives the PSI# signal that the CPU will go into the active state, Q1 is turned off. Since

there is no energy transferred from the input side, Vin2 decreases quickly, and Vin1 increases

correspondingly. When Vin1 hits n times Vo, we can let Q1 and S1 work at normal condition and

150 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

there is no surge current, since the input voltage and equivalent output voltage of the DCX is the same.

CPU Sleep State CPU Active state Lr n:1 PSI# Signal + * * LM1 S1 Vin1- 0

Vgs(Q1) 0

Vin Q 1 Vgs(S1) Vo 0

+ Vin2 Q i s1 - 2 S2 0

G - Vin cv + Vin2 0 n*Vo Vref Vin1 0

Figure 4.35 Activate DCX at heavy load for sigma VR

Figure 4.36 shows the light-load efficiency comparison again. It can be seen that the sigma VR can achieve similar light-load efficiency with a buck VR, while having higher efficiency at heavy load.

95.0%

90.0%

85.0%

80.0% sigma VR with DCX sheding Efficiency buck VR with phase shedding 75.0%

70.0% 0 10203040506070 Io (A)

Figure 4.36 Light-load efficiency comparison by shutting down DCX at light load

151 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

4.6. Comparison between Sigma VR and Two-Stage VR

If we only compare the sigma VR and the two-stage VR, the sigma VR can achieve higher efficiency, since the power is delivered to the load in parallel instead of in series. However, this does not mean the sigma VR can replace the two-stage VR. Each of these VR designs can be used in different applications.

1) The sigma VR cannot be used in laptop applications due to its wide input voltage range, as shown in Figure 4.37. Two-stage power architecture is a better solution for laptop computers.

isolation

** + n 1 Vin1- DCX 7V Vin

9V~19V Buck 1.2V + Vin2- 2V~12V Io

Figure 4.37 Wide input voltage range for buck converter if the input voltage range is wide

2) To power the Itanium CPU which is for high-end servers, since there are three VRs to power three different loads, using the sigma VR would be more complicated than using the two-stage VR. As a result, the two-stage VR is preferred.

3) To power the Xeon CPU, since there is only one VR required and high efficiency is very important, the sigma VR can be used to improve the efficiency.

4) For desktop computers, since the cost is still the most important thing, sigma VR is more preferred due to its ability to reduce the output capacitors. More research is required if sigma VR is applied to desktop computers.

152 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

4.7. Three Other Implementations of Sigma VR

The above just shows one implementation of the sigma VR, where the regulated converter has the same ground for both the input and the output, and the DCX converter is isolated. Both the DCX converter and the buck converter deliver the power to the load. We call this first implementation, Implementation #1.

The most challenging part of implementing the proposed concept is the high-side converter selection, since the input and output of the high-side regulated converter do not share the same ground. One possible alternative implementation is to use an inverting buck- to serve as the regulated converter in Implementation #2, as shown in Figure 4.38. However, the drawback of Implementation #2 is that the unregulated converter should provide more current than the load current, since the buck-boost converter is circulating power. It is not an efficient implementation, especially for high-current applications.

Inverting Av Buck- + Boost Vin2 converter Io2 - V in n:1 I Unregulated o1 Vo + converter Vin1 I - o

Figure 4.38 Implementation #2 of sigma concept

To solve the grounding problem, it is better to use an isolated converter as the high-side regulated converter, as shown in Figure 4.39. This third implementation works exactly the same as the concept in Implementation #1, so it is an efficient converter. The regulated converter can be a half-bridge PWM converter, a forward converter, etc.

153 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

isolation Av Isolated + D2D Vin2 Io2 -

Vin n:1 Io1 Vo + Unregulated V in1 converter - Io

Figure 4.39 Implementation #3 of sigma concept

There is one more implementation, which is shown in Figure 4.40. It has a similar working principle as Implementation #2, so the efficiency is not good. However, Implementations #2 and #4 do not require the transformer to perform isolation, so they may be used for low power POL applications.

Io1 Unregulated + Converter Vin1 - n:1

Vin Io2 Vo Boost + Converter Io Vin2 - d Av

Figure 4.40 Implementation #4 of sigma concept

4.8. Other Applications for Sigma Architecture

4.8.1. Sigma Architecture for Low Current POL Converters

For low-power POL converters, high power density is required, since they are mostly for portable applications. The simplest way to improve the power density is to raise the switching frequency and further integrate the converter. Figure 4.41 shows that even when we push the

154 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

POL converter frequency up to 5MHz, the inductor size dominates the overall size. It is possible to further reduce the inductor size by raising the switching frequency even higher, but the efficiency would drop too much.

LTM460X EN536X Integrated L Integrated L L Die Fs=1.5MHz Die Fs=5MHz L L Volume L Volume ≈16*Die ≈ 13*Die

The size of the magnetic component dominates the overall size

Figure 4.41 Inductor size is too large for POL converters

Let us determine which sigma structure is suitable for the POL converter. Since it is low power, isolation is not preferred. Thus we select Implementation #2.

Figure 4.43 shows the detailed circuit diagram. Two voltage dividers are in series to get 4:1 voltage gain. As discussed in Chapter 2, the power density of the voltage divider is very high because it doesn’t include any magnetic components. Although the regulated buck-boost converter includes one inductor, it only needs to handle 2.5A current, which is 25% of the load current. As a result, the inductor size can be greatly reduced compared with 10A buck POL converters.

Inverting Av Buck- + Boost Vin2 converter Io2 - V in n:1 I Unregulated o1 Vo + converter Vin1 I - o

Figure 4.42 Implementation #2 is selected for POL converters

155 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

Q1 Q2

V in2 L

2.5A Vin 0.5A

Q1 Q1 Vo C C VD 1 1 Q Q2 C 2 C 2 + 2 + Vin1 Q Vin1 Q 10.5A 10A Vin1 3 3 C3 C3 Q Q4 - 4 - Co 4:1

Figure 4.43 Detailed circuit diagram for POL application

Figure 4.44 shows the inductor size comparison. The benchmark POL converter is from Linear Technology (LTC3418) with a 1MHz switching frequency [79]. It shows that the sigma POL converter can achieve 68% footprint reduction and 80% volume reduction. Figure 4.45 shows the efficiency comparison.

Inductor for Sigma Converter Inductor for Buck Converter Coilcraft Tok o L=0.33uH, Isat=5A L=0.2uH, Isat=16A DCR=23mohm DCR=4.5mohm Dimension: 4.0*4.0*1.2mm Dimension: 6.7 x 7.4 x 2.0 mm 68% footprint reduction 80% volume reduction

Figure 4.44 Inductor size comparison of sigma POL and buck POL converters

156 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

90% Sigma POL converter 85% 80% 75% 70% Benchmark: LTC3418 Efficiency 65% single-phase Buck, fs=1MHz 60% 0246810 Io (A)

Figure 4.45 Efficiency comparison

It should be mentioned that although the sigma POL converter can greatly reduce the inductor size, it also increases the silicon size significantly. As shown in Figure 4.43, a total of ten MOSFETs are used in the sigma POL converter while only two MOSFETs are used for the benchmark buck POL converter. However, based on the information from Figure 4.41, which indicates that the inductor size is much larger than the silicon size, the sigma POL converter can make a better trade-off between the silicon size and the size of the magnetic component.

To better understand the above argument, let us take EN536X as an example. The inductor size of EN536X is about 13 times as the die size. With sigma configuration, the die size will increase about 4 times, but the inductor size can be reduced by 80%, assuming the above ratio still holds. Finally, the total size can still be reduced by around 50% with sigma power architecture.

4.8.2. Sigma Architecture for Server and Telecom Applications

To embrace the traits of the LLC resonant converter running at the resonant point and achieve the output voltage/current regulations, the sigma DC/DC architecture is proposed for server and telecom applications [80], as shown in Figure 4.46. It features:

(1) High overall efficiency which is determined by the DCX at full load. (2) High light load efficiency by shutting down DCX at light load.

157 Chapter 4. High Efficiency Quasi-Parallel Two-Stage Architecture – Sigma Voltage Regulators

(3) Simple SR driving of the DCX due to the fixed switching frequency at resonant frequency. (4) With the help of the regulated converter, the output current can be easily controlled for the modular paralleling, soft start and output current limiting/protection.

isolation I in1 Io1 ** ~98% I + n 1 O Vin1 DCX - Vin

Iin2 I Vo ** o2 + n 1 Vin2 ~2% I - O Io d PWM Av

Figure 4.46 Sigma architecture for server and telecom applications

158 Chapter 5. Conclusions and Future Work

Chapter 5. Conclusions and Future Work

This dissertation tackles two major challenges in VR design: achieving high efficiency and achieving high power density. Two different power architectures are studied for this purpose.

5.1. Improvements of Series Two-Stage Power Architecture

The first power architecture is a two-stage architecture, and it has been investigated before. Using the ‘divide and conquer’ concept, the single-stage VR is split into two stages to get even higher efficiency and higher power density.

The first stage just reduces the input voltage from 12V to 6V, and the second stage can be used to raise the frequency to reduce the output filter size.

In this dissertation, some improvements to the standard two-stage VR architecture are made to make the two-stage design more attractive:

1. A higher-efficiency and higher-power-density switched-capacitor voltage divider is proposed as the first stage to replace the buck converter first stage used previously. • By analyzing the switching losses and conduction losses carefully and considering parasitic inductors in the real circuit, 2000W/in3 power density voltage dividers are designed with more than 97% efficiency. The light-load efficiency can be as high as 99%. • Transient performance of voltage divider is analyzed in detail to make sure there are no instability or interactions when the voltage divider is used as the first stage of two-stage VR. • Start-up and protection methods are proposed to make sure the safe operation of the proposed voltage divider. • Besides the switched-capacitor voltage divider, some other related topologies are also studied. The three-level resonant buck converter can achieve similar efficiency as the switched-capacitor voltage divider, but with lower power density. However,

159 Chapter 5. Conclusions and Future Work

with the help of the small inductor, voltage regulation can be achieved, which may be helpful for current-sharing if paralleling is necessary. 2. Some improvements are made to the second stage. First, low-voltage-rating devices are applied in the second stage, which leads to a 89% efficiency with 1MHz frequency. Then, an adaptive on-time control method is proposed to improve the buck converter’s efficiency at CCM operation. Up to 2% higher CCM efficiency can be achieved with the proposed control method. A non-linear inductor with advanced control strategy is proposed to improve the buck converter’s efficiency at DCM operation. Up to 4% higher DCM efficiency can be achieved with the proposed method. 3. According to new situations, the two-stage VR architecture is no longer cost-effective if one first stage is only used to power one second stage. To get better performance, the two-stage power architecture is applied to laptop systems and high-end server systems, and we call them as system level two-stage architecture. For laptop applications, it has been demonstrated that a two-stage VR can achieve 5% lower cost and 45% footprint reduction if the power density is the major concern. If efficiency is the primary concern, it can achieve 3% higher efficiency with a similar cost. For high-end server microprocessors, since high power density is required in the VR, 1MHz two-stage VRs are designed to achieve similar efficiency as the single-stage VR, but with a 45% footprint reduction.

5.2. Analysis and Design on Proposed Sigma Power Architecture

The two stages of the above architecture are in series, so the power is processed two times. If the two stages are in parallel, the efficiency should be even higher. Sigma architecture is based on this concept. The proposed sigma VR takes advantage of the high-efficiency, fast- transient unregulated converter and relies on this converter to deliver most of the output power, while it uses a low-power buck converter to achieve voltage regulation. My contributions for this part include: 1. We proposed the sigma power architecture. 2. Ninety percent efficiency, which is required by IBM and Intel, has been demonstrated for the sigma VR. Different DCX converter structures are discussed in the dissertation to provide more flexibility for the sigma VR.

160 Chapter 5. Conclusions and Future Work

3. The sigma cell concept is introduced in the dissertation for scalability of sigma VR. 4. Modeling and design on sigma VR is illustrated in the dissertation and AVP is achieved with experimental results. It demonstrated that the proposed sigma VR can achieve 33% output capacitor reduction. 5. Some other possible applications for sigma architecture are also investigated in the dissertation. One example is that sigma architecture can play better trade-off between the die size and the inductor size to reduce the total size of POL converters. I do not assert that the sigma VR can replace the series two-stage VR. The drawback for the sigma VR is that the input voltage range should be narrow (+-10%). Additionally, DCX converter design is more complicated than buck converter design. As a result, both the sigma VR and the two-stage VR can be applied in applications that can best make use of their respective strengths.

5.3. Future Work

There are still some remaining works to be done which are related to this dissertation:

1. The detailed analysis on three-level resonant buck. This topology is only slightly touched in this dissertation, thus thorough study on it, e.g. how to control three- level resonant which can achieve both high efficiency and output voltage regulation, should be a very good topic. 2. Evaluation on low-voltage lateral MOSFET for the bottom switch. It has been demonstrated that lateral MOSFET can achieve higher efficiency than trench MOSFET for the top switch, but due to the reverse-recovery problem of the body diode, it cannot be used as the bottom switch of the buck converter. As body diode performance is improved, the benefits of lateral MOSFET as the bottom switch needs to be investigated. 3. Magnetic integration of DCX for sigma VR to reduce the leakage inductance. It has been verified by simulation that with about 150pH equivalent output inductance for DCX, the output bulk capacitors of the sigma VR can be eliminated with only 600kHz switching frequency. However, the current discrete version has about 3nH output inductance for DCX and needs about four SP capacitors. Some magnetic integration work may be done to reduce the leakage inductance of DCX.

161 Chapter 5. Conclusions and Future Work

4. Investigation of sigma VR for desktop computers. In this dissertation, the major applications are laptop and server computers. Ideally, sigma VR is more suitable than two-stage VR for desktop VR since the cost is its most concern. With careful design, the sigma architecture could reduce the VR cost significantly while achieving similar efficiency.

162

References

Chapter 1

[1] Intel document, “Moore's Law: Made Real by Intel® Innovation,” www.intel.com/technology/timeline.pdf.

[2] Intel presentation, “On the road to concurrency…: Reading the Terrain Ahead,” December 2007.

[3] R. Gonzalez, B. M. Gordon, and M. A. Horowitz, “Supply and Threshold Voltage Scaling for Low Power CMOS,” IEEE Journal of Solid State Circuits, 32(8), August 1997.

[4] S. Saewong and R. Raikumar, “Practical Voltage Scaling for Fixed-Priority RT-Systems,” Ninth IEEE Real-Time and Embedded Technology and Application Symposium, May 2003, pp. 106-115.

[5] Kisun Lee, “Advanced Control Schemes for Voltage Regulators,” Ph. D. dissertation, Virginia Tech, 2008.

[6] Xunwei Zhou, Pit-Leong Wong, Peng Xu, Fred C. Lee, Alex Q. Huang, “Investigation of Candidate VRM Topologies for Future Microprocessors,” IEEE Transactions on , Nov 2000, pp. 1172 - 1182.

[7] Intel document, “Voltage Regulator-Down (VRD) 11.0 Processor Power Delivery Design Guidelines for Desktop LGA775 Socket,” November 2006.

[8] Intel document, “ (VRM) and Enterprise Voltage Regulator- Down (EVRD) 11.0 Design Guidelines,” November 2006.

[9] A.Waizman and C.Y. Chung, "Resonant Free Power Network Design using Extended Adaptive Voltage Positioning (EAVP) Methodology," IEEE Trans. Advanced Packaging, vol. 24, pp. 236-244, Aug. 2001.

163

[10] Yuancheng Ren, Kaiwei Yao, Ming Xu and Fred C. Lee, “The Analysis of the Power Delivery Path from 12V VR to the Microprocessor,” IEEE Trans. on Power Electronics, Vol. 19, No. 6, Nov 2004.

[11] IDC, “The Impact of Power and Cooling on Data Center Infrastructure”, Document #201722, May 2006.

[12] Tomm Aldridge, “Enterprise Platform Power Efficiency for Future Server Systems,” IBM Technology Symposium, 2004.

[13] Annabelle Pratt, Pavan Kumar, Kevin Bross, Tomm Aldridge, “Powering Compute Platforms in High Efficiency Data Centers,” IBM Technology Symposium, 2006.

[14] Urs Hoelzle and Bill Weihl, “High-efficiency Power Supplies for Home Computers and Servers,” from Google website.

[15] Jan Janick, “IBM Systems and Technology Update,” IBM Technology Symposium, 2007.

[16] Kaladhar Radhakrishnan, et al, “Optimization of Package Power Delivery and Power Removal Solutions to Meet Platform Level Challenges,” IDF 2005.

[17] Molex, “Z-Axis Power Delivery,” IBM Technology Symposium 2004.

[18] “Power Pod Design Specification,” Intel Document No. 10956.

[19] Xin Zhang, Alex Q. Huang and Nick X. Sun, “Microprocessor Integration by VRB-CPU Approach,” in PESC 2003.

[20] Alex Q. Huang, “New Unipolor Switching Power Device Figures of Merit,” IEEE Electron Device Lett., vol. 25, pp. 298-301, 2004.

[21] Yuancheng Ren, Ming Xu, Jinghai Zhou and Fred C. Lee, “Analytical Loss Model of Power MOSFET,” IEEE Transactions on Power Electronics, VOL.21, No. 2, Mar. 2006, pp. 310-319.

[22] Yuancheng Ren, “High Frequency, High Efficiency Two-Stage Approach for Future Microprocessors,” Ph.D. Dissertation, Virginia Tech 2005.

[23] Jia Wei, “High Frequency High-Efficiency Voltage Regulators for Future Microprocessors”, Ph.D. Dissertation, Virginia Tech 2004.

164

[24] Juan M. Rivas, Riad S. Wahby, John S. Shafran, and David J. Perreault, “New Architectures for Radio-Frequency DC–DC Power Conversion,” IEEE Transactions on Power Electronics, March 2006.

[25] Julu Sun, Ming Xu, David Reusch and Fred C. Lee, “High Efficiency Quasi-Parallel Voltage Regulators,” in IEEE APEC 2008.

Chapter 2

[26] Martin F. Schlecht, “High efficiency power converter,” United State Patent No. 6,222,742.

[27] Factorized Power Architecture and V•I Chips”, http://www.vicorpower.com

[28] Maxim MAX1545 datasheet, www.maxim-ic.com.

[29] Linear Technology, LT1054 datasheet, www.linear.com.

[30] K. D. T. Ngo, R. Webster, “Steady-State Analysis and Design of Switched-Capacitor DC/DC converter,” IEEE Trans. on aerospace and electronics systems, Jan. 1994.

[31] Fan Zhang, Lei Du, Fang Zheng Peng, and Zhaoming Qian, “A New Design Method for High-Power, High-Efficiency Switched-Capacitor DC/DC converters,” IEEE Transactions on Power Electronics, Vol. 23, No. 2, March 2008.

[32] Jonathan W. Simball, Philip T. Krein, “Analysis and Design of Switched Capacitor Converters,” in APEC 2005.

[33] Ming Xu, Julu Sun and Fred C. Lee, “Voltage Divider and its Application in the Two-stage Power Architecture”, IEEE APEC 2006, pp. 499-505.

[34] Fangzheng Peng, Fan Zhang and Zhaoming Qian, “A magnetic-less DC-DC converter for dual-voltage automotive systems” IEEE Transactions on Industry Applications, Volume 39, Issue 2, March-April 2003 pp. 511–518.

[35] Yungtaek Jang, Milan M. Jovanović, and Yuri Panov, “Multi-Phase Buck Converters with Extended Duty Cycle”, APEC 2006.

[36] Julu Sun, Ming Xu and Fred C. Lee, “High Power Density, High Efficiency System Two- stage Power Architecture for Laptop Computers,” in IEEE PESC 2006.

165

[37] Julu Sun, Ming Xu, Fred C. Lee and Yucheng Ying, “High Power Density Voltage Divider and Its Application in Two-Stage Server VR,” in IEEE PESC 2007.

[38] Masahito Shoyama, et al, “Resonant Switched Capacitor Converter with High Efficiency,” in PESC 2004.

[39] Vahid Yousefzadeh, et al, “Three-Level Buck Converter for Envelope Tracking Applications,” in IEEE Transactions on Power Electronics, Mar. 2006..

[40] C.M. Wildrick, et al, “A method of Defining Load Impedance Specifications for a Stable Distributed Power System,” in IEEE Transactions on power Electronics, May 1995.

[41] G. W. Wester, and R. D. Middlebrook, “Low-Frequency Characterization of Switched DC- DC converters,” IEEE Transactions on Aerospace and Electronic Systems, May 1973, pp. 376-385.

[42] R. D. Middlebrook, and S. Cuk, “A General Unified Approach to Modeling Switching- Converter Power Stages,” IEEE PESC, 1976, pp. 18-34.

[43] Taiyo Yuden capacitor datasheet. Part number: LMK432BJ226MM-T.

[44] Synqor product: www.synqor.com.

Chapter 3

[45] Yuming Bai, “Optimization of Power MOSFET for High-Frequency Synchronous Buck Converter,” Ph. D. dissertation, Virginia Tech, 2003.

[46] Z. John Shen, , David N. Okada, Fuyu Lin, Samuel Anderson, and Xu Cheng, “Lateral Power MOSFET for Megahertz-Frequency, High-Density DC/DC Converters,” IEEE Transactions on Power Electronics, Vol. 21, No. 1, January 2006.

[47] K.R. Varadarajan, A. Sinkar, T.P. Chow, “Novel Integrable 80V Silicon Lateral Trench Power MOSFETs for High Frequency DC-DC Converters,” in IEEE PESC 2007.

[48] Alex Q. Huang, “New Unipolar Switching Power Device Figures of Merit,” IEEE Electron Device Lett., vol. 25, pp. 298-301, 2004.

[49] Yucheng Ying, “Device Selection Criteria-- Based on Loss Modeling and Figure of Merit”, master thesis, 2008, CPES, Virginia Tech.

166

[50] L. Spaziani, “A study of MOSFET Performance in Processor Targeted Buck and Synchronous rectifier buck converters,” in Proc. HFPC, 1996, pp. 123–137.

[51] Yuancheng Ren, Ming Xu, Jinghai Zhou, and Fred C. Lee, “Analytical Loss Model of Power MOSFET”, in IEEE Transactions on power Electronics, March, 2006.

[52] G. Chinn, S. Desai, E. DiStefano, K. Ravichandran and S. Thakkar, “Mobile PC Platforms Enabled with Intel® Centrino™ Mobile Technology,” Intel Technology Journal Vol.7 Issue 2, May 21, 2003.

[53] Intel Core Duo Processor and Intel Core solo processor Datasheet, Intel document number: 309221-004.

[54] H. Arbetter, R. Erickson, and D. Maksimovid, “DC-DC Converter Design for Battery- Operated Systems,” in IEEE Proc. PESC, 1995.

[55] Linear Technology LTC3732 Datasheet, http://www.linear.com.

[56] Maxim MAX1545 datasheet, http://datasheets.maxim-ic.com/en/ds/MAX1519- MAX1545.pdf.

[57] P. L. Wong, F. C. Lee, P. Xu, and K.Yao, “Critical Inductance in Voltage Regulator Modules,” in IEEE Transactions on power Electronics, July, 2002.

[58] Coilcraft design tools. http://www.coilcraft.com/apps/loss/loss_1.cfm.

[59] Cooper Bussmann FP2-V150 datasheet. http://www.cooperbussmann.com.

[60] K. Harada, and T. Nabeshima, “Applications of magnetic to high-frequency DC- to-DC converters,” in Proceedings of the IEEE, Volume 76, Issue 4, April 1988, pp. 355 – 361.

[61] G. Hua, F. C. Lee, and M. Jovanovic, “An Improved Full-Bridge Zero-Voltage-Switched PWM converter using a Saturable Inductor,” in IEEE Transactions on Power Electronics, Volume 8, Issue 4, Oct. 1993.

[62] B.-H. Kwon, J.-H. Kim, and G.-Y. Jeong, “Full-Bridge Soft Switching PWM Converter with Saturable Inductors at the Secondary Side,” in IEE Proceedings on Electric Power Applications, Jan. 1999.

167

[63] M. H. Lim, J. D. van Wyk, F. C. Lee, and K. D. T. Ngo, “ A Class of Ceramic-Based Chip Inductors for Hybrid Integration in Power Supplies,” in IEEE Transactions on Power Electronics, Volume 23, Issue 3, May 2008, pp. 1556 – 1564.

[64] Jai Wei, Fred C. Lee, “Two-Stage Voltage Regulator for Laptop Computer CPUs and the corresponding Advanced Control Schemes to Improve Light-Load Performance,” in IEEE Proc. APEC, 2004.

[65] X. Zhou, M. Donati, L. Amoroso, and F. C. Lee, “Improved Light-Load Efficiency for Synchronous Rectifier Voltage Regulator Module,” IEEE Trans. Power Electron., vol. 15, pp. 826–834, Sep. 2000.

[66] Y. Panov and M. M. Jovanovic, “Adaptive Off-Time Control for Variable-Frequency, Soft- Switched Flyback Converter at Light Loads,” IEEE Transactions on Power Electronics, vol. 17, no. 4, July 2002

[67] M. D. Mulligan, B. Broach, and T. H. Lee, “A Constant-Frequency Method for Improving Light-Load Efficiency in Synchronous Buck Converters,” IEEE Power Electronics Letters, vol. 3, no. 1, MARCH 2005

[68] Bruce Rose, “EBL Power Delivery in Mobile Platforms,” Intel Technology Symposium, September, 2003.

[69] Yang Qiu, et al, “High-Bandwidth Designs for Voltage Regulators with Peak-Current Control,” in APEC 2006.

[70] Julu Sun, Yuancheng Ren, Ming Xu and Fred C. Lee, “Light Load Efficiency Improvement for Laptop VRs,” in IEEE APEC 2007.

[71] Julu Sun, Ming Xu, Fred C. Lee and Yucheng Ying, “High Power Density Voltage Divider and Its Application in Two-Stage Server VR,” in IEEE PESC 2007.

Chapter 4

[72] Bo Yang, Fred C. Lee, Alpha J. Zhang and Guisong Huang, “LLC Resonant Converter for Front-End DC/DC Conversion,” in IEEE APEC 2002.

168

[73] Amol Bhinge, Ned Mohan, Ramesh Giri and Raja Ayyanar, “Series-Parallel Connection of DC-DC Converter Modules with Active Sharing of Input Voltage and Load Current,” in IEEE APEC 2002.

[74] Phitip T. Krein and Jonathan W. Kimball, “Series-Parallel Approaches and Clamp Methods for Extreme Dynamic Response with Advanced Digital Loads,” IEEE Workshop on Computers in Power Electronics, 2004.

[75] Yuancheng Ren, Ming Xu, Julu Suu and Fred C. Lee, “A family of high power density unregulated bus converters,” IEEE Transactions on Power Electronics, Sept, 2005.

[76] Vicor’s VTM product, http://www.vicorpower.com

[77] Kaiwei Yao, et al, “Adaptive Voltage Position (AVP) Design,” in IEEE APEC 2004.

[78] Eric X. Yang, Fred C. Lee and Milan M. Jovanovic, “Small signal model of series and parallel resonant converters”,in IEEE PESC 1992.

[79] LTC3418 datasheet. www.linear.com.

[80] Ming Xu, Ya Liu, Julu Sun and Fred C. Lee, “Σ/Sigma DC/DC Conversion for Computing and Telecom Applications”, in IEEE PESC 2008.

169