Ultra Low Power, High Performance Negative Edge Triggered Ecrl Energy Recovery Sequential Elements with Power Clock Gating

Flip flops form an indispensable building block in the design of digital systems. In this study, adiabatic switching technique is used in the design of low-power negative edge triggered energy recovery flip-flops. In particular, the negative edge triggered D, SR and JK flip-flop designs based on the Quasi-adiabatic ECRL (Efficient Charge Recovery Logic) architecture are proposed. The projected design is illustrated with an 8 bit Serial Input and Parallel Output shift register (SIPO). The setup time and hold time are minimal in negative edge triggering compared to the pulsed and positive edge triggering which contributes to the high performance and the signal integrity of the design. Additionally, glitching is avoided due to the edge triggering which in turn reduces the soft error rate in the flip-flops. The simulation results have shown improvements in power efficiency by 95 and 75%, respectively for the D flip-flop and 8 bit shift registers than their CMOS counter parts. Due to its several merits, this design can find many real time applications such as digital communication (cryptography), memories and shift registers in microcontrollers. The four phase power clocks are utilized in the pipelining of the stages of the shift register, which mitigates all the relevant timing problems addressed in the literature of flip flops. For simulation, SPICE EDA simulation environment using the 350 nm process technology library from Austria micro systems have been used.


INTRODUCTION
With the rapid growth of CMOS technology, the density and complexity of the integrated circuit increases.This aggravates the power dissipation problem and temperature control becomes difficult thus enhancing the cooling costs.Moreover, the need for reliable, low cost, high performance portable devices adds power as one of the design metrics.The conventional approaches to achieve low power design are to reduce: • The supply voltage • The capacitance of the gate • Loading capacitance of the gate • The switching activity One of the promising techniques in the design of low power digital circuits is to apply the concept of adiabatic switching or energy recovery.Energy recovery principle achieves low energy dissipation by recycling the energy through the use of an AC type (oscillating) power supply voltage (Hamid et al., 2009).In this method, the supply clock is a trapezoidal signal that pumps energy to the circuit during the first phase and retrieves the energy during the third phase.
This study identifies the following problems and the solutions: • Major portion of total power in highly synchronous systems is dissipated in the clock network.Hence, energy recovery clocking is a much needed low power solution.• In adiabatic or energy recovery logic, the clock generator circuit continuously provides clock signal.Generally, a clock system may be driving sequential elements like flip-flops and latches (Cooke et al., 2003).As the clock travels utmost distances, the design of power and performance efficient sequential elements with adequate concern on the clock skew and clock jitter is also necessary.• Power gating can also be made at the clock nodes for reducing the clock switching power dissipation during the idle mode of the circuits.• Both positive and negative edge triggered flip-flops are required in the sequential circuit.However, there are no negative edge triggered energy recovery flip-flops in the literature so far.
Several adiabatic logic architectures have been reported in the literature.Most of these adiabatic logic families use either diodes for pre charge (Dickinson and Denker, 1995;Lau and Liu, 1997), or a Pair of Cross-Coupled PMOS transistors for both pre charge and evaluation (Moon and Jeong, 1996;Peiyi et al., 2011).These circuits overcome the CV 2 f barrier posed by the conventional CMOS logic and achieve extremely low power consumption.This study presents the ECRL (Efficient Charge Recovery Logic), a promising architecture for low power design of the negative edge triggered energy recovery flip-flops.Further, four phase clocking used for pipelining in the stages of the ECRL architecture improves the performance also.

Structure for the ECRL circuit:
The ECRL circuits use a differential signaling scheme so that an inverter can also be used as a buffer.The supply clocks comprises of 4 operational phases.The 1 st phase of ECRL circuit is the Evaluation, during which the input values are sampled and the output nodes are charged or discharged depending on the inputs.The sampled input values are held/stored during the Hold phase for the next stage and the stored charge is recovered during the Recovery phase.
The input signal can be changed during the Wait phase.The PMOS transistors are cross-coupled and are used for pre-charge and evaluation (Fig. 1a).The NMOS transistors, however, are used to tie one of the differential output nodes to ground, when the inputs are in the hold phase.Four power clocks are used for the cascading stages in sequence.Here, the power clocks are represented by trapezoidal waveforms (Fig. 1b).Although the ECRL is an adiabatic logic style, a small fraction of non-adiabatic energy dissipation still exists.Hence it is said to be belonging to 'quasi-adiabatic' logic family.When in = 1 during the evaluate phase of the clock cycle, outb is tied to ground (as the power clock Pclk ramps from 0 to Vdd).When Pclk reaches │Vtp│, (where Vtp is the threshold voltage of the PMOS transistor), P2 turns on and the load capacitance at the output node (out) will begin to charge up to logic '1'.This causes an initial, non-adiabatic dissipation of approximately 1/2 C L Vtp2 where C L is the load capacitance at the output nodes (Ng et al., 2000).
During the recovery phase, as the voltage of the power clock Pclk approaches │V tp │, the PMOS transistor P 2 gets turned off and the recovery path to the power clock is disconnected.The charge left on the load capacitance at the end of the recovery phase will either leak away slowly through the turned-off devices, or discharge through N2, if inb changes value (from '0' to '1') for the next logic evaluation.Thus, the nonadiabatic dissipation during the recovery phase, (less than or equal to ½ C L V 2 tp ) is inevitable.However, it can be minimized as discussed below.As the PMOS Fig. 1a: An ECRL inverter/buffer Fig. 1b: Power clock waveform body is connected to the Pclk instead of a DC power line, better recovery is achieved through the parasitic diodes formed between the source/drain and the body of the PMOS transistors as shown in Fig. 1a.By doing so, the output voltage can be further recovered to Vd (0.5V), the forward voltage drop across the diode, even if the PMOS transistor is already switched off when Pclk decreases below V TP during the recovery phase.This improves the energy recovery process and realizes a substantial improvement in terms of power dissipation.
However, this method only applies during the recovery phase, as the diodes are reverse biased during the pre-charge/evaluate phase.Subsequently, the proposed circuit designs of D, SR, JK flip flops and the 8 bit serial input parallel output shift register are designed using the modified ECRL structure The use of four clocks enables each of the cascaded stages to be simultaneously active during the same clock cycle.This pipelining greatly enhances the performance of the design.Furthermore, the several pulsed flip-flops in the literature are found to have less setup time but have many drawbacks like larger hold time and hence larger Clk-Q delay.On the other hand, the proposed edge triggered flip-flops have lesser setup time, lesser hold time and hence less Clk-Q delay overcoming all the

RESULTS AND DISCUSSION
The circuits are designed based on the Austria micro systems 350 nm process CMOS technology and are simulated using SPICE tool.Four-phase sinusoidal power clocks with peak-to-peak voltage of 3.   To evaluate the performance of the proposed flipflops, CMOS transmission gate based flip-flops powered at 3.3 V DC supply are also designed and simulated.The pulsed flip-flops, namely, CDMFF, SDFF (Athas et al., 1994), use less number of clocked transistors for power improvement and incur lesser setup time.And, it has large hold time, thus increasing the total propagation delay.However, the proposed design of flip-flops, whose setup time is the rise time of the 'negtrigg' signal is extremely minimum (t setup = 490 ps) and the hold time is also less, of the order of t h = 500 ps.The CLK to Q delay was found to be 240 ps (for f = 125 MHz) when simulated in the 350 nm technology.The above results clearly signify the performance enhancement of the proposed design, inn comparison against those published in the literature.Further, a 4-bit shift register implemented with the proposed D flip-flop shows a power dissipation of 84 µW showing 75% power efficiency (Fig. 5c).

CONCLUSION
This study presented the performance and power improvement of the quasi adiabatic sequential structures.Negative edge triggering realizes additional merits of reduced clock skew and jitter, when compared against the positive triggered and the pulsed flip-flops.This study presented a negative edge triggered ECRL energy recovery D, SR, JK flip-flops.From the simulations, these flip-flops have shown significant improvement in terms of power consumption and D-Q delay compared to the CMOS based flip-flops.The simulation results precisely shows that the proposed design of D flip flop is highly power efficient with an energy dissipation of 12.75 fJ for a load of C L = 150 fF.Further, it is proved that the SR flip-flop is dissipating energy of 13.5 fJ for the same load.The design is validated through 4-bit and 8-bit shift register constructed with the proposed D flip-flop.Hence, the ECRL based negative edge triggered flip-flops can be used as building blocks in the design of adiabatic/energy recovery systems both in terms of power and delay.

Fig. 3c :
Fig. 3c: Cascaded blocks of ECRL negative edge triggered SR flip-flop and buffers clock generation is overcome due to the use of the pulse generation circuitry shown in Fig. 2a and b, which by itself generates the four phase signal.The transient waveforms in Fig. 2b precisely explain the generation of the negative trigger signal (negtrigg).This negtrigg signal is used in the proposed flip-flops.Figure 2c shows the cascaded blocks of ECRL negative edge triggered D fip-flop and buffers Proposed D flip-flop: The proposed D ECRL flip-flop and the simulated transient waveforms are as shown in Fig.2d and e.Besides the PMOS loads (P1, P2) there are four NMOS pull down transistors (N1-N4).During the evaluate phase, when D = 1 and the negtrigg is high, node QB becomes low.This will turn P2 'on' and we have Pclk (high) in the Q output.The output is held during the hold phase.The energy delivered during the evaluate phase is recovered during the recovery phase by the discharging current through P1.When D = 0 and the negtrigg is high, node Q takes low value and this will turn P1 'on' and we have Pclk in the QB output and the energy is recovered during the recovery phase.Such a flip flop can be cascaded with four stages of buffers operating with the four phases of clock Pclk1, Pclk2, Pclk3 and Pclk4, respectively.

Fig. 4c :
Fig. 4c: ECRL negative edge triggered 4 bit shift register 3 V are used to power up the ECRL circuits.A clock frequency of 125 MHZ is used.Simulated results of the D flip flop, SR flip-flop, the JK flip-flop at 125 MHz power clock frequency are shown in the tabulation.Furthermore, the designs are also simulated for the frequencies 20, 40, 50, 90, 100 MHz, respectively and the results were analyzed for optimum performance.The power reduction achieved by the ECRL-based negative edge triggered D flip-flop, ranges to 96% at 125 MHz compared to its CMOS counterpart as depicted in Fig.5a.The power delay product has also been found to insist the tradeoff between the power and delay as depicted in Fig.5b.The comparison of the flipflops is shown in the Fig.5cwhich illustrates the major three design metrics of the IC design.Besides, the other parameters determining the functionality, performance and area are also tabulated in Table1and compared for all the designed flip-flops.The energy dissipation of the designs at light and medium load are also evaluated.The power delay product calculated serves as the figure of merit to compare all the designs.

Fig. 5a :
Fig. 5a: Power comparison of the ECRL negative edge triggered flip-flops with its CMOS counterparts

Table 1 :
Simulation results of the ECRL negative edge triggered flip- flops Parameter ECRL D-FF ECRLSR-FF ECRL JK-FF