VLSI Lec 4

download VLSI Lec 4

of 54

Transcript of VLSI Lec 4

  • 7/22/2019 VLSI Lec 4

    1/54

    VLSI DesignVLSI Design

    By Dr. Yaseer A. DurraniDept. of Electronics Engineering

    University of Engineering & Technology, Taxila

    Outline Introduction to Combinational & Sequential

    Circuits

    De Morgans Law & Boolean Algebra Rules

    Static & Dynamic Logic Circuits

    Memory Logic Circuits

    2

  • 7/22/2019 VLSI Lec 4

    2/54

    Combinational Vs. Sequential Logic Combinational logic circuits

    Output depends only on present input Sequential logic circuits

    Output depends on present input and present state of circuit

    3

    Circuit Circuit

    Combinational Circuit

    Sequential Circuits

    State

    Output=f(input)

    Output=f(Input, Previous Input)

    Design Procedure Combinational logic design procedure

    Start with the problem statement Determine the number of inputs variables & required number of output variables Derive a truth table that defines the required relationship between I/O Simplify each output function (Karnaugh maps) Draw the logic diagram

    Sequential logic design procedure Derive a state/output table from problem specification Minimize number of states in state/output table by eliminating equivalent states Choose a set of state variables. Assi n to each state a uni ue combination from set

    4

    derived above Create a transition/output table

    Choose a flip-flop type & construct its excitation table Using excitation table fill the values for input excitation function columns on

    transition/output table Derive the excitation & output equations Draw logic diagram

  • 7/22/2019 VLSI Lec 4

    3/54

    De Mogans Law Commutative law of addition for two variables is written as: A+B = B+A

    Commutative law of multiplication for two variables is written as: AB = BA

    Associative law of addition for 3 variables is written as: A+(B+C) = (A+B)+C

    A

    B A+B

    B

    A B+A

    A

    B AB

    B

    A B+A

    A A A+B

    5

    Associative law of multiplication for 3 variables is written as: A(BC) = (AB)C

    Distributive law is written for 3 variables as follows: A(B+C) = AB + AC

    B

    C

    B(A+B)+C

    CB+C

    A

    B

    A(BC)

    C

    A

    B(AB)C

    CBC

    AB

    B

    C

    A

    B+C

    A

    B

    C

    A

    XX

    AB

    AC

    X=A(B+C) X=AB+AC

    Rules of Boolean Algebra

    DeMor ans theorems rovide mathematical verification of:

    1.6

    .5

    1.4

    00.3

    11.2

    0.1

    =+

    =+

    =

    =

    =+

    =+

    AA

    AAA

    AA

    A

    A

    AA

    BCACABA

    BABAA

    AABA

    AA

    AA

    AAA

    +=++

    +=+

    =+

    =

    =

    =

    ))(.(12

    .11

    .10

    .9

    0.8

    .7

    A, B, and C can represent a single variable or a combination of variables

    6

    Equivalency of NAND & negative-OR gates

    Equivalency of NOR & negative-AND gates

    ZYXW

    ZYX

    ZYX

    ZYX

    ++

    ++

    Complement of two or more ANDed variables is equivalent to OR ofcomplements of individual variables

    Complement of two or more ORed variables is equivalent to AND ofcomplements of individual variables

    YXYX +=

    YXYX =+

    Exercise

    )(

    )(

    FEDCBA

    EFDCBA

    DEFABC

    DCBA

    +++

    ++

    +

    ++

  • 7/22/2019 VLSI Lec 4

    4/54

    Boolean function is a expression formed with binary variables, OR, AND, NOToperators

    F=xyz Boolean function can be algebraically expressed from truth table by taking OR all

    minterms that produce 1 as output in function

    Boolean function can be algebraically expressed from truth table by taking ANDall maxterms that produce 0 as output in function

    Boolean Function

    7

    Digital Design Digital design engineers always tries to build circuit using universal gates

    (NAND or NOR) as they are easy to fabricate

    Rules of obtaining NAND gate implementation of Boolean function Convert all AND gates into NAND gates by using AND-NOT symbol Convert all OR gates into NAND gates by using NOT-OR symbol For every bubble that is not counteracted by another bubble along same line,

    insert a NOT gate or complement the literal from its original appearance

    Rules of obtaining NOR gate implementation of Boolean function

    8

    - Convert all AND gates into NOR gates by using NOT-AND symbol For every bubble that is not counteracted by another bubble along same line,

    insert a NOT gate or complement the literal from its original appearance

  • 7/22/2019 VLSI Lec 4

    5/54

    Examples

    9

    Simplification of Boolean Algebra

    10

  • 7/22/2019 VLSI Lec 4

    6/54

    Random Logic Random Logic: Circuit design using NAND, NOR & NOT gates (often

    called AOI Logic gate represents = AND-OR-Inverter Logic)

    11

    Random logic: Adder

    Logic Families Logic families are classified as: Static Logic

    Do not use clock signal Slower Easy to design

    Dynamic Logic Use clock signal More faster Difficult to design

    12

  • 7/22/2019 VLSI Lec 4

    7/54

    Static CMOS Circuit At every point in time (except during switching transients) each gate output is

    connected to either VDD or VSS via low-resistive path Outputs of gates assume at all times the value of Boolean function,

    implemented by circuit (ignoring, once again, transient effects during switchingperiods)

    PUN consists PMOS transistors & PDN consists NMOS transistors PUN & PDN implementations are complimentary to each other

    PMOS NMOS Series topology Parallel topology VDD Make a connection from VDD to

    F when F(In1, In2,Inn)=1

    13

    VSS

    PUN

    PDN

    In1

    In2

    In3

    F =G

    In1In2In3

    PUN and PDN are Dual NetworksPUN and PDN are Dual Networks

    PMOS Only

    NMOS Only

    Make a connection from VDD toVSS when F(In1, In2,Inn)=0

    No steady state path b/w VDD & VSS (no staticpower consumption)

    Delay a function of load capacitance &transistor resistance

    Comparable rise & fall times (underappropriate transistor sizing conditions)

    High noise margins Full rail to rail swing VOH & VOL are at VDD & VSS Low output impedance, high input impedance

    Design Method NMOS devices pull the output to 0 when the gate inputs are 1

    PMOS devices pull the output to 1 when the gate inputs are 0

    Consider a function to be realized: F(A,B,C, . . .)

    NMOS pull-down network must realize the pull-down function

    FPD = F(A,B,C,)

    PMOS pull-up network must realize the pull-up function

    FPU = F(A,B,C) The literals in FPU have to be inverted, because the p-channel transistors

    conduct, if their gate input is 0 (low)

    14

  • 7/22/2019 VLSI Lec 4

    8/54

    Rules for Logic Formation Rule 1: NMOS transistors in series implement the AND operation Rule 2: NMOS transistors in parallel implement the OR operation Rule 3: Logic functions in series are ANDed together

    Rule 4: Parallel nMOS branches OR the individual branch functions Rule 5: Parallel connections of NMOS transistors have to be transformed to

    serial connections of PMOS transistors. The input literals applied to the PMOStransistors are identical with the gate inputs of the NMOS transistors (noinversion needed)

    Rule 6: Serial connections of NMOS transistors have to be transformed to.

    Rule 7: Parallel connected logic blocks of the NMOS network Serial connection in the PMOS network

    Rule 8: Serial connected logic blocks of the NMOS network Parallel connection in the PMOS network

    15

    PUN/PDN Rules NMOS is ON when Vg=1, OFF when Vg=0. For PMOS vice versa NMOS produces strong Zeros while PMOS produces strong Ones NMOS connected in Series corresponds to AND function while in Parallel OR

    function. PMOS connected in Series corresponds to NOR function & Parallel toNAND function

    Using De Morgans Theorem usingPUN/PDN of complementary CMOS structure are dual network. This meansParallel connection in PUN corresponds to Series connection in PDN & vice

    ( )BABA =+)( BABA +=

    16

    Complementary gates is naturally inverting, implementing only functions such asNAND, NOR and XNOR. Noninverting Boolean function such as AND, OR, XORin a single stage is not possible & requires addition of an extra inverter stage

    Number of transistors required to implement an N-input logic gate is 2N

  • 7/22/2019 VLSI Lec 4

    9/54

    NMOS/PMOS Serial/Parallel Connection Transistors can be thought as a switch controlled by its gate signal NMOS switch closes when switch control input is high PMOS switch closes when switch control input is low

    NMOS: 1 = ON, PMOS: 0 = ON Series: both must be ON, Parallel: either can be ON

    X Y

    A B

    X Y

    A B

    A C A C

    17

    Y = X if A and B

    NMOS Serial Connection

    X Y

    A

    B

    Y = X if A OR B

    NMOS Transistors pass astrong 0 but a weak 1

    NMOS Parallel Connection

    Y = X if A AND B = A + B

    PMOS Serial Connection

    X YB

    Y = X if A OR B = AB

    PMOS Transistors pass astrong 1 but a weak 0

    PMOS Parallel Connection

    B D

    A

    B

    C

    D

    A B C DA B

    C D

    B

    D

    Y

    A

    C

    A

    C

    B D

    B

    D

    Y

    Construction of PDN NMOS devices in series implements NAND function NMOS devices in parallel implements NOR function

    A

    B

    A B

    A B

    A + B

    18

  • 7/22/2019 VLSI Lec 4

    10/54

    CMOS Inverter

    A Y

    0

    1 0

    V DD

    A= 1 Y= 0

    GND

    ON

    OFF

    A YA Y

    0 1

    1 0

    V DD

    A= 0 Y= 1

    OFF

    ON

    19

    GND

    CMOS NAND Gate

    A B F

    0 0 1

    0 1 11 0 1

    BAC =

    20

    1 1 0

    A

    B

    Y

    C

    3 input NAND gate 4 input NAND gate

    In3

    In1

    In2

    In4

    In1

    In2 In3 In4

    VDD

    Out

  • 7/22/2019 VLSI Lec 4

    11/54

    CMOS NOR Gate

    BAC +=

    A B C

    0 0 1

    A

    B

    C

    D

    Y

    21

    1 0 0

    1 1 0

    4 input NOR gate

    2 input NOR gate

    AND/OR Gate

    22

    BAC +=

  • 7/22/2019 VLSI Lec 4

    12/54

    XOR Gate

    A B C

    0 0 0

    0 1 1

    1 0 1

    A

    B

    A

    A

    B

    B

    A

    B

    A A

    A

    B

    BABAY +=

    BAABY +=

    23

    1 1 0

    A B

    A

    B

    A

    BA B

    A B

    BB

    XNOR Gate

    A B C

    0 0 1

    0 1 0

    A

    B

    A

    B

    A B

    A

    B

    A A

    A

    B

    BB

    BABAY +=

    BAABY +=

    24

    A B

    A

    B

    A

    BA B

    1 0 0

    1 1 1

    A B

  • 7/22/2019 VLSI Lec 4

    13/54

    Complex NMOS Logic

    25

    2-input NMOS NOR gate

    N-input NMOS NOR gate

    2-input NMOS NAND gate

    Complex NMOS Logic

    26

    XOR gateXNOR gate

  • 7/22/2019 VLSI Lec 4

    14/54

    Relatively it is easy to obtain PDN, simply because we already had Y in terms ofuncomplemeted inputs

    To obtain PUN, we had to manipulate the given Boolean expression to express

    Y as a function of complemented variables

    )( CDBAY +=

    )( CDBAY +=

    Complex Gate

    )(

    CDBA

    CDBAY

    ++=

    +=

    )( DCBA ++=

    +=

    27

    Systematic Function ConstructionF= A ( B C + D)

    28

    F=AB+(A+B)C

  • 7/22/2019 VLSI Lec 4

    15/54

    Examples

    BCAF +=

    BCAF +=

    BCAF

    BCAF

    =

    +=

    )C(ABDAF ++=

    Start from innermost term

    29

    BC)A(F +=

    Examples))C(ABDA()D(EF +++=

    Start from the innermost term

    VDD

    A

    B

    C

    D

    F = D +A (B+C)

    30

    D

    A

    B C

  • 7/22/2019 VLSI Lec 4

    16/54

    Examples( )Y A B C D= + +

    Y

    DC

    B

    A

    VDD

    C

    ( )( )

    ( )( )( )( )( )( ) DCBAY

    DCBAY

    DCBAY

    DCBAY

    +=

    ++=

    +=

    ++=

    ( )( ) DCBAY ++= F = (D + A (B + C))

    D

    A

    B

    C

    31

    A B C

    D

    Y

    A

    A

    B

    B

    C

    D

    D

    D

    A

    B C

    Full Adder

    32

  • 7/22/2019 VLSI Lec 4

    17/54

    AOI (AND-OR-INVERT) CMOS Gate AOI complex CMOS gate can be used to directly implement a sum-of-products

    Boolean function Pull-down N-tree can be implemented as follows:

    Product terms yield series-connected NMOS transistors Sums are denoted by parallel-connected legs

    Complete function must be an inverted representation

    Pull-up P-tree is derived as the dual of the N-tree

    33

    OAI (OR-AND-INVERT) CMOS Gate An Or-And-Invert (OAI) CMOS gate is similar to the AOI gate except that it is an

    implementation of product-of-sums realization of a function N-tree is implemented as follows:

    Each product term is a set of parallel transistors for each input in the term All product terms (parallel groups) are put in series The complete function is again assumed to be an inverted representation

    P-tree can be implemented as the dual of N-tree Note: AO & OA gates (non-inverted function representation) can be

    implemented directly on the P-tree if inverted inputs are available

    34

  • 7/22/2019 VLSI Lec 4

    18/54

    Full AdderV

    VDD

    DD

    VDD

    VDD

    A B

    Ci

    S

    X

    B

    A

    Ci A

    BBA

    Ci

    A B Ci

    Ci

    A

    Ci

    A

    B

    BA

    28 transistors

    35

    Co B

    C o = AB + C i (A+B)

    VDD

    Ci

    A

    BBA

    B

    A

    A B

    Kill

    Generate"1"-Propagate

    "0"-Propagate

    V DD

    Ci

    A B Ci

    Ci

    B

    A

    Ci

    A

    BBA

    V DD

    SCo

    24 transistors

    Transistor Sizing Transistor sizing is the ratios of the Width/Length (W/L) of the transistor

    These ratios provide the gate with current-driving capability in both directionsequal to that of the basic inverter

    (W/L)=n and (W/L)p=P, n is usually 1.5 to 2 & for matched design p=(n/p)n

    We should find the input combinations that result in lowest output current & thenchoose sizes that will make this current equal to that of the basic inverter

    We consider the parallel/series connections & find equivalent W/L ratios

    Transistors connected in series:

    Transistors connected in parallel:

    ...)/(

    1

    )/(

    1)/(

    21

    ++

    =

    LWLW

    LW eq

    ...)/()/()/( 21 ++= LWLWLW eq

    VDD

    GND

    NMOS (2/.24 = 8/1)

    PMOS (4/.24 = 16/1)

    metal2

    metal1 polysilicon

    InOutmetal1-poly via

    metal2-metal1 via

    metal1-diff via

    pdiff

    ndiff

    36

  • 7/22/2019 VLSI Lec 4

    19/54

    Example Two identical MOS transistors with individual W/L ratios of 4 result in an

    equivalent W/L of 2 when connected in series & of 8 when connected in parallel Worst case (lowest current) for PDN is obtained when only one of NMOS

    transistors is conducting We therefore select W/L of each NMOS transistor to be equal to that of NMOS

    transistor of basic inverter, namely, n For PUN, worst-case situation is when all inputs are low & four series PMOS

    transistors are conducting Since equivalent W/K will be one-quarter of that of each PMOS device, we

    P

    basic inverter, that is ,4p

    37n& pdenote the (W/L) ratios of QN&QP, of the basic inverter

    Example Provide transistor W/L ratios for the logic circuit . Assume that for the basic

    inverter n = 1.5 and p =5 and that the channel length is 0.25m? Solution: Consider PDN: In worst case, when QNB is ONB & either QNC or QND is ON Worst case, we have two transistors in series Therefore, we select each of QNB, QNC, and QND to have twice the width of the n-

    channel device in the basic inverter, thus Q

    NB

    : W/L = 2n = 3 =0.75/0.25 QNC: W/L = 2n = 3 =0.75/0.25

    ND . .

    38

  • 7/22/2019 VLSI Lec 4

    20/54

    Example For transistor QNA, select W/L to be equal to that of the n-channel device in the

    basic inverter: QNA: W/L = n = 1.5 = 0.375/0.25

    Next, consider the PUN Here, we see that in the worst case, we have three transistors in series: QPA,

    QPC, and QPD. Therefore, we select the W/L ratios of each of these to be three times that of

    QP in the basic inverter, that is, 3p, thus QPA: W/L = 3p = 15 =3.75/0.25

    PC = = = . . QPD: W/L = 3p = 15 =3.75/0.25

    39

    CMOS Transmission Gates (TG) CMOS TG can be constructed by parallel combination of NMOS & PMOS

    transistors, with complementary gate signals TG input signal transmitted to output without threshold voltage attenuation

    C

    A B

    C

    BA

    CB

    C=0, CB=1

    A B

    C=1, CB=0A

    0 Strong 0

    Input Output

    C=1, CB=0

    C=1, CB=0

    40

    C

    BA

    C

    C

    0V Vdd 0V

    0V

    |VTP|

    Vdd

    Vdd 0V Vdd

    Vdd-VTN

    Vdd

    Vdd

    A

    B

    F

    B

    A

    B

    B

    M1

    M2

    M3/ M4

    XOR

  • 7/22/2019 VLSI Lec 4

    21/54

    CMOS Transmission Gate

    B

    B

    41

    A

    F

    B

    A

    B

    M1

    M2

    M3/M4

    A

    A

    B

    B

    BABA +

    XOR

    Half Adder

    TG based Full Adder

    42

    Full Adder

    A

    B

    P

    Ci

    VDD A

    A A

    VDD

    Ci

    A

    P

    AB

    VDD

    VDD

    Ci

    Ci

    Co

    S

    Ci

    P

    P

    P

    P

    P

    Sum Generation

    Carry Generation

    Setup

  • 7/22/2019 VLSI Lec 4

    22/54

    S

    S

    D0

    D1

    YS

    2x1 MUXOnly 4 transistors

    Multiplexers (MUX)

    2x1 MUX

    S

    S

    A

    B

    SBAS+

    S0

    A

    B

    C

    S0 S1 S1

    OUT

    43

    AM 2

    M 1

    B

    S

    S

    S F

    V DD

    Pass Transistor based MUX4x1 Multiplexor

    D

    Two Level MUX

    TG Tristates Tristate buffer produces Z when not enabled Non-restoring Tristate

    TG acts as tristate buffer Only two transistors

    But nonrestoring Noise on A is passed on to Y

    EN A Y

    0 0 Z

    0 1 Z

    1 0 0

    1 1 1

    A Y

    EN

    A Y

    EN

    EN

    A Y

    EN

    EN

    Tristate inverter produces restored output

    Violates conduction complement rule

    44

    A

    Y

    EN

    A

    Y

    EN = 0

    Y = 'Z'

    Y

    EN = 1

    Y = A

    A

    EN

  • 7/22/2019 VLSI Lec 4

    23/54

    Multiplexers (MUX) 2:1 multiplexer chooses between two inputs

    S D1 D0 Y

    0 X 0 0

    0 X 1 1

    1 0 X 0

    1 1 X 1

    0

    1

    S

    D0

    D1

    Y

    Gate level MUX Desi n

    45

    01 DSSDY +=

    4

    4

    D1

    D0

    SY

    4

    2

    2

    2 Y

    2

    D1

    D0

    S

    20 transistors

    Inverting MUX Use compound AOI Or pair of tristate inverters Essentially the same thing

    Noninverting MUX adds an inverter

    S

    D0 D1

    Y

    S

    D0

    D1

    Y

    0

    1S

    Y

    D0

    D1

    S

    S

    S

    S

    S

    S

    InvertingMUX

    Multiplexers (MUX)

    46

    c ooses one o npu s us ng wo se ec s Two levels of 2:1 MUXes Or four tristates

    S0

    D0

    D1

    0

    1

    0

    1

    0

    1Y

    S1

    D2

    D3

    D0

    D1

    D2

    D3

    Y

    S1S0 S1S0 S1S0 S1S0

    4x1 MUX

  • 7/22/2019 VLSI Lec 4

    24/54

    Fast Complex Gate - Design Techniques Design techniques to reduce delay of large fan-in circuits Transistor Sizing Increase transistor size, while larger parasitic capacitances & propagation delay

    Progressive Transistor Sizing Uniform sizing, reduces the dominant resistance

    Input Reordering Some signals might critical than others. Not all inputs of a gate arrive at

    same time Logic Restructuring

    Mani ulatin lo ic e uations can reduce fan-in & reduce ate dela

    47

    CL

    In1

    InN

    In3

    In2

    Out

    C1

    C2

    C3

    M1 > M 2 > M3 > MN

    M1

    M2

    M3

    MN

    Distributed RC-line

    Can Reduce Delay with more than 30%!

    In1

    In3

    In2

    C1

    C2

    CL

    M1

    M2

    M3

    In3

    In1

    In2

    C3

    C2

    CL

    M3

    M2

    M1

    critical pathcritical path

    Buffering: Isolate Fan-in from Fan-out

    CL CL

    Improved Logic Design

    Input reording

    Fan-In & Fan-Out Fan-In = Number of inputs to a logic gate

    4 input NAND has a FI = 4 2 input NOR has a FI = 2, etc.

    Fan-Out = Number of gate inputs which are driven by a particular gate output FO = 4 in Fig. b below shows an output wire feeding an input on four

    different logic gates Circuit delay of a gate is a function of both Fan-In & Fan-Out

    Ex. m-input NAND: tdr

    = (Rp

    /n)(m.n.Cd

    + Cr

    + k.Cg

    ) = tinternal-r

    + k.toutput-rwhere n=width multiplier, m=fan-in, k=fan-out, Rp=resistance of min inverter P

    , , ,capacitance

    48

  • 7/22/2019 VLSI Lec 4

    25/54

    Fan-In & Fan-Out Circuit fall delay can be written in a similar manner

    Ex. m-input NAND: tdf = m(Rn/n)(m.n.Cd + Cr + k.Cg) = tinternal-f + k.toutput-fwhere n=width multiplier, m=fan-in, k=fan-out, Rn=resistance of min inverter

    NMOS Tx, Cg=gate capacitance, Cd = source/drain capac, Cr = routing (wiring)capacitanceIf we set tdr=tdf for the case of symmetrical rise & fall delay, we get Rp = m Rn

    pWp = (nWn)/m

    49

    CMOS Logic Families Complementary Logic

    Standard CMOS Clocked CMOS (C2MOS) BICMOS (CMOS logic with Bipolar driver)

    Ratio Circuit Logic Pseudo-NMOS Saturated NMOS Load Saturated PMOS Load Depletion NMOS Load (E/D) - -

    Dynamic Logic: CMOS Domino Logic NP Domino Logic (also called Zipper CMOS) NORA Logic Cascade voltage Switch Logic (CVSL) Sample-Set Differential Logic (SSDL) Pass-Transistor Logic

    50

  • 7/22/2019 VLSI Lec 4

    26/54

    Clock Signal Used to synchronize data flow through a digital network

    Clocked static or dynamic circuits Problems: clock skew (delay caused by clock distribution wires)

    51

    Ideal nonoverlapping 2-phase clocks 2-phase clocking

    Single & Multiple Clock Signals For nonoverlapping clock phases and fine tuned and well designed delay

    lines (realized as Transmission gates) have to be inserted in order to avoidoverlapping of and

    Generation ofinverted clock phase TG delay circuit

    52Psuedo 2- clocking

    Clocked shift register circuitShift register

    Clocked Static Logic

  • 7/22/2019 VLSI Lec 4

    27/54

    Dynamic CMOS Circuits Dynamic logic is temporary (transient) in that output levels will remain valid

    only for a certain period of time Static logic retains its output level as long as power is applied

    Dynamic logic is due to charging & discharging capacitance Precharge clock to charge the capacitance Evaluate clock to discharge the capacitance depending on condition of

    logic inputs

    Precharge Evaluate Precharge

    1

    2A Y

    1

    1

    A

    Y

    4/3

    2/3

    AY

    53

    Y

    Dynamic NMOS Inverter Dynamic PMOS Inverter

    Mp

    Me

    VDD

    PDN

    f

    In1In2In3

    Out

    Me

    Mp

    VDD

    PUN

    f

    In1In2In3

    f

    f

    Out

    CL

    CL

    f p networkfnnetwork

    Static DynamicPseudo-NMOS

    Dynamic CMOS Circuits Advantages over static logic:

    Use of low leakage of FETs to store charge instead of moving current.Provides higher density, faster operation at cost of reduced noise immunity

    Avoids duplicating logic twice as both N-tree & P-tree Faster gates & used in very high performance applications

    Best for wide OR/NOR gates (e.g. bit-lines), providing 50% delayimprovement over static CMOS

    Simple sequential memory circuits; amenable to synchronous logic

    Transistor count is reduced from 2n (static CMOS) to n+2 for dynamicrechar ed CMOS but now: 2 hases of o eration

    54

    High density achievable Consumes less power (in some cases)

    Consumes 2xpower due to its phase activity (unconditional pre-charging), not counting clock power

    Disadvantages compared to static logic: Problems with clock synchronization & timing Design is more difficult

    Increased sensitivity to noise Capacitive crosstalk, Charge sharing, Power supply, Feed through noise

  • 7/22/2019 VLSI Lec 4

    28/54

    Single clock is to accomplish both pre-charge & evaluation operations When is low: PMOS pre-charge transistor Mp charges Vout to Vdd, since it

    remains in its linear region during final pre-charge

    During this time, logic inputs A1 B2 are active & Me is OFF, no chargewill be lost from Vout

    When is high: Mp is turned OFF & NMOS evaluate transistor Me is turned ON,allowing for Vout to be selectively discharged to GND depending on logic inputs If A1 B2 inputs are such that a conducting path exists b/w Vout & Me,

    then Vout will discharge to 0, Otherwise, Vout remains at Vdd No DC current flows durin rechar e or evaluate hase

    Dynamic CMOS Circuits

    Power is dynamic P = CN Vdd2 f where CN represents an equivalent total

    capacitance on the output, f=clock frequency,=logic repetition rate

    55

    Dynamic Logic

    56

    Dynamic NMOS inverterPrecharge & Evaluate

    Precharge n/w for worse case

    Evaluation discharge n/wComplex Dynamic Logic

  • 7/22/2019 VLSI Lec 4

    29/54

    Cascading Problem in Dynamic CMOS Logic If several stages of dynamic circuit are cascaded together using same clock, a

    problem in evaluation involving a built-in race condition will exist

    Duringpre-charge, both Vout1 & Vout2 are pre-charged to Vdd

    When goes high to beginevaluate, all inputs at stage-1 require some time toresolve, but during this time charge may erroneously be discharged from Vout2

    Assume 1st-stage NMOS logic conducts & fully discharges Vout1, but allinputs are not resolved, it takes some time to discharge Vout1 to GND

    If, during this time delay, 2nd-stage has input condition shown with bottomNMOS transistor gate at logic 1, then Vout2 will start to fall & discharge itsCL until Vout1 finall evaluates & turns OFF the to series NMOS transistorin stage-2

    Result is an error in output of 2nd-stage Vout2

    57

    Possible Solutions: Two phase clocks

    Use of inverters to createDomino Logic

    NP Domino Logic Zipper/NORA logic

    Dynamic Cascades PMOS & NMOS blocks have to be installed alternated in order to avoid glitches

    58

    Cascaded NMOS-NMOS glitch problemDynamic cascades

  • 7/22/2019 VLSI Lec 4

    30/54

    Pass Transistor Logic Transistor are used as switches which are controlled by input literals do logic Inputs drive diffusion terminals as well as gates Limited fan-in & excessive fan-out

    Noise vulnerability (not restoring) Supply voltage offset/bias vulnerability Decode exclusivity (else short-circuit!) Poor high voltage levels if NMOS-only Pass-logic may consume half the power of static logic. But be careful of Vt drop

    resulting in static leakage Pass- ate lo ic is not a ro riate when lon interconnects se arate lo ic sta es

    Pass Transistor Model

    or when circuits have high fan-out load (use buffering)

    59Complementary switchPass Transistor NXOR

    A =5 VB

    C =5V

    CL

    A =5 V

    C =5 V

    BM2

    M1

    Mn

    VB

    NMOS only switchDoes not pull up to 5V, but 5V Vth loss causes static power consumption

    VTN

    Pass Transistor Logic Criteria for Clocked CMOS & Pass-Gate Logic Clocked CMOS logic

    Mitigates hot electron effects Pass-gate logic

    Fast, if few pass gates in series Good for complex functions Small area, low power

    60

  • 7/22/2019 VLSI Lec 4

    31/54

    Clocked CMOS logic

    Mitigates hot electron effects Pass-gate logic

    Fast, if few pass gates in series

    Good for complex functions

    ,

    Pass Transistor Circuits Pass transistor (MP) is NMOS device, but could also be implemented with TG

    Cx is equivalent capacitance of input gate of second NMOS (part of inverteror logic gate) & PN junction capacitance of MPs drain (source)

    When clock (CK ) goes high, MP is turned ON & allows Vin to be placed oncapacitor Cx

    When CK goes low, MP is turned OFF, trapping charge on Cx Operation for 1 or 0:

    If Vin is high (say VOH), then MP will allow current to flow into Cx, charging it

    up to Vdd Vtn (assume CK up level Vdd) If Vin is low sa GND then MP will allow current to flow out of Cx

    discharging it to GND Due to leakage from drain (source) of MP, Cx can only retain charge Q for a

    given period of time (called soft node) If MP is NMOS, Cx will discharge to GND If MP is PMOS, Cx will discharge to VDD If MP is a TG, Cx could discharge in either direction

    62

  • 7/22/2019 VLSI Lec 4

    32/54

    Pass Transistor Logic with PMOS Pull-Up For reduction of device count and area an NMOS version with PMOS pull-up

    can also be useful (kind of pseudo NMOS)

    63

    Pass Transistors

    Vdd-VTN Vdd-VTNVdd-VTN

    Vdd-2VTN

    Vdd-3VTN

    NMOS Pass GateConfiguration

    64

    |VTP| |VTP|

    |VTP|

    2|VTP|

    3|VTP|PMOS Pass GateConfiguration

  • 7/22/2019 VLSI Lec 4

    33/54

    Complexity of CMOS pass-gate logic can be reduced by dropping PMOStransistors & using only NMOS pass transistors called CPL CMOS inverters must be used periodically to recover the full VDD level since NMOS

    pass transistors will provide VOH of VDD

    VTn in some cases CPL circuit requires complementary inputs & generates complementary outputsto pass on to next CPL stage

    Complimentary Pass Transistor Logic (CPL)

    FPass-Transistor

    Network

    A

    ABB

    B B

    A

    A =

    FPass-Transistor

    Network

    A

    ABB

    Inverse

    B

    S

    S

    S

    S

    A

    B

    A

    Y

    YL

    L

    A

    A F=A

    65

    Boolean Function CPL Pass-transistor logic used to design NOR, XOR, NAND, AND & OR gates

    depending upon P1-P4 inputs: P1,P2,P3,P4 = 0,0,0,1 gives F(A,B) = NOR P1,P2,P3,P4 = 0,1,1,0 gives F(A,B) = XOR

    P1,P2,P3,P4 = 0,1,1,1 gives F(A,B) = NAND P1,P2,P3,P4 = 1,0,0,0 gives F(A,B) = AND P1,P2,P3,P4 = 1,1,1,0 gives F(A,B) = OR

    Circuit can be operated with clocked P pull-up device or inverter-based latch Output inverter provides F(A,B) and can be used to latch node F high (-F low)

    66

  • 7/22/2019 VLSI Lec 4

    34/54

  • 7/22/2019 VLSI Lec 4

    35/54

  • 7/22/2019 VLSI Lec 4

    36/54

    Mixing Domino Logic with Static Logic We can add even number of static inverting logic gates after Domino logic stage

    prior to next Domino logic stage

    Even number of inverting stages guarantees that inputs to second Domino

    logic stage experience only 0-to-1 transitions In cascaded Domino logic structure, evaluation of each stage ripples through

    cascaded stages similar to a chain of Dominos

    Evaluate cycle must be sufficient duration to allow all cascaded logic stages(between latches) to complete their evaluation process within clockevaluation interval

    71

    NP Domino/NORA Logic In dynamic logic erroneous evaluation problem is to use NP Domino

    Logic/NORA logic Alternate stages of N logic with stages of P logic:

    N logic stages use true clock, normal pre-charge & evaluation phases, withN logic tree in the pull down leg

    P logic stages use a complement clock, with P logic stage tied above theoutput node

    During pre-charge clk is low (-clk is high) & P-logic output pre-charges to

    ground while N-logic outputs pre-charge to Vdd Durin evaluate clk is hi h -clk is low and both t e sta es o throu h

    evaluation; N-logic tree logically evaluates to ground while P-logic treelogically evaluates to Vdd

    Inverter outputs can be used to feed other N-blocks from N-blocks, or to feedother P-blocks from P-blocks

    72

    Mp

    Me

    VDD

    PDN

    In1In2In3

    Me

    Mp

    VDD

    PUN

    In4

    Out1

    Out2

    Only 10 transitions allowed at inputs of PUN

  • 7/22/2019 VLSI Lec 4

    37/54

    NP-CMOS Logic

    VDD

    B1

    A1

    VDD

    A1 B1

    Ci2

    VDD

    B1 Ci1

    B1

    A1A1

    VDD

    S1

    Ci1

    73

    VDD

    Ci0

    A0 B0 B0

    A0 Ci1

    Ci0Ci0

    B0

    A0B0

    S0

    A0

    VDD

    VDD

    Carry PathNP CMOS AdderNP CMOS Adder

    NORA Logic NO Raace:

    Signal race can arise, when both TG conduct at same time. If new inputfrom TG1 reaches the input of TG2 while TG2 is still transmitting theoutput, output information will be lost. Imperfect TG synchronization occursbecause of normal transition intervals or clock skew

    NORA is very insensitive to clock delay One clock signal & inverted clock signal with short slopes rise times are

    sufficient No inverter is needed b/w logic stages due to use of n-& p-type blocks Last stage is a clocked inverter, a C2MOS latch

    74Race Problem

  • 7/22/2019 VLSI Lec 4

    38/54

    Pipelined NORA Logic With pipelined NORA CMOS logic design

    One can alternate N and P stages betweenC2MOS latches where high is used for

    evaluation as shown in (a) Or, one can alternate N and P stages similarly

    between C2MOS latches with high used forevaluation as in (b)

    sections may be alternately cascaded with sections as shown in (c)

    ,each stage in succession up to next C2MOS latch

    75

    NORA Logic During low ( high), each stage pre-charges

    N logic stages pre-charge to Vdd; P logic stages pre-charge to GND When goes high ( low), each stage enters the evaluation phase

    N logic evaluates to GND; P logic stages evaluate to Vdd All NMOS & PMOS stages evaluate one after another in succession, as in

    Domino logic Logic below:

    Stage 1 is X = (A B) Stage 2 is G = X + Y

    76

  • 7/22/2019 VLSI Lec 4

    39/54

    Single-Phase NP Dynamic Logic Structures Combines NP Domino logic sections with C2MOS latch

    n-logic block can drive p-logic block or another n-logic block with a static inverter Similarly for a p-logic block

    Must end in a C2

    MOS latch Clk logic: (a) prechrg on clk=0, eval clk=1 -Clk logic: (b) pre on clk=1, eval on clk=0 Clk logic can feed clk logic & vice-versa Can mix static logic with NP domino logic Rules to avoid race conditions:

    During precharge, logic blocks are OFF

    During eval, internal inputs make only one transition

    Pipeline design: Even # of inversions between C2MOS, or at least 1 dynamic stage and even # inversions prior to it

    77

    f

    inputs

    Y

    2/3 2/3

    Pseudo-NMOS Logic Consists of a single PMOS load per gate & NMOS PDN It is a ratio circuit where DC current flows when N PDN is conducting Must design the ratio of N devices W/L to P load device W/L so that when N pull

    down leg with max resistance is conducting, output is at a sufficiently low VOL Needs ratioed devices Dissipates static power, when PDN is ON Provides a method of emulating NMOS circuits in CMOS Reduced noise margin

    VDD

    A B C D

    F

    CL

    Inverter NAND2NOR2

    4/3

    2/3

    A

    Y

    8/3

    8/3

    B

    AA B 4/34/3

    Y

    78

    r er a or sue o og c Fully-complementary CMOS logic

    Immune to noise Virtually zero static power Many stages required for high fan-in functions

    Pseudo nMOS logic Good for high fan-in Nor function

    ROM PLA Adder carry look-ahead

  • 7/22/2019 VLSI Lec 4

    40/54

    Zipper Dynamic Logic Zipper CMOS logic is for improving charge leakage & charge sharing problems Pre-charge transistors receive a slightly modified clock where clock pulse

    (during pre-charge off time) holds the pre-charge transistor at weak conduction

    in order to provide a trickle pre-charge current during the evaluation phase PMOS pre-charge transistor gates are held at Vdd - |Vtp| NMOS pre-charge transistor gates are held at Vtn above GND

    79

    Source-Follower Pull-up Logic (SFPL) SFPL is a variation on pseudo-NMOS whereby load device is an N pull-down

    transistor & N source-follower pull-ups are used on inputs N pull-up transistors can be small limiting input capacitance N transistors are also duplicated as pull-down devices in order to improve

    fall time Rise time is determined by P1 inverter pull-up transistor when all low inputs

    SFPL is useful for high fan-in NOR logic gates

    80

  • 7/22/2019 VLSI Lec 4

    41/54

    Ratioed Logic

    VDD

    VDD

    VDD

    Ratioed logic is an attempt to reduce number of transistors required toimplement a given function, often at cost reduced robustness & extra powerdissipation

    Purpose of PUN is to provide conditional path b/w Vdd & output when PDN isOFF. In ratioed logic, entire PUN is replaced with load device that pulls up theoutput for high output

    Psuedo NMOS gate reduced number of transistors (N+1) Vs. N

    81

    VSS

    PDN

    In1

    In 2In

    3

    F

    RLLoad

    VSS

    In1

    In 2In

    3

    F

    VSS

    PDN

    In1

    In 2In

    3

    F

    VSS

    PDN

    Resistive Depletion

    Load

    PMOS

    Load

    (a) resistive load (b) depletion load NMOS (c) pseudo-NMOS

    VT

    < 0

    Goal: to reduce the number of devices over complementary CMOS

    Differential Cascode Voltage Switch Logic(DCVSL)

    VDD

    Out

    VDD

    Out

    M1 M2

    Out

    Out

    82

    VSS

    PDN1

    VSS

    PDN2

    A

    AB

    B

    Even Better Noise Immunity/Density

    A A

    XOR-NXOR gate

  • 7/22/2019 VLSI Lec 4

    42/54

    CMOS Circuit Styles - Summary

    83

    On-Chip Internal VDD Generator CMOS and Pseudo-NMOS circuitry often require an internally-generated Vdd

    supply voltage or a bias voltage (above Vss) for on-chip use Requirements:

    track supply voltages Vdd and Vss temperature compensation highly regulated de-coupled from power supply noise with use of on-chip capacitance

    Circuit below utilizes a reference voltage generator comprised of series-connected saturated P devices feeding a current mirror to obtain Internal VDD 0

    0 -> 1

    1 -> 1

    But not 1 -> 0 AX

    Y

    Precharge Evaluate

    X

    Precharge

    A = 1

    Y

    A

    Monotonicity Woes

    86

    Precharge Evaluate

    Y

    Precharge

    A

    Output should rise but does not

    violates monotonicityduring evaluation

    But dynamic gates produce monotonically falling outputs during evaluation

    Illegal for one dynamic gate to drive another!

    AX

    Y

    PrechargeEvaluate

    X

    Precharge

    A = 1

    Y should rise but cannot

    Y

    X monotonically falls during evaluation

  • 7/22/2019 VLSI Lec 4

    44/54

    Leakage & Charge Sharing Dynamic node floats high during evaluation

    Transistors are leaky (IOFF 0)

    Dynamic value will leak away over time Formerly miliseconds, now nanoseconds!

    Use keeper to hold dynamic node

    Must be weak enough not to fight evaluation

    A

    H

    2

    2

    1 k XY

    weak keeper

    87

    Dynamic gates suffer from charge sharing

    B = 0

    A

    Y

    x

    Cx

    CYA

    x

    Y

    Charge sharing noise

    Yx Y DD

    x Y

    CV V VC C

    = =+

    Secondary Precharge Solution: add secondary precharge transistors

    Typically need to precharge every other node

    Big load capacitance CY helps as well

    Y

    secondaryprechargetransistor

    88

    B

    x

  • 7/22/2019 VLSI Lec 4

    45/54

    Principle of CMOS MemoryLD=1 QD Q DLD=0 store current state

    89

    Dynamic Flip0Flops:Pseudo 2-Phase Clocking

    90

    Clock skew

    Slow Clock Edges

    Pseudo 2-Phase LatchCharge distribution problem in (b)

  • 7/22/2019 VLSI Lec 4

    46/54

    Reduced Transistor Count Latch

    With high impedance sustainer transistors

    91

    Dynamic D-Latches

    Dynamic D-Latches

    92

    Pseudo 2-Phase dynamic logic

    Pseudo 2-Phase domino logic

  • 7/22/2019 VLSI Lec 4

    47/54

    2-Phase Memory

    93

    2-Phase Static D flip-flops

    2-Phase Static D flip-flops

    Static Memory

    Static D flip-flop with set and reset

    94

  • 7/22/2019 VLSI Lec 4

    48/54

    Latch These sequential devices differ in way their outputs are changed:

    Output of a latch changes independent of a clocking signal

    Output of flipflop changes at specific times determined by clocking signal

    SR Latch based on NOR gate S input sets Q output to 1 while R reset it to 0

    When R=S=0 then output keeps previous value

    When R=S=1 then Q=Q=0, and the latch maygo to an unpredictable next state

    95

    S input sets Q output to 1 while R reset it to 0

    When R=S=1 then output keeps previousvalue

    When R=S=1 then Q=Q=1, & latch may go toan unpredictable next state

    D latch

    This latch eliminates the problem that occurs in SRlatch when R=S=0

    C is an enable input:

    When C=1 then output follows the input D &latch is said to be open

    When C=0 then output retains its last value &latch is said to be closed

    Flip Flop D flip-flop is made out of two D latches. First latch is master, & second slave

    When Ck = 0, master is open & slave is closed. Qm & Ds follow Dm When Ck = 1 master is closed, slave is open & Qm is transferred to Qs Note: Qs does not change because master latch is closed leaving Qm fixed

    96

    J-K flip-flop is similar to S-R flip-flop. Difference arises when J & K are assertedsimultaneously. In this situation output of J-K flip-flop inverts its current state

  • 7/22/2019 VLSI Lec 4

    49/54

    Flip-Flop T flip-flop

    When input T = 0, output Q retain its previous value

    When input T = 1, output Q inverts on every tick of clock

    When inputs J & K of a J-K flip-flop are connected together, J-K flip-flop willbehave like a T flip-flop

    97

    Flip-flops/Latches Flip-flops occupy a special place in conventional digital design

    Always Dynamic Behavior

    Allow time coherence across large parts of the circuit

    Preserve data across synchronization boundaries

    Inherently asynchronous design

    98

  • 7/22/2019 VLSI Lec 4

    50/54

    D Latch When CLK = 1, latch is transparent

    D flows through to Q like a buffer When CLK = 0, the latch is opaque

    Q holds its old value independent of D Transparent latch or level-sensitive latch

    CLK

    D Qtch D

    CLK

    CLK = 1

    D Q

    Q

    D Q

    Q

    99

    L Q

    Multiplexer chooses D or old Q

    1

    0

    D

    CLK

    QCLK

    CLKCLK

    CLK

    DQ Q

    Q

    D Latch Operation

    Advanced Latches

    100

    Tri-state based static Latch

    Modified Svensson Latch 21064

  • 7/22/2019 VLSI Lec 4

    51/54

    D Flip-flop Design When CLK rises, D is copied to Q

    At all other times, Q holds its value

    a.k.a. positive edge-triggered flip-flop,

    master-slaveflip-flop

    Flop

    CLK

    D Q D

    CLK

    Q

    CLK = 1

    D

    CLK = 0

    Q

    D

    QM

    QMQ

    101

    QM

    CLK

    CLKCLK

    CLK

    Q

    CLK

    CLK

    CLK

    CLK

    D

    Latch

    Latch

    D QQM

    CLK

    CLK

    Built from master and slave D latches

    D

    CLK

    Q

    D-F/F Operation

    Sense Amplifier-Based Flip-Flop

    ! "#$% & % %"'"" (!

    102

    Driver transistors are large

    Keeper transistors are small

    & disengaged during transitions

  • 7/22/2019 VLSI Lec 4

    52/54

    Asymmetric Read/Write Ports

    103

    Multi porting

    Dynamic Latches with a Single Clock Dynamic latches eliminate dc feedback leg by storing data on gate capacitance

    of inverter (or logic gate) and switching charge in or out with a transmissiongate Minimum frequency operation is 50-100 KHz, not to lose data due to junction or gate

    leakage from the node

    Can be clocked at high frequency since very little delay in latch elements

    Examples: (a) or (b) show simple transmission gate latch concept

    (c ) tri-state inverter dynamic latch holds data on gate when clk is high (d) and (e) dynamic D register

    104

  • 7/22/2019 VLSI Lec 4

    53/54

    Dynamic Registers with two Phase Clocks Dynamic register with pass gates & two phase

    clocking is shown

    Clocks phi1 and phi2 are non-overlapping

    When phi1 is high & phi2 is zero,

    1st pass gate closed & D data chargesgate capacitance C1 of 1st inverter

    2nd pass gate is open trapping priorcharge on C2

    en p s ow an p s g ,

    1st pass gate opens trapping D data onC1

    2nd pass gate closes allowing C2 tocharge with inverted D data

    If clock skew or sloppy rise/fall time clockbuffers cause overlap of phi1 and phi2 clocks,

    Both pass gates can be closed at the sametime causing mixing of old and new dataand therefore loss of data integrity!

    105

    Two Phase Dynamic Registers (Compact Form) Compact implementation of of two phase

    dynamic registers using a tri-state buffer form

    Transmission gate & inverter integratedinto one circuit

    Two versions:

    Pass devices closest to output

    Inverter devices closest to output

    wo p ase ynam c reg s ers og c s o enpreferred over single phase because:

    Due to finite rise/fall times, CLK & CLK arenot truly non-overlapping

    Clock skew often is a problem due to CLKis usually generated from CLK using aninverter circuit & also due to practicalproblem of distributing clock lines withoutany skew

    106

  • 7/22/2019 VLSI Lec 4

    54/54

    Dynamic Shift Registers with Enhancement Load Dynamic shift register implemented with a

    technique named ratioed dynamic logic. 1 and 2 are non-overlapping clocks

    When 1 is high, Cin1 charges to Vdd Vt if Vinis high or to GND if Vin is low

    When 1 drops and 2 comes up, the input datais trapped on Cin1 and yields a logic output onCout1 which is transferred to Cin2

    When 2 drops and 1 comes up again, the logicoutput on Cout1 is trapped on Cin2, which yieldsa lo ic out ut on Cout2 which is transferred toCin3, etc.

    To avoid losing too much voltage on the logic highlevel, Coutn >> Cinn+1 is desired

    Each inverter must be ratioed to achieve adesired VOL (e.g. when 2 is high on 1

    st inv)

    Dynamic shift register is a ratioless dynamiclogic circuit When 2 is high transferring data to stage 2, 1

    has already turned off the stage 1 load transistor,allowing a VOL = 0 to be obtained without a ratiocondition between load and driver transistors.

    107