DESIGN OF MULTI-GIGABIT SERIAL LINK TRANSCEIVER...

1

DESIGN OF MULTI-GIGABIT SERIAL LINK TRANSCEIVER USING BANDWIDTH-EFFICIENT HALF-SYMBOL-RATE-CARRIER OFFSET QUADRATURE PHASE SHIFT

KEYING MODULATION

By

HYEOPGOO YEO

A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT

OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY

UNIVERSITY OF FLORIDA

2007

2

© 2007 Hyeopgoo Yeo

3

To my parents and family

4

ACKNOWLEDGMENTS

I express my sincere gratitude to my supervisory committee chair, Dr. Jenshan Lin, for his

support throughout the course of this work. I also thank Professors William R. Eisenstadt,

Rizwan Bashirullah, and Li-Chien Shen for their advice on this work and their willing service

and guidance on my research. Without their invaluable support and encouragement, my

exploration in the research could not have come to fruition. I appreciate their interest in my work

and valuable suggestions and comments from the research proposal to its realization.

I thank my fellow colleagues (Tien-Yu Chang, Mingqi Chen, Lance Covert, Sangwon Ko,

Changzhi Li, Zehn Ning Low, Yachi Liu, Zivin Park) in the Radio Frequency Silicon on Chip

(RFSOC) Group for all the help they offered. My thanks also go to my other colleagues in

Electrical and Computer Engineering for their helpful discussion, advice, and friendship (Jongsik

Ahn, Kooho Jung, Sudeep, Jeashin Kim, Jeaseok Kim, Kwangchun Jung, Eunyoung Seok,

Dongha Sim, Kyujin Oh, Minsun Hwang, Seon-Ho Hwang). Their support and advice have

contributed immensely to my work. Also, I thank all of the friends who made my years at the

University of Florida such an enjoyable chapter of my life (Jangsup Yoon, SeungHwan Kim,

Sanghoon Choi, Jiwoon Yang, Choongeol Cho, Okjune Jeon). I acknowledge TSMC and UMC

for the technical and state-of-art fabrication support. I thank Inphi Corp. and Agilent

Technologies for their help on test equipments.

I dedicate this work and my deepest love to my parents who have given me utmost trust

and support throughout the years. I express my profound thanks to my wife, Seungyun Lee, for

her endless and unconditional love and support, and my dearest children, One and Myoung.

Without them, it would not have been possible to pursue my graduate studies.

5

TABLE OF CONTENTS page

ACKNOWLEDGMENTS ...............................................................................................................4

LIST OF TABLES...........................................................................................................................7

LIST OF FIGURES .........................................................................................................................8

ABSTRACT...................................................................................................................................13

CHAPTER

1 INTRODUCTION ..................................................................................................................15

1.1 Motivations .......................................................................................................................15 1.2 Compensation Techniques of High Frequency Signal .....................................................19

1.2.1 Pre-Emphasis Signal...............................................................................................19 1.2.2 Broadband Circuit Technique.................................................................................20

1.3 Modulation Techniques ....................................................................................................22 1.3.1 Conventional Modulation Techniques ...................................................................22 1.3.2 Modulation Technique Using Half-Symbol-Rate-Carrier (HSRC)........................24 1.3.2.1 Why Not Carrier Modulation?.............................................................................25

2 HSRC PHASE SHIFT KEYING (PSK) MODULATIONS ..................................................27

2.1 Binary Modulation............................................................................................................27 2.2 Quadrature Modulations ...................................................................................................28

2.2.1 HSRC Quadrature PSK (QPSK) Modulation.........................................................28 2.2.2 HSRC Offset QPSK (OQPSK) Modulation ...........................................................28 2.2.3 HSRC Minimum Shift Keying (MSK) Modulation ...............................................30

2.3 Signal Spectrum................................................................................................................31 2.4 Bit Error Rate (BER) Performance...................................................................................33 2.5 DC-Free Signaling Based on HSRC-OQPSK Modulation...............................................41 2.5 Measurement of the HSRC-OQPSK Signal .....................................................................45 2.6 Summary...........................................................................................................................51

3 HSRC-OQPSK TRANSMITTER ..........................................................................................53

3.1 Transceiver Architecture ..................................................................................................53 3.1.1 A Conventional Serial Link Transceiver Architecture...........................................53 3.1.2 A Conceptual HSRC-OQPSK Transceiver Architecture .......................................53

3.2 HSRC-OQPSK Transmitter Architecture.........................................................................55 3.2 Circuit Implementation.....................................................................................................57

3.2.1 Current-mode-logic (CML) Circuit ........................................................................57 3.2.2 CML Double-Edge Triggered D flip-flop ..............................................................61 3.2.3 Resistive Load Gilbert Mixer .................................................................................62

6

3.2.4 Quadrature Phase Clock Generator ........................................................................63 3.2.5 I/Q Channel Signal Combining ..............................................................................66 3.2.6 Output Buffer..........................................................................................................66

3.3 Chip Design ......................................................................................................................69 3.3.1 Rev. 1 Transmitter ..................................................................................................69 3.3.2 Rev. 2 Transmitter ..................................................................................................71

3.4 Measurement.....................................................................................................................76 3.4.1 Rev.1 transmitter ....................................................................................................76 3.4.2 Rev. 2 transmitter ...................................................................................................81

4 HSRC-OQPSK RECEIVER DESIGN ...................................................................................91

4.1 Receiver Architecture .......................................................................................................91 4.2 HSRC-OQPSK Receiver (Rev. 1) ....................................................................................91 4.3 HSRC-OQPSK Receiver (Rev. 2) ....................................................................................93

4.3.1 Polarity-Type Costas Loop for Carrier Synchronization........................................94 4.3.2 A New Clock and Data Recovery (CDR) based on the Modified Costas Loop.....96

4.3.2.1 Phase detector characteristics.......................................................................97 4.3.2.2 Loop analysis..............................................................................................102 4.3.2.3 Noise characteristics...................................................................................104

4.3.3 Behavioral Model Simulations .............................................................................106 4.3.3.1 Phase error..................................................................................................106 4.3.3.2 Time domain simulation.............................................................................107

4.4 Chip Design (Rev. 2) ......................................................................................................110 4.4.1 Circuit Implementation.........................................................................................111

4.4.1.1 Sample/hold circuit ....................................................................................111 4.4.1.2 Quadrature VCO (QVCO) .........................................................................112 4.4.1.3 Voltage-to-current (V/I) converter .............................................................114

4.4.2 Circuit Simulations...............................................................................................115 4.4.3 Layout...................................................................................................................119

4.5 Measurement (Rev.2) .....................................................................................................120 4.6 Summary.........................................................................................................................125

5 SUMMARY AND SUGGESTIONS FOR FUTURE WORK .............................................127

5.1 Summary.........................................................................................................................127 5.2 Four-fold Ambiguity Issue .............................................................................................130

LIST OF REFERENCES.............................................................................................................133

BIOGRAPHICAL SKETCH .......................................................................................................137

7

LIST OF TABLES

Table page 2-1 Summary of components used in the measurement...........................................................48

4-1 Transceiver (Rev.2) performance summary ....................................................................124

5-1 Performance comparison of the different modulations....................................................129

8

LIST OF FIGURES

Figure page 1-1 Characteristics of a PCB trace channel with various lengths. ...........................................16

1-2 Basic information of eye diagram......................................................................................16

1-3 Eye diagram of the 10Gbps signal at the far-end after tracing (a) 0.1” (b) 10” (c) 20” PCB channel.......................................................................................................................17

1-4 High data-rate transmission with (a) a conventional parallel bus (b) a serial link. ...........18

1-5 Pre-emphasis of NRZ signal and received signal with/without pre-emphasis. .................19

1-6 Frequency domain representation of the pre-emphasis technique of the transmitted signal. 20

1-7 Active inductor (a) realized with NMOS source follower (b) small signal equivalent circuit (c) simplified model................................................................................................21

1-8 Comparison of PAM-2 and PAM-4 signal spectrum.........................................................23

1-9 Basic digital modulation schemes (a) baseband data (b) ASK (c) FSK (d) PSK. .............24

1-10 Simplified spectrums of carrier modulation signals (a) using carrier signal higher than data-rate (b) using half-symbol-rate-carrier signal. ...................................................26

2-1 HSRC-BPSK modulation: baseband data sequences and modulated signal. ....................27

2-2 HSRC-QPSK and OQPSK (a) modulation scheme, (b) QPSK time domain waveforms, (c) OQPSK time domain waveforms. ............................................................29

2-3 HSRC-MSK (a) modulation scheme (b) time domain waveforms. ...................................30

2-4 Normalized power spectral density for HSRC modulations..............................................32

2-5 Modulated time domain signals of HSRC-OQPSK and its symbol energies. ...................33

2-6 Gaussian distribution of different energy symbols for HSRC-OQPSK modulation. ........36

2-7 Comparison of theoretical and simulated BER performance of HSRC modulations........37

2-8 The demodulated signal (a) demodulation process of I (or Q) channel (b) demodulated peak signal (Es,p) of the HSRC-OQPSK. .....................................................38

2-9 Comparison of the BER performance between the symbol time (matched filter) and the bit time integration of HSRC-OQPK signal. ...............................................................39

9

2-10 Frequency response of the band-limited channel modeled with a one pole low-pass filter. ...................................................................................................................................40

2-11 Comparison of simulated BER performance of the PAM-2 (NRZ) and HSRC-OQPSK modulation with band-limited channel. ...............................................................41

2-12 DC-free modulation based on HSRC-OQPSK (a) modulation scheme (b) time-domain waveforms.............................................................................................................43

2-13 Comparison of the spectra between NRZ and the dc-free signals. ....................................44

2-14 A prototype HSRC-OQPSK transmitter and measurement setup......................................45

2-15 A 500MHz branch-line hybrid quadrature power splitter structure for HFSS simulation...........................................................................................................................46

2-16 Simulated and measured characteristics of the 500MHz branch-line hybrid quadrature power splitter (a) S-parameters (b) phases. .....................................................46

2-17 Measured characteristics of the HSRC-OQPSK modulation (equivalent 2Gbps random data input) (a) time domain waveform (b) spectrum. ...........................................49

2-18 Comparison of the theoretical and measured waveforms of HSRC-OQPSK modulation. ........................................................................................................................50

2-19 Comparison of the measured and theoretical spectrum of the HSRC-OQPSK signal. .....50

3-1 A simplified conventional transceiver for a serial data link. .............................................54

3-2 A conceptual HSRC-OQPSK transceiver architecture. .....................................................54

3-3 A HSRC-OQPSK transmitter architecture.........................................................................55

3-4 A structure of the DET F/F and data and clock synchronization by inserting (a) delay unit (b) a MUX as a delay unit...........................................................................................56

3-5 A fully differential (a) CML buffer and (b) differential input voltage versus output voltages. .............................................................................................................................58

3-6 Characteristics of a differential pairs versus differential input voltage (a) drain currents (b) transconductance. ...........................................................................................60

3-7 CML Circuits (a) D-latch (b) analog multiplexer (c) double-edge triggered flip-flop. .....61

3-8 A resistive load Gilbert mixer............................................................................................63

3-9 Two different delay control circuit (a) a current-starved inverter (b) a shunt capacitive inverter. .............................................................................................................64

10

3-10 A fully differential (a) shunt capacitor inverter with cross-coupled PMOS active load and (b) two-stage ring oscillator. .......................................................................................65

3-11 Injection-locked LC QVCO...............................................................................................66

3-12 Combining I and Q channel signal by direct connecting outputs. .....................................67

3-13 Three stage output buffer with open drain output stage. ...................................................67

3-14 Output buffer (a) differential three-stage output buffer (b) doubly terminated structure..............................................................................................................................68

3-15 HSRC-OQPSK transceiver chip using TSMC 0.18μm CMOS technology. .....................69

3-16 HSRC-OQPSK modulation signal (a) time domain waveforms (b) signal spectrum........70

3-17 Transmitter die photo implemented by UMC 0.18μm CMOS technology. ......................71

3-18 Linearity simulation of resistive Gilbert mixer using UMC 0.18μm CMOS technology (a) conversion gain vs. LO power, (b) conversion gain vs. RF input frequency, (c) input referred 1dB compression. ................................................................73

3-19 HSRC-OQPSK modulation signal (a) time domain waveforms (b) signal spectrum........74

3-20 Simulated dc-free signaling (a) time domain waveforms (b) signal spectrum ..................75

3-20 Test board for the Rev. 1 HSRC-OQPSK transmitter (transceiver) implemented by TSMC 0.18μm CMOS Technology. ..................................................................................76

3-21 Measured spectrums and phase noise of the transmitter’s QVCO implemented by TSMC 0.18μm CMOS technology (center frequency of 2.5GHz with 5MHz span and 47 KHz RBW) (a) free-running mode (b) injection-locked mode (c) comparison of the phase noises between free-running and injection-locked modes. ................................77

3-22 Simplified test setups for (a) 2.5Gbps (5Gbps equivalent) (b) 10Gbps random input for the transmitter...............................................................................................................79

3-23 Eye-diagram of the HSRC-OQPSK transmitted signal implemented by TSMC 0.18μm CMOS technology. ...............................................................................................81

3-24 Test board for the Rev. 2 transmitter implemented by UMC 0.18μm CMOS technology..........................................................................................................................82

3-25 Measured spectrums and phase noise of the transmitter’s QVCO implemented by UMC 0.18μm CMOS technology (center frequency of 2.25GHz with 100MHz span and 910 KHz RBW) (a) free-running mode (b) injection-locked mode (c) comparison of the phase noises between the free-running and the injection-locked modes. ................83

3-26 Characteristics of channels used in the measurement........................................................85

11

3-27 Eye-diagram of HSRC-OQPSK transmitted signal implemented by UMC 0.18μm CMOS technology after (a) 2” PCB trace, (b) 5” PCB trace, (c) 10” PCB trace, (d) 20” PCB trace, (e) 19” SATA cable, in response to 4.86Gbps (both I and Q channel input with 2.43Gbps pseudo random bit stream (PRBS) sequence of 231-1).....................86

3-28 Spectrum of 4.86Gbps dc-free signal.................................................................................88

3-29 Eye-diagram of the HSRC-OQPSK transmitted signal with 9.72Gbps PRBS sequence of (a) 27-1 (ideal eye-opening is depicted with blue line), (b) 231-1. .................89

3-30 Signal spectrum in response to9.72Gbps PRBS sequence of 27-1.....................................53

4-1 HSRC-QOPSK receiver (Rev. 1) architecture incorporated with quarter-rate PD............92

4-2 Quarter-rate phase detector (a) architecture (b) waveforms (for 40Gbps NRZ). ..............92

4-3 HSRC-OQPSK receiver architecture incorporated with a CDR........................................94

4-4 Polarity-type Costas loop for QPSK signal carrier recovery. ............................................95

4-5 A modified Costas loop for the HSRC-OQPSK signal clock and data recovery. .............97

4-6 Early and late sampling time of I/Q data. ........................................................................100

4-7 Averaged phase detector characteristic of the proposed CDR loop. ...............................101

4-8 Equivalent linear model of proposed CDR for HSRC-OQPSK. .....................................103

4-9 Open loop gain characteristics of the proposed CDR loop..............................................104

4-10 Phase error characteristic with SNR (S-curve) of the proposed CDR loop. ....................105

4-11 Phase error simulations of the CDR for the HSRC-OQPSK signal with MatLab Simulink behavioral models. ...........................................................................................106

4-12 Behavioral model simulation result of the phase error for the proposed CDR using MATLAB.........................................................................................................................107

4-13 HSRC-OQPSK Transceiver Model with QVCO for Time-Domain Simulation. ............108

4-14 Time-domain response of phase error signal for the VCO frequency control.................108

4-15 Time-domain response of phase error signal for the VCO frequency control with 10° I/Q mismatch....................................................................................................................109

4-16 Normalized settling time and peak-peak ripple voltage in locking state vs. I/Q mismatch. .........................................................................................................................110

4-17 A high-speed differential sample/hold (S/H) circuit........................................................111

12

4-18 A ping-pong structure differential sample/hold circuit....................................................112

4-19 LC quadrature VCO (LC-QVCO). ..................................................................................113

4-20 Simulated QVCO’s tuning range and its gain..................................................................114

4-21 A differential to single-ended V/I converter. ...................................................................115

4-22 A detailed receiver architecture. ......................................................................................115

4-23 Time-domain simulation setup with a HSRC-OQPSK transmitter and 10cm transmission line with characteristic impedance of 50Ω. ................................................116

4-24 Simulation of the locking behavior of the proposed CDR loop.......................................117

4-25 I/Q data of the transmitter and the receiver in the proper phase locked state. .................118

4-26 HSRC-OQPSK receiver chip fabricated with UMC 0.18μm CMOS technology. ..........119

4-27 Receiver test board...........................................................................................................120

4-28 Simplified receiver (transceiver) measurement setups (a) for 2.5Gbps input (equivalent data-rate of 5Gbps), (b) for 10Gbps data-rate...............................................121

4-29 Phase noise performance of the receiver’s VCO in free-running and locking states. .....122

4-30 Measured jitter of recovered clock in response to 4.86Gbps (both I and Q channel input with 2.43Gbps PRBS sequence of 231-1)................................................................123

4-30 Recovered I (or Q) channel eye-diagrams in response to 4.86Gbps (both I and Q channel input with 2.43Gbps PRBS sequence of 231-1). .................................................123

5-1 A differentially coded HSRC-OQPSK transceiver architecture to resolve the fourfold ambiguity issue (a) the receiver includes a 2:1 multiplexer for serializing I/Q channel data (b) alternative architecture without using a multiplexer. .........................................130

5-2 An example of a conceptual architecture for resolving four-fold ambiguity issue using eight bits SYNC field. ............................................................................................132

13

Abstract of Dissertation Presented to the Graduate School of the University of Florida in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy

DESIGN OF MULTI-GIGABIT SERIAL LINK TRANSCEIVER USING BANDWIDTH-EFFICIENT HALF-SYMBOL-RATE-CARRIER OFFSET QUADRATURE PHASE SHIFT

KEYING MODULATION

By

Hyeopgoo Yeo

August 2007

Chair: Jenshan Lin Major: Electrical and Computer Engineering

My research introduces new quadrature phase-shift-keying (QPSK) modulation techniques

for high-speed data communication systems that use two orthogonal half-symbol-rate-carrier

(HSRC) signals by which channel bandwidth requirements are reduced compared to that of the

conventional non-return-to-zero (NRZ) modulation. The proposed HSRC offset-QPSK (HSRC-

OQPSK) improves spectral efficiency by reducing the side lobes of the signal spectrum. In

addition, HSRC minimum-shift keying (HSRC-MSK) modulation is also introduced. The

performances and the simulation results of the proposed modulation techniques are studied and

compared with those of the conventional ones.

Using the proposed HSRC-OQPSK modulation, a prototype transmitter generating the

HSRC-OQPSK signal was designed and built. Measurement results confirm the theory that the

proposed HSRC-OQPSK modulation improves spectral efficiency by reducing the second lobe

of the signal spectrum by 10dB. Furthermore, the HSRC-OQPSK modulation reduces the first

null bandwidth by 25% compared to the standard Non-Return-to-Zero (NRZ) modulation. Like

NRZ, HSRC-OQPSK uses a 2-level data decision which enables a simpler transceiver

14

architecture than multi-level pulse amplitude modulations (PAM), such as 4-PAM and

duobinary.

My research also examines a modified Costas loop for the clock and data recovery (CDR)

involving HSRC-OQPSK modulation. Behavioral model simulation has verified analysis of the

proposed CDR. The carrier frequency of the HSRC-OQPSK signal is quarter data rate. Hence the

proposed CDR is comparable to quarter-rate CDR using quadrature-phase VCO (QVCO), which

relaxes the timing constraints and allows a simple structure as well.

The HSRC-OQPSK transceivers are implemented and simulated with TSMC and UMC

0.18μm CMOS technology to prove the theoretical performance. The theory and the

measurement results show that it is feasible to increase the data-rate in wire-line communications

using low-cost channels.

15

CHAPTER 1 INTRODUCTION

1.1 Motivations

These days, a demand for high data rate transmission through low cost band-limited

channels (e.g., copper trace on FR4 PWBs and CAT-5 cables) is increasing. Data transmission

with binary format, (represented by non-return-to-zero (NRZ) format) to receive data looks very

simple and clear. However, it is a real challenge to high data rate communications over the band-

limited channel because of impaired signal loss, reflections, and crosstalk. Although parallel I/O

data transmission is efficient for short length data communications, it suffers from a large data

skew and jitter for the aforementioned reasons. Obviously, those problems will be more severe as

data rates and channel lengths increase. The high speed point-to-point link, a serial link, can

overcome these kinds of bottlenecks. It offers high data transmission, up to multi-gigabit data

transmission, over a long PCB trace, and a cable line. The transmitted signal bandwidth for the

point-to-point links increases directly proportional to the data rate. Since a low cost channel,

such as a PCB trace (e.g., copper cable), has low-pass characteristics, the high frequency loss of

the transmitted data is unavoidable, as shown in Figure 1-1. This frequency dependent low-pass

characteristic largely comes from the skin effect of the material as well as dielectric loss of the

material. Figure 1-1 also shows the frequency characteristics of the FR4 PCB with various

lengths. 10 GHz signal attenuated about 20dB in 50Ω characteristic impedance of 20” length

with a tangential loss of 0.023. Besides board trace, a board attachment, a connector, and IC

package make parasitics. Unfortunately, these cause nonlinear effects such as reflection,

resonance, and ripple.

16

2 4 6 8 10 12 140 16

-25

-20

-15

-10

-5

-30

0

freq, GHz

dB(S

(2,1

))5"

10"

20"

15"

Figure 1-1 Characteristics of a PCB trace channel with various lengths.

The non-return-to-zero (NRZ) signal, which is the same as PAM-2, is commonly used for

digital systems including high speed serial links. The NRZ signal, defined as binary data, can be

very simple because the signal has 2-levels and is easily implemented by digital circuits. An eye

diagram is usually used to estimate signal quality. The signal is chopped into equal periods and

accumulated onto one plot. The eye diagram gives visual information regarding signal usefulness

in data communication systems [1].

Zero-crossing jitter

Unit Interval

SNR at the sampling point

ISI

Ideal sampling point

Figure 1-2 Basic information of eye diagram.

17

Figure 1-3 shows an example of the eye diagrams for various trace lengths. The 10Gbps

random binary data is applied to the input port and the output signal is monitored. The eye

diagram has no jitter components and its opening is very clear with 0.1” channel trace. However,

as the PCB trace increases, distortion and jitter components are introduced to the signal and the

eye opening is almost closed after 20” trace, as shown in Fig. 1-3(a).

0.1"

0 20 40 60 80 100 120 140 160 180 200-20 220

0.0

0.2

0.4

0.6

0.8

1.0

-0.2

1.2

time, psec

0 20 40 60 80 100 120 140 160 180 200-20 220

0.0

0.2

0.4

0.6

0.8

1.0

-0.2

1.2

time, psec

10"

0 20 40 60 80 100 120 140 160 180 200-20 220

0.0

0.2

0.4

0.6

0.8

1.0

-0.2

1.2

time, psec

20"

(a) (b) (c)

Figure 1-3 Eye diagram of the 10Gbps signal at the far-end after tracing (a) 0.1” (b) 10” (c) 20” PCB channel.

Conventional parallel buses are used for short range data links such as system buses

between CPU and the main memory to increase the data-rate. However, there are several crucial

limitations to increase data-rate to infinity. First, in high-speed parallel buses, signal jitter and

data skew occur due to the crosstalk between signals, which causes a synchronization problem

between the signal lines at the receiver-end. The synchronization mismatch prevents the

appropriate data transmission between the transmitter and the receiver even though a precise

clock signal is used for the data fetch. Figure 1-4 shows the example of data skew between the

signal lines and the misaligned data arrival at the receiver-end, which causes the wrong data

transfer. Moreover, the skew problem is getting more severe as the clock frequency goes high [2].

Instead of using parallel buses, point-to-point communication called a serial link is used to

resolve the skew problem. This communication uses a low-swing signal instead of a

conventional large swing signal with the terminations which prevent the signal reflection. The

18

low-swing signaling can also reduce the power dissipation. However, to convert the parallel-bit

data into a serial data, the clock frequency should be increased by the parallel bit times for

maintaining the data-rate. However, this method can resolve the data skew between the data in

parallel bits, hence, the synchronization problem can be fixed even if the jitter is introduced to

the signal. Of course, the duration of the data fetch timing is reduced due to the jitter in the signal.

Figure 1-4(b) shows the example of the serial link.

Tx RxParallel Bus

Serial Link

Crosstalk, Jitter, Skew

(Backplane PCB, USB, Ethernet, Optical link)ex) USB2 : 480Mb/s

(a)

(b)

Tx RxParallel Bus

Serial Link

Crosstalk, Jitter, Skew

(Backplane PCB, USB, Ethernet, Optical link)ex) USB2 : 480Mb/s

(a)

(b)

Figure 1-4 High data-rate transmission with (a) a conventional parallel bus (b) a serial link.

Since a serial link uses a very high frequency clock compared to a parallel bus system, the

signal quality gets more influenced from the limitation of the data bandwidth due to the loss of

high frequency energy in the band-limited channel which has normally a low-pass characteristic

as investigated before. Therefore, it remains a challenging problem in the high-speed serial link

communications as to how to compensate the high-frequency energy of the signal and recover

19

the transmitted data at the receiver, including clock and data recovery (CDR) using the

transmitted signal.

1.2 Compensation Techniques of High Frequency Signal

1.2.1 Pre-Emphasis Signal

There are two primary approaches to improving high speed data link limitation over the

low-cost channel. First is a direct compensation of the signal’s high frequency component during

transmission and reception. Pre-emphasis of the PAM-4 transmitted signal with an equalizer

implemented by a multi-tap finite impulse response (FIR) filter has been demonstrated in [3-4].

The equalization distorts the transmitted signal by giving the signal high frequency energy,

hence, the signal arriving at the receiver-end has more power compared to the non pre-emphasis

signal, which are shown in Figure 1-5. However, it is still challenging to implement the equalizer

in the GHz range with CMOS technology [5].

Pre-emphasis Signal

Received Signal

Received Signal without pre-emphasis

Figure 1-5 Pre-emphasis of NRZ signal and received signal with/without pre-emphasis.

Figure 1-6 depicts the pre-emphasis in the frequency domain. The band-limited channel is

characterized as a low-pass characteristic as discussed in the previous section. The FIR filter for

20

the pre-emphasis of the signal should have a high-pass characteristic to compensate the energy of

high-frequency signal which will be attenuated more than the lower frequency signal energy.

However, this technique is very challenging in GHz range with CMOS technology as well as

requires high speed sampling DAC, ADC for the PAM-4 signal, which increase the system

complexities [3-4].

+Channel

Equalizing

Received Signal

Transmitted Signal

+Channel

Equalizing

Received Signal

Transmitted Signal

Figure 1-6 Frequency domain representation of the pre-emphasis technique of the transmitted signal.

1.2.2 Broadband Circuit Technique

Besides the direct compensation of high frequency components, broadband circuit

techniques to broaden the limitation of the operation frequency of the circuit, such as an inductor

peaking, capacitive degeneration, a Cherry-Hooper limiting amplifier, and a fT doubling, can be

classified as compensation techniques of the high frequency signal [6].

Inductive peaking is one of the broadband techniques widely used to increase the

bandwidth of the amplifier. However, a passive inductor integrated in chip occupies a large chip

area. An active inductor implemented by an active device such as a MOS can be used to save the

21

chip area. Detailed active inductor analysis is presented in [6]. Figure 1-7 shows the active

inductor implemented with a PMOS device and its equivalent models.

M1

VDD

RS

Zout

R1R2 Zout

LRS

CGS

IXVX

+-

+- V1

Figure 1-7 Active inductor (a) realized with NMOS source follower (b) small signal equivalent circuit (c) simplified model [6].

The small-signal equivalent circuit is obtained as (1-1), (1-2), and the impedance of the output

node is derived as (1-3).

jωCGSV1 + gmV1 = -IX (1-1)

jωCGSV1RS + V1 = -VX (1-2)

GSm

SGSout Cjg

RCjZ

ωω+

+=

1 (1-3)

Note that if RS >> 1/gm, absolute value of the Zout is increasing with frequency, which behaves

like an inductor and is modeled as depicted in Figure 5-1(c) where R1= RS – 1/gm, R2 = 1/gm.

Then the value of the inductor, L is obtained as (1-4).

⎟⎟⎠

⎞⎜⎜⎝

⎛−=

mS

m

GS

gR

gC

L 1 (1-4)

To get a high quality (Q) factor of the active inductor, we must maximize R1 and minimize

R2. Since the Q of the parallel combination of R1 and L is represented by R1/ωL and R1/L =

gm/CGS, Rs does not affect the Q factor significantly. However, the active inductor mainly

suffers from its voltage headroom for the operation. To relieve voltage headroom, a modified

22

active inductor with NMOS has been developed; however, the achievable bandwidth of the

circuit is limited compared to those using a passive inductor.

Other broadband techniques such as a capacitive degeneration, a Cherry-Hooper limiting

amplifier, are studied well in [6].

1.3 Modulation Techniques

1.3.1 Conventional Modulation Techniques

Pulse amplitude modulation (PAM) -strictly baseband PAM- signal is most often used in

high-speed data communications, such as a serial link communication because it is easy to define

its level of “1” or “0” and implement with a simple digital circuit. However, it needs a relatively

large bandwidth requirement for the data transmission. For high-speed serial link over the band-

limited channel, it is important to reduce the bandwidth requirement of the transmitted signal

because the channel has low-pass characteristics, as discussed in the previous section. Multi-

PAM (e.g. 4-PAM) which reduces signal bandwidth requirement has been introduced and

demonstrated to increase the data-rate [3]. Duobinary, which is characterized as a 3-level PAM,

has also been introduced [7]. However, in these multi-PAMs, it is difficult to maintain the linear

spacing between levels in low-voltage and low-power application and as a result the system’s

performances are degraded. The level spacing also causes complexity in the transceiver design

not only because the received signal needs to be linearly amplified but also because the multi-

PAM signal requires accurate reference voltages.

Second is a modulation technique that reduces the transmitted bandwidth signal, such as

multi-level pulse-amplitude modulation (PAM). Four-level PAM (known as PAM-4) signaling

for serial links using CMOS technology has been proposed and demonstrated over the band-

limited channels [3], [8]. Obviously, the PAM-4 data rate is double that of a non-return-to-zero

(NRZ) with the same bandwidth signal because the former uses four-levels instead of two-levels.

23

Figure 1-8 shows the signal spectrums of baseband PAM-2 and PAM-4. The PAM-4 signal

occupies exactly half of the bandwidth of PAM-2 signal. However, the multi-level signal reduces

the signal energy hence degrading the BER performance. Moreover, electrical limitations of the

system such as low supply voltage resulting from device scaling make the system design more

difficult. For example, the level spacing of the PAM-M signal is inversely proportional to (M-1).

Therefore the level is much smaller in low voltage systems, which are vulnerable to noise and

also require a more precise ADC for the reliable communications. Recently, duobinary signaling,

a 3-level PAM, for the backplane serial link has been introduced and demonstrated for serial

links applications [9]. The duobinary signal has only three signal levels which offer higher signal

energy than the PAM-4 signal within the same voltage system.

0 0.5 1 1.5 2 2.5 3-60

-50

-40

-30

-20

-10

0

10

f/BR (Hz/bit/s)

Nor

mal

ized

Pow

er S

pect

ral D

ensi

ty (d

B)

PAM-2PAM-4

Figure 1-8 Comparison of PAM-2 and PAM-4 signal spectrum.

24

1.3.2 Modulation Technique Using Half-Symbol-Rate-Carrier (HSRC)

On the other hand, a carrier modulation such as a phase-shit-keying (PSK) modulation is

difficult to be used in the serial link data communications because the carrier signal makes

baseband data to the passband signal. Figure 1-9 shows basic digital modulation schemes using

carrier signals. Digital modulation is using an analog carrier signal to modulate the binary digital

sequences. The basic digital modulation schemes are depicted in Figure 1-5. Amplitude-shift-

keying (ASK) is the modulation technique that mixes the baseband binary data with a carrier

signal. The carrier signal is generated when the data is high, otherwise the amplitude of the

carrier goes zero. Frequency-shift-keying (FSK) modulation has two different frequency carrier

signals representing one or zero data. Baseband data is used for the information of the frequency

to be generated from the carrier signal generator which will be usually implemented by a voltage

controlled oscillator (VCO). Phase-shift-keying modulation uses one carrier frequency signal

different from the FSK. The phase of the carrier signal is changed as the baseband data changes.

1 0 0 1

(a)

(b)

(c)

(d)

1 0 0 1

(a)

(b)

(c)

(d)

Figure 1-9 Basic digital modulation schemes (a) baseband data (b) ASK (c) FSK (d) PSK.

25

1.3.2.1 Why Not Carrier Modulation?

These carrier modulation techniques shift the spectrum of the baseband data to the carrier

frequency, as shown in Figure 1-10(a). Usually the carrier frequency is much higher than the

baseband data-rate. Applied to the baseband data communications, the carrier modulation shifts

the baseband data information to the carrier frequency so that the required bandwidth to transfer

the baseband information would be increased as shown in Figure 1-10(a). Consequently, it is

inevitable to waste the required bandwidth of the channel with this carrier modulation.

What happens in the PSK signal if the carrier signal is sub-symbol-rate? Figure 1-10(b)

shows the spectrum where a half-symbol-rate carrier is used as a carrier modulation signal. Both

main lobes of spectrums at positive and negative frequency are overlapped. The modulated

signal is characterized as a passband signal due to the carrier modulation; however, the spectrum

of the signal looks like that of a baseband signal, as depicted in Figure 1-10(b). This signal

would be called the pseudo-baseband signal. Quadrature modulation using two orthogonal carrier

signals can be defined. With this quadrature modulation – quadrature PSK (QPSK) - technique,

the bandwidth requirement of the baseband data can be reduced by half compared to the PSK

modulation using a single carrier signal [10]. What if the quadrature modulation uses the HSRC

signal? As mentioned earlier, the spectrums are overlapped as shown in Figure 1-10(b) so that

the modulation reduces the first-null bandwidth of the spectrum by 25% than that of non-return-

zero (NRZ) modulation.

My research mainly focuses on a signal modulation technique to reduce the bandwidth of

the transmitted signal. Modulations based on PSK are proposed and analyzed. The conventional

PSK type modulations are well-studied in [10-11]. The proposed high speed data links signal

modulation techniques modulating digital data with a half-symbol-rate carrier using phase-shift

keying (named as HSRC-PSK).

26

fc-fc

Signal Bandwidth

Required Bandwidth

fc=1/4Tb

Signal Bandwidth

Required Bandwidth

-fc=-1/4Tb

(a) (b)

Figure 1-10 Simplified spectrums of carrier modulation signals (a) using carrier signal higher than data-rate (b) using half-symbol-rate-carrier signal.

Like binary phase-shift keying (BPSK) and quadrature phase-shift keying (QPSK) used in

wireless systems using radio frequency (RF) carriers, this HSRC-PSK modulation technique can

be extended to quadrature type modulations using two orthogonal carrier signals, such as QPSK,

offset-QPSK (OQPSK), minimum shift keying (MSK) modulations popularly used in wireless

communications. These quadrature type PSK modulations can increase the data rate with nearly

equal signal energy [9]. The proposed half-symbol-rate carrier modulation technique is similar to

conventional modulation techniques. However, their properties are not identical and the analysis

is given in this report. Effectively, the half-symbol-rate-carrier modulation performs waveform

shaping on the data bits when transmitted through band-limited channels. The signal spectrums

of proposed modulations are derived in analytical form and their bit error rate performances are

simulated. The results are compared to conventional modulations.

The HSRC-OQPSK signaling is especially chosen as a possible modulation candidate for

high speed serial links, implemented and simulated by TSMC and UMC 0.18um CMOS

technology.

27

CHAPTER 2 HSRC PHASE SHIFT KEYING (PSK) MODULATIONS

2.1 Binary Modulation

A binary PSK (BPSK) signal whose sinusoidal carrier amplitude is Ac can be represented

by (2-1) where m(t) is the binary data signal, Tb is the bit period, fc, is the frequency of carrier,

and θc is the phase of the carrier [12].

( ) bccc TttfAtmts ≤≤+⋅= 02cos)()( θπ (2-1)

Let us now consider a special case where fc=1/2Tb (Nyquist bandwidth) and θc=-π/2. With

this condition, the BPSK signal can be represented by (2-2).

bb

c TttT

Atmts ≤≤⎟⎟⎠

⎞⎜⎜⎝

⎛⋅= 0sin)()( π

(2-2)

Since the carrier signal frequency is within the baseband, this modulation is similar to

signal waveform shaping rather than modulation. The baseband data are mixed with a carrier

whose frequency is the same as one half of the symbol rate, fc=1/2Ts, where Ts is the symbol

period, and Ts=Tb in BPSK. Figure 2-1 shows the concept of half-symbol-rate-carrier BPSK

(HSRC-BPSK) modulation.

-1

1m(t)

1 1s(t)

11 -1 1

sin(π/Tb)t

-1

1m(t)

1 1-1

1m(t)

1 1s(t)

11 -1 1

s(t)1

1 -1 1

sin(π/Tb)t

Figure 2-1 HSRC-BPSK modulation: baseband data sequences and modulated signal.

28

2.2 Quadrature Modulations

2.2.1 HSRC Quadrature PSK (QPSK) Modulation

HSRC-QPSK modulation is similar to a conventional QPSK modulation. The conventional

QPSK signal can be represented in (2-3), where mI and mQ are the data sequences in their in-

phase (I) and the quadrature-phase (Q) components, respectively, and fc is the carrier frequency

which is generally larger than the symbol rate.

tftmtftmts cQcI ππ 2sin)(2

12cos)(2

1)( ⋅+⋅= (2-3)

However, for multi-Gb/s data communications (i.e. backplane serial link), it is often

unpractical to generate a carrier frequency higher than the symbol rate. To address this issue, the

HSRC-QPSK modulation is proposed, as shown in Figure 2-2(a). The HSRC-QPSK signal is

obtained by substituting QPSK’s carrier frequency, as fc=1/(4Tb), where Tb is the bit-period,

which is the half-symbol-rate-carrier frequency, as shown in (2-4).

tT

tmtT

tmtsb

Qb

I ⎟⎟⎠

⎞⎜⎜⎝

⎛⋅+⎟⎟

⎠

⎞⎜⎜⎝

⎛⋅=

2sin)(

21

2cos)(

21)( ππ (2-4)

Since the carrier frequency is lower than the data-rate, the phase of the carrier signal can

affect the modulated signal’s properties. Therefore, the phase of the carrier signal is fixed to

define this modulation. The theoretical time domain HSRC-QPSK waveforms are shown in

Figure 2-2(b).

2.2.2 HSRC Offset QPSK (OQPSK) Modulation

The HSRC-OQPSK shares the same architecture as HSRC-QPSK shown in Fig. 2-2(a). Its

signal is obtained by staggering (offsetting) I and Q by Tb, as shown in Figure 2-2(c). The

HSRC-OQPSK signal can also be defined by (2-3) but using offset sequences of mI and mQ [13].

29

m (t)m1

m2

m3 m4

m5 m8

m6 m7

0 2Tb 4Tb 6Tb 8Tb

m (t)m1

m2

m3 m4

m5 m8

m6 m7

0 2Tb 4Tb 6Tb 8Tb

( )tTb2cos2

1 π

mI(t)

mQ(t)

Serial to

Parallel

s(t)

sI(t)

sQ(t)

m(t)

( )tTb2sin2

1 π

( )tTb2cos2

1 π

mI(t)

mQ(t)

Serial to

Parallel

s(t)

sI(t)

sQ(t)

m(t)

( )tTb2sin2

1 π

(a)

m (t)m1

m2

m3 m4

m5 m8

m6 m7

0 2Tb 4Tb 6Tb 8Tb

m (t)m1

m2

m3 m4

m5 m8

m6 m7

0 2Tb 4Tb 6Tb 8Tb

( )tTb2cos2

1 π

mI(t)

mQ(t)

Serial to

Parallel

s(t)

sI(t)

sQ(t)

m(t)

( )tTb2sin2

1 π

( )tTb2cos2

1 π

mI(t)

mQ(t)

Serial to

Parallel

s(t)

sI(t)

sQ(t)

m(t)

( )tTb2sin2

1 π

m (t)m1

m2

m3 m4

m5 m8

m6 m7

0 2Tb 4Tb 6Tb 8Tb

m (t)m1

m2

m3 m4

m5 m8

m6 m7

0 2Tb 4Tb 6Tb 8Tb

( )tTb2cos2

1 π

mI(t)

mQ(t)

Serial to

Parallel

s(t)

sI(t)

sQ(t)

m(t)

( )tTb2sin2

1 π

( )tTb2cos2

1 π

mI(t)

mQ(t)

Serial to

Parallel

s(t)

sI(t)

sQ(t)

m(t)

( )tTb2sin2

1 π

(a)

s I( t)

0 2 T b 4 T b 6 T b 8 T b

s I( t)

0 2 T b 4 T b 6 T b 8 T b

s Q ( t)

0 2 T b 4 T b 6 T b 8 T b

s Q ( t)

0 2 T b 4 T b 6 T b 8 T b

s ( t)

0 2 T b 4 T b 6 T b 8 T b

s ( t)

0 2 T b 4 T b 6 T b 8 T b

(b) (c)

m5

m7mI(t) m3

0 2Tb 4Tb 6Tb 8Tb

m1

Tb 9Tb3Tb 5Tb 7Tb

mQ(t)

m2

m4 m6

m8

ssI(t)

0 2Tb 4Tb 6Tb 8Tb

sQ(t)

Tb 9Tb3Tb 5Tb 7Tb

m5

m

0 2Tb 4Tb 6Tb 8Tb

mI(t) m1 m3 m7

mQ(t)

m2

m4 m6

m8

0 2Tb 4Tb 6Tb 8Tb

s ( t)s ( t)

Tb 9Tb3Tb 5Tb 7Tb

s I( t)

0 2 T b 4 T b 6 T b 8 T b

s I( t)

0 2 T b 4 T b 6 T b 8 T b

s Q ( t)

0 2 T b 4 T b 6 T b 8 T b

s Q ( t)

0 2 T b 4 T b 6 T b 8 T b

s ( t)

0 2 T b 4 T b 6 T b 8 T b

s ( t)

0 2 T b 4 T b 6 T b 8 T b

(b) (c)

m5

m7mI(t) m3

0 2Tb 4Tb 6Tb 8Tb

m1

m5

m7mI(t) m3

0 2Tb 4Tb 6Tb 8Tb

m1

Tb 9Tb3Tb 5Tb 7Tb

mQ(t)

m2

m4 m6

m8

Tb 9Tb3Tb 5Tb 7Tb

mQ(t)

m2

m4 m6

m8

ssI(t)

0 2Tb 4Tb 6Tb 8Tb

ssI(t)

0 2Tb 4Tb 6Tb 8Tb

sQ(t)

Tb 9Tb3Tb 5Tb 7Tb

sQ(t)

Tb 9Tb3Tb 5Tb 7Tb

m5

m

0 2Tb 4Tb 6Tb 8Tb

mI(t) m1 m3 m7

m5

m

0 2Tb 4Tb 6Tb 8Tb

mI(t) m1 m3 m7

mQ(t)

m2

m4 m6

m8

0 2Tb 4Tb 6Tb 8Tb

mQ(t)

m2

m4 m6

m8

0 2Tb 4Tb 6Tb 8Tb

s ( t)s ( t)

Tb 9Tb3Tb 5Tb 7Tb

s ( t)s ( t)

Tb 9Tb3Tb 5Tb 7Tb

Figure 2-2 HSRC-QPSK and OQPSK (a) modulation scheme, (b) QPSK time domain waveforms, (c) OQPSK time domain waveforms.

This is the same as the direct combination of conventional MSK’s I/Q channels without the

mixing of the carrier frequency. As shown in Figure 2-2(c), the combined signal is free from any

discrete transitions. If the in-phase signal is at its peak of either positive or negative value, the

quadrature-phase signal is zero at every 2Tb. On the other hand, if the quadrature-phase signal is

at its peak of either positive or negative value, the in-phase signal is zero at every 2Tb. Therefore,

the combined signal has no discrete transitions.

30

2.2.3 HSRC Minimum Shift Keying (MSK) Modulation

The HSRC-MSK modulation can be obtained by applying the HSRC signal to the

conventional MSK modulation format as described in (2-5) and shown in Fig. 2-3(a). As shown

in Fig. 2-3(b), the data sequences of I/Q channels are the same as those of the OQPSK as well as

the proposed HSRC-OQPSK. The resulting HSRC-MSK time domain data sequences are the

same as the original data sequences.

tT

tmtT

tmtsb

Qb

I ⎟⎟⎠

⎞⎜⎜⎝

⎛+⎟⎟

⎠

⎞⎜⎜⎝

⎛=

2sin)(

2cos)()( 22 ππ (2-5)

sI(t)

0 2Tb 4Tb 6Tb 8Tb

sQ(t)

Tb 3Tb 5Tb 7Tb 9Tb

s(t)

Tb 3Tb 5Tb 7Tb 9Tb

m4

m6mI(t) m0 m2

0 2Tb 4Tb 6Tb 8Tb

mQ(t)

m1

m3 m5

m7

Tb 3Tb 5Tb 7Tb 9Tb

( )tTb2cos π

( )tTb2sin π

( )tTb2cos π

mI(t)

mQ(t)

Serial to

Parallel

s(t)

sI(t)

sQ(t)

m(t)

( )tTb2sin π

m (t)m0

m1

m2 m3

m4 m7

m5 m6

0 2Tb 4Tb 6Tb 8Tb

(a)

(b)

sI(t)

0 2Tb 4Tb 6Tb 8Tb

sI(t)

0 2Tb 4Tb 6Tb 8Tb

sQ(t)

Tb 3Tb 5Tb 7Tb 9Tb

sQ(t)

Tb 3Tb 5Tb 7Tb 9Tb

s(t)

Tb 3Tb 5Tb 7Tb 9Tb

s(t)

Tb 3Tb 5Tb 7Tb 9Tb

m4

m6mI(t) m0 m2

0 2Tb 4Tb 6Tb 8Tb

m4

m6mI(t) m0 m2

0 2Tb 4Tb 6Tb 8Tb

mQ(t)

m1

m3 m5

m7

Tb 3Tb 5Tb 7Tb 9Tb

mQ(t)

m1

m3 m5

m7

Tb 3Tb 5Tb 7Tb 9Tb

( )tTb2cos π

( )tTb2sin π

( )tTb2cos π

mI(t)

mQ(t)

Serial to

Parallel

s(t)

sI(t)

sQ(t)

m(t)

( )tTb2sin π

( )tTb2cos π

( )tTb2sin π

( )tTb2cos π

mI(t)

mQ(t)

Serial to

Parallel

s(t)

sI(t)

sQ(t)

m(t)

( )tTb2sin π

m (t)m0

m1

m2 m3

m4 m7

m5 m6

0 2Tb 4Tb 6Tb 8Tb

m (t)m0

m1

m2 m3

m4 m7

m5 m6

0 2Tb 4Tb 6Tb 8Tb

(a)

(b)

Figure 2-3 HSRC-MSK (a) modulation scheme (b) time domain waveforms.

31

The modulated signal described in (2-5) has the same data sequences to those of the

original binary data which is shown in Figure 2-3(a). This is similar to raised-cosine

approximation (RCA) signaling [14].

2.3 Signal Spectrum

Since the frequency of the carrier signal is lower than the data-rate, the carrier signal can

be effectively considered as a spectral pulse-shaping. The spectral pulse-shaping function of the

proposed modulations is derived as shown in (2-6) by applying the similar approach described in

[10]. Note that the pulse-shaping function of HSRC-QPSK and HSRC-OQPSK are different,

although the pulse-shaping functions of the conventional QPSK and OQPSK are the same.

( )

⎪⎪⎪⎪

⎭

⎪⎪⎪⎪

⎬

⎫

⎪⎪⎪⎪

⎩

⎪⎪⎪⎪

⎨

⎧

<<⎟⎟⎠

⎞⎜⎜⎝

⎛

<<⎟⎟⎠

⎞⎜⎜⎝

⎛

<<⎟⎟⎠

⎞⎜⎜⎝

⎛+

=

elsewhere

TtTt

TtTt

TtTt

tg

bb

bb

bb

;0

MSK-HSRCfor 20;2

sin

OQPSK-HSRCfor 20;2

sin

QPSK-HSRCfor 20;42

sin

2 π

π

ππ

(2-6)

The normalized power spectral densities, S(f), are derived by the Fourier transforms of (2-

6), S(f)=|G(f)|2/T, where G(f) is the Fourier transform of any given g(t) [10]. Using this equation

and combining with trigonometric identities of sin(x)=(ejx-e-jx)/2j, cos(x)=(ejx+e-jx)/2, the final

forms of S(f) can be equated as (2-7). Their plots are shown in Figure 2-4.

( )

( )

( ) ⎪⎪⎪⎪

⎭

⎪⎪⎪⎪

⎬

⎫

⎪⎪⎪⎪

⎩

⎪⎪⎪⎪

⎨

⎧

⎟⎟⎠

⎞⎜⎜⎝

⎛

−

⎟⎟⎠

⎞⎜⎜⎝

⎛

−

⎟⎟⎠

⎞⎜⎜⎝

⎛

−+

=

MSK-HSRC;412

2sin

OQPSK-HSRC;161

2cos16

QPSK-HSRC;161

2cos1618

2

22

2

222

222

22

bb

bb

b

bb

b

bbb

TffTfTT

TffTT

TffTTfT

fS

ππ

ππ

ππ

(2-7)

32

0 0.5 1 1.5 2 2.5 3-60

-50

-40

-30

-20

-10

0

10

f/BR (Hz/bit/s)

Nor

mal

ized

Pow

er S

pect

ral D

ensi

ty (d

B)

NRZ (PAM-2)HSRC-MSKHSRC-QPSKHSRC-OQPSK(MSK)

Figure 2-4 Normalized power spectral density for HSRC modulations.

The first null point of the HSRC-QPSK and HSRC-OQPSK spectrum are located at

0.75Tb. Therefore, their first-null bandwidths are reduced by 25% compared to the first-null

bandwidth of the conventional NRZ, alternatively known as baseband binary pulse-amplitude-

modulation (PAM-2). However, the first-null bandwidth of the HSRC-MSK is the same as that of

the conventional NRZ.

For the side lobes, HSRC-OQPSK shares the same pulse-shaping with the conventional

MSK. Therefore, the HSRC-OQPSK power spectral density should also be identical to that of

MSK, which is presented in [11]. Also, they fall off more rapidly (1/f4) than those of the

conventional NRZ (1/f-2) [11]. The side lobes of the HSRC-MSK fall off even more rapidly (1/f6)

and therefore the high frequency components can be further suppressed [14].

33

Recent semiconductor technologies scaled down the transistor size, which results in the

reduction of supply voltage. In low voltage circuits, PAM-4 modulation, that requires linear

amplification, is more difficult to maintain the same spacing between levels. The HSRC-OQPSK

is expected to have advantages over this because it does not need to space levels. Especially, for

the amplifiers to be operating at power saturation, MSK has superior performance to QPSK [13].

For bandwidth efficiency, the spectrum of HSRC-OQPSK, which is the same as MSK, contains

99% of the total signal power within the bandwidth, B≈(1.2/Tb), while for QPSK and PAM-4, the

99% bandwidth increase to B≈(8/Tb) [13], [15]. Therefore, it is expected that the HSRC-OQPSK

should have the effective signal spectrum in band-limited channel, such as backplane serial link.

2.4 Bit Error Rate (BER) Performance

The BER performances are characterized using a coherent demodulation with a matched

filter in Additive-White-Gaussian-Noise (AWGN) channel for the proposed HSRC modulations

are simulated via MATLAB. The HSRC-QPSK signal has only one symbol energy, Es.

Therefore, it is expected to follow the same BER performance of the conventional QPSK, as

shown in Figure 2-7.

s(t)Es,t Es,t

Es,t Es,tEs,f Es,p

Es,p Es,f

s(t)Es,t Es,t

Es,t Es,tEs,f Es,p

Es,p Es,f

Figure 2-5 Modulated time domain signals of HSRC-OQPSK and its symbol energies.

For the HSRC-OQPSK, the BER performance result is different from that of the

conventional OQPSK, due to the existence of three different symbol energies – namely, Es,t, Es,f,

34

Es,p. For the case where the quadrature symbols are evenly transmitted, an example of s(t)

illustrating the three different energies is described in Figure 2-5.

The upper sequences are for the I channel while the lower sequences are for the Q channel.

The different three symbol energies can be calculated by integrating the square of the modulated

signal (∫|s(t)|2dt) over a 2 bit-time (2Tb), as shown in (2-8)~(2-10). Es,t is the symbol energy for

the case where only one of the 2 bits transits between its high and low points. This symbol

energy is the same as that of the conventional QPSK, Es. Es,f is for the case where both 2 bits

remain in its high or low points (remains flat). Finally, Es,p is when the signal peaks by

consecutive transitions. Note that the final terms in (2-9), (2-10), Es,f, Es,p are expressed in scalar-

multiples of Es,t by kf, kp, respectively.

dtTtAE bT

bcts

22

0

2, 42

sin ∫ ⎟⎟⎠

⎞⎜⎜⎝

⎛+=ππ (2-8)

⎟⎠⎞

⎜⎝⎛ +

≡=⎟⎟⎠

⎞⎜⎜⎝

⎛+⋅= ∫ π

πππ 2 ; 42

sin2 ,

2

0

2, ftsf

T

bcfs kEkdt

TtAE b (2-9)

⎟⎠⎞

⎜⎝⎛ −

≡⋅=⎟⎟⎠

⎞⎜⎜⎝

⎛+⋅= ∫ π

πππ 2 ; 42

sin2 ,

222

, ptsp

T

Tb

cps kEkdtTtAE b

b

(2-10)

The average probability of errors of s(t) can be calculated using (2-10), where M is the

number of different types of symbols [10], [12]. As was the case in Figure 2-5, this equation

assumes that M symbols are evenly transmitted, where si represents one of M possible symbols

of the modulation signal.

( ) ( ) ( ) ( )∑=

==M

iieiiee sP

MsPsPP

1

|1| εεε (2-11)

35

If we define noise energy to be N, the BER can be expressed in terms of energy-per-

symbol (or bit) to the noise-density ratio of the three energies, as shown in (2-12)~(2-14). Note

that in the final terms of (2-13) and (2-14), the noises are scaled to an effective values, so that the

signal energies can be considered as a function of Es,t.

NE

NE sts =, (2-12)

f

tstsffs

kNE

NEk

NE

/,,, =

⋅= (2-13)

p

tstspps

kNE

NEk

NE

/,,, =

⋅= (2-14)

In the AWGN channel, the noise N, has a Gaussian distribution with the mean being 0 and

the variance being ½No. Hence, the scaled effective noises of N/kf and N/kp also have Gaussian

distributions where the means are 0 and variances are ½No/kf2 and ½No/kp

2, respectively. With

these parameters, the probability of errors for each symbol can be obtained by integrating the

Gaussian distribution functions over the error decision section as shown in Figure 2-6. Therefore,

the BER of HSRC-OQPSK in AWGN channel can be derived as (2-15), where Eb=Es/2 and

( ) ( )∫∞

−=x

dxxxQ 2/exp21 2

π.

⎟⎟⎠

⎞⎜⎜⎝

⎛⎟⎠⎞

⎜⎝⎛ −

⋅+⎟⎟⎠

⎞⎜⎜⎝

⎛⎟⎠⎞

⎜⎝⎛ +

⋅+⎟⎟⎠

⎞⎜⎜⎝

⎛⋅=

000

224122

412

21

NE

QNE

QNE

QP bbbe π

ππ

π (2-15)

The 1/2 weight in the first term and the 1/4 in the second and last terms account for the fact

that Es,t occurs twice as often as Es,f or Es,p, in an evenly transmitted quadrature symbols. As

shown in Figure 2-7, (2-15) fits well with the results obtained from MATLAB.

36

Nominal EnergyLow EnergyHigh Energy

sEsE−

0

22

21

2N⎟

⎠⎞

⎜⎝⎛−

=ππσ

0

22

21

2N⎟

⎠⎞

⎜⎝⎛+

=ππσ

02

21 N=σ

( ) ( ) ⎟⎠⎞⎜

⎝⎛ −= 22

2exp2

1 σπσ sExxf

Es,t/N

Es,p/NEs,f/N

Figure 2-6 Gaussian distribution of different energy symbols for HSRC-OQPSK modulation.

The HSRC-MSK also has different BER from that of the conventional MSK, due to its

existence of two different symbol energies – namely, Es,t,m, Es,f,m. The analysis approach is the

same as that of the HSRC-OQPSK’s case and the results symbol energies and the BER of HSRC-

OQPSK are shown in (2-16), (2-17), and (2-18), respectively.

bc

T

bbcmts TAdt

Tt

TtAE b 2

22

0

222,, 2

cos2

sin =⎟⎟⎠

⎞⎜⎜⎝

⎛−⎟⎟

⎠

⎞⎜⎜⎝

⎛= ∫

ππ (2-16)

bc

T

bbcmfs TAdt

Tt

TtAE b 2

22

0

222,, 2

2cos

2sin =⎟⎟

⎠

⎞⎜⎜⎝

⎛+⎟⎟

⎠

⎞⎜⎜⎝

⎛= ∫

ππ . (2-17)

⎟⎟⎠

⎞⎜⎜⎝

⎛⋅+⎟

⎟⎠

⎞⎜⎜⎝

⎛⋅=

00

232

212

34

21

NE

QNE

QP bbe (2-18)

The analytical results are compared with those obtained from the MATLAB simulation as

shown in Figure 2-7. From the results, we observe that the HSRC-QPSK modulation has no

degradation of BER performance as compared to the conventional QPSK, which has almost the

37

same BER performance as that of the NRZ modulation. Therefore, HSRC-QPSK can be used as

a modulation technique for high-speed broadband communications with less bandwidth

requirement while keeping similar BER performance of NRZ. On the other hand, the HSRC-

OQPSK and HSRC-MSK modulations have less BER performances but having even more

efficient signal spectrum characteristics as described in the previous section.

0 5 10 15 2010-6

10-5

10-4

10-3

10-2

10-1

100

Eb/N0 (dB)

BE

R

Theoretical QPSKTheoretical HSRC-MSKTheoretical HSRC-OQPSKSimulated HSRC-QPSKSimulated HSRC-MSKSimulated HSRC-OQPSK

Figure 2-7 Comparison of theoretical and simulated BER performance of HSRC modulations.

The BER performances of the proposed HSRC PSKs’ are calculated by demodulation with

the matched filer. Generally, the demodulation with a matched filter enables the system to have

the best performance. However, a different result is obtained in the HSRC-OQPSK modulation.

Figure 2-8 shows demodulation process of I (or Q) channel at the HSRC-OQPSK receiver which

is presented in Chapter 4 in detail and the demodulated peak signal of the HSRC-OQPSK, which

has a low symbol energy, Es,p. The carrier signals are mixed with the incoming signals, as shown

in Figure 2-8. The demodulated signals at the first half bit time and the last half bit time periods

38

have negative values while the signal at the middle bit time period is positive. Therefore, the

integration of the demodulated HSRC-OQPSK signal over the period of 2Tb is less than that over

the period of Tb, as shown in Figure 2-8.

+

- -Bit Period (Tb)

Symbol Period (2Tb)

(a)

(b)

Carrier Signal

Input Signal

Figure 2-8 The demodulated signal (a) demodulation process of I (or Q) channel (b)

demodulated peak signal (Es,p) of the HSRC-OQPSK.

Since a matched filter is not used in the actual realization of the receiver (the level decision

is made by a clocked flip-flop (FF)) which is discussed in Chapter 4, the integrated value of the

demodulated signal over the period of Tb produce more realistic BER performance. The signal

energy is calculated as (2-19).

21 ;

2sin

42cos22 '

,'

2

2', ≡⋅=⎟⎟

⎠

⎞⎜⎜⎝

⎛⋅⎟⎟

⎠

⎞⎜⎜⎝

⎛+⋅= ∫ ptsp

TT

bbcps kEkdt

Tt

TtAE b

b

πππ (2-19)

Since the BER performance of the HSRC-OQPSK modulation is mainly determined by

third term of (2-15), the BER performance with the symbol energies of (2-19) can be represented

as (2-20) approximately.

39

⎟⎟⎠

⎞⎜⎜⎝

⎛⎟⎠⎞

⎜⎝⎛⋅≈

0

221

41

NE

QP be (2-20)

Figure 2-9 shows the theoretical BER performance using symbol energies of (2-19) and

simulated BER performance. From the simulation results, the difference of the BER

performances between the theoretical PAM-2 (NRZ) and the HSRC-OQPSK is approximately

4dB instead of 8dB (shown in Figure 2-7). The difference between the theoretical and simulated

BER performance is caused the noise amount integrated during the bit time period is, Tb, is

different from the total noise introduced in the AWGN channel. Although, the absolute values of

the BER performance using an AWGN channel would be different from that using a band-

limited channel, the difference of the BER performance is comparable to those in the band-

limited channel.

0 5 10 15 2010-6

10-5

10-4

10-3

10-2

10-1

100

Eb/N0 (dB)

BE

R

Theoretical PAM-2 (NRZ)Theoretical HSRC-OQPSK (Tb time integration)Theoretical HSRC-OQPSK (2Tb time integration)Simulated HSRC-OQPSK (Tb time integration)

Figure 2-9 Comparison of the BER performance between the symbol time (matched filter) and

the bit time integration of HSRC-OQPK signal.

40

To estimate the BER performance in the band-limited channel, a simple low-pass filter

having one pole, which is characterizing a band-limited channel, is added to the channel. The

transfer function is (2-21) and Figure 2-10 shows the frequency response of (2-21). The

characterized channel is comparable to the band-limited channel which has the loss of

0.75dB/GHz approximately.

10

10

103103)(⋅+

⋅=

ssH (2-21)

0 2 4 6 8 10 12 14 16-12

-10

-8

-6

-4

-2

0

Frequency (GHz)

H(s

) (dB

)

Figure 2-10 Frequency response of the band-limited channel modeled with a one pole low-pass

filter.

Figure 2-11 shows the simulated results of the 10Gbps system. Approximately, 5dB more

signal power is needed with the band-limited channel modeled as (2-21) to get the same BER

performance, which is caused energy loss of the transmitted signal in the band-limited channel

modeled as (2-21). From the frequency response depicted in Figure 2-10, 5GHz signals which is

comparable to 10Gbps data rate are losing their energy of 4dB by the band-limited channel.

41

However, the difference (4dB) of the signal to noise ratio per bit (Eb/N0) between the NRZ and

the HSRC-OQPSK is almost the same as the simulation result without the band-limited channel.

If the signal power is enough compared to noise introduced in the channel, the energy loss of the

signal in the band-limited channel mainly determines the BER performance of the system.

0 5 10 15 2010-5

10-4

10-3

10-2

10-1

100

Eb/N0 (dB)

BE

R

Simulated PAM-2 (NRZ)Simulated HSRC-OQPSK

Figure 2-11 Comparison of simulated BER performance of the PAM-2 (NRZ) and HSRC-

OQPSK modulation with band-limited channel.

2.5 DC-Free Signaling Based on HSRC-OQPSK Modulation

The ac coupled interconnect gives several advantages over the dc coupled interconnect.

First, the ac coupled channel would provide more flexible interconnection between the various

signal standards [16]. Second, the ac coupled interconnect allows high density and low-power

chip-to-chip communication. Recently, an ac coupled interconnect (ACCI) for the chip-to-chip

communication has been introduced and its performance demonstrated [17]. It enables low

power properties as well as high density I/Os. Moreover, an on-chip capacitor formed under pad

42

blocks the dc levels of the signal line which allows communication between the chips using a

different voltage level. However, capacitive coupling interconnect suffers from the “zero

wander” effect [16]. The capacitive coupling characterized as a high pass filter tends to cut off

the dc or low frequency information. However, it is very difficult to compensate the loss of the

signal where an ac coupled interconnect is used in a long line channel, such as a high-speed

serial links because both low and high frequency signal information should be compensated due

to the high frequency signal loss in the channel being characterized as a low pass filter.

Therefore, a modulation technique that removes the dc or low frequency component seems to be

more effective in this long line channel communication to relieve this “zero wander” problem

effectively.

This section investigates a modulation technique that removes the dc component in the

signal, which is relied on half-symbol-rate-carrier offset QPSK (HSRC-OQPSK) modulation.

The proposed dc-free signaling can not only remove the dc and low frequency components but

also maintain the (first-null) bandwidth of non-return-to-zero (NRZ) signal which is use in data

communication of the most conventional digital systems. In telecommunications, the transmitted

data are often encoded with 8B/10B [18] that converts 8-bit symbols to 10-bit symbols for proper

dc balance to guarantee clock and data recovery (CDR) operation at the receiver. This encoding

scheme will increase the transmitted data bandwidth by 20%. However, the 8B/10B encoding

might not be necessary for this modulation because the modulated signal includes the carrier

signal and no dc components. Consequently, this dc-free signaling will decrease the required

bandwidth by 20% effectively. As analyzed in section 2.4, approximately 4dB more signal to

noise ratio per bit is required to get the same BER performance of the NRZ modulation.

43

Decreasing data-rate by half, TB=2Tb, with maintaining data offset of Tb of (2-4), we can

get another modulation which can be characterized as a half bit time, TB/2, offset QPSK using a

quadrature symbol-rate-carrier (HRC) signal. The signal can be obtained as (2-22) simply by

substituting 2Tb=TB of (2-4). Figure 2-12 shows the modulation scheme and its time domain

waveforms.

tT

tmtT

tmtsB

QB

I ⎟⎟⎠

⎞⎜⎜⎝

⎛⋅+⎟⎟

⎠

⎞⎜⎜⎝

⎛⋅=

ππ sin)(2

1cos)(2

1)( (2-22)

(a)

(b)

0

mI(t)

2TB 4TB 6TB 8TB

m1 m3m5

m7

0.5TB(=Tb) 2.5TB 6.5TB4.5TB 8.5TB

mQ(t)

m2

m4 m6m8

sI(t)

0 2TB 4TB 6TB 8TB0

0.5TB 2.5TB 4.5TB 6.5TB

sQ(t)

8.5TB

s(t)

0.5TB 2.5TB 4.5Tb 6.5Tb 8.5Tb

00 2TB

m(t)m1

m2

m3 m4

m5

m6 m7

m8

Serial to

Parallel

1 ( )tTBcos2

π2

mI(t)

mQ(t) sQ(t)

sI(t)

s(t)

4TB 6TB 8TB

T( )tsin21 π2

B

(a)

(b)

0

mI(t)

2TB 4TB 6TB 8TB

m1 m3m5

m7

0

mI(t)

2TB 4TB 6TB 8TB

m1 m3m5

m7

0.5TB(=Tb) 2.5TB 6.5TB4.5TB 8.5TB

mQ(t)

m2

m4 m6m8

0.5TB(=Tb) 2.5TB 6.5TB4.5TB 8.5TB

mQ(t)

m2

m4 m6m8

sI(t)

0 2TB 4TB 6TB 8TB0

sI(t)

0 2TB 4TB 6TB 8TB0

0.5TB 2.5TB 4.5TB 6.5TB

sQ(t)

8.5TB0.5TB 2.5TB 4.5TB 6.5TB

sQ(t)

8.5TB

s(t)

0.5TB 2.5TB 4.5Tb 6.5Tb 8.5Tb

s(t)

0.5TB 2.5TB 4.5Tb 6.5Tb 8.5Tb

00 2TB

m(t)m1

m2

m3 m4

m5

m6 m7

m8

Serial to

Parallel

1 ( )tTBcos2

π2

mI(t)

mQ(t) sQ(t)

sI(t)

s(t)

4TB 6TB 8TB

T( )tsin21 π2

B

00 2TB

m(t)m1

m2

m3 m4

m5

m6 m7

m8

Serial to

Parallel

1 ( )tTBcos2

π21 ( )tTBcos2

π2

mI(t)

mQ(t) sQ(t)

sI(t)

s(t)

4TB 6TB 8TB

T( )tsin21 π2

BT( )tsin21 π2

B

Figure 2-12 DC-free modulation based on HSRC-OQPSK (a) modulation scheme (b) time-

domain waveforms.

Unlike conventional OQPSK modulation, data offset of the dc-free modulation using SRC

signal is half bit time not 1-bit time as shown in Figure 2-10. As shown in Fig 1(b), there are no

discrete transitions in the signal like HSRC-OQPSK, from which we can expect the side lobes of

44

the signal spectrum to be suppressed like those of MSK. The pulse shaping function of the signal

can be represented as (2-23).

( )⎪⎩

⎪⎨

⎧<<⎟⎟

⎠

⎞⎜⎜⎝

⎛=

elsewhere

TtT

ttg B

B

0

20sin π (2-23)

The normalized power spectral density, S(f), is derived by the Fourier transform of (2-23),

S(f)=|G(f)|2/T, where G(f) is the Fourier transform of g(t). The resulting power spectral density is

represented as (2-24).

( )2

222 412sin4

⎟⎟⎠

⎞⎜⎜⎝

⎛

−=

B

BB

TffTTfS π

π (2-24)

Figure 2-13 shows the theoretical spectrums of NRZ and dc-free signals. The first null

point of the spectrum of the dc-free signal is located at fB=1/TB, which is the same as that of the

conventional NRZ.

0 0.5 1 1.5 2 2.5 3-60

-50

-40

-30

-20

-10

0

10

f/BR (Hz/bit/s)

Nor

mal

ized

Pow

er S

pect

ral D

ensi

ty (d

B)

NRZdc-free

Figure 2-13 Comparison of the spectra between NRZ and the dc-free signals.

45

The carrier signal moves the dc information of the data to the other frequency range

without increasing bandwidth of the signal transmitted. Moreover, the side lobes of the dc-free

modulation fall off more rapidly (1/f4) than conventional NRZ (1/f-2).

2.5 Measurement of the HSRC-OQPSK Signal

A prototype HSRC-OQPSK transmitter was built and tested because the HSRC-OQPSK is

the most feasible modulation to be implemented for high-speed wire-line data communications,

such as a backplane serial link. The block diagram of the transmitter with test setup is shown in

Figure 2-14.

CK

I Data

I/Q Modulator

Spectrum Analyzer

Bit Pattern Generator

500MHz ClockSignal Generator

90o

90° Power Splitter

180° Power Splitter

180° Power Splitter

180o

180o

Oscilloscope

Branch Line

5MHz Pulse Function Generator

Power Combiner

Q Data

M1

M2

M3

External Clock In

1GHz Clock

1Gbps Data

Frequency Doubling

Figure 2-14 A prototype HSRC-OQPSK transmitter and measurement setup.

Two wide-band mixers, with a bandwidth of 50~4200MHz, are used as I/Q channel

mixers. A 500MHz quadrature branch-line hybrid power splitter which is constructed with PCB

is used to generate quadrature carrier signals. Figure 2-15 shows the structure of the branch-line

hybrid power splitter. The port2 and port3 received the signal power split equally from port1

with a phase offset of 90°. Of course, the symmetrical S-parameter characteristic can be observed

46

due to the symmetric structure. The characteristics of S-parameters are simulated with Ansoft

HFSS and compared to the measured data in terms of their magnitude and phase information, as

shown in Figure 2-16. The splitter can be integrated into a single chip using the passive elements

approach [19].

Figure 2-15 A 500MHz branch-line hybrid quadrature power splitter structure for HFSS simulation.

200 300 400 500 600 700 800-45

-40

-35

-30

-25

-20

-15

-10

-5

0

Frequency (MHz)

Mag

nitu

de (d

B)

S12

S13

S23

Simulated S12 Simulated S13 Simulated S23 Measured S12 Measured S13 Measured S23

(a)

Figure 2-16 Simulated and measured characteristics of the 500MHz branch-line hybrid quadrature power splitter (a) S-parameters (b) phases.

47

200 300 400 500 600 700 8000

30

60

90

120

150

Frequency (MHz)

Pha

se D

iffer

ence

(deg

ree)

SimulatedMeasured

(b)

Figure 2-16 (continued).

The two quadrature output signals are split again by using 180° power splitters. The 0°

signals of the 180° power splitters are used for the quadrature carriers while the 180° signals are

fed into the mixer M3 to generate a 1GHz clock signal for the digital bit pattern generator’s

external clock. The bit pattern generator generates 1Gbps pseudo random data for the

transmitter’s I channel. The HSRC-OQPSK modulation requires a serial-to-parallel conversion

with 1-bit time offset. To simplify the test, a 5MHz pulse is injected into the Q channel input to

represent fixed-pattern data bits. Time delay from the bit pattern generator is adjusted to

synchronize the I channel data and the carrier signal. Consequently, the overall transmitter data

rate is equivalent to 2Gbps.

Agilent N4906A 3.6Gbps serial Bit Error Rate Tester (BERT) is used to generate 1Gbps

pseudo random data. Agilent 54832D 4Gsa/s oscilloscope and Agilent E4448A spectrum

analyzer are used to monitor the time domain waveforms and output signal spectrum,

respectively. Table I summarizes the components used in the measurement.

48

Table 2-1 Summary of components used in the measurement.

Device Function Model Frequency Range

Mixer (M1, M2) I/Q channel mixer Mini-Circuits ZX05-42MH-S 5 ~ 4200MHz

Mixer (M3) Frequency Doubler Mini-Circuits ZX05-30W 300 ~ 4000MHz

90° Power Splitter Dividing I/Q channel carrier Quadrature Hybrid (Branch Line) 500MHz

180° Power Splitter Frequency double Mini-Circuits ZFSCJ-2-4 50 ~ 1000MHz

Power Combiner/Divider I/Q channel signal combine Agilent 11636A DC ~ 18GHz

Signal Generator 500MHz carrier source Agilent E8254A 250KHz ~ 40GHz

Function Generator 5MHz pulse for Q channel Agilent 33120A 15MHz

BERT 1Gbps I channel data Agilent N4906A 3.6Gbps Serial BERT

Oscilloscope Monitoring time domain waveform Agilent 548320 1GHz / 4Gsa/s

Spectrum Analyzer Monitoring signal spectrum Agilent E4448A 3Hz ~ 50GHz

Figure 2-17 shows the measured HSRC-OQPSK time domain waveform and its spectrum.

Since the HSRC-OQPSK signal follows the MSK signal spectrum, the first null point of the

spectrum must be located at 1.5GHz because the equivalent data rate is 2Gbps. Time domain

waveform is also well-matched with the theoretical waveform shown in Figure 2-2(c).

Both time domain waveforms and frequency domain spectrum were measured and

compared to the theoretical predictions. Figure 2-18 shows the measured HSRC-OQPSK time

domain waveform and the theoretical prediction.

For comparison purposes, a fixed pattern of sequences ‘110’ was used for the data pattern

instead of random bit patterns. Therefore, the data sequence of the I channel is

‘110110110……’. Measured waveform matches well with the theoretical waveform as shown in

49

Figure 2-12. A phase mismatch between the data and the carrier signal caused the difference in

waveform.

100mV/div

2ns/div

100mV/div

2ns/div

(a)

(b)

Figure 2-17 Measured characteristics of the HSRC-OQPSK modulation (equivalent 2Gbps random data input) (a) time domain waveform (b) spectrum.

Random data sequences were fed into the I channel to measure the signal spectrum. As

shown in Figure 2-19, the measured broadband spectrum matches very well with the theoretical

spectrum of a 2Gbps random bit stream. The spectrum’s first null is located at 1.5GHz which is

50

the same as the theoretical value. Other null points match the theoretical predictions as well. The

difference between the measured spectrum’s main lobe and second lobe is approximately 23dB,

which agrees with the theoretical value.

10 15 20 25 30 35-0.2

-0.15

-0.1

-0.05

0

0.05

0.1

0.15

0.2

Time (ns)

Am

plitu

de (V

)

Theoretical WaveformMeasured Waveform

Figure 2-18 Comparison of the theoretical and measured waveforms of HSRC-OQPSK

modulation.

0 1 2 3 4 5-90

-80

-70

-60

-50

-40

-30

-20

-10

Frequency (GHz)

Pow

er S

pect

ral D

ensi

ty (d

Bm

)

Theoretical Spectrum

23dB

Figure 2-19 Comparison of the measured and theoretical spectrum of the HSRC-OQPSK signal.

51

2.6 Summary

The HSRC-PSK modulations are proposed to optimize spectral efficiency for high data-

rate transmission over band-limited channels. The analysis and simulation results show that the

proposed modulations can be used in high-speed data communications, such as a backplane

serial link.

In the past, a multi-PAM signal (e.g., PAM-4) has been demonstrated to increase the data-

rate in band-limited channels [3]. However, in PAM-4 modulation, it is difficult to maintain the

linear spacing between levels in low-voltage and low-power application and as a result the

system’s performances are degraded. The level spacing also causes complexity in the transceiver

design not only because the received signal needs to be linearly amplified but also because PAM-

4 signaling requires accurate reference voltages.

The proposed HSRC modulations not only reduced the bandwidth requirement but also can

be easily implemented in deep submicron integrated circuit technologies with low supply

voltages. The proposed HSRC-QPSK signal has irregular timing transitions and therefore a new

phase detector design for clock and data recovery (CDR) and a matched filter for the

demodulation are required. On the other hand, since such irregular transition does not exist in

HSRC-OQPSK, the HSRC-OQPSK signaling can be used effectively in band-limited wire-line

applications with maximum spectral efficiency. This modulation might be able to optimize the

spectral efficiency for baseband high data rate transmission over band-limited channels.

Compared to the conventional NRZ modulation, the proposed modulation greatly reduces the

bandwidth requirements. For bandwidth efficiency, the HSRC-OQPSK spectrum, which is the

same as the MSK spectrum, contains 99% of the total signal power within the bandwidth of B ≈

(1.2/Tb). In comparison, PAM-4 has much larger 99% bandwidth of B ≈ (8/Tb) [13], [15].

Therefore, it is expected that the proposed HSRC-OQPSK modulation should have an efficient

52

signal spectrum in band-limited channels. And this spectrum efficiency will reduce the high

frequency crosstalk noise between the signal lines which may improve the performance of the

multi-port serial communication links. In addition, dc-free signaling based on the HSRC-OQPSK

modulation has been introduced. The dc-free signaling is expected to have good performance in

ac coupled channel applications including a flip-chip ACCI.

Measurement results verified that the HSRC-OQPSK modulation can be effectively used

in band-limited wire-line applications requiring spectral efficiency, such as the backplane serial

link. Moreover, because it requires only a two-level decision, the HSRC-OQPSK transmitter can

be implemented in a simpler architecture than PAM-4 and is suitable for low-voltage systems. In

addition, a QPSK carrier recovery structure can be used for the HSRC-OQPSK modulation

which enables flexible design of the receiver, which will be discussed in Chapter 4.

For the HSRC-MSK modulation, it can greatly minimize crosstalk noise compared to the

conventional ones, since the high frequency components are further suppressed [14].

53

CHAPTER 3 HSRC-OQPSK TRANSMITTER

3.1 Transceiver Architecture

3.1.1 A Conventional Serial Link Transceiver Architecture

Prior to the HSRC-OQPSK transmitter detail, the conventional transceiver and a

conceptual HSRC-OQPSK transceiver need to be investigated. Conventional quadrature

modulation transceiver architectures are investigated in [20-21]. Figure 3-1 shows the simplified

transceiver structure of a conventional serial link. The transmitter has a serializer which converts

the parallel data into serial data using a clock generated from a phase-locked loop (PLL). An

output buffer amplifies the converted serial data for driving a channel. The receiver has a clock

and data recovery (CDR) circuit that includes a phase and frequency detector (PFD). Besides a

CDR, the receiver includes a retimer, and a deserializer. Passing through the channel, the

transmitted signal is amplified with a limiting amplifier allowing the signal to travel through a

CDR. In the CDR circuit, the incoming signal restores a clock used as the clock of the decision

circuit. After the decision, the data may convert to parallel data by a deserializer. Typically, a

reference clock for precise frequency detection is also used to avoid false locking conditions,

which is not depicted in Figure 3-1.

3.1.2 A Conceptual HSRC-OQPSK Transceiver Architecture

Figure 3-2 shows a conceptual HSRC-OQPSK transceiver structure. The signal modulation

uses two mixers and one combiner circuit while two mixers, matched filters and decision circuits

are used for demodulation. This system architecture is basically the same as conventional QPSK

modulation and demodulation system. The quarter-rate clock and data recovery (CDR) is

incorporated with the HSRC-OQPSK receiver because the frequency of the carrier signal is half-

symbol-rate which is comparable to the quarter-data-rate of the system.

54

n

Transmitter

LimiterJittered

DataPLL CDR

F/Fn

Receiver

Phase Detector

Low-Pass Filter VCO

Data CK

Serializer Deserializer

Channeln

Transmitter

LimiterJittered

DataPLL CDR

F/Fn

Receiver

Phase Detector

Low-Pass Filter VCOVCO

Data CK

Serializer Deserializer

Channel

Figure 3-1 A simplified conventional transceiver for a serial data link.

Though a matched filter for the QPSK signal demodulation improves system performance,

it is hard to implement the matched filter operating at giga-hertz frequency range. So, the

matched filter should be replaced by other circuits which allow the high-speed operation. A high-

speed flip-flop (F/F), and appropriate retiming circuit for the incoming data can replace the

matched filter. The detailed transmitter architecture is discussed in this chapter and the detail

receiver architecture is presented in Chapter 4.

Serial to

Parallel

I

Q

LO

90°

T

Q

LO

90°

T

In

Out

Transmitter

Receiver

I dt

dt

Parallelto

Serial

Quarter-Rate CDR

Quarter-Rate CDR

RxOut

TxIn

Channel

Channel

Linear Amplifier

Serial to

Parallel

I

Q

LO

90°

T

I

Q

LO

90°90°

TT

Q

LO

90°90°

TT

In

Out

Transmitter

Receiver

I dtdt

dtdt

Parallelto

Serial

Quarter-Rate CDR

Quarter-Rate CDR

RxOut

TxIn

Channel

Channel

Linear Amplifier

Figure 3-2 A conceptual HSRC-OQPSK transceiver architecture.

55

3.2 HSRC-OQPSK Transmitter Architecture

Figure 3-3 shows the simplified HSRC-OQPSK transmitter architecture which is basically

the same as the conventional OQPSK modulation system architecture. The received data are

separated into the I and Q channels with 1 bit-time offset by a serial-to-parallel logic which is

comprised of two double-edge-triggered (DET) F/Fs, as shown in Figure 3-3. The separated data

are mixed with the quadrature HSRC signals of each channel and then combined by wiring

outputs of the I/Q mixers for generating the modulated signal, as shown in Fig. 2. For the bit

error rate (BER) test purposes, the HSRC signal is generated by an injection-locked LC voltage

controlled oscillator (VCO) whose injection clock is also used in the clock of the external 2:1

multiplexer (serializer) of the receiver, which will be discussed in Chapter 4.

Data In

Injection-Locked VCO

90o

I Channel

Q Channel

DET F/F

Delay

Delay

Transmitted Signal Out

DET F/F

Injection Clock

Serial to Parallel with 1-bit offset

Figure 3-3 A HSRC-OQPSK transmitter architecture.

In this transmitter architecture, the phase synchronization of the clock and data is crucial to

produce an undistorted modulation signal. To maximize the modulated signal spectrum

bandwidth efficiency, the data should be synchronized with the carrier signal of each channel.

Synchronization mismatch caused from the clock to data output delay of the DET F/F should be

cancelled out by inserting a delay component between the VCO outputs and the mixer inputs, as

shown in Figure 3-4(a). Most of the circuits in this transmitter design have been implemented

with current mode logic (CML) circuits which are discussed in section 3.2.1 in more detail. CML

56

circuits offer low delay variation due to its supply independent low-swing voltage characteristic

[22]. From the simulation results, the delay variation is less than 10% of the various supply

voltages and processes. Synchronized delay lines or RC delay circuits could be reasonable

solutions, as shown in Figure 3-4(a). However, a delay line for several tens of pico-seconds is a

long line to be integrated and makes severe signal attenuation. Rev.1 transmitter implemented

TSMC 0.18μm CMOS technology uses a RC delay circuit for the synchronization. However,

delay mismatch might occur because the flip-flop delay is not fixed over the various voltage,

temperature, and process conditions during the operation. In general, a buffer as a delay unit can

be an effective solution for the compensation of the delay mismatch problem [6]. Rev. 2

transmitter using UMC 0.18μm CMOS technology employs a buffer as a delay unit. Since a

delay of the DET F/F is mainly determined by a MUX, a matching delay buffer can be

implemented by inserting the same MUX to the signal path, as shown in Figure 3-4(b). The

MUX input ports are tied together and the selection port is logically fixed to select one of the two

inputs. Consequently, equal delays can be inserted in the signal paths.

F/F

Clock (Sinusoidal)

I/Q Data

Delayed Data

Delay

F/F

Clock (Sinusoidal)

I/Q Data

Delayed Data

Delay

(a)

Figure 3-4 A structure of the DET F/F and data and clock synchronization by inserting (a) delay unit (b) a MUX as a delay unit.

57

D

Clock (sinusoidal)

DET F/F

Q

Logical 0 or 1

I/Q Channel Signal

MUX

F/F

F/F

MUX

(b) Figure 3-4 (continued).

3.2 Circuit Implementation

3.2.1 Current-mode-logic (CML) Circuit

A fully differential CML type circuit is widely used for the lower signal voltage and

high-speed digital system. As discussed and analyzed in [22], the CML logic has several

advantages over conventional digital logic. The logic has supply voltage independent output

swing. The differential pair with low input and output voltage swing offers high-speed operation.

Figure 3-5 shows a CML buffer circuit and its characteristic of output voltage versus input

voltage. The detailed analysis of the basic differential pair with resistive loads is presented in

[23]. The maximum output voltage swing is determined by tail current ISS and load resistance

RD, which varies VDD to VDD-RDISS as the differential input varies -∞ to ∞, as shown in Figure 3-

4. The NMOS logic parts are pulled up with load resistors.

The minimum and maximum input common mode level which allows M1, M2 to stay in

the saturation region is analyzed in (3-1) [23].

58

Vin1 Vin2

Vout2Vout1

ISS

RD RD

P

VDD

(a)

VDD Vout1 Vout2

VDD-RDISS

RDISS

Vin1 – Vin2

(b)

Figure 3-5 A fully differential (a) CML buffer and (b) differential input voltage versus output voltages.

( ) ⎥⎦

⎤⎢⎣

⎡ +−≤≤++ DDTHSS

DDDCMinGSGSGS VVI

RVVVVV ,2

min,331 (3-1)

We have Vout1=VDD-RDID1 and Vout2=VDD-RDID2 and Vout1-Vout2=RD(ID2-ID1). Since the virtual

ground node, P, is equal to Vin1-VGS1 and Vin2 – VGS2, we get equation (3-2).

Vin1 – Vin2 = VGS1 – VGS2 (3-2)

59

The relationship between VGS and ID is represented as (3-3) for a square-law device, where

μn is the mobility of the NMOS device, Cox is the gate oxide capacitance, W is the device width,

and L is the device length.

( )

LWC

IVVoxn

DTHGS

μ21

2 =− (3-3)

Using (3-3), (3-2) can be represented as (3-4).

LWC

I

LWC

IVV

oxn

D

oxn

Dinin

μμ

2121

22−=− (3-4)

Squaring two sides of (3-4) with the constraints of ID1 + ID2 = ISS, we get (3-5).

( ) ( )212

21 22DDSS

oxn

inin III

LWC

VV −=−μ

(3-5)

Squaring the two sides again and then, we finally get the equation which represents the

relationship between the output current difference, (ID1 – ID2), and the input voltage difference,

(Vin1 – Vin2), given in (3-6).

( ) ( )2212121

421

inin

oxn

ssininoxnDD VV

LWC

IVV

LWCII −−−=−

μμ (3-6)

By differentiating the two sides of (3-6), transconductance, Gm, is obtained as (3-7) where ΔVin =

(Vin1 – Vin2).

2

2

4

24

21

inoxn

SS

inoxn

SS

oxnin

Dm

VLWC

I

VLWC

I

LWC

VI

GΔ−

Δ−=

Δ∂Δ∂

=

μ

μμ (3-7)

60

Equation (3-7) implies that Gm drops to zero for ( )LWCIV oxnSSin μ2=Δ . What if ΔVin

exceeds the value which makes Gm to be zero? In this case, one transistor drives the total tail

current, ISS, input because the other transistor is approaching the turn-off mode. Thus, ID1 = ISS

and ΔVin1 = VGS1 - VTH where ( )LWCIV oxnSSin μ21 =Δ and M2 is turned off for ΔVin > ΔVin1.

Figure 3-6 shows the relationship between the ID and ΔVin and the characteristic of Gm.

+ΔVin1

Gm

-ΔVin1 ΔVin

(a) (b)

Figure 3-6 Characteristics of a differential pairs versus differential input voltage (a) drain currents (b) transconductance [19].

As analyzed so far, the differential pair shown in Figure 3-4(a) can be used for a small

signal amplifier whose maximum differential input voltage is ∆Vin1. Within ∆Vin1, the differential

amplifier could have a high small gain which is dependant on RD and ISS. In case of using it as a

digital logic which is called CML, it is better to guarantee ∆Vin > ∆Vin1 for the maximum signal

output. Typically, the value of ∆Vin1 does not exceed several-hundred mV for the high-speed

operation. Therefore, the differential CML logic uses a smaller input signal than a conventional

logic. Since the differential structure cancels the even mode harmonic terms, it is obvious that

the differential CML buffer is more linear than conventional CMOS logic and not easily affected

by common mode noise, which becomes more important in the low-voltage systems. The

differential structure gives us a one-stage buffer while the conventional CMOS needs a two-

61

stage, greatly reducing the delay of the buffer. Moreover, the characteristics of the CML logic

are strongly related to the tail current ISS and use a smaller logic signal. Delay variation due to

voltage, temperature, and process could be very small compared to conventional CMOS logic.

3.2.2 CML Double-Edge Triggered D flip-flop

A CML is widely used in high-speed digital logic because its low-swing voltage enables

high-speed operation [22], [24]. For this reason, most of the circuits used in this transceiver

design are CML circuits.

A CML DET F/F consists of a CML latch and a CML-style analog MUX, as shown in

Figure 3-7(a). The basic structure of the CML DET F/F is the same as a conventional one. Two

latches are selected by an analog multiplexer with a clock signal, shown in Figure 3-5(c). Due to

its high-speed and low delay variation characteristics, the CML latch offers good system

performance. Input transistors sense and track the input data differentially and cross-coupled

transistors store that data [24]. The clock signal selects tracking modes and storing modes. The

input transistors are tracking the input signal when the clock is high and cross-coupled transistors

are storing the input data when the clock signal is low.

CLK-CLK+

Vout+Vout-

Vin+

Vin-

RD RD

ISS

sel-sel+

Vout+Vout-

Vin1+

Vin1-

Vin2-

RD RD

ISS

Vin2+

(a) (b)

Figure 3-7 CML Circuits (a) D-latch (b) analog multiplexer (c) double-edge triggered flip-flop.

62

MUX

D Q

D QData

CLK

Out

(c)


The MUX also can be implemented with CML style digital logic which offers a more

flexible interface with CML latches than a conventional full-swing high-speed logic as well as

high-speed operation. The clock signal of the MUX selects one of the latches which stores the

previous data, so that the DET F/F can transfer the data when both the rising-edge and the

falling-edge clock signals occur.

3.2.3 Resistive Load Gilbert Mixer

One major difference between the conventional serial link transmitter and the HSRC-

OQPSK transmitter is mixing the data with the carrier signal using an analog mixer shown in

Figure 3-3. I and Q channel mixers generate an analog signal instead of a digital signal like NRZ.

Since the data input encompasses wide-band signals up to several GHz, a wide band mixer

should be used in this system. A resistive load Gilbert mixer has been chosen for this system

because it has wide bandwidth operation. It offers a direct output-to-input interface without any

voltage level shifting because it has basically the same structure as a CML circuit. Moreover, I

and Q channel signal combining can be obtained by connecting the outputs of the mixer in the

channels, which is discussed in section 3.2.2.5. The load resistor, RD uses the value of 2.5KΩ for

both Rev. 1 and Rev.2 transmitter. Figure 3-8 shows a resistive load Gilbert mixer.

63

Vin1-Vin1+

Vout+Vout-

Vin2+

Vin2-

Vin2+

RD RD

ISS

Figure 3-8 A resistive load Gilbert mixer.

3.2.4 Quadrature Phase Clock Generator

Quadrature-phase carrier signals whose frequency is half-symbol-rate are needed for the

modulation. This can be simply implemented by using a two stage differential ring oscillator as

shown in Figure 3-9(b). To control the delay of each stage to varying the clock frequency, there

are two ways of controlling the propagation delay. One is the current starved approach and the

other is the shunt capacitive approach [25]. Figure 3-8 shows the current-starved and the shunt

capacitive inverter. In the current-starved inverter, Vctl controls the resistance of M4 through

current mirroring. This variable resistance controls the charging and discharging timing. In the

shunt capacitor inverter, control voltage, Vctl, adjust the resistance of M3 which is connected to

the output of the inverter. The other output of M3 is connected to a load capacitor. Therefore, the

shunt resistance of M3 controls the effective load capacitance seen by the output node of the

inverter. Decreasing the resistance of M3, the effective load capacitance seen by the output node

becomes large, producing more delay. From [25], the shunt capacitance topology has better

64

linear and noise rejection characteristics than the current-starved topology. However, the shunt

capacitance topology occupies a larger area due to the lumped capacitor.

Vctl

Vin

M1

M2

M3

M4

Vout

M1

Vctl

Vin VoutM2

M3

(a) (b)

Figure 3-9 Two different delay control circuit (a) a current-starved inverter (b) a shunt capacitive inverter.

Figure 3-10 shows fully differential a shunt capacitive type inverter and two stage ring

oscillator. It is common that the fully differential circuit has better power supply insensitivity.

Each differential inverter has cross-coupled PMOS load and shunt capacitor with a control

NMOS. The phase difference of this adjacent node is 90°.

It is known that LC oscillators allow large output swing at higher frequency with lower

voltage and make less phase noise than ring oscillators described in this section do [6]. For these

reasons an LC VCO, especially, an injection-locked quadrature-phase LC QVCO has been

chosen in this work. For the bit-error-rate (BER) test, the HSRC-OQPSK receiver needs an

external 2:1 mux for serializing I and Q channel data, which will be discussed in detail in

Chapter 4. To synchronize both the transmitter and the receiver outputs for the BER test, the

injected clock signal can also be used as an external reference clock. Moreover, the injection

locked clock signal also helps to overcome failure in locking the receiver’s CDR loop that comes

65

from the frequency mismatch between the transmitter and the receiver due to the design variation

(e.g., process variation).

M1

Vin+

+Vout

-M2

M3

Vin-

Vctl

M1

M2

M3

(a)

Vin+

Vin-

Vout+

Vout-

Vctl

Vin+

Vin-

Vout+

Vout-

VctlVctl

90o 270o 180o 0o

(b)

Figure 3-10 A fully differential (a) shunt capacitor inverter with cross-coupled PMOS active load and (b) two-stage ring oscillator.

An injection-locked LC divider is introduced as a high frequency divider for the phase-

locked loop or low-power quadrature LO generation [26]. An external 5GHz clock is injected

into the VCO for generating 2.5GHz quadrature clock signals by which data modulation

performed. Figure 3-8 shows the structure of an injection-locked quadrature phase VCO [26].

The Q of the inductor for the LC tanks is approximately 8 and its value is 3.8nH. The varactor

value is 677fF with a fixed MIM capacitor of 329fF which offers approximately 20% tuning

66

range with a simulated 2mA tail current. The outputs of each stage are followed by a voltage

follower buffer not shown in Figure 3-11.

Vinj+

VQout-VIout-

Vcnt

VIout+ VQout+

-

Figure 3-11 Injection-locked LC QVCO.

3.2.5 I/Q Channel Signal Combining

The basic architecture of the HSRC-OQPSK transmitter follows that of a QPSK

modulator. Therefore, both I and Q channel signals must be combined together to generate the

HSRC-OQPSK signal. Since the mixer is operated in current mode, a combined signal of the I

and Q channels can easily be implemented by wiring both outputs of the I/Q channel mixers, as

shown in Figure 3-12. This makes the transmitter structure simple to implement.

3.2.6 Output Buffer

An output buffer of Rev. 1 transmitter has been designed with an open-drain structure as

shown in Figure 3-13. The output buffer has a differential three-stage cascaded structure

enabling the output buffer to have enough current to drive the 50Ω load. The last-stage of the

open drain buffer is pulled up with a 50Ω external resistor for the measurement. The resistor

values of each stage, R1, R2 are 550Ω, 140Ω, respectively which are implemented with on-chip

poly resistors. The DC bias currents, I1, I2, I3, of the buffer are 1.5mA, 6mA, 15mA

respectively.

67

Vin1-Vin1+

Vout+Vout-

Vin2+

Vin2-

Vin2+

RD RD

ISSVin1-Vin1+

Vout+Vout-

Vin2+

Vin2-

Vin2+

RD RD

ISS

- Signal Out +

Figure 3-12 Combining I and Q channel signal by direct connecting outputs.

Vin

Vout

I1

R1 R1

I2

R2 R2

+

-

I3

-

+

Figure 3-13 Three stage output buffer with open drain output stage.

The high impedance output of the output buffer used in Rev.1 transmitter may cause the

inter-symbol-interference (ISI) due to the mismatches [6], [27]. In order to minimize the

mismatch between the near-end (transmitter) and the far-end (receiver) and improve the signal

quality, an on-chip resistor of 60Ω is used in the Rev. 2 transmitter. Figure 3-10 shows the

differential output buffer with an on-chip terminated resistor. A drawback of the double

termination is unavoidable increasing power consumption. It would double the power dissipation

in order to deliver the same voltage swing at the receiver end because the buffer needs twice the

tail current compared to the open-drain structure.

68

For the HSRC-OQPSK signal, linear amplification (without limiting output with limiting

amplifier [28]) is needed for generating the modulated signal since the modulated signal should

be kept undistorted. Therefore, a buffer is designed with the constraints of limited voltage gain

and maximum current gain. The resistor values of each stage, R1, R2, and R3 are 550Ω, 140Ω,

and 60Ω, respectively which is implemented with on-chip poly resistors. The DC bias currents,

I1, I2, I3, of the buffer are 1.5mA, 6mA, 15mA respectively. The DC coupled output signal is

directly interfaced with the inputs with the pull-up of the receiver which enables double

termination. The architecture of the transceiver and detail test setup is discussed in Chapter 4.

Vin

Vout

I1

R1 R1

I2

R2 R2

+

-

I3

-

+

R3 R3

(a)

Transmitter

Channel

Receiver

+Vin-

ISS

RD RD

VDD

RL RL

(b)

Figure 3-14 Output buffer (a) differential three-stage output buffer (b) doubly terminated structure.

69

3.3 Chip Design

3.3.1 Rev. 1 Transmitter

An integrated HSRC-OQPSK transmitter (Rev. 1) was designed and fabricated using

TSMC 0.18um CMOS technology. The chip includes a serial-to-parallel logic, an injection-

locked LC VCO, mixers, and output buffers. Total chip size is 2000μm X 2000μm which

includes a transmitter and a receiver. The transmitter core without pads occupies 598μm X

575μm. Figure 3-15 shows the entire HSRC-OQPSK transceiver chip. The transmitter is

located at the lower right corner of the chip. The design includes a transmitter which modulates

the binary signal into HSRC-OQPSK signal. The transmitter consists of DET F/Fs, mixers, an

injection-locked VCO, and output buffers.

Receiver (Demodulator)

Transmitter (Modulator)

Receiver (Demodulator)

Transmitter (Modulator)

Figure 3-15 HSRC-OQPSK transceiver chip using TSMC 0.18μm CMOS technology.

The time domain simulation of the Rev.1 transmitter is performed using Cadence Spectre.

For the transmitter simulation, a random bit stream in the Cadence adhl library is used for

generating 10Gbps random data.

70

1 2 0 1 2 1 1 2 2 1 2 3 1 2 4 1 2 5-0 . 8

-0 . 6

-0 . 4

-0 . 2

0 . 0

0 . 2

0 . 4

0 . 6

0 . 8

Vou

t (V

)

T i m e ( n s )

(a)

0 5 10 15 20-100

-90

-80

-70

-60

-50

-40

-30

-20

-10

Frequency (GHz)

Spe

ctru

m (d

B)

SimulationTheoretical

(b)

Figure 3-16 HSRC-OQPSK modulation signal (a) time domain waveforms (b) signal spectrum.

A 5GHz reference clock is injected into the transmitter for locking the VCO. The

generated 10Gbps serial data fed into the transmitter are separated into I and Q channel data. The

clock frequency is half-symbol-rate of the channel data, therefore, a double-edge triggered flip-

flop (DETFF) is used as a serial-to-parallel logic for each channel. The simulated time domain

71

waveforms and their spectrum without an external load are shown in Figure 3-16. The

spectrum’s first generated signal null is approximately 6.7GHz which is a little bit lower than

theoretical value of 7.5GHz while the difference between the main lobe and the second lobe is

less than 20dB which is higher than the theoretical value of 23dB as shown in Figure 3-16(b).

3.3.2 Rev. 2 Transmitter

Rev. 2 transmitter has been designed and fabricated in UMC 0.18um CMOS technology.

The chip occupies 1130μm X 1240μm. Figure 3-11 shows a simulation structure for the

transmitter. As discussed earlier, the design includes a transmitter which modulates the binary

signal into the HSRC-OQPSK signal. The transmitter consists of flip-flops, mixers, an injection-

locked VCO, and output buffers.

Modulator Logic

Varactors

Injection-Locked LC VCO

DC Blocking Cap.

gndvbmod Din+ Din-

Out+

Out-

Osc- Osc+vbovboc

ckin+

ckin-

vbIL vbvco vcnt

gnd

gnd

gnd

gnd

gnd

vdd

vdd

vdd

vdd

vdd

Figure 3-17 Transmitter die photo implemented by UMC 0.18μm CMOS technology.

72

The power and ground rings made by metal 5 are placed around the chip. To protect

circuits, the electrostatic discharges (ESD) circuits implemented by MOS devices are attached to

the DC bias lines. Octagon shape pads are used for the high speed signal. Pads for output signals

depicted as “Out+” and “Out-‘ are placed as close to the modulator logic as possible.

The architecture of the injection-locked LC VCO for Rev. 2 transmitter also has the same

architecture as shown in Figure 3-11. The Q of the inductor for the LC tanks is approximately 8

and its value is 3.8nH. The varactor value is 677fF with a fixed metal-insulator-metal (MIM)

capacitor of 329fF which offers approximately 20% tuning range with a simulated 2mA tail

current.

Similarly, the simulation has performed for the Rev.2 transmitter implemented by UMC

0.18μm CMOS technology. The conversion gain and input referred 1dB compression point of

the mixer are simulated using the Cadence Spectre [29] with the port resistance of 2.5KΩ

because a high impedance logic interface, rather than a 50Ω, is employed in the digital system.

Up-conversion gain and 1dB gain compression are simulated as shown in Figure 3-7.

Conversion gain is less than -1dB if the LO power is larger than -20dBm which roughly

corresponds to a single-ended peak-peak voltage amplitude of 230mV in a 2.5KΩ system. Figure

3-18(b) shows the conversion gain as the input RF frequency increases. For the 4GHz input, the

conversion gain is -1.03dB. The input referred 1dB compression is -23.08dBm with the port

resistance of 2.5KΩ, LO frequency of 2.5GHz, RF input frequency of 3GHz, and output

frequency of 5.5GHz.

For the transmitter simulation, a random bit stream in the Cadence adhl library is used for

generating 10Gbps random data. A 5GHz reference clock is injected into the transmitter for

locking the frequency of VCO. The generated 10Gbps serial data fed into the transmitter are

73

separated into I and Q channel data. The clock frequency is half-symbol-rate of the channel data,

therefore, a double-edge triggered flip-flop (DETFF) is used as a serial-to-parallel logic for each

channel.

-45 -40 -35 -30 -25 -20 -15 -10 -5 0 5-20

-18

-16

-14

-12

-10

-8

-6

-4

-2

0

Port Resistance=2.5Kohm

RF Input Frequncy=3GHz

Con

vers

ion

Ga

in (d

B)

LO Power (dBm)

1.0 1.5 2 .0 2.5 3.0 3.5 4.0-1.1

-1.0

-0.9

-0.8

-0.7

-0.6

-0.5

-0.4

-0.3

-0.2

LO Power=-10dBm (2.5GHz)(Port Resistance=2.5Kohm)

Up-

Con

vers

ion

Gai

n (d

B)

RF Input Freqeuncy (GHz) (a) (b)

-50 -40 -30 -20 -10-55

-50

-45

-40

-35

-30

-25

-20

Output Freqeuncy=5.5GHzLO Freqeuncy=2.5GHz

Port Resistance=2.5Kohm

Input Referred 1dB Compression=-23.80dBm

Out

put

Pow

er (

dBm

)

Input Power (dBm) (c)

Figure 3-18 Linearity simulation of resistive Gilbert mixer using UMC 0.18μm CMOS technology (a) conversion gain vs. LO power, (b) conversion gain vs. RF input frequency, (c) input referred 1dB compression.

The time domain simulation of the HRSC-OQPSK transmitter is performed using the

Cadence Spectre. The simulated time domain waveforms and their spectrum without an external

load and channel are shown in Figure 3-19. The spectrum’s first generated signal null is 7.5GHz

74

while the first null point of 10Gbps NRZ data spectrum is 10GHz. The signal spectrum

bandwidth is reduced and the side lobes of the spectrum are greatly suppressed as well. The

difference between the main-lobe and the second-lobe of the spectrum is more than 20dB, which

agrees with the theoretical analysis of the spectrum of HSRC-OQPSK.

1 0 1 1 0 2 1 0 3 1 0 4 1 0 5 1 0 6

- 0 . 4

- 0 . 3

- 0 . 2

- 0 . 1

0 . 0

0 . 1

0 . 2

0 . 3

0 . 4

Vou

t (V)

T im e ( n s ) (a)

(b)

Figure 3-19 HSRC-OQPSK modulation signal (a) time domain waveforms (b) signal spectrum.

Figure 3-20 shows that the time domain waveforms of the dc-free signal using HSRC-

OQPSK transmitter. The spectrum’s first generated signal null is 5GHz which agrees with the

75

value of the theoretical spectrum. DC components of the signal have been reduced more than

theoretical values because the I and the Q channel data have been correlated every other bit due

to the fact that the 2.5Gbps data are being injected to I and Q channel simultaneously with just

half bit time data offset by serial-to-parallel logic. However, it is quite well-matched with

theoretical values to be considered 5Gbps signal effectively. The difference between the main

lobe and the second lobe is approximately 20dB while the difference is 13dB for the NRZ, which

agrees well with the analytical form of the dc-free spectrum derived as (2-21).

5 0 . 0 5 0 . 5 5 1 . 0 5 1 . 5 5 2 . 0

-0 . 4

-0 . 3

-0 . 2

-0 . 1

0 . 0

0 . 1

0 . 2

0 . 3

0 . 4

Am

plitu

de (

V)

T i m e ( n s ) (a)

0 5 10 15 20-90

-80

-70

-60

-50

-40

-30

-20

Frequency (GHz)

Sig

nal S

pect

rum

(dB

)

SimulationTheoretical

(b)

Figure 3-20 Simulated dc-free signaling (a) time domain waveforms (b) signal spectrum

76

3.4 Measurement

3.4.1 Rev.1 transmitter

Figure 3-12 shows the simplified evaluation board structure and an actual photograph. A

chip has attached to the pad on the PCB board using conductive epoxy as shown in Figure 3-20.

The pad is connected to the backside ground plane through via hole. And the signals of the chip

are connected directly to the PCB trace via wire-bond. All the ground pads are down bonded to

the pad. Thermal epoxy covers the chip for the purpose of protection. Rev. 1 transmitter’s output

buffer structure is open drain, hence, external chip resistors of 62Ω are attached to the output

signal lines, as shown in Figure 1-13. 110Ω chip resistors are attached to differential signal lines

of data input and injection clock inputs.

Chip

Ground Down Bonding

PCB FR4 BoardVia Hole

Chip PadSignal Trace

Ground Plane

Thermal epoxy(encapsulating)

Signal wire bonding

(a)

(b)

Figure 3-20 Test board for the Rev. 1 HSRC-OQPSK transmitter (transceiver) implemented by TSMC 0.18μm CMOS technology.

77

From the measurement results, the measured frequency tuning range of the VCO is

2.28GHz ~ 2.58GHz, which is approximately 12.3% from the center frequency 2.43GHz. The

center frequency of the VCO has shifted lower from the simulated center frequency

approximately 2.8%, which is caused from the layout parasitics such as routing metals, which are

not considered in the simulation phase. Figure 3-21 shows the QVCO’s signal spectrums for both

free-running and injection-locked states. Figure 3-21(a) shows the spectrum of the VCO in the

free running state. The center frequency is 2.5GHz with 5MHz span. Figure 1-14(b) shows the

spectrum of the VCO when the 5GHz clock is injected for the injection-locking of the QVCO.

The injection clock signal is generated from the Agilent E8254A signal generator. The carrier

power is measured approximately -5dBm with single-ended output. However, the carrier power

estimated -3dBm if the PCB trace and cable loss are taken into account. The characteristic of the

noise floor in the injection-locked mode is much lower than that in the free-running mode.

(a)

Figure 3-21 Measured spectrums and phase noise of the transmitter’s QVCO implemented by TSMC 0.18μm CMOS technology (center frequency of 2.5GHz with 5MHz span and 47 KHz RBW) (a) free-running mode (b) injection-locked mode (c) comparison of the phase noises between free-running and injection-locked modes.

78

(b)

10K 100K 1M-160

-140

-120

-100

-80

-60

-40

Frequency Offset (Hz)

Pha

se N

oise

(dB

c)

free-runninginjection-locked

(c)

Figure 3-21 (continued)

The spectrums show that the noise floor characteristic has been improved more than -

30dB. The phase noise has been measured by the Agilent E4448A spectrum analyzer that offers

the phase noise measurement mode. Figure 3-21(c) shows the phase noise comparison of the two

79

states which are the free-running and the injection-locked state. Measured results show that the

phase noise of free-running and injection-locked QVCO signal is -111dBc and -136dBc at 1MHz

frequency offset, respectively.

Agilent 86100B wideband oscilloscope has been used to get the eye-diagram of the signal.

The simplified test setup for 2.5Gbps and 10Gbps BER test are shown in Figure 3-22(a), (b),

respectively. Since 2.5Gbps random input is injected into both I and Q channel simultaneously,

the transmitted signal can be characterized as a 5Gbps signal equivalently even though the data

of I and Q channel are correlated to every other bit.

Wideband Oscilloscope

CK

Data In

Transmitter

Signal Out

Channel

Trigger In

50O

2.5GHz VCO Signal

Spectrum Analyzer

Signal Generator

VCO OutExternalClock In

5GHz Clock

2.5Gbps

Trigger Out

BERT

(a)

Wideband Oscilloscope

CK

Data In

Transmitter

Signal Out

Channel

Trigger In

50OSpectrum Analyzer

VCO Out

5GHz Clock

10Gbps

Trigger Out 1/2 sub clock

BERT

Phase Shifter

(b)

Figure 3-22 Simplified test setups for (a) 2.5Gbps (5Gbps equivalent) (b) 10Gbps random input for the transmitter.

80

For the 2.5Gbps measurement, 5GHz clock signal from the signal generator has been

injected to the transmitter for the VCO locking and a half frequency signal generated from the

VCO of the chip is fed into the Agilent 4903A BERT for the external clock. Then BERT

generated the 2.5Gbps synchronized to the clock inside the chip. 10Gbps test setup is more

simple because the 1/2 sub-rate clock from the BERT can be used as an injection clock of the

transmitter. Since the sub clock output of the BERT has the fixed phase with the data output, a

phase shifter between the sub clock signal output of the BERT and the clock input of the

transmitter for the timing offset between the input data and the clock.

The eye-diagram of the transmitter signal is shown in Figure 3-23. However, the

measurement could not get the proper eye-opening of the transmitted signal. Despite the

reasonable simulation results, the transmitted signal suffers from huge jitter components and two

eye-openings depicted in Figure 3-23 have different shapes even in 2.5Gbps data-rate due to the

delay mismatch between the data and the clock. The measured eye-diagram of the modulated

signal is shown in Figure 3-23. The delay mismatch is mainly caused from a delay unit

composed by a RC. The RC delay unit shown in Figure 3-4(a) does not compensate the delay

mismatch effectively because its characteristic varies much with the temperature and the voltage

and the process conditions. ISI noise due to the reflection of the high impedance node of open

drain output buffer could be another reason for this result. For the 10Gbps transmitted signal,

jitter components are introduced to the modulated signal, something that is not shown in this

dissertation. The eye-diagram of the HSRC-OQPSK signal is different from that of the NRZ

signal. As analyzed in Chapter 2, the low-energy symbol signal that practically determines the

BER performance also affects the eye-opening size directly.

81

Two eye-openings in Figure 3-23 are not identical to each other because the delay

mismatch between the clock and the data signals. A RC delay unit which is used in Rev. 1

transmitter offers a fixed delay. The limitation of the delay matching comes from the variation of

the resistive value due to the process conditions or the delay variations of the F/F over various

voltage, temperature, and process conditions.

45mV/div

50ps/div

45mV/div

50ps/div

Figure 3-23 Eye-diagram of the HSRC-OQPSK transmitted signal implemented by TSMC 0.18μm CMOS technology.

3.4.2 Rev. 2 transmitter

Rev. 2 transmitter has been designed and fabricated separately with the receiver. Figure 3-

14 shows a die photo of the transmitter test board. The chip is attached to the chip pad which is

also used down bond ground. The signals are connected to the PCB signal traces directly via

wire-bond. The down bonded grounds are connected to the backside ground plane. Thermal

epoxy covers the chip for the protection purpose. Surface mount type capacitor of

approximately100μF as bypass capacitors are attached between the power and the ground as

82

shown in Figure 3-24. Data and clock input signals are terminated with the external 50Ω

resistors. The board has oscillator signal outputs for the purpose of monitoring as well as

modulated signal outputs.

Figure 3-24 Test board for the Rev. 2 transmitter implemented by UMC 0.18μm CMOS technology.

Figure 3-25 shows signal spectrum of the Rev. 2 transmitter’s VCO. The measured tuning

range of the VCO is approximately 2.18GHz ~ 2.44GHz, which is representing the tuning range

of 12% frequency. The center frequency of the VCO has been shifted to approximately 10%

lower frequency than simulation. It is caused from the parasitics such as a routing metal

capacitance and resistors during the layout, which are not considered in the simulation phase.

Post-layout simulation with equivalent circuits modeled layout parasitics would improve the

mismatch problem. Figure 3-25(a), (b) show the frequency spectrum of the VCO both free-

running and injection-locked states, respectively. As is the case of Rev. 1, we can investigate the

noise floor of the spectrum of the injection-locked state that has been lowered than that of the

free-running state. The measured phase noise of the VCO is shown in Figure 3-25(c). To

measure the phase noise of the VCO, Agilent E4448A spectrum analyzer is used. The phase

noise performance of the VCO in the free-running state is approximately -110dBc/Hz at 1MHz

83

offset, while the -140dBc/Hz at 1MHz offset when the VCO is locked with the injection clock.

The phase noise performance of the VCO of the Rev. 2’s transmitter is almost the same as that of

Rev.1’s VCO implemented using TSMC 0.18μm CMOS technology.

(a)

(b)

Figure 3-25 Measured spectrums and phase noise of the transmitter’s QVCO implemented by UMC 0.18μm CMOS technology (center frequency of 2.25GHz with 100MHz span and 910 KHz RBW) (a) free-running mode (b) injection-locked mode (c) comparison of the phase noises between the free-running and the injection-locked modes.

84

10K 100K 1M-160

-140

-120

-100

-80

-60

-40

Offset Frequency (Hz)

Pha

se N

oise

(dB

c)

free-runninginjection-locked

(c)


Figure 3-26 shows the measured eye-diagram of the transmitted signal where the 2.43Gbps

231-1 pseudo random bit streams (PRBS) are injected into the transmitter that generates 4.86Gbps

transmitted signal equivalently. From the measurement results, the clock and data

synchronization using a buffer insertion depicted in Figure 3-4 can effectively be working to

generate the HSRC-OQPSK modulated signal. Compared to Rev.1 transmitter using a RC delay

unit for compensating delay mismatch, a clear eye-opening has been obtained. The buffer

insertion as a delay unit discussed in section 3.2.1 can effectively make good compensation of

the delay mismatch while the RC delay unit used in Rev.1’s transmitter did not compensate the

delay mismatch effectively.

SMA connectors are used to connect the channels in the test board shown in Figure 3-24.

The characteristic impedance of 50Ω channel 6.2mil FR-4 PCB board and tangential loss is

approximately 0.023. Since the PCB channel has a low-pass characteristic as discussed in

85

Chapter 1, the detected signal power at the receiver end drops as the channel length increases.

Therefore, the eye-opening is getting smaller as the channel increases and eye-opening would be

closed after tracing a long channel.

Figure 3-26 shows the measured channel characteristics used in the measurement. Three

different lengths of PCB trace and one SATA of 19” length cable are used to compare the

performances. The measured results show that 20” trace has a loss of 13dB at 6GHz while the 5”

channel is approximately 3dB.

0 2 4 6 8 10 12-50

-40

-30

-20

-10

0

10

Frequency (GHz)

S21

(dB

)

PCB-5"PCB-10"PCB-20"SATA-19"

Figure 3-26 Characteristics of channels used in the measurement.

The measured peak-to-peak small eye-opening of the transmitted signal after tracing 2”

PCB trace with a SMA cable is about 300mV without equalization of the signal at the trace end.

Peak-to-peak eye-opening of the flat signal which has the largest symbol energy (Es,f) is

approximately 400mV. The eye-opening after tracing the 10” channel is reduced to 150mV. And

less than 80mV eye-opening has been obtained after 20” trace. Another serial link channel,

86

serial-ATA (SATA) cable, has been used for this experiment to evaluate the performance of the

transmitter. A 19” SATA cable with 2” PCB trace to connected transmitter is used to obtain the

eye-diagram. Figure 3-16(d) shows the eye-diagram after 19” SATA cable trace. More balanced

eye-diagram has been obtained compared to PCB trace because the differential signals are

strongly coupled than PCB channels used in the measurement.

60mV/div

50ps/div

60mV/div

50ps/div

(a)

60mV/div

50ps/div

60mV/div

50ps/div

(b)

Figure 3-27 Eye-diagram of HSRC-OQPSK transmitted signal implemented by UMC 0.18μm CMOS technology after (a) 2” PCB trace, (b) 5” PCB trace, (c) 10” PCB trace, (d) 20” PCB trace, (e) 19” SATA cable, in response to 4.86Gbps (both I and Q channel input with 2.43Gbps pseudo random bit stream (PRBS) sequence of 231-1).

87

60mV/div

50ps/div

60mV/div

50ps/div

(c)

60mV/div

50ps/div

60mV/div

50ps/div (d)

60mV/div

50ps/div

60mV/div

50ps/div (e)


88

Figure 3-28 shows spectrum of the 4.86Gbps dc-free signal. High frequency components

of the signal are suppressed due to the signal loss at the test board and the limitation of the

operating frequency of the circuit. However, the nulls at dc and the other frequency of the

spectrum are quite well matched with the theoretical values.

0 5 10 15-100

-90

-80

-70

-60

-50

-40

-30

-20

-10

Frequency (GHz)

Sig

nal S

pect

rum

(dB

m)

MeasurementTheoretical

Figure 3-28 Spectrum of 4.86Gbps dc-free signal.

Figure 3-29 shows the measured eye-diagram of the HSRC-OQPSK signal in response to

9.72Gbps PRBS sequence of 27-1. The measured eye-diagram shows that the HSRC-OQPSK

signal generates the similar eye-diagram shape to that of the duobinary signal [30]. A major

difference of the signal eye-diagram between the HSRC-OQPSK and duobinary signals is that

HSRC-OQPSK has no decision references because it uses only a two-level decision while the

duobinary signal needs the two decision level to decide the logical value of the signal [9], [30-

31]. Therefore, it is better for the HSRC-OQPSK signal to open the side eye-diagram wide

enough depicted with a blue diamond shape (ideal eye-opening). Actually, the HSRC-OQPSK

89

signal can be obtained based on the duobinary decoding scheme assuming the high frequency

components of the signal are filtered out at the band-limited channel. However, every 3rd and 4th

bit data should be reversed to get the proper HSRC-OQPSK signal.

60mV/div

20ps/div

ideal eye-opening

60mV/div

20ps/div

60mV/div

20ps/div

ideal eye-opening

(a)

100mV/div

100ps/div

100mV/div

100ps/div

(b)

Figure 3-29 Eye-diagram of the HSRC-OQPSK transmitted signal with 9.72Gbps PRBS sequence of (a) 27-1 (ideal eye-opening is depicted with blue line), (b) 231-1.

As discussed in Chapter 2, the consecutive transition signal which has low signal energy

makes the diamond shape eye-diagram depicted with a blue line in Figure 3-29(a). The signals

90

suffer from severe attenuation of the signal energy as shown in Figure 3-30. The small eye-

opening is less than 60mV which is smaller than the simulation results which will affect the BER

performance of the transceiver. The BER performance will be discussed in Chapter 4. The

attenuation of the signal mainly comes from the limit of the circuit’s operating frequency. And

the delay mismatch also prevents enlarging the eye-opening of the signal. Broadband circuit

techniques such as a fT doubling of the output buffer are needed to increase the signal quality and

maximize the eye-opening. The spectrum shown in Figure 3-27(c) also represents the high

frequency attenuation of the signal and there are no side lobes of the signal above 10GHz which

are supposed to follow the theoretical spectrum corresponding to the red line. The performance

of the HSRC-OQPSK transceiver associated with the transmitter is discussed in Chapter 4.

Figure 3-30 Signal spectrum in response to9.72Gbps PRBS sequence of 27-1.

91

CHAPTER 4 HSRC-OQPSK RECEIVER DESIGN

4.1 Receiver Architecture

A conceptual HSRC-OQPSK receiver architecture which is based on that of a QPSK has

been depicted in Figure 3-2. Different from the conventional serial link receiver that uses a PAM

signal, the HSRC-OQPSK receiver uses a carrier signal which is quarter data-rate frequency for

the demodulation. Therefore, a quarter-rate CDR must be incorporated with the receiver. A

quarter-rate PD as a CDR for the conventional PAM signal has been introduced and

demonstrated [32]. The HSRC-OQPSK receiver (Rev. 1) designed with TSMC 0.18μm CMOS

technology uses the same PD architecture introduced in [32].

To improve the performance of the proposed HSRC-OQPSK receiver, a new CDR

architecture has been proposed for the Rev. 2 receiver and implemented using UMC 0.18μm

CMOS technology. The proposed CDR has been modified from a Costas loop which is often

used for the carrier recovery loop for the BPSK and QPSK signal. The details are discussed in

section 4.3.

4.2 HSRC-OQPSK Receiver (Rev. 1)

Figure 4-1 shows the HSRC-OQPSK receiver architecture with a quarter-rate CDR. The

input buffer amplifies the incoming signal followed by I and Q channel mixers. As described in

the conceptual receiver architecture, the receiver has I and Q channels. I and Q channel mixers

are demodulating the received signal using the quarter data-rate carrier signals recovered by a

CDR loop. DET F/Fs followed by I/Q mixers determine the retimed data of I and Q channels. To

recover the clock which is quarter data-rate, a conventional quarter-rate PD [32] has been

adopted for the Rev. 1 receiver. Originally, the quarter-rate PD has been proposed for the

40Gbps NRZ signal relaxing timing requirements and reducing the cost of fabrication [32].

92

Signal in

VCO

DET F/F

DET F/F

90o

I

Q

e

IData

QData

LF

CI

CQ

Quarter-rate PD

Figure 4-1 HSRC-QOPSK receiver (Rev. 1) architecture incorporated with quarter-rate PD.

Figure 4-2(a) shows the phase detector (PD) architecture employed in the Rev. 1 receiver.

The PD is introduced in [32]. F/Fs strobe the data by using multi-phase VCO signal and

determine the polarity of the phase error by using XOR gates. The outputs of two consecutive

XOR gates are connected to the differential V/I converters which determine the phase error.

Figure 4-2(b) shows the timing diagram of the PD.

(a) (b)

Figure 4-2 Quarter-rate phase detector (a) architecture (b) waveforms (for 40Gbps NRZ) [32].

The Rev. 1 receiver has been fabricated with TSMC 0.18μm CMOS technology. The

simulated maximum current of the PD in the Rev.1 receiver is approximately 200μA.

93

In the measurement, however, the PD has failed to appraise the performance of the receiver

for the HSRC-OQPSK signal. There might be several reasons. First, the transmitter signal itself

was not good enough for the receiver due to the limited performance of the transmitter. Second,

the PD employed in the receiver is originally for the NRZ signal. The PD has the best

performance when the signal has a sharp transition like the NRZ signal and not like the HSRC-

OQPSK signal. Since the meta-stable behavior of the F/F will lead to the finite gain of the PD

[32], the HSRC-QOPSK signal having no discrete transition like NRZ would increase the meta-

stable behavior in F/Fs. Consequently, the PD gain might be decreased significantly.

4.3 HSRC-OQPSK Receiver (Rev. 2)

Figure 4-3 shows the proposed HSRC-OQPSK receiver structure. The receiver is based on

a Costas loop (often used as a carrier recovery loop of a conventional QPSK demodulator) [33-

34]. Two mixers separate the incoming modulated signal into the I and Q channel for the signal

demodulation. After passing through the mixer, the I/Q channel signals are fed into the DET F/Fs

clocked with quarter data-rate frequency carrier signals recovered by the CDR loop. The loop

also allows the retimed I/Q channel data. Generally, a carrier recovery loop as well as a Costas

loop has LPFs in order to get rid of the high frequency components in the I and Q channel path

and get the phase error to control the VCO of the receiver. Unlike a conventional carrier

recovery loop, the proposed CDR does not include LPFs. Since the carrier recovery of the

HSRC-OQPSK receiver is basically the same as the symbol synchronization of the QPSK

demodulation process, it is expected that the proposed CDR loop (algorithm) can be effectively

applied for the symbol synchronization loop of the QPSK demodulation as well. The detailed

analysis of the proposed receiver integrated with a CDR loop will be discussed.

94

Signal In

VCO

DET F/F

PP S/H

DET F/F

PP S/H

90o

LF

I

Q

e-

+

Is

Qs

I Data

Q Data

Figure 4-3 HSRC-OQPSK receiver architecture incorporated with a CDR.

4.3.1 Polarity-Type Costas Loop for Carrier Synchronization

A Costas loop is commonly used as a carrier recovery loop for the QPSK type modulation

signal. The Costas loop was first introduced in [35]. A couple of modified Costas loops have

been proposed and their performances analyzed in [33-34]. Other structures for the carrier

recovery loops for the QPSK signals are introduced and analyzed in [36-41].

Figure 4-4 shows a polarity-type Costas loop for the QPSK type signal proposed in [33].

The polarity-type Costas loop is a kind of hard-limited carrier recovery loop which is known to

have a better performance than that of the non-limited type carrier recovery loops in a higher

signal-to-noise ratio (SNR) situation [10], [34]. Input signals of the polarity-type Costas loop are

mixed with quadrature carriers generated from a quadrature VCO (QVCO) and separated into the

I and the Q channel by the mixers. The I/Q channel signals pass through low-pass filters (LPFs)

to get rid of the high-frequency components which have no significance for generating the

control signal. These signals are mixed again with the limited signals from the other channel, as

shown in Figure 4-4.

95

Signal in

VCO90o

LPF

LPF

Loop Filter

-1

+1

-1

+1

-

+

Figure 4-4 Polarity-type Costas loop for QPSK signal carrier recovery.

In QPSK modulation, the signal coming into the Costas loop can be defined as (4-1),

where S is the average received signal power, mI(t), mQ(t) are the data sequences of ±1 of I/Q

channel, ωc is the carrier frequency and θi is the phase of the signal.

( ) ( ) ( ) ( ) ( )[ ]icQicI ttmttmSts θωθω +⋅++⋅= cossin (4-1)

The phase error signal controlling the VCO frequency is generated by subtracting these

two signals from each channel. Assume VCO signal has an amplitude of S1 and a frequency

of ωc, then the low frequency output of the polity-type Costas loop can be represented using

trigonometric operations as (4-2), where Ф=θi-θo, θo is the phase of the VCO signal [42].

( ) ( ) ( )[ ] ( ) ( )[ ]( ) ( )[ ] ( ) ( )[ ]φφφφ

φφφφ

cossinsgnsincos

sincossgncossin21

tmtmtmtm

tmtmtmtmte

QIQI

QIQI

+−−

−+= (4-2)

Then the low frequency phase error can be rewritten as (4-3).

96

( )

⎪⎪⎪

⎩

⎪⎪⎪

⎨

⎧

≤<−

≤≤−

−<≤−

=

43

4cos

44sin

443cos

πφπφ

πφπφ

πφπφ

te (4-3)

4.3.2 A New Clock and Data Recovery (CDR) based on the Modified Costas Loop

Figure 4-5 shows the proposed CDR loop which is basically the same as the HSRC-

OQPSK receiver structure. Since the CDR offers retimed data similar to other CDRs do, it can be

used as a receiver. The proposed CDR loop for the HSRC-OQPSK modulation and its analysis

and simulation results are presented in this section.

Note that the HSRC-OQPSK modulation inherits the properties of the QPSK modulation

even though its carrier frequency is lower than the data-rate. Therefore, it is expected that the

polarity-type Costas loop shown in Figure 4-4, can be used as a clock (carrier) recovery loop for

the HSRC-OQPSK signal. However, there are a couple of limitations in implementing a CDR

loop and demodulating the HSRC-OQPSK signal. First, since the data-rate HSRC-OQPSK

signal is higher than its carrier frequency, LPFs in I/Q channels selecting low frequency

components cannot be used for the phase detector output in the Costas loop. Second, a coherent

demodulation of the QPSK signal with a matched filter consisting of an integrate/dump and a

decision circuit is very difficult to implement in the GHz range. Also, a bit-time delay between

the I and the Q channel data should be properly compensated for the phase error detection as

well as the demodulation. Consequently, the conventional Costas loop should be modified for the

CDR of HSRC-OQPSK signal. A new CDR loop for HSRC-OQPSK signal modified from the

Costas loop is proposed and shown in Figure 4-5.

97

Signal in

VCO

DET F/F

PP S/H

DET F/F

PP S/H

90o

I

Q

e-

+

IS

QS

IF

QF

LF

CI

CQ

Figure 4-5 A modified Costas loop for the HSRC-OQPSK signal clock and data recovery.

The modulated signals fed into the CDR are split into the I/Q channels and mixed with

quadrature carrier signal generated from QVCO. The LPFs shown in Figure 4-2 are removed and

the limiters are replaced by DET F/Fs. Sample/hold (S/H) circuits sampling both clock edges

hold the signal of each channel by 2 bit-time for the proper evaluation of the phase error. The

demodulated signals, I and Q, are sampled by sample/holds and DET F/Fs. The sampled signals,

IS and QF, QS, and IF, are mixed for the final evaluation of the phase error. The sampling time of

the I and Q channel are offset by 1 bit-time. Therefore, the CDR loop evaluates the phase error

every 1 bit-time. The details are analyzed and a behavioral model simulation will be presented.

4.3.2.1 Phase detector characteristics

The received HSRC-OQPSK signal can be represented as (4-4) [43], where Tb is a bit time

and θi is an arbitrary phase of the incoming signal.

( ) ( ) ( ) ⎟⎟⎠

⎞⎜⎜⎝

⎛+⋅+⎟⎟

⎠

⎞⎜⎜⎝

⎛+⋅= i

bQi

bI T

ttmTttmts θπθπ

2cos

21

2sin

21 (4-4)

98

For the QVCO signals of the CDR, they can be assumed as (4-5), where θo is the initial phase of

the QVCO signals.

( ) ( ) ⎟⎟⎠

⎞⎜⎜⎝

⎛+=⎟⎟

⎠

⎞⎜⎜⎝

⎛+= o

bQo

bI T

ttcTttc θπθπ

2cos2,

2sin2 (4-5)

Then I/Q signals can be obtained as (4-6), (4-7) by multiplying two signals represented as (4-4),

(4-5) using trigonometric operations.

( ) ( ) ( )

( ) ( )⎥⎥⎦

⎤

⎢⎢⎣

⎡−−⎟⎟

⎠

⎞⎜⎜⎝

⎛++⋅+

⎥⎥⎦

⎤

⎢⎢⎣

⎡−−⎟⎟

⎠

⎞⎜⎜⎝

⎛++⋅−=

oioib

Q

oioib

I

Tttm

TttmtI

θθθθπ

θθθθπ

sinsin21

coscos21

(4-6)

( ) ( ) ( )

( ) ( )⎥⎥⎦

⎤

⎢⎢⎣

⎡−+⎟⎟

⎠

⎞⎜⎜⎝

⎛++⋅+

⎥⎥⎦

⎤

⎢⎢⎣

⎡−+⎟⎟

⎠

⎞⎜⎜⎝

⎛++⋅=

oioib

Q

oioib

I

Tttm

TttmtQ

θθθθπ

θθθθπ

coscos21

sinsin21

(4-7)

Since the samplings of S/Hs and F/Fs are taking place at the zero crossing points of carrier

signals defined in (4-5), one can obtain the sampling time of each channel by setting

cI(tQ)=cQ(tI)=0. Therefore, the sampling time for the I/Q channels are defined as (4-8).

⎟⎠⎞

⎜⎝⎛ ⋅−=⎟

⎠⎞

⎜⎝⎛ ⋅−=

πθ

πθ o

bQo

bI TtTt 2,21 (4-8)

Now, one can define the sampled signals using (4-8). The sampled signals, IS(t) and QS(t) shown

in Figure 4-3, can be calculated as (4-9), (4-10) by substituting (4-8) into (4-6), (4-7)

respectively, where Ф=θi-θo.

99

( ) ( ) ( ) φφ sincos ⋅−⋅= IQIIS tmtmtI (13)

( ) ( ) ( ) φφ cossin ⋅+⋅= QQQIS tmtmtQ (14)

Similarly, IF and QF signals can be determined by limiting the sampled signals which are

obtained by taking sgn[Is(tI)], sgn[Qs(tQ)]. For these signals, the phase difference between the

received signal and the carrier signal determines the values of IF and QF. Therefore, IF and QF are

represented as (4-11), (4-12), respectively.

( )

( )

( )

( )⎪⎪⎪

⎩

⎪⎪⎪

⎨

⎧

<≤−

<≤−

−<≤−

=

43

4

44

443

πφπ

πφπ

πφπ

IQ

II

IQ

IF

tm

tm

tm

tI (4-11)

( )( )( )( )

⎪⎪⎪

⎩

⎪⎪⎪

⎨

⎧

<≤

<≤−

−<≤−−

=

43

4

44

443

πφπ

πφπ

πφπ

QI

QQ

QI

QF

tm

tm

tm

tQ (4-12)

The data sequences of mI and mQ should be specified for determining the phase error, e(t).

Figure 4-4 illustrates the early and late sampled I/Q channel data when the phase difference, Ф, is

either positive or negative. When Ф<0, the sampling time is earlier than the ideal sampling time

while the sampling time is later than the ideal sampling time when Ф<0, as shown in Figure 4-6.

Consequently, data constraints related to the phase difference can be stated as (4-13).

( ) ( )( ) ( )⎩

⎨⎧

<=≥=

00

φφ

QIII

QQIQ

tmtmtmtm

(4-13)

100

mI

mQmQ(tI) mQ(tQ)

mI(tI) mI(tQ)Ф<0Ф>0

mI

mQmQ(tI) mQ(tQ)

mI(tI) mI(tQ)Ф<0Ф>0

Figure 4-6 Early and late sampling time of I/Q data.

Now, the phase error e(t), represented as (4-14), can be estimated. The CDR loop evaluates

the phase error every bit-time because the IS(tI) and QF(tQ) or QS(tQ) and IF(tI) are overlapped by

a bit time.

e(t)= QS(t)IF(tI) - IS(t)QF(tQ) (4-14)

With the data constraints in (4-13), the phase error, e(t), is obtained as (4-15) if it is assumed that

the amplitude of the mI, mQ are unity.

( )

( ) ( )( )( ) ( )( )( ) ( )( )( ) ( )( )⎪

⎪⎪⎪

⎩

⎪⎪⎪⎪

⎨

⎧

≤<⋅⋅+−

≤≤⋅⋅+

<≤−⋅⋅+

−<≤−⋅⋅+

=

43

4cos1

40sin1

04

sin144

3cos1

πφπφ

πφφ

φπφ

πφπφ

QIII

QIII

QQIQ

QQIQ

tmtm

tmtm

tmtm

tmtm

te (4-15)

Still, the values of mQ(tI)·mQ(tQ) when Φ<0 and mI(tI)·mI(tQ) when Φ>0 cannot be fixed to

finalize e(t) because the random data sequences are uncorrelated. The values can be either -1 or 1

which is dependant on the data sequences. The undetermined value of -1 or 1 leads to two

101

possible results of e(t). The coefficients of each term of (4-15) will be zero or ±2 when the value

is -1 or 1, respectively. Then, the averaged phase error can be rewritten as (4-16) with the

assumptions that I/Q channel data sequences occur equally likely with symbol time and the phase

error process is changing slowly over the large number of symbol periods. Consequently, the

averaged phase error is equivalent to that of the polarity-type Costas loop. Its plot is shown in

Figure 4-7.

( )

⎪⎪⎪

⎩

⎪⎪⎪

⎨

⎧

≤<−

≤≤−

−<≤−

≅

43

4cos

44sin

443cos

πφπφ

πφπφ

πφπφ

te (4-16)

π/4 π/2

e

3π/4-π/4π/2-3π/4 Фπ/4 π/2

e

3π/4-π/4π/2-3π/4 Ф

Figure 4-7 Averaged phase detector characteristic of the proposed CDR loop.

The proposed CDR loop based on the polarity-type Costas loop can be characterized as a

linear phase detector, as shown in Figure 4-5. A linear phase detector (PD) generates an error

signal linearly proportional to phase error. The output of the PD goes to zero when the loop is

locked while a non-linear PD is pumping the charge even with the loop in a locked state. It is

known that a linear PD produces lower jitter compared to non-linear PD due to less charge pump

102

activities [44-45]. Therefore, the proposed CDR is expected to generate less clock jitter noise

compared to non-linear type PD. Moreover, the carrier frequency of the proposed CDR loop for

HSRC-OQPSK signal is quarter data-rate, hence, the loop is equivalent to quarter-rate CDR for

NRZ signal. As a result, the proposed CDR can relax the timing constraints of the receiver

system. In addition, the proposed CDR allows retimed I/Q channel data as shown in Figure 4-1,

similar to that of other CDRs [6]. In addition, although the proposed CDR has been developed

for the clock recovery of the serial link transceiver system, the concept can also be applied to the

symbol synchronization of the QPSK signal in wireless communications.

One drawback of the proposed CDR is that there are four stable locking points over the π

radian period, only one of which has proper phase information. A differential encoding in the

transmitter can resolve this four-fold phase ambiguity problem at the cost of 3dB reduced signal

power [10]. And, other methods for phase ambiguity resolution have been introduced in [46].

However, this four-fold ambiguity issue is for future work.

4.3.2.2 Loop analysis

The modified Costas loop can be represented by the equivalent model, as shown in Figure

4-8. The Ka is the gain of the combiner, Kv is the VCO gain, and F(S) is the function of the loop

filter. Transient response of the loop cannot be characterized easily; however, the loop can be

modeled as a linear system with assumptions that the phase error, Ф, changes slowly over a large

amount of the symbol period [40].

Assuming no noise is added in this loop, then the overall transfer function of the closed

loop is calculated as (4-17).

( ) ( )( )

( )( )sFKKs

sFKKss

sHva

va

i

o

+==

θθ

(4-17)

103

VCO

Ka F(s)+

-

Kv/s

ni

e

0o

0i O/

Figure 4-8 Equivalent linear model of proposed CDR for HSRC-OQPSK.

Since the transfer function of the loop filter can affect the overall transfer function of the

system, the loop filter should be carefully chosen to stabilize the system. A lead/lag network is

widely used as a loop filter because it offers an increased open loop phase margin of the system

by inserting zero to the transfer function [6]. The transfer function can be rewritten by using the

parameters from circuit implementation. Ka=180μA/V is obtained from the circuit simulation and

Kv=120MHz/V, as shown in Figure 4-18. A lead/lag network with Rp=7KΩ and Cp=30pF is used

for the simulation purpose to reduce the simulation time and the required memory. With these

parameters, the closed loop gain of the system is represented as (22). The zero of the closed loop

of the transfer function is located at 1/RpCp. In real design for the measurement, the closed loop

bandwidth of 1MHz has been chosen which is examined in a later section.

( )( )

z

vapva

ppz

va

CKKsRKKs

sCRC

KK

sH++

+=

2

1 (4-18)

104

The stability issues of the type I and type II PLL are discussed thoroughly in [6], [47-48].

The phase margin of the system’s open loop transfer function is often used to examine the

stability of the system shown in Figure 4-7. The characteristics of the open loop transfer function

show an approximately 90° phase margin at the gain crossover which will lead to stable locking

of the system.

100 102 104 106 108 1010-100

0

100

200

300

400

Ope

n Lo

op G

ain,

|Hop

en| (

dB)

100 102 104 106 108 1010

-180

-160

-140

-120

-100

-80

Frequency (Hz)

Pha

se, ∠

Hop

en (d

eg)

-40dB/dec

-20dB/dec

Figure 4-9 Open loop gain characteristics of the proposed CDR loop.

4.3.2.3 Noise characteristics

The phase error characteristic, S-curve, which is expressed in terms of signal-to-noise ratio

(SNR), can be calculated by averaging the value of the phase error as (4-19) [33]. Obviously, the

loop of the system includes the noise term depicted in Figure 4-6.

( ) ( )[ ]φφ |teEg = (4-19)

As we have discussed in the previous section, the phase error of the proposed CDR loop

based on the modified Costas loop is equivalent to that of the polarity-type Costas loop.

105

Therefore, the averaged phase error of the proposed CDR can also be considered to have the

same value as that of the polarity-type Costas loop. Consequently, the phase error characteristic

can be represented as the equivalent form as derived in [33]. The S-curve for the QPSK signal,

that is M=4 in [33], is represented as (4-20), where R=S/2N0B, S is the signal power, N0 is the

Gaussian noise spectral density, B is the noise bandwidth, am=sin[(2m-1)π/4], bm=cos[(2m-

1)π/4], and ( ) ( )∫∞

−=x

dxxxQ 2/exp21 2

π.

( ) ( )

( )[ ] ( )[ ] ( )

( )[ ] ( )[ ] φφφφ

φφπφφφφ

φφπφ

sincos2sincos2

cossin4

sin41

sincos2sincos2

cossin4

sin41,

4

1

4

1

llll

lll

llll

lll

baRQbaRQ

ba

baRQbaRQ

baRg

+−−−⋅

−⎟⎠⎞

⎜⎝⎛+

−−+−⋅

+⎟⎠⎞

⎜⎝⎛=

∑

∑

=

=

(4-20)

The phase error gain of the CDR loop is dependant on the SNR, as shown in Figure 4-8. In low

SNR, the S-curve performs like as sin4Ф [33].

-150 -100 -50 0 50 100 150-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

1

Phase Error (deg)

Nor

mal

ized

S-C

urve

0dB10dB20dB30dB∞dB

Figure 4-10 Phase error characteristic with SNR (S-curve) of the proposed CDR loop.

106

The S-curve gives the ability to estimate the characteristics of the CDR loop when the

noise is injected. However, the HSRC-OQPSK signal is used as a wide-band signal utilizing a

band-limited channel such as a PCB board trace. The band-limited channel having low-pass

characteristics may affect the S-curve behaviors.

4.3.3 Behavioral Model Simulations

4.3.3.1 Phase error

To verify the functionalities of the proposed CDR, the proposed CDR is designed by

behavioral models using MATLAB Simulink. A transmitter generating the HSRC-OQPSK

signal is also modeled, as shown in Figure 4-11.

HSRC-OQPSK TransmitterModified Costas Loop Structure

Integrator

HSRC-OQPSK TransmitterModified Costas Loop Structure

Integrator

Figure 4-11 Phase error simulations of the CDR for the HSRC-OQPSK signal with MatLab Simulink behavioral models.

The transmitter and the receiver structures modeled by behavioral components are

basically the same as described in the previous section. The transmitter generates HSRC-OQPSK

signals using random data. The generated HSRC-OQPSK signal is fed into the receiver directly.

A F/F in the receiver is modeled by a S/H and a limiter, as shown in Figure 4-11.

For the phase error simulation, the QVCO signals modeled with ideal sinusoidal sources

with phase offset of 90° are used in both the transmitter and the receiver. An integrator of the

107

CDR accumulates the phase error generated from the phase mismatch of the carrier signals

between the transmitter and the CDR loop. The phase mismatch can easily be modeled by

changing the phase of the carrier signal. The simulation result of the averaged phase error fits

well with the analytical form obtained in (4-16), as shown in Figure 4-12.

- 2.0 - 1.5 -1. 0 - 0.5 0.0 0. 5 1.0 1.5 2.0

- 0.8

- 0.6

- 0.4

- 0.2

0.0

0.2

0.4

0.6

0.8

e (N

orm

aliz

ed P

hase

Gai

n)

T heta (Phas e D if ferenc e)

Figure 4-12 Behavioral model simulation result of the phase error for the proposed CDR using MATLAB.

4.3.3.2 Time domain simulation

The system is modeled with data-rate of 1bps and the frequency of the carrier is 0.25Hz.

The reduced data-rate and the carrier frequency can be considered as 10Gbps and 2.5GHz

respectively in actual implementation.

Two VCOs with a phase difference of 90º replaced the ideal carrier signals in order to

model a QVCO, as shown as Figure 4-13. And the loop filter (LF) is characterized with a

transfer function which is equivalent to the characteristics of a series RC network given in the

loop analysis section.

108

The phase error signal, e(t), changes the frequency of the modeled QVCO. Figure 4-14

shows the simulation result of the phase error, e(t), which controls the frequency of the QVCO.

The initial frequency of the QVCO is 0.247Hz with a gain of 0.012Hz/V which are comparable

to 2.47GHz and 120MHz/V in a 10Gbps system, respectively.

Quadrature VCO

Loop Filter

Quadrature VCO

Loop Filter

Figure 4-13 HSRC-OQPSK Transceiver Model with QVCO for Time-Domain Simulation.

Figure 4-14 Time-domain response of phase error signal for the VCO frequency control.

109

In the locking state, approximately after 3700 time steps in Figure 4-14, the control voltage

is not changing because no error signal is generated from the loop. Ideally, the phase detector

(PD) of the loop would not generate an error signal while the non-linear PDs, such as a bang-

bang PD, have a charge injection even in the locking state, which causes large control voltage

transitions in a type II PLL [6], [47]. However, high frequency components might be injected at

this control signal because of unavoidable mismatches in circuits, such as a device and an I/Q

mismatches in actual circuits.

Phase differences are added to one of the VCOs in order to observe the I/Q mismatch-

effects. Figure 4-15 illustrates the simulation of phase error, e(t), when the case of the phase

mismatch is 10°. The locking time and ripple voltage of the phase error signal increase, as shown

in Figure 4-15. The ripple voltage directly affects the clock jitter making it degrade the system

performance.

Figure 4-15 Time-domain response of phase error signal for the VCO frequency control with 10° I/Q mismatch.

110

Figure 4-16 shows the normalized locking time and peak-peak ripple voltage in the locking

process. Since the locking time changes for random data sequences, the simulation results are

obtained from the average value of 5-time repeated simulation results. t0 is the average locking

time of control voltage when no I/Q mismatch has occurred. From the simulation results shown

in Figure 4-16, the locking time and peak-peak ripple voltage are almost proportional to the

amount of I/Q mismatch. Needless to say, it is important to reduce the ripple voltage because it

directly affects the QVCO’s phase noise.

0 2 4 6 8 10

0

1 0

2 0

3 0

4 0

5 0

I/Q Mismatch (deg)

Nor

mal

ized

Lo

ckin

g T

ime

(t-t 0)

/t 0

0.00

0.01

0.02

0.03

0.04

Ripple V

oltage in locking S

tate (V)

Figure 4-16 Normalized settling time and peak-peak ripple voltage in locking state vs. I/Q mismatch.

4.4 Chip Design (Rev. 2)

Since all the components can be characterized as a linear model in the behavioral model

simulation, the simulation is well-matched with the theoretical results. However, there are many

non-linear factors in circuit implementation such as a VCO gain, a clock jitter, or a clock feed

through of a S/H circuit.

111

In this chapter, the proposed receiver combined with a CDR is implemented and simulated

with UMC 0.18μm CMOS technology. The receiver consists of an input buffer, mixers, S/Hs,

F/Fs, a V/I converter, and a QVCO. Most of the circuits are designed with CML style circuits;

this is much the same for the transmitter design.

4.4.1 Circuit Implementation

4.4.1.1 Sample/hold circuit

[49] proposed a high speed S/H circuit, sampling bandwidth up to 7GHz with 0.25μm

CMOS technology, as shown in Figure 4-17. In tracking mode where the clock is high, the S/H

circuit operates as a differential amplifier. In the sampling mode when the clock falls, PMOS

loads and tail current source are turned off making the output nodes isolated so that the S/H

circuit is holding the sampled signal. M2 provides a low resistance differential load for wide

bandwidth operation. M1 pulls up the tail node to quickly turn off the input transistors, which

allows the amplifier bandwidth to be close to the sampling bandwidth [49].

Vin

clk

bias

Vref

outPoutM

clkb

M2

M1

Figure 4-17 A high-speed differential sample/hold (S/H) circuit [49].

As mentioned in the previous section, a S/H circuit for the proposed CDR loop should

sample the input signal at both clock edges and maintain the signal until the next clock edge

112

(either rising or falling) occurs because the clock frequency of the loop for each channel is half-

symbol-rate. A ping-pong (PP) structure S/H circuit [50] can be used for this purpose. An analog

MUX selects one of S/Hs which are tracking the input signal alternately.

Figure 4-18 shows a differential double clock edge sampled S/H circuit based on the PP

structure S/H. The structure of the S/H shown in Figure 4-18 is equivalent to that of DET F/F.

However, the analog multiplexer of the S/H should linearly amplify the sampled signal without

limiting the output. Since the analog multiplexer shares the same structure of the mixer shown in

Figure 3-6, the linearity characteristics can be applied for the analog MUX.

Analog MUX

S/H

S/HData

CLK

Out

Figure 4-18 A ping-pong structure differential sample/hold circuit.

4.4.1.2 Quadrature VCO (QVCO)

As discussed in Chapter 3, a two-stage ring oscillator using a current-starved inverter or

shunt capacitor inverter can generate the quadrature clock signal. However, a LC QVCO has

been designed for the receiver to improve the phase noise of the carrier signal, as is the same

reason for that of the transmitter design. The proposed CDR loop is comparable to the quarter-

rate CDR of the NRZ signal. Typically, a quarter-rate CDR needs multi-phase clock signals for

the proper phase error estimation [32]. However, the proposed CDR for the HSRC-OQPSK

modulation uses quadrature-phase clock signals. Figure 4-19 shows the LC-QVCO structure [6].

A cross-coupled NMOS generates a negative resistance increasing the Q of LC tanks. The

113

outputs of one VCO, I+ and I-, are fed into the inputs of the other VCO, which generates

quadrature-phase outputs; Q+ and Q-. The Q+ and Q- are also fed into the inputs of the other

VCO. Detail analysis of the QVCO is found in [6].

Vb

I-

Q-

I+

c cQ+

Q-

I+

Q+

c cI-

Vc

Figure 4-19 LC quadrature VCO (LC-QVCO).

The Q of inductor is approximately 8 with a value of 3.9nH. A higher value inductor has

chosen to get a large voltage swing to provide a clock signal for the appropriate operation of the

S/H circuit. The varactor value of 730fF with a fixed MIM capacitor of 320fF is used. The

frequency of VCO is controlled by a voltage of node vc, which changes the capacitances of the

varactors.

Figure 4-20 shows simulated QVCO’s tuning range and its gain. The QVCO achieves a

tuning range of approximately 10% of the center frequency, 2.5GHz, and a maximum gain of

120MHz/V in the Cadence Spectre simulation. The VCO gain is relatively small, but a varactor

diode is chosen to get more linear tuning performance instead of using a MOS varactor. The

pull-in range is limited due to the small QVCO tuning range and its gain. However, the linear

tuning performance of the QVCO and low phase noise improve the system reliability as well as

114

reduce the jitter noise. The tail current of each NMOS is 2mA, therefore, each VCO stage

consumes 4mA. A voltage follower is used as an output buffer of the QVCO, which is not shown

in Figure 4-19. The phase noise of -117dBc/Hz at 1MHz offset is achieved in the Cadence

Spectre simulation.

-0. 2 0.0 0.2 0.4 0.6 0. 8 1 .0 1 .2 1 .4 1 .6

2 .42

2 .44

2 .46

2 .48

2 .50

2 .52

2 .54

2 .56

2 .58

2 .60

2 .62

Kvco=120MHz/V

Freq

uenc

y (G

Hz)

Contro l Voltage (V)

Figure 4-20 Simulated QVCO’s tuning range and its gain.

4.4.1.3 Voltage-to-current (V/I) converter

A simple differential input to single ended output V/I converter is designed which is

illustrated in Figure 4-21. The simulated current gain is approximately 180μA/V. A lead/lag

network as a loop filter is attached to the output node of the V/I converter. For more flexibility

to choose the loop filter, the VCO control line can be externally accessed. Therefore, the value of

the components of a loop filter can easily be replaced.

115

Vin+

bias

Vin-

Iout

Figure 4-21 A differential to single-ended V/I converter.

4.4.2 Circuit Simulations

Figure 4-22 shows the detailed receiver structure. A buffer is inserted as a delay unit to

compensate the delay between the mixer output and the clock signal of the PP S/H, DET F/F.

The I/Q mixer outputs are tied directly together for combining the signal and followed by a

differential to single-ended V/I converter. The lead/lag type loop filter is attached to the control

node of the VCO.

Signal in QVCO

DET F/F

PP S/H

DET F/F

PP S/H

90o

I

Q

Vc

IS

QS

IF

QF

CI

CQ

V/I Converter

wired

delaydelay

Input Buffer

I data

Q data CpCz

Rz

Figure 4-22 A detailed receiver architecture.

116

For the receiver simulation, the transmitter and the receiver are connected via a channel of

10cm transmission line with 50Ω characteristic impedance modeled with RLC elements. Figure

4-23 shows the simplified simulation setup.

CK

Data In

Transmitter

Signal Out D

Receiver

Channel 50O I

QQVCO

Rz

CzCp

CDR

Figure 4-23 Time-domain simulation setup with a HSRC-OQPSK transmitter and 10cm transmission line with characteristic impedance of 50Ω.

Since the output buffer of the Rev. 2 transmitter has been terminated with the on-chip

resistor, the external 50Ω pull-up resistor has been attached to the receiver input. Both the source

and the end termination structure can minimize the signal reflections; however, the power would

be increased to transfer the signal compared to the open-drain structure which is used in Rev.1’s

transmitter. As used in the behavioral model simulation, a LF with Rz=7KΩ, Cz=30pF and

Cp=2pF is attached to the VCO control node for the simulation purpose. To decrease the ripple of

the control voltage distorting the phase of VCO, a small value of a capacitor is added to the node

in parallel. A 2pF capacitor is attached for this reason, as shown in Figure 4-23. Note that this

would not much change the characteristics of the closed loop time and frequency responses [6].

However, the much lower loop bandwidth has been realized in actual design to avoid high

frequency jitter noise. The loop parameters are chosen to ensure zero of a loop transfer function

located at lower frequency than the closed loop pole. These values are Cp=12nF, Cz=150nF,

117

Rz=55Ω which are externally added to the test board. In this condition, the closed loop bandwidth

is approximately 200KHz and the phase margin is more than 60°.

Figure 4-24 shows the simulated phase error signal, e(t), which illustrates the locking

behavior of the CDR. As discussed in behavioral simulation, if the I/Q channels are perfectly

matched, there is no charge injection from the PD output when it is locking. However, a circuit’s

non-linearities make the I/Q mismatch which introduces high-frequency noises, that is, creates

the difference between behavioral and circuit simulations. From the result, the estimated locking

time of the CDR loop is approximately 500ns with the loop filter in Figure 4-24. The tuning

range of less than 50MHz is achieved by the simulation.

0.0 0.5 1 .0 1.5 2.0 2 .50 .70

0 .75

0 .80

0 .85

0 .90

0 .95

1 .00

Vcn

t (V

)

Time (us)

Figure 4-24 Simulation of the locking behavior of the proposed CDR loop.

Since the receiver has no multiplexer for serializing retimed I/Q data, the I/Q data of the

transmitter and the receiver are compared, as shown in Figure 4-25. The transmitted data are

118

recovered by a demodulation process in the receiver after approximate 1ns delay. Figure 4-25

shows the simulation results where the CDR locked the loop with the proper phase.

Transmitter Data Q Channel

Transmitter Data I Channel

Receiver Out Q Channel

Receiver Out I Channel

Figure 4-25 I/Q data of the transmitter and the receiver in the proper phase locked state.

As mentioned in the previous section, the HSRC-OQPSK CDR has a four-fold phase

ambiguity as is the case of the conventional OQPSK modulation. Therefore, the CDR loop may

be locked in four possible phase points. Since only one phase has the proper phase relationship,

the phase ambiguity should be resolved for the proper data acquisition. As discussed before, a

differential coding with loss of 3dB power efficiency or other methods can fix this problem [10],

[46]. Intuitively, it might be also resolved with training bits at the initializing steps of the

communication. However, this fourfold phase ambiguity issue is for future work which is

discussed in Chapter 5.

119

4.4.3 Layout

The prototype receiver is designed and fabricated in UMC 0.18μm CMOS technology.

Figure 4-26 show the layout structure of the HSRC-OQPSK receiver.

Varactors

Quadrature VCO

gndvbrev outQ+ outQ-

Din-

Din+

outI- outI+vbivbcp

osc+

osc-

vbvco lpfout vcnt

gnd

gnd

gnd

gnd

gnd

vdd

vdd

vdd

vboc

vbo

Demodulation with CDR

vdd

Varactors

Quadrature VCO

gndvbrev outQ+ outQ-

Din-

Din+

outI- outI+vbivbcp

osc+

osc-

vbvco lpfout vcnt

gnd

gnd

gnd

gnd

gnd

vdd

vdd

vdd

vboc

vbo

Demodulation with CDR

vdd

Figure 4-26 HSRC-OQPSK receiver chip fabricated with UMC 0.18μm CMOS technology.

Four inductors for the quadrature VCO (QVCO) occupy the most part of chip, varactors

are place in the center and the demodulation logics with CDR are placed on right side of the

chip. The power and ground ring is placed around the chip, which are not shown in the die photo

because the power and ground ring is metal5 layers. Input signal pads are placed close to the

demodulation logic. The chip has 26 pads including 8 differential signal inputs and outputs

which are operated up to 10Gbps, 11 power and ground, 5 biases, and VCO control outputs for

the external loop filter with the total chip size of 1185μm X 1260μm. The bypass capacitor for

the power and ground has been placed at the unused chip area with the value of more than 60pF.

120

4.5 Measurement (Rev.2)

The same test of the transmitter’s has been used for the receiver. Figure 4-27 shows the test

board which is the same board as that of the transmitter’s. Bypass capacitor of approximately

100μF are attached between the power and the ground. The I and Q channel outputs are to be

connected to the external 2:1 mux for the BER test. VCO outputs are for monitoring the locked-

state of the CDR loop and the recovered clock signal.

Figure 4-27 Receiver test board.

Figure 4-28(a), (b) show the simplified 2.5Gbps and 10Gbps receiver measurement setups,

respectively. The signals are depicted with a single signal line for the simplification while the

actual signals are differential. Since it is not easy to estimate the receiver’s performance with a

conventional signal generator because the proposed HSRC-OQPSK signal is different from the

conventional NRZ signal, the HSRC-QOPSK receiver incorporated with the proposed CDR

requires HSRC-QOPSK modulated signal to evaluate the receiver’s performance. Therefore, the

121

HSRC-OQPSK transmitter output is connected to the receiver input via a channel as shown in

Figure 4-28.

A 2.5GHz VCO signal output of the transmitter is fed into the external clock input of the

Agilent 4093A BERT for the 2.5Gbps BER test. An external 2:1 mux (Inphi 20709SE) operating

up to 20Gbps is located at the receiver output for serializing the I and Q channel data. The

serialized data is fed back into the BERT for the BER performance test of the receiver.

CK

Data In

Transmitter

Signal Out

2.5GHz VCO Signal

Signal Generator

VCO OutExternalClock In

5GHz Clock

2.5Gbps

Data In

BERT

D

Receiver

Channel 50ohm IData

QData

2:1 MUX (external)

Phase Shifter

(a)

BERT

CK

Data In

Transmitter

Signal Out1/2 sub clock

10Gbps

Data In

D

Receiver

Channel 50ohm IData

QData

2:1 MUX (external)

Phase Shifter

Phase Shifter

(b)

Figure 4-28 Simplified receiver (transceiver) measurement setups (a) for 2.5Gbps input (equivalent data-rate of 5Gbps), (b) for 10Gbps data-rate.

Figure 4-29 shows the phase noise performance of the VCO. The red line in Figure 4-24

represents the phase noise of the receiver’s VCO signal in locked-state. The phase noise of the

122

recovered clock is approximately -87dBc/Hz at 50KHz offset. The 2.43Gbps random data has

been fed into the transmitter to generate the HSRC-OQPSK signal which is the source signal of

the receiver. The measured tracking range of the CDR is approximately 30MHz.

10K 100K 1M-120

-110

-100

-90

-80

-70

-60

-50

-40

Offset Frequency (Hz)

Pha

se N

oise

(dB

c)free-runninglocking

Figure 4-29 Phase noise performance of the receiver’s VCO in free-running and locking states.

Figure 4-30 illustrates the measured jitter of the recovered clock in response to 2.43Gbps

PRBS sequence of 231-1. Since the modulated signal cannot be generated by BERT, the

transmitter outputs which generate the HSRC-OQPSK signal are connected to the receiver’s

inputs for the measurement. The measured rms jitter and the peak-to-peak jitter of the recovered

clock by the CDR loop are equivalent to 6.5ps and 39.9ps respectively.

Figure 4-31 shows the recovered I or Q channel data output when the 231-1 PRBS input has

been used. As shown in Figure 4-23, the BER test has been performed using 2.5Gbps and

10Gbps test setups, respectively with three different ( 5”, 10”, 20” ) FR-4 PCB channels and a

19” SATA cable. From the measurement results, BER of < 10-9 have been achieved up to 10”

PCB channel and a SATA cable where 2.43Gbps 231-1 PRBS is used while BER of 10-4 is

123

achieved where a 20” PCB channel is used. However, the BER of <10-9 has been achieved where

2.43Gbps 27-1 PRBS is used. The power consumption of the receiver core is approximately

130mW from a 1.8V supply excluding power consumptions of the output buffers for the I and Q

channel data, which is driving the input of the external 2:1 mux for serializing the data and

output buffer of the VCO for the purpose of monitoring.

50mV/div

10ps/div

p-p jitter

50mV/div

10ps/div

p-p jitter

Figure 4-30 Measured jitter of recovered clock in response to 4.86Gbps (both I and Q channel input with 2.43Gbps PRBS sequence of 231-1).

100mV/div

100ps/div

100mV/div

100ps/div

Figure 4-30 Recovered I (or Q) channel eye-diagrams in response to 4.86Gbps (both I and Q channel input with 2.43Gbps PRBS sequence of 231-1).

124

Unfortunately, it has been failed to evaluate the performance of the receiver at 9.72Gbps

because there are no reasonable BER performance and recovered eye-diagrams from the

measurement. One of the main reason of the failure in recovering the 9.72Gbps data is that the

transmitted signal generated from the transmitter does not have enough energy in a peak signal

which has the signal energy of Es,p analyzed in Chapter 2. Eye-opening mismatches in the

transmitted data-rate of 9.72Gbps eye-diagram shown in Figure 3-27(a) caused a slight

synchronization mismatch between the clock and the data and it might affect the performance of

the receiver. Another reason for failure in recovering data might be from the CDR performance

of the receiver. However, it is hard to evaluate the CDR performance itself without the HSRC-

QOPSK modulated signal because there is no equipment that generates the ideal 10Gbps HSRC-

QOPSK signal. Although it has failed to recover 9.72Gbps transmitted data at the receiver which

is originally designed for recovering up to 10Gbps transmitted data, we could get a much

improved system performance if the low-noise and the broadband circuit design are considered

to design the transceiver. And frequency compensation techniques, such as a pre-emphasis of the

signal at the transmitter or the equalizing signal at the receiver-end using filter will also greatly

help in increasing the performance of the transceiver. However, these are put to future work.

Table 4-1 summarizes the performance of the receiver.

Table 4-1 Transceiver (Rev.2) performance summary Data Rate 4.86Gbps (2.43Gb/s I/Q input)

@ sequence of 231-1 < 10-9 @ up to 10” PCB channel and 19” SATA cable < 10-4 @ 20” PCB channel

BER Performance @ 4.86Gb/s PRBS input (2.43Gb/s PRBS I/Q input) @27-1

< 10-9 @ 20” PCB channel Phase Noise (recovered clock) -87dBc @ 50KHz offset

125

Table 4-1 (continued)

Recovered clock jitter 6.5ps (rms), 39.9ps (p -p) @ 2.43Gbps PRBS of 231-1

CDR tracking range 30MHz (Loop Bandwidth ≈ 200KHz)

Power consumption Tx + Rx : 200mW @ 1.8V

Die Size Tx: 1130 x 1240μm2 Rx: 1185 x 1260μm2

Technology UMC 0.18μm CMOS

4.6 Summary

This chapter proposed a HSRC-OQPSK receiver incorporated with a new CDR loop, and

demonstrated its viability of increasing the data-rate in high-speed serial link system by using the

HSRC-OQPSK modulation. The proposed receiver has been designed with UMC 0.18μm CMOS

technology and the analysis has been compared with the simulation results. From the simulation

results, the time domain signals and the spectrum of the HSRC-OQPSK modulation fit well with

theoretical analyses.

This paper also proposed a CDR based on the Costas loop for the HSRC-OQPSK

modulation. The proposed CDR is comparable to a quarter-rate CDR of NRZ modulation

because it uses quarter data-rate frequency clock. Therefore, the proposed CDR can improve the

timing constraints of the receiver’s clock and data recovery. The CDR incorporated with the

receiver is simulated and compared to the analytical results. Moreover, the CDR uses a QVCO

instead of multi-phase VCO which is applied to a conventional quarter-rate CDR hence it offers

a simple receiver structure. The circuit simulation results of the phase error and the locking

behavior relatively fit well with the behavioral model simulation. In addition, the proposed CDR,

characterized as a linear PD, can lower the jitter noise [44-45].

The HSRC-OQPSK transceiver can easily be implemented with a low-voltage technology

due to the reason that it uses two level data decision while a multi-PAM (e.g., 4-PAM) system [3]

126

needs reference voltages and the level spacing which is difficult to maintain linear levels in a

low-voltage system for data decisions. Moreover, this allows a simple transceiver architecture.

The measurement results show the feasibility of using the HSRC-OQPSK as a modulation

technique offering a simple transceiver architecture for the high-speed wire-line

communications, such as a serial link. Moreover, the HSRC-OQPSK modulation and the

receiver can enable a serial link system to achieve higher data-rate without aiding a reference

clock in low-voltage wire-line communication systems.

127

CHAPTER 5 SUMMARY AND SUGGESTIONS FOR FUTURE WORK

5.1 Summary

The HSRC-PSK modulations are proposed to optimize spectral efficiency for high data-

rate transmission over band-limited channels. The analysis and simulation results show that the

proposed modulations can be used in high-speed data communications, such as a backplane

serial link.

In the past, a multi-PAM signal (e.g., 4-PAM) has been demonstrated to increase the data-

rate in band-limited channels [3]. However in 4-PAM modulation, it is difficult to maintain the

linear spacing between levels in low-voltage and low-power application and as a result the

system’s performances are degraded. The level spacing also causes complexity in the transceiver

design not only because the received signal needs to be linearly amplified but also because 4-

PAM signaling requires accurate reference voltages.

The proposed HSRC modulations not only reduced the bandwidth requirement but also can

be easily implemented in deep submicron integrated circuit technologies with low supply

voltages. Three HSRC-PSK modulations named HSRC-QPSK, HSRC-OQPSK, and HSRC-

MSK are introduced and analyzed in terms of their spectrums and BER performances.

The HSRC-QPSK modulation can reduce the required bandwidth without any BER

performance degradation compared to the NRZ modulation. However, it is hard to implement a

HSRC-QPSK demodulator with a conventional circuit technique due to the difficulty in realizing

a matched filter in GHz range. The demodulation method for HSRC-QPSK is still under

investigation.

The HSRC-MSK modulation can minimize crosstalk noise compared to the conventional

ones, since the high frequency components are maximally suppressed [14] among the introduced

128

three quadrature HSRC-PSK modulations. However, it does not help to reduce the required

bandwidth of the transmitted signal because the first null bandwidth is the same as that of the

NRZ modulation.

The HSRC-OQPSK modulation has been chosen as a feasible modulation technique which

can be used in high-speed wire-line data communications. A prototype HSRC-OQPSK

transmitter has been designed with discrete components to verify the theory. Measurement

results confirmed that the proposed modulations can be effectively used in band-limited wire-line

applications requiring spectral efficiency, such as the backplane serial link. Moreover, because it

requires only a two-level decision, the HSRC-OQPSK transmitter can be implemented in a

simpler architecture than 4-PAM and is suitable for low-voltage systems. The HSRC-OQPSK

spectrum, which is the same as the MSK spectrum, contains 99% of the total signal power within

the bandwidth of B ≈ (1.2/Tb). In comparison, 4-PAM which has the same signal spectrum as

that of QPSK has a much larger 99% bandwidth of B ≈ (8/Tb) [13]. Therefore, it is expected that

the proposed HSRC-OQPSK modulation should have an efficient signal spectrum in band-

limited channels and also this spectrum efficiency will reduce the high frequency crosstalk noise

between the signal lines which may improve the performance of the multi-port serial

communication links.

Moreover, a fully reference-less serial link transceiver using the HSRC-OQPSK

modulation has been proposed and demonstrated its viability of increasing data-rate in high-

speed serial link system. The proposed transceiver has been designed with UMC 0.18μm CMOS

technology and the analysis has been compared with the simulation results. From the simulation

results, the time domain signals and the spectrum of the HSRC-OQPSK modulation fit well with

the theoretical analyses.

129

A QPSK carrier recovery loop can be used for the propoed HSRC modulations which

enables flexible design of the receiver. A new CDR loop for the HSRC-OQPSK based on the

polarity-type Costas loop has been proposed and implemented by the UMC 0.18μm CMOS

technology. This paper also proposed a CDR based on the Costas loop for the HSRC-OQPSK

modulation. The proposed CDR is comparable to a 1quarter-rate CDR of the NRZ modulation

because it uses a quarter data-rate frequency clock. Therefore, the proposed CDR can improve

timing constraints of the receiver’s clock and data recovery. Moreover, the HSRC-OQPSK

allows a simple transceiver architecture which can easily be implemented with a low-voltage

technology due to the reason that it uses two-level data decision while a multi-PAM (e.g., PAM-

4) system [3] needs reference voltages and the level spacing which is difficult to maintain linear

levels in a low-voltage system for data decisions. Table 5-1 summarizes and compares the

characteristics of the conventional modulations and the proposed HSRC-OQPSK modulation.

Table 5-1 Performance comparison of the different modulations.

PAM-2 (NRZ) PAM-4 Duobinary HSRC-

OQPSK

Main lobe bandwidth (first null) 1/Tb 1/2Tb 1/2Tb 3/4Tb Transmitted

Signal Spectrum Difference between

main and second lobe 13dB 13dB 13dB 23dB

1CDR Clock Frequency Full Rate Half Rate Full Rate Quarter Rate

# of Level 2 4 3 2

Decision Threshold 1 3 2 1 Data Decision

Decision Interval Tb 2Tb Tb 2Tb

1 Half and quarter data rate CDR has been developed for the PAM signaling [32], [51].

130

The results show that the HSRC-OQPSK modulation and the transceiver can enable a

serial link system to achieve a higher data-rate without aiding a reference clock in low-voltage

wire-line communication systems.

5.2 Four-fold Ambiguity Issue

As mentioned in Chapter 4, the proposed CDR based on the Costas loop has a fourfold

ambiguity problem. Only one out of the four stable locking points has proper phase information.

The differentially encoded data can resolve this four-fold ambiguity problem with 3dB signal to

noise ratio (SNR) degradation [10]. A transmitter with differentially encoded data input can be

implemented using a XOR gate and a F/F, as shown in Figure 5-1. It is essential for the

architecture of Figure 5-1(a) to have a parallel-to-serial logic which is implemented with a 2:1

multiplexer in the receiver. The alternative architecture decoding the data in the receiver without

using a 2:1 multiplexer is shown in Figure 5-1(b). The I and Q channels of the receiver generate

the demodulated data which are offset with one bit-time, Tb, offset, hence, the architecture shown

in Figure 5-1(b) can also be used to decode the modulated data.

HSRC-OQPSK TransmitterD Q

CK

HSRC-OQPSK Receiver

(include 2:1 multiplxer with I/Q inputs)

Data In

Data Out

CK

Q D

Transceiver 1

Channel

Transceiver 2

(a)

Figure 5-1 A differentially coded HSRC-OQPSK transceiver architecture to resolve the fourfold ambiguity issue (a) the receiver includes a 2:1 multiplexer for serializing I/Q channel data (b) alternative architecture without using a multiplexer.

131

HSRC-OQPSK TransmitterD Q

CK

HSRC-OQPSK Receiver

Data In

Data OutCK

Q D

Transceiver 1

Channel

Transceiver 2

I

Q

(b)


Another approach to resolve the fourfold ambiguity issue is synchronization at the protocol

layer. Every serial data link communications – almost all data communications – needs

synchronization, that is initialization of the data communications. Bits are sent out to the channel

from the least-significant bit (LSB) to the most-significant bit (MSB). All packets start with a

synchronization (SYNC) field, which is coded for the maximum edge transition rate. It is used

by CDR to recover the clock and data synchronization. A SYNC filed is defined to be eight bits

in length for full/low speed and 32 bits for high speed in the USB specification 2.0 [52] as an

example. Packet identifier (PID) follows the SYNC field. The USB specification 2.0 protocol

detail is presented in [52].

The protocol detail is not discussed in this chapter; however, a conceptual resolution of

the fourfold ambiguity by using a SYNC field. As described before, the SYNC field is coded to

make maximum edge transition in order to give more gain to the circuitry which aligns the

incoming data. However, the HSRC-OQPSK should have different SYNC to maximize edge

transition because it uses a carrier for the modulation. The carrier signal transforms the original

132

transmitted data sequences. To make it simple, assume the SYNC field is eight bit as is the case

for full/low speed of USB specification 2.0 and the SYNC field of the initial transmitter of

‘10010110’ has the maximum edge transition.

The data is split into I and Q channel ‘1001’ and ‘0110’ for the transmitter. After

demodulated at the receiver, four possible data sequences exist on the I and Q channel. When the

CDR locks the loop with a proper locking phase, the data sequences are the same as the

transmitted ones which are ‘1001’ and ‘0110’ for the I and Q channels of the receiver,

respectively. If the loop is locked with ±90° phase offset, one of the I/Q channels would recover

the transmitted data reversely from the original data which are ‘0110’, ‘0110’ or ‘1001’, ‘1001’.

Both the I/Q data are inverted –‘0110’ and ‘1001’– on condition that the CDR loop is locked

with the 180° phase offset. With these four possible SYNC fields at the receiver, we can estimate

at which phase the CDR loop is locked. Figure 5-2 shows the conceptual four-fold ambiguity

resolution method using a SYNC field.

D Q

CKB

HSRC-OQPSK Receiver

I

Q

D Q

CK

D Q

D Q

D Q

D Q

D Q

D Q

4

4

I Data Out

Q Data Out

Logical Operation(Control signals generate

logical 1 if the SYNC of I/Q is inverted from expected value, otherwise logical 0)

I Control

Q Control

Figure 5-2 An example of a conceptual architecture for resolving four-fold ambiguity issue using eight bits SYNC field.

133

LIST OF REFERENCES

[1] A. B. Carson, P. B. Crilly, and Janet C. Rutledge, Communication Systems, New York: McGraw-Hill, 2001, ch.11.

[2] G. Lawday, “On the buses” IEE Review, pp. 44-47, Jan. 2004.

[3] R. Farjad-Rad, C.-K. Ken Yang, M. A. Horowitz, and T. H. Lee, “A 0.3-µm CMOS 8-Gb/s 4-PAM serial link transceiver,” IEEE Journal of Solid-State Circuits, vol. 35, no.5 pp. 757-764, May 2000.

[4] J. T. Stonick, G.-Y. Wei Yang, J. L. Sonntag, and D. K. Weinlader, “An adaptive PAM-4 5Gb/s backplane transceiver in 0.25μm CMOS,” IEEE Journal of Solid-State Circuits, vol. 38, no.3 pp. 436-443, Mar. 2003.

[5] R. Farjad-Rad, “A CMOS 4-PAM Multi-Gbps Serial Link Transceiver,” Ph.D. dissertation, Dept. Elct. Eng., Stanford Univ., Stanford, CA, 2000.

[6] B. Razavi, Design of Integrated Circuits for Optical Communications, McGraw-Hill, 2003.

[7] P. Kabal, S. Pasupathy “Partial-response signaling,” IEEE Transactions on Communications, vol. 23, no. 9, pp. 921-934, Sep. 1975.

[8] M. Horowitz, C.-K. K. Yang, and S. Sidiropoulos, “High-speed electrical signaling: overview and limitations,” IEEE Micro, pp. 12-24, Jan. 1998.

[9] J. H. Sinsky, M. Duelk, and A. Adamiecki, “High-speed electrical backplane transmission using duobinary signaling,” IEEE Transaction on Microwave Theory and Techniques, vol. 53, no.1, pp. 152-160, Jan. 2005.

[10] J. G. Proakis, Digital Communications, New York: McGraw-Hill, 1995.

[11] D. R. Smith, Digital Transmission Systems, New York: Van Nostrand Reinhold, 1985, ch. 6.

[12] T. S. Rappaport, Wireless Communications, NJ: Upper Saddle River, Prentice Hall, 1996, ch. 5.

[13] S. Pasupathy, “Minimum shift keying: a spectrally efficient modulation,” IEEE Communications Magazine, pp. 14-22, July 1979.

[14] R. Bashirullah, W. Liu, and R. Cavin, “Cross-talk reduction for interconnect-limited bus based on raised cosine signaling,” Proceeding of 2002 CICC, pp. 513-516

[15] S. A. Gronemeyer, and A. L. McBride, “MSK and offset QPSK modulation,” IEEE Transactions on Communications, vol. 24, no. 8, pp. 809-820, Jul. 1976.

134

[16] T. J. Gabara and W. C. Fischer, “Capacitive coupling and quantized feedback applied to conventional CMOS technology,” IEEE Journal of Solid-State Circuits, vol. 32, no. 3, pp. 419-427, Mar. 1997.

[17] L. Luo, J. M. Wilson, S. E. Mick, J. Xu, L. Zhang, and P. D. Franzon, “3Gb/s AC coupled chip-to-chip communication using a low swing pulse receiver,” IEEE Journal of Solid-State Circuits, vol. 41, no. 1, pp. 287-296, Jan. 2006.

[18] Part3: Carrier sense multiple access with collision detection (CSMA/CD) access method and physical layer specification, IEEE Standards 802.3-2002.

[19] J. Yoon, and W. Eisenstadt, “Lumped passive circuit for 5GHz embedded test of RF SoCs,” IEEE ISCAS, pp. 1241-1244, May 2004.

[20] T. H. Lee, The Design of CMOS Radio-Frequency Integrated Circuits, 2nd, MA: Cambridge University Press, 1998, ch. 18.

[21] B. Razavi, RF Microelctronics, NJ: Upper Saddle River, Prentice Hall PTR, 1997, ch. 5.

[22] P. Heydari, “Design and analysis low-voltage current-mode logic buffers,” Proceedings of 4th ISQED03, pp. 293-298, 2003.

[23] B. Razavi, Design of Analog CMOS Integrated Circuits, New York: McGraw-Hill, 2003.

[24] P. Heydari, and R. Mohanavelu, “Design of ultrahigh-speed low-voltage CMOS CML buffres and latches,” IEEE Transactions on VLSI Systems, vol. 12, No.10, pp. 1081-1093, Oct. 2004.

[25] M. G. Johnson, and E. L. Hudson, “A variable delay line PLL for CPU-coprocessor synchronization,” IEEE Journal of Solid-State Circuits, vol. 23, no.5 pp. 1218-1223, Mar. 1988.

[26] A. Mazzanti, P. Uggetti, and F. Svelto, “Analysis and design of LC dividers for quadrature generation,” IEEE Journal of Solid-State Circuits. vol.39, no.9, pp.1425-1433, Sep. 2004

[27] H. Johnson, and M. Graham, High-Speed Digital Design, 1st, Prentice Hall PTR, 1993.

[28] S. Galal, and B. Razavi, “10-Gb/s limiting amplifier and laser/modulator driver in 0.18-μm CMOS technology,” IEEE Journal of Solid-State Circuits, vol. 38, no.12, pp. 2138-2146, Dec. 2003.

[29] Affirma Spectre Circuit Simulator User Guide, Cadence Design Systems Inc., San Jose, CA, 2000.

[30] A. Adamiecki, M. Duelk, and J. H. Sinsky, “25Gbit/s electrical duobinary transmission over FR-4 backplanes,” IEE Electronics Letters, vol. 41, no.14, pp. 826-827, Jul. 2005.

135

[31] K. Yamaguchi, K. Sunaga, S. Kaerlyama, T. Nedachi, M. Takamiya, K. Nose, Y. Nakagawa, M. Sugawara, and M. Fukaishi, “12Gb/s Duobinary signaling with x2 oversampled edge equalization,” ISSCC 2005 IEEE International, pp. 584-585

[32] J. Lee and B. Razavi, “A 40-Gb/s clock and data recovery circuit in 0.18um CMOS technology,” IEEE Journal of Solid-State Circuits, vol. 38, no.12, pp. 2181~2190, Dec. 2003

[33] H. C. Osborne, “A generalized “polity-type” Costas loop for tracking MPSK signals,” IEEE Transactions on Communications, vol. 30, no. 10, pp. 2289-2296, Oct. 1982.

[34] M. K. Simon, “Tracking performance of Costas loop with hard-limited in-phase channel,” IEEE Transactions on Communications, vol. 26, no.4, pp. 420-432, Apr. 1978.

[35] J. P. Costas, “Synchronous communications,” Proceedings of the IRE, vol. 44, no.12, pp. 1713-1718, Dec. 1956.

[36] W. C. Lindsey and M. K. Simon, “Carrier synchronization and detection of polyphase signals,” IEEE Transactions on Communications, vol. 20, no.3, pp. 441-454, Jun. 1972.

[37] A. Leclert, and P. Vandamme, “Universal carrier recovery loop for QASK and PSK signal sets,” IEEE Transactions on Communications, vol. 31, no.1, pp. 130-136, Jan. 1983.

[38] M. K. Simon and J. G. Smith, “Carrier synchronization and detection of QASK signal sets,” IEEE Transactions on Communications, vol. 20, no.2, pp. 98-106, Feb. 1974.

[39] M. K. Simon, “On the optimality of the MAP estimation loop for carrier phase tracking BPSK and QPSK signals,” IEEE Transactions on Communications, vol. 27, no. 1, pp. 158-165, Jan. 1979.

[40] C. L. Weber, and W. K. Alem, “Demod-remod coherent tracking receiver for QPSK and SQPSK,” IEEE Transactions on Communications, vol. 28, no. 12, pp. 1945-1954, Dec. 1980.

[41] C. L. Weber, and W. K. Alem, “Performance analysis of demod-remod coherent receiver for QPSK and SQPSK input,” IEEE Transactions on Communications, vol. 28, no. 12, pp. 1954-1968, Dec. 1980.

[42] R. E. Best, Phase-Locked Loops: Design, Simulation, and Applications, New York: McGraw-Hill, 2003, ch. 9.

[43] H. Yeo, Y. Lee, J. Chen, and J. Lin, “Half-symbol-rate-carrier offset QPSK transmitter for bandwidth-efficient high-speed data communications,” IEEE Microwave and Wireless Components Letters, vol. 17, no. 6, pp. 466-468, June 2007.

[44] S. B. Anand, and B. Razavi, “A CMOS clock recovery circuit for 2.5Gb/s NRZ data,” IEEE Journal of Solid-State Circuits, vol. 36, no.3, pp. 432-439, Mar. 2001.

136

[45] C. R. Hoggie, “A self correcting clock recovery circuit,” IEEE Journal of Lightwave Technology, vol. 3, no. 6, pp. 1312-1314, Dec. 1985

[46] E. R. Cacciamani. and C. J. Wolejsza, Jr. “Phase ambiguity resolution in a four-phase PSK communications system,” IEEE Transactions on Communication Technology, vol. 19, no.6, pp. 1200-1210, Dec. 1971.

[47] B. Razavi, Monolithic Phase-Locked Loops and Clock Recovery Circuits, NJ: Piscataway, IEEE Press, 1996.

[48] F. M. Gardner, “Charge-pump phase locked loops,” IEEE Transactions on Communications, vol. 28, no.11, pp. 1849-1858, Nov. 1980.

[49] C.-K. Ken Yang, V. Stojanovic, S. Modjtahedi, M. A. Horowitz, and W. F. Ellersick, “A serial-link transceiver based on 8-GSamples/s A/D and D/A converters in 0.25-μm CMOS,” IEEE Journal of Solid-State Circuits, vol. 36, no.11, pp. 1684-1691, Dec. 2001.

[50] M. H. Shakiba, D. A. Johns, and K. W. Martin, “BiCMOS circuits for analog Viterbi decoders,” IEEE Transactions on Circuits and Systems, vol. 45, no.12, pp. 1527-1537, Dec. 1998.

[51] J. Savoj, and B. Razavi, “10-Gb/s CMOS clock and data recovery circuit with a half-rate binary phase/frequency detector,” IEEE Journal of Solid-State Circuits, vol. 38, no.1, pp. 13-21, Jan. 2003.

[52] Universal Serial Bus Specifications, revision 2.0, Apr.. 2000.

137

BIOGRAPHICAL SKETCH

Hyeopgoo Yeo was born in Seoul, Korea, in 1968. He received his B.S. and M.S. degrees

in electronic engineering from Yonsei University, Seoul, Korea, in 1991 and 1993, respectively.

He also received his M.S. degree in electrical and computer engineering from the University of

Florida, Gainesville, USA, in 2003.

From 1993 to 1999, he worked as a design engineer at Samsung Electronic Co. Ltd.,

Kihung, Kyounggi-do, Korea, where he performed CMOS ASIC cell library design and

development.

He is currently pursuing his Ph.D. degree as a graduate research assistant at the University

of Florida. His research interests involve RF/analog circuit design, high-speed digital systems.

He is particularly interested in high-speed serial links.

DESIGN OF MULTI-GIGABIT SERIAL LINK TRANSCEIVER...

Documents

Transcript of DESIGN OF MULTI-GIGABIT SERIAL LINK TRANSCEIVER...