Efficient adaptive receivers for joint equalization and interference

14
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003 2849 Efficient Adaptive Receivers for Joint Equalization and Interference Cancellation in Multiuser Space-Time Block-Coded Systems Waleed M. Younis, Student Member, IEEE, Ali H. Sayed, Fellow, IEEE, and Naofal Al-Dhahir, Senior Member, IEEE Abstract—This paper develops low-complexity adaptive re- ceivers for space-time block-coded (STBC) transmissions over frequency-selective fading channels. The receivers are useful for equalization purposes for single user transmissions and for joint equalization and interference cancellation for multiuser transmissions. The receivers exploit the rich code structure of STBC codes in order to deliver recursive-least-squares (RLS) performance at least-mean-squares (LMS) complexity. Besides reduced complexity, the proposed adaptive receivers also lower system overhead requirements. Index Terms—Adaptive receiver, equalization, interference cancellation, multiantenna systems, multiuser systems, space-time block-code. I. INTRODUCTION I NCREASING system capacity without requiring additional bandwidth is of major significance for spectrally-efficient high-rate wireless communication systems. Space-time block- codes (STBCs) help increase reliability over wireless networks [1], and they can achieve full diversity gains with simple linear processing at the receiver. It was shown in [2] that co-channel users, each equipped with antennas and transmitting uncorrelated signals, can be detected with -order diversity gains if the receiver is equipped with antennas. However, the structure of STBC can be exploited to reduce the number of receive antennas. It was shown in [3] that only receive antennas are needed to provide -order diversity gains and suppress co-channel users. A simple interference cancellation scheme for two co-channel users employing the Alamouti STBC scheme [4] was already developed in [3]. It was shown that by using two transmit antennas for each user and two receive antennas at the base station, it is possible to double the system capacity by applying only linear processing at the receiver. This scheme has Manuscript received December 16, 2002; revised May 13, 2002. This work was supported in part by the National Science foundation under Grants ECS- 9820765 and CCR-0208573 and by a gift from AT&T Shannon Laboratory. Parts of the results presented in this work were previously reported in [12]–[14]. The associate editor coordinating the review of this paper and approving it for publication was Dr. Helmut Bölcskei. W. M. Younis and A. H. Sayed are with the Department of Electrical Engineering, University of California, Los Angeles, CA 90095 USA (e-mail: [email protected]). N. Al-Dhahir is with the Department of Electrical Engineering, University of Texas at Dallas, Richardson, TX 75083 USA. Digital Object Identifier 10.1109/TSP.2003.818209 been extended to the multiuser case in [5], where it was shown that the system capacity can be further increased by allowing more co-channel users. This is a useful observation, especially given that transmit diversity signaling using the Alamouti scheme [4] has been adopted in several wireless standards such as WCDMA and CDMA2000. Some of the attractive features of this diversity scheme are the following. • It achieves full spatial diversity at full transmission rate for any (real or complex) signal constellation. • It does not require channel state information (CSI) at the transmitter (i.e., it is open loop). • Maximum likelihood decoding involves only linear pro- cessing at the receiver (due to the orthogonal code struc- ture), thus keeping user terminals simple. However, when implemented over frequency-selective chan- nels, which is the focus of this paper, the Alamouti scheme should be implemented at a block not symbol level, and it should be combined with an effective equalization scheme to realize additional multipath diversity gains without sacrificing full spa- tial diversity. Several block level STBC structures for frequency selective fading channels, including time-reversal space-time block codes (TR-STBC) [6] and orthogonal frequency division multiplexing STBC (OFDM-STBC) [7], have been proposed in the literature. Another low-complexity scheme that achieves a similar goal is the single-carrier frequency-domain-equalized (SC FDE) STBC described in [8]. This scheme combines the advantages of the Alamouti scheme with those of SC FDE [9], namely, low complexity [due to use of the fast Fourier tarnsform (FFT)] and reduced sensitivity, compared with OFDM, to carrier frequency offsets and nonlinear distortion (due to reduced peak to average ratio). Now, coherent joint interference cancellation, equalization, and decoding of SC FDE-STBC transmissions require channel state information (CSI) at the receiver, which can be estimated using training sequences embedded in each block. Then, the op- timum equalizer/decoder settings can be computed from the es- timated CSI. An alternative to this channel-estimate-based ap- proach is adaptive equalization/decoding that does not require CSI estimation and helps reduce system overhead. It also pro- vides a tracking mechanism for time-variant channels. The purpose of this paper is to develop efficient low-com- plexity adaptive receivers for joint interference cancellation, equalization, and decoding in a multiuser environment with STBC transmissions. The operation of the receivers will be described for both training and tracking modes, and they will 1053-587X/03$17.00 © 2003 IEEE

Transcript of Efficient adaptive receivers for joint equalization and interference

Page 1: Efficient adaptive receivers for joint equalization and interference

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003 2849

Efficient Adaptive Receivers for Joint Equalizationand Interference Cancellation in Multiuser

Space-Time Block-Coded SystemsWaleed M. Younis, Student Member, IEEE, Ali H. Sayed, Fellow, IEEE, and Naofal Al-Dhahir, Senior Member, IEEE

Abstract—This paper develops low-complexity adaptive re-ceivers for space-time block-coded (STBC) transmissions overfrequency-selective fading channels. The receivers are usefulfor equalization purposes for single user transmissions and forjoint equalization and interference cancellation for multiusertransmissions. The receivers exploit the rich code structure ofSTBC codes in order to deliver recursive-least-squares (RLS)performance at least-mean-squares (LMS) complexity. Besidesreduced complexity, the proposed adaptive receivers also lowersystem overhead requirements.

Index Terms—Adaptive receiver, equalization, interferencecancellation, multiantenna systems, multiuser systems, space-timeblock-code.

I. INTRODUCTION

I NCREASING system capacity without requiring additionalbandwidth is of major significance for spectrally-efficient

high-rate wireless communication systems. Space-time block-codes (STBCs) help increase reliability over wireless networks[1], and they can achieve full diversity gains with simple linearprocessing at the receiver.

It was shown in [2] that co-channel users, each equippedwith antennas and transmitting uncorrelated signals, can bedetected with -order diversity gains if the receiver is equippedwith antennas. However, the structure of STBCcan be exploited to reduce the number of receive antennas. Itwas shown in [3] that only receive antennas are needed toprovide -order diversity gains and suppress co-channelusers.

A simple interference cancellation scheme for twoco-channel users employing the Alamouti STBC scheme[4] was already developed in [3]. It was shown that by usingtwo transmit antennas for each user and two receive antennas atthe base station, it is possible to double the system capacity byapplying only linear processing at the receiver. This scheme has

Manuscript received December 16, 2002; revised May 13, 2002. This workwas supported in part by the National Science foundation under Grants ECS-9820765 and CCR-0208573 and by a gift from AT&T Shannon Laboratory.Parts of the results presented in this work were previously reported in [12]–[14].The associate editor coordinating the review of this paper and approving it forpublication was Dr. Helmut Bölcskei.

W. M. Younis and A. H. Sayed are with the Department of ElectricalEngineering, University of California, Los Angeles, CA 90095 USA (e-mail:[email protected]).

N. Al-Dhahir is with the Department of Electrical Engineering, University ofTexas at Dallas, Richardson, TX 75083 USA.

Digital Object Identifier 10.1109/TSP.2003.818209

been extended to the multiuser case in [5], where it was shownthat the system capacity can be further increased by allowingmore co-channel users. This is a useful observation, especiallygiven that transmit diversity signaling using the Alamoutischeme [4] has been adopted in several wireless standards suchas WCDMA and CDMA2000. Some of the attractive featuresof this diversity scheme are the following.

• It achieves full spatial diversity at full transmission rate forany (real or complex) signal constellation.

• It does not require channel state information (CSI) at thetransmitter (i.e., it is open loop).

• Maximum likelihood decoding involves onlylinear pro-cessing at the receiver (due to the orthogonal code struc-ture), thus keeping user terminals simple.

However, when implemented over frequency-selective chan-nels, which is the focus of this paper, the Alamouti schemeshould be implemented at ablocknotsymbollevel, and it shouldbe combined with an effective equalization scheme to realizeadditional multipath diversity gains without sacrificing full spa-tial diversity. Several block level STBC structures for frequencyselective fading channels, including time-reversal space-timeblock codes (TR-STBC) [6] and orthogonal frequency divisionmultiplexing STBC (OFDM-STBC) [7], have been proposedin the literature. Another low-complexity scheme that achievesa similar goal is the single-carrier frequency-domain-equalized(SC FDE) STBC described in [8]. This scheme combines theadvantages of the Alamouti scheme with those of SC FDE[9], namely, low complexity [due to use of the fast Fouriertarnsform (FFT)] and reduced sensitivity, compared withOFDM, to carrier frequency offsets and nonlinear distortion(due to reduced peak to average ratio).

Now, coherent joint interference cancellation, equalization,and decoding of SC FDE-STBC transmissions require channelstate information (CSI) at the receiver, which can be estimatedusing training sequences embedded in each block. Then, the op-timum equalizer/decoder settings can be computed from the es-timated CSI. An alternative to this channel-estimate-based ap-proach isadaptiveequalization/decoding that does not requireCSI estimation and helps reduce system overhead. It also pro-vides a tracking mechanism for time-variant channels.

The purpose of this paper is to develop efficient low-com-plexity adaptive receivers for joint interference cancellation,equalization, and decoding in a multiuser environment withSTBC transmissions. The operation of the receivers will bedescribed for both training and tracking modes, and they will

1053-587X/03$17.00 © 2003 IEEE

Page 2: Efficient adaptive receivers for joint equalization and interference

2850 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003

be shown to deliver RLS performance at LMS complexity.We will consider single-user, two-user, and multiuser STBCtransmissions. In all cases, frequency-selective channels areassumed with two antennas per user for transmission and oneantenna per user for reception.

The paper is organized as follows. In Section II, we describethe structure of the space-time block code and review a schemefor nonadaptive equalization for the single-user scenario andother schemes for joint interference cancellation and equaliza-tion with two or more co-channel users. Adaptive receivers arethen developed in Section III for both training and trackingmodes. Simulation results for the EDGE cellular1 environmentare presented in Section IV, and the paper is concluded in Sec-tion V.

II. SPACE-TIME BLOCK CODES FORBROADBAND CHANNELS

For STBCs to achieve multipath and spatial diversity gainson frequency-selective channels, the Alamouti scheme shouldbe implemented at ablock not symbollevel. In this section, wedescribe the relevant encoding rule and the corresponding linearminimum-mean-square-error (MMSE) receivers for both casesof single-user and two-users transmissions.

A. Single-User Transmissions

Consider the scheme shown in Fig. 1, where a single userequipped with two antennas transmits data over a wirelesschannel, and the receiver has a single antenna. Data are trans-mitted from the antennas in blocks of length, each accordingto the space-time coding scheme indicated in Fig. 2. Denotethe th symbol of the th transmitted block from antennaby

. At times , pairs of blocks andare generated by an information

source according to the rule [8]:

(1)

where the date vectorhas a covariance matrix equal to ,and where and denote complex conjugation andmodulo- operations, respectively. In addition, a cyclic prefixof length is added to each transmitted block to eliminateinter-block interference (IBI) and to make all channel matricescirculant. Here, denotes the longest channel memory betweenthe transmit antennas and the receive antennas. With two

1EDGE stands for enhanced data rates for global evolution.

transmit and one receive antenna, the received blocksand, in the presence of additive white noise, are described by

for (2)

where is the noise vector with covariance matrix ,and and are thecirculant channel matrices from thefirst and second transmit antennas, respectively, over block,to the receive antenna. Specifically, each has the formshown at the bottom of the page, in terms of the impulse re-sponse sequence . Ap-plying the DFT matrix to , we get a relation in terms offrequency-transformed variables:

(3)

where , , and and are di-

agonal matrices given by . Using the encodingrule (1) and properties of the DFT [10] and assuming the twochannels are fixed over two consecutive blocks, we have that

(4)

for and . Combining (3)and (4), we arrive at the linear relation

(5)

where denotes complex conjugation of the entries of thevector. Relation (5) tells us how the transformed received vec-tors are related to the transformed transmittedblocks from the two antennas. It turns out that thematrix in (5) has a very useful structure, viz., is diagonal.This fact can be exploited to recover the from thenoisy observations via a simplelinear receiverstructure. The matrix structure of in (5), with diagonal sub-matrices , will appear frequently throughout the paper,and we will use the terminologyAlamouti-like matrixto refer toit. Indeed, if we multiply both sides of (5) by , we can de-couple the signals and since this step gives

(6)

......

. . .. . .

. . ....

.... ..

. . .. . .

. . ....

Page 3: Efficient adaptive receivers for joint equalization and interference

YOUNIS et al.: EFFICIENT ADAPTIVE RECEIVERS FOR JOINT EQUALIZATION AND INTERFERENCE CANCELLATION 2851

where is an diagonal matrix with (, ) element equalto , which is also equal to the sum of thesquaredth DFT coefficients of first and second channel impulseresponses. The filtered noise vectorhas a diagonal covariancematrix equal to diag . The minimum mean square error(MMSE) estimator of given is now seen to be [11]

SNR(7)

where is diagonal, and SNR is the signal-to-noise ratio atthe receiver SNR . The resulting MMSE frequency-domain equalizer is shown in Fig. 3 [12].

B. Two-User Transmissions

By using a second receive antenna, we can double the numberof users [13]. Consider the block diagram shown in Fig. 4. Withtwo receive antennas and two users (each equipped with twoantennas), (5) generalizes to

(8)

where and are the received signals at the secondreceive antenna at blocksand , respectively, and

and are the transmitted blocks from the first and secondantennas of the second user, respectively. In more compact form,(8) can be written as

(9)

where is the processed signal from theth receive antenna,whereas is the corresponding noise vector. Moreover,consists of the two subvectors representing the size-DFTs ofthe information blocks transmitted from theth user’s first andsecond transmit antennas at time, i.e.,

(10)

and each is the Alamouti-like overall frequency domainchannel matrix from theth user transmit antennas to thethreceive antenna. The two users can be decoupled as follows:

(11)

Fig. 1. Single-user system with two-transmit one-receive antenna.

Fig. 2. Block format for SC FDE-STBC transmission scheme.

Fig. 3. Receiver structure for a two-transmit one-receive antenna system.

where , and . Itcan be verified that both and have an Alamouti-like struc-ture, i.e.,

and

where are all diagonal. Once the two usershave been decoupled, the equalization procedure described by(7) can be applied to each user in order to recover theoriginal data (see [13]).

C. Multiuser Case

With users (each equipped with two antennas), we can usereceive antennas to decouple all users and, hence, increase

the system capacity. Equation (5) generalizes to

......

.... . .

......

...(12)

where , , and are given by (10), and each is theAlamouti-like frequency domain channel matrix from thethuser transmit antennas to theth receive antenna. We can it-eratively recover the symbols of each user by successive Schurcomplementation, starting from theth user, proceeding to the

Page 4: Efficient adaptive receivers for joint equalization and interference

2852 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003

Fig. 4. Two-user system with two-transmit antennas per user and two-receiveantennas.

th user, and down to the first user. For example, to re-cover the symbols of the th user, we partition the channel ma-trix in (12) as

......

.. ....

(13)with , , , and denoting theupper left, upper right,lower left, and lower right matrices, respectively. Alinear interference canceller similar to the one designed for thetwo-user case in (11) can then be used to suppress the interfer-ence from the first users on the th user as follows:

...

......

(14)

where , and . Itcould be easily verified that has an Alamouti-like structureand that is an block matrixwith each block being an Alamouti-like matrix. Thisresult follows from the following properties of Alamouti-likematrices:

• The sum or difference of two Alamouti-like matrices is anAlamouti-like matrix.

• The inverse of an Alamouti-like matrix is Alamouti-like.• The inverse of a block matrix with Alamouti-like sub-

blocks is a block matrix with Alamouti-like subblocks.The next step is to apply the single-user SC MMSE-FDE of (7)to in order to get an estimate for the DFT of the symbols

of the th user . Then, we apply IDFT to to get anestimate for the corresponding symbols .

The symbols of the th user can be recovered by re-peating the procedure on the reduced system:

......

Again, we partition in a way similar to (13) with , ,, and now being the upper left,

upper right, lower left,and lower right matrices, respectively. We then applyan interference canceller scheme similar to (14), followed by asingle-user SC MMSE-FDE to get an estimate for the symbolsof the th user . We then proceed until we recoverthe symbols of all users. The block diagram of the multiuserreceiver is shown in Fig. 5 [14].

III. A DAPTIVE SCHEME

The joint interference cancellation and equalization tech-niques described in the previous sections require the channelsto be known at the receiver in order to know the . Usually,channel estimation is done by adding a training sequence toeach transmitted block and using it to estimate the channel,which tends to increase the system overhead. Reduction of thesystem overhead requires using longer blocks, which may notbe viable for channels with fast variations. These observationsmotivate us to develop adaptive receivers for joint interferencecancellation and equalization purposes. Our interest is todevelop adaptive structures with fast tracking/convergence abil-ities (like recursive-least-squares structures) and then to showhow the special structure of the space-time block code can beexploited to obtain RLS performance at LMS complexity. Oneof the advantages of adaptive receivers is that they eliminatethe need for adding a training sequence to each data block.They use a few training blocks during initialization, and thenthey can track channel variations in a decision-directed mode.In this way, the system overhead can be reduced. We start withthe single-user transmissions case.

A. Single-User Transmissions

From Fig. 3, we note that the MMSE equalizer consists of alinear combiner followed by scaling by a diagonal matrix.We denote the combined matrix by , which is seen tohave the structure

where and are diagonal matrices given by

diag

diag

Page 5: Efficient adaptive receivers for joint equalization and interference

YOUNIS et al.: EFFICIENT ADAPTIVE RECEIVERS FOR JOINT EQUALIZATION AND INTERFERENCE CANCELLATION 2853

Fig. 5. Receiver structure for multiuser STBC transmissions.

Therefore, from Fig. 3, we have that

(15)

which can be rewritten alternatively as

diag diagdiag diag

(16)

where and are the vectors containing the diagonal el-ements of and , respectively. Moreover, is avector containing the elements of , and is anAlamouti-like matrix of size containing the receivedsymbols from blocks and . Equation (16) reveals thespecial structure of the STBC problem. In the nonadaptive sce-nario, the coefficients of are calculated from a channel esti-mate at every block. Equation (16), on the other hand, suggeststhat can be computed adaptively by using a block version ofthe RLS algorithm. Specifically, the equalizer coefficients areupdated every two blocks, according to the following recursion:

(17)

where

(18)

where is a forgetting factor that is usually close to 1. The initialconditions are and , is a large number, and

is the identity matrix. Moreover, is thedesired response vector given by

for training

for decision-directed tracking.

It might seem that the computational complexity of the algo-rithm is high since matrix inversion is needed in (18). However,due to the special structure of the space-time block-code, no ma-trix inversion is needed and the complexity of the algorithm canbe reduced to that of an LMS implementation. In this way, weare able to obtain RLS performance at LMS cost. The reasoningis as follows.

It follows by induction that has a diagonal structure ofthe form:

(19)

where is diagonal as well. This statement holds at timesince, by assumption, (so that ).

Now, assume the statement holds at time. Then, it is easy tosee that

where is diagonal and given by

diag diag

diag diag (20)

Since diagonal matrices commute, (20) simplifies to

diag(21)

Page 6: Efficient adaptive receivers for joint equalization and interference

2854 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003

TABLE IADAPTATION ALGORITHM FOR SINGLE-USERTRANSMISSIONS

Fig. 6. Proposed receiver block diagram for single user with two-transmit one-receive antennas.

Then, has the desired structure with

(22)

where

diag (23)

Substituting (21) into (23), we get

diag

(24)Therefore, the algorithm collapses to

(25)where is diagonal computed via (22) and (24). Theprocedure for updating the entries of is shown in Table I.The block diagram of the resulting adaptive receiver is depictedin Fig. 6. The received signal is transformed to the frequency do-main using FFT; then, the data matrix in (16) is formed. Thefilter output is the product of the data matrix and the filtercoefficients . The filter output is transformed

back to the time domain using IFFT, and a decision device isused to generate the receiver output. The output of the equal-izer is compared with the desired response to generate an errorvector. The error vector is used to update the equalizer coeffi-cients according to the RLS algorithm. The equalizer operates ina training mode until it converges, and then, it switches to a deci-sion-directed mode, where previous decisions are used to updatethe equalizer coefficients for tracking. When tracking channelswith fast variations, retraining blocks might be needed to pre-vent divergence of the adaptive algorithm.

B. Two-User Transmissions

Starting with (11), since and are Alamouti-like, wecan easily verify that both and are alsoAlamouti-like. Combining the interference canceller and theMMSE equalizer, the equalizer output is then given by

(26)

Page 7: Efficient adaptive receivers for joint equalization and interference

YOUNIS et al.: EFFICIENT ADAPTIVE RECEIVERS FOR JOINT EQUALIZATION AND INTERFERENCE CANCELLATION 2855

where SNR andSNR with SNR and SNR being the signal-to-

noise ratios for the first user and the second user, respectively,and each , , , 2 has the following Alamouti-likestructure:

(27)

Equation (26) can be rewritten alternatively as

diag diag

diag diag

diag diag

diag diag

(28)

and

diag diag

diag diag

diag diag

diag diag

(29)

where and are the vectors containing the diagonalelements of and , respectively. Moreover, is a

vector containing the elements of , andis an Alamouti-like matrix of size containing the

received symbols at antennafrom blocks and . Equa-tions (28) and (29) again reveal the special structure of the in-terference canceller for the STBC problem. In the nonadaptivescenario, the coefficients of and are calculated from achannel estimate at every block. Equations (28) and (29), on theother hand, indicate that and can be computed adap-tively, e.g., by using RLS (for faster convergence). In this case,the receiver coefficients are updated every two blocks accordingto the recursions:

(30)

where

(31)

and

(32)

The initial conditions are and , where is alarge number, and is the identity matrix. ,

is the desired response vector given by

for training

for decision-directed tracking.

(33)Now, although matrix inversion is apparently needed in (32)

for operation of the RLS algorithm, the computational com-plexity can be significantly reduced, and matrix inversion canbe avoided altogether by exploiting the special STBC structure.Actually, the complexity of the algorithm can again be shownto be similar to that of an LMS implementation. The rationalebehind complexity reduction is as follows. Starting with (31) at

, we get

(34)

However, evaluates to

(35)where is , and is diagonal and givenby

diag

diag (36)

Substituting (35) into (34), we find that is given by

(37)

where and are diagonal andand have an Alamouti-like structure. Proceeding for, we get

(38)

After some simple algebra, it can be verified that has theform

(39)

Page 8: Efficient adaptive receivers for joint equalization and interference

2856 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003

Fig. 7. Matrix structures of���(k) andP for k � 2.

where , , , 2 are diagonal ma-trices. Using block matrix inversion [15], evalu-ates to (40), shown at the bottom of the page, where

is the Schurcomplement of . Since , , , 2 are alldiagonal matrices, no matrix inversion is needed to calculatethe matrix inverse in (40). Using (40), is then found to be

(41)

It follows that is a block matrix that consists of16 diagonal matrices. This means that the number ofentries to be calculated is much lower than for a full matrix. If weproceed further beyond , we will find that the structuresfor and stay the same. The matrix structures areshown in Fig. 7. Table II shows how we can use the structure of

to update its entries. It is worth mentioning that allmatrices in Table II are diagonal. This means that any matrixmultiplication is simply evaluated by scalar multiplications.The structure of the receiver can bee seen in Fig. 8 for the caseof .

C. Multiuser Transmissions

By inspecting the structure of the interference canceller de-scribed in Section II, we find that it successively multiplies thereceived symbols of each user with Alamouti-like matrices inorder to decouple them. We can verify that the overall responseof the interference canceller and the MMSE equalizers, i.e., themapping from the to the has the following form:

......

.... . .

......

(42)

where each , , has an Alamouti-like structuregiven by (27). For the two-user case, the entries of, , ,

2 are given explicitly in (26). However, the explicit knowledgeof the entries of these matrices is not needed for the developmentof the adaptive solution. The adaptive solution proposed hereworks for any , regardless of their entries, as long as theyhave an Alamouti-like structure. Equation (42) can be rewrittenas

diag diag

diag diag

... (43)

where . Equation (43) also indicates that the re-ceiver coefficients can be computed adaptively. We use the RLSalgorithm to update the receiver coefficients every two blocksaccording to the recursions

(44)where

(45)

and

(46)

The initial conditions are and , is a largenumber, and is the identity matrix.is the desired response vector given by (33). The block diagramof the adaptive receiver is shown in Fig. 8. The received signalsfrom both antennas are transformed to the frequency domainusing FFT, and then, the data matrices in (43) areformed and passed through the filters to form the frequency do-main estimates for the two users’ transmitted data .These outputs are transformed back to the time domain usingIFFT, and decision devices are used to generate the receiveroutputs. The receiver operates in a training mode where knowntraining data are used to generate the error vectors and updatethe receiver coefficients until they converge; then, it switchesto a decision-directed mode, where previous decisions are usedto update the receiver coefficients for tracking. For decision-di-rected operation, the reconstructed data are transformed backto frequency domain and compared with the corresponding re-ceiver outputs to generate error vectors. The error vectors areused to update the coefficients according to the RLS algorithm.

(40)

Page 9: Efficient adaptive receivers for joint equalization and interference

YOUNIS et al.: EFFICIENT ADAPTIVE RECEIVERS FOR JOINT EQUALIZATION AND INTERFERENCE CANCELLATION 2857

TABLE IIADAPTATION ALGORITHM FOR TWO-USERTRANSMISSIONS

Again, the computational complexity can be significantly re-duced, and matrix inversion can be avoided by exploiting thespecial STBC structure, in a manner similar to what we did inthe two-user case in the previous section. We briefly explainthe rationale behind complexity reduction. Starting with (45) at

, we get

(47)

However, evaluates to

(48)

where is diagonal and given by

diag

Page 10: Efficient adaptive receivers for joint equalization and interference

2858 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003

Fig. 8. Adaptive receiver structure for anM -user system with 2M -transmit andM -receive antennas.

Substituting (48) into (47), we find that is given by

......

... (49)

where , , are diagonal. Proceedingfor , it can be verified that has the same form as (39);then, its entries can be computed using (40).is then found tobe

(50)

It follows that is a block matrix that consistsof diagonal matrices. If we proceed further beyond

, we will find that the structures for and staythe same. Table III shows how we can use the structure ofto update its entries. Again, all matrices in Table III arediagonal. This means that any matrix multiplication is simplyevaluated by scalar multiplications.

D. Adaptation for Slowly Varying Channels

The adaptive algorithm in the multiuser case can be furthersimplified without much loss in performance for slowly varyingchannels by forcing to be a diagonal matrix. In this way, theRLS algorithm update (44) can be written as

...

(51)

for . In the light of (43), we can split (51) intoindependent equations given by

(52)

where , . Now, we have filters , ,adapted individually by means of the RLS equations

(53)

where now

for . In this case, RLS uses a modified error signal.The modified error signal for each filter depends on the outputsof other filters. Due to the fact that the filter outputsare far from the correct values at the beginning of the receiveroperation, the algorithm is expected to have slower convergencethan the one given by (44). However, when the channel is slowlyvarying, the algorithm can track channel variations effectively.For cases when the channel changes rapidly, the first algorithmgiven by (44) has superior performance. By comparing this casewith the single-user case, we notice that ,has a structure similar to that of in (19). Then, it can becomputed as follows.

(54)

Page 11: Efficient adaptive receivers for joint equalization and interference

YOUNIS et al.: EFFICIENT ADAPTIVE RECEIVERS FOR JOINT EQUALIZATION AND INTERFERENCE CANCELLATION 2859

TABLE IIIADAPTATION ALGORITHM FORM USERS

where

(55)

and

diag (56)

Page 12: Efficient adaptive receivers for joint equalization and interference

2860 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003

Fig. 9. Simplified adaptive receiver block diagram for multi-user transmissions.

Therefore, is diagonal as computed via (55) and (56). Theblock diagram of the adaptive receiver with simplified RLS isshown in Fig. 9.

E. Effect of Parameters Selection

Different parameters affect the performance of the adaptivereceivers. A judicious selection of these parameters is neededto achieve best performance. The main factor affecting theparameter selection is how fast the channel changes, which ismeasured by the Doppler frequency. As the Doppler frequencyincreases, the channel changes significantly even within oneblock, which violates the assumption that the channel is fixedover two consecutive blocks. In this case, shorter data blockshave to be used for better algorithm tracking. However, usingsmaller data blocks result in increased system overhead dueto the cyclic prefix attached to each block. It is also useful touse more frequent retraining to prevent divergence although itincreases the system overhead as well. Another parameter is theRLS forgetting factor . By using smaller , we decrease thealgorithm memory and allow faster tracking. Careful selectionof must be used since RLS has numerical problems when

is reduced. To reduce system overhead, we can use smallerblocks for training than the actual data blocks. However, thenumber of equalizer coefficients depend on the block size.We need to calculate the remaining equalizer coefficients (thedifference between date block size and training block size) byinterpolation. It was shown in [12] that smaller training blockscan reduce the system overhead at the expense of a minorloss of performance. The effect of different parameters on thereceiver performance is investigated in Section IV.

IV. SIMULATION RESULTS

In this section, we provide simulation results for the per-formance of the adaptive receivers for STBC. We simulated

three different scenarios including one, two, and three users. Inour simulations, we assumed a symbol rate of 271 kSymbols/s.A typical urban (TU) channel is considered with a linearizedGMSK transmit pulse shape. Furthermore, all channels are as-sumed to be independent. The overall channel impulse responsememory of the channel is . We start with the single-usercase. Two transmit antennas with 8-PSK signal constellation areused. Data blocks of 32 symbols plus three symbols for thecyclic prefix are used. In the following figures, we focus onthe performance of the RLS algorithm and how it is affectedby different parameters. Fig. 10 shows the performance of theRLS algorithm at different Doppler frequencies. One block oftraining data is used every 50 blocks to prevent the divergenceof the algorithm at high Doppler frequencies. It is obvious thatthe BER increases with Doppler frequency as a result of the in-ability of the algorithm to track the faster channel variations.Since the equalizer coefficients are updated on block-by-blockbasis, it is useful to use smaller data blocks. Retraining moreoften can also be a good solution. The effect of the block sizeand the number of blocks before retraining on the BER perfor-mance is shown in Fig. 11. It shows that retraining more oftenand using smaller data blocks can help improve the BER perfor-mance as the Doppler frequency increases. Despite the increaseof the system overhead, dramatic BER reduction was observedas we used 16-symbol data blocks and as we used one retrainingblock every 25 data blocks.

In the two-user and three-user scenarios, the receiver isequipped with two and three receive antennas, respectively.Each user is equipped with two transmit antennas, and 8-PSKsignal constellation is used. Signal-to-interference noise ratio(SINR) is set to 0 dB, i.e., both users are transmitting at thesame power. Data blocks of 32 symbols plus three symbolsfor the cyclic prefix are used. Fig. 12 shows the bit-error-rateperformance of the two-user and three-user systems compared

Page 13: Efficient adaptive receivers for joint equalization and interference

YOUNIS et al.: EFFICIENT ADAPTIVE RECEIVERS FOR JOINT EQUALIZATION AND INTERFERENCE CANCELLATION 2861

Fig. 10. Effect of Doppler frequency on the performance of RLS algorithm.

Fig. 11. Effect of block size and retraining frequency on tracking performance.

Fig. 12. BER performance of the adaptive receiver for two and three users.

with the single-user case at two different Doppler frequencies.From this figure, it is clear that the adaptive interferencecancellation technique can separate co-channel users withoutsacrificing performance. However, at higher Doppler frequen-cies, the scheme of Fig. 8 has a better performance than the oneof Fig. 9. At very high Doppler frequencies, the RLS algorithmmight not be able to track the channel variations. In this case,training more often can improve the system performance at theexpense of increasing system overhead.

V. CONCLUSIONS

Adaptive receivers for space-time block-coded transmissionsare developed for single-user, two-user, and multiuser transmis-sions. These receivers do not require explicit channel knowl-edge or channel estimation at the receiver. The receivers arebased on a modified low-complexity RLS algorithm that ex-ploits the rich structure of STBC to reduce the computationalcomplexity to that of LMS algorithm. Moreover, they are ca-pable of separating co-channel users without sacrificing per-formance or bandwidth. Both training and tracking performanceresults in a time-varying frequency-selective fading environ-ment are presented. The effect of different design parameterson the performance of the receivers is discussed. Different tech-niques for performance improvement and computational com-plexity and system overhead reduction by a better utilizationof the receivers’parameters are also suggested. In a related re-cent work [16], it is shown that the complexity of the receiverscan be further simplified by performing the equalization at aper-frequency-bin level rather than at a block level. in this way,the complexity of the implementation is reduced to ,where is the number of users, and is the block size.

REFERENCES

[1] A. F. Naguib, N. Seshadri, and A. R. Calderbank, “Increasing data rateover wireless channels: space-time coding and signal processing for highdata rate wireless communications,”IEEE Commun. Mag., vol. 17, pp.76–92, May 2000.

[2] J. H. Winters, J. Salz, and R. D. Gitlin, “The impact of antenna diver-sity on the capacity of wireless communication systems,”IEEE Trans.Commun., vol. 42, pp. 1740–1751, Feb. 1994.

[3] A. F. Naguib, N. Seshadri, and A. R. Calderbank, “Applications ofspace-time block codes and interference suppression for high capacityand high data rate wireless systems,” inProc. 32nd Asilomar Conf.Signals, Syst., Comput., Pacific Grove, CA, Nov. 1998, pp. 1803–1810.

[4] S. Alamouti, “A simple transmit diversity technique for wireless com-munications,”IEEE J. Select. Areas Commun., vol. 16, pp. 1451–1458,Oct. 1998.

[5] A. Stamoulis, N. Al-Dhahir, and A. R. Calderbank, “Further results oninterference cancellation and space-time block codes,” inProc. 35thAsilomar Conf. Signals, Syst., Comput., Pacific Grove, CA, Nov. 2001,pp. 257–261.

[6] E. Lindskog and A. Paulraj, “A transmit diversity scheme for delayspread channels,” inProc. Int. Contr. Conf., New Orleans, LA, June2000, pp. 307–311.

[7] Z. Liu, G. Giannakis, A. Scaglione, and S. Barbarossa, “Decoding andequalization of unknown multipath channels based on block precodingand transmit antenna diversity,” inProc. 33rd Asilomar Conf. Signals,Syst., Comput., Pacific Grove, CA, Nov. 1999, pp. 1557–1561.

[8] N. Al-Dhahir, “Single-carrier frequency-domain equalization forspace-time block-coded transmissions over frequency-selective fadingchannels,”IEEE Commun. Lett., vol. 5, pp. 304–306, July 2001.

[9] A. Czylwik, “Comparison between adaptive OFDM and single carriermodulation with frequency domain equalization,” inProc. Veh. Technol.Conf., New York, May 1997, pp. 865–869.

Page 14: Efficient adaptive receivers for joint equalization and interference

2862 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003

[10] A. Oppenheim and R. Schafer,Discrete Time Signal Processing.Englewood Cliffs, NJ: Prentice-Hall, 1989.

[11] A. H. Sayed,Fundamentals of Adaptive Filtering. New York: Wiley,2003.

[12] W. M. Younis, N. Al-Dhahir, and A. H. Sayed, “Adaptive frequency-domain equalization of space-time block-coded transmissions,” inProc.ICASSP, Orlando, FL, May 2002, pp. 2353–2356.

[13] W. M. Younis, A. H. Sayed, and N. Al-Dhahir, “Joint frequency-do-main adaptive equalization and interference cancellation for multi-userspace-time block-coded systems,” inProc. ICASSP, Hong Kong, Apr.2003.

[14] , “Adaptive frequency-domain joint equalization and interferencecancellation for multi-user space-time block-coded systems,” inProc.Int. Contr. Conf., Anchorage, AK, May 2003.

[15] T. Kailath,Linear Systems. Englewood Cliffs, NJ: Prentice-Hall, 1980.[16] W. M. Younis and A. H. Sayed, “Further complexity reduction of

space-time block-coded receivers,” , submitted for publication.

Waleed M. Younis (S’96) was born in Cairo, Egypt,in 1973. He received the B.S. and M.S. degrees inelectrical engineering from Ain Shams University,Cairo, in 1995 and 1998, respectively. SinceSeptember 1999, he has been pursuing the Ph.D.degree in electrical engineering at the University ofCalifornia, Los Angeles.

His research interests include adaptive filtering,equalization for wireless communications, space-time coding, and MIMO communication systems.

Ali. H. Sayed (F’01) received the Ph.D. degree inelectrical engineering in 1992 from Stanford Univer-sity, Stanford, CA.

He is currently Professor and Vice-Chairman ofElectrical Engineering at the University of California,Los Angeles (UCLA). He is also the Principal In-vestigator of the UCLA Adaptive Systems Labora-tory (www.ee.ucla.edu/asl). He has over 190 journaland conference publications, is the author of the text-bookFundamentals of Adaptive Filtering(New York:Wiley, 2003), and is coauthor of the research mono-

graphIndefinite Quadratic Estimation and Control(Philadelphia, PA: SIAM,1999) and of the graduate-level textbookLinear Estimation(Englewood Cliffs,NJ: Prentice-Hall, 2000). He is also co-editor of the volumeFast Reliable Al-gorithms for Matrices with Structure(Philadelphia, PA: SIAM, 1999). He is amember of the editorial boards of theSIAM Journal on Matrix Analysis and ItsApplicationsand of theInternational Journal of Adaptive Control and SignalProcessing. He has served as coeditor of special issues of the journalLinearAlgebra and Its Applications. He has contributed several articles to engineeringand mathematical encyclopedias and handbooks and has served on the programcommittees of several international meetings. He has also consulted with in-dustry in the areas of adaptive filtering, adaptive equalization, and echo cancel-lation. His research interests span several areas including adaptive and statisticalsignal processing, filtering and estimation theories, signal processing for com-munications, interplays between signal processing and control methodologies,system theory, and fast algorithms for large-scale problems.

Dr. Sayed received the 1996 IEEE Donald G. Fink Award, a 2002 Best PaperAward from the IEEE Signal Processing Society in the area of Signal ProcessingTheory and Methods, and co-author of two Best Student Paper awards at inter-national meetings. He is also a member of the technical committees on SignalProcessing Theory and Methods (SPTM) and on Signal Processing for Com-munications (SPCOM), both of the IEEE Signal Processing Society. He is amember of the editorial board of the IEEE SIGNAL PROCESSINGMAGAZINE. Hehas served twice as Associate Editor of the IEEE TRANSACTIONS ONSIGNAL

PROCESSING, of which he is now serving as Editor-in-Chief.

Naofal Al-Dhahir (SM’99) received the M.S. andPh.D. degrees from Stanford University, Stanford,CA, in 1990 and 1994, respectively, both in electricalengineering.

He was an Instructor at Stanford University duringthe Winter of 1993. From August 1994 to July 1999,he was at the General Electric R and D Center, Sch-enectady, NY, where he worked on various aspects ofsatellite communication systems design and anti-jamGPS receivers. From August 1999 to July 2003, hewas a principal member of technical staff at AT&T

Shannon Laboratory, Florham Park, NJ. Since August 2003, he has been an As-sociate Professor with the University of Texas at Dallas, Richardson. His currentresearch interests include equalization schemes, space-time coding and signalprocessing, OFDM, wireless networks, and digital subscriber line technology.He has authored over 45 journal papers and holds ten U.S. patents in the areasof satellite communications, digital television, and space-time processing.

Dr. Al-Dhahir is a a member of the IEEE SP4COM technical com-mittee. He is Editor for IEEE TRANSACTION ON SIGNAL PROCESSING, IEEECOMMUNICATIONS LETTERS, and IEEE TRANSACTIONS ONCOMMUNICATIONS

and will serve as co-chair of the Communication Theory Symposium atGlobecom’04. He is co-author of the bookDoppler Applications for LEOSatellite Systems(Boston, MA: Kluwer, 2001).