Ergodic Capacity Analysis of AF DH MIMO Relay Systems ......dual-hop (DH) amplify-and-forward (AF)...

16
1 Ergodic Capacity Analysis of AF DH MIMO Relay Systems with Residual Transceiver Hardware Impairments: Conventional and Large System Limits Anastasios K. Papazafeiropoulos * , Shree Krishna Sharma , Symeon Chatzinotas , and Björn Ottersten * Institute for Digital Communications, University of Edinburgh, Edinburgh, U.K. SnT - securityandtrust.lu, University of Luxembourg, Luxembourg Email: [email protected], {shree.sharma, symeon.chatzinotas,bjorn.ottersten}@uni.lu Abstract—Despite the inevitable presence of transceiver im- pairments, most prior work on multiple-input multiple-output (MIMO) wireless systems assumes perfect transceiver hardware which is unrealistic in practice. In this direction, motivated by the increasing interest in MIMO relay systems due to their improved spectral efficiency and coverage, this paper investigates the im- pact of residual hardware impairments on the ergodic capacity of dual-hop (DH) amplify-and-forward (AF) MIMO relay systems. Specifically, a thorough characterization of the ergodic channel capacity of DH AF relay systems in the presence of hardware impairments is presented herein for both the finite and large antenna regimes by employing results from finite-dimensional and large random matrix theory, respectively. Regarding the former setting, we derive the exact ergodic capacity as well as closed-form expressions for tight upper and lower bounds. Furthermore, we provide an insightful study for the low signal- to-noise ratio (SNR) regimes. Next, the application of the free probability (FP) theory allows us to study the effects of the hardware impairments in future 5G deployments including a large number of antennas. While these results are obtained for the large system limit, simulations show that the asymptotic results are quite precise even for conventional system dimensions. I. I NTRODUCTION The continuous evolution of cellular networks is led by the rapidly increasing demand for ubiquitous wireless connectivity and spectral efficiency [2]. Initially, multiple-input multiple- output (MIMO) technology emerged by means of the pioneering works of Telatar and Foschini [3], [4] as an enabling technique to implement high data-rate systems based on their impressive capacity scaling in the high signal-to-noise ratio (SNR) regime with the minimum of transmit and receive antennas. In order to characterize such a MIMO wireless link, an information- theoretic capacity has been commonly used as an important figure of merit. Copyright (c) 2015 IEEE. Personal use of this material is permitted. However, permission to use this material for any other purposes must be obtained from the IEEE by sending a request to [email protected]. The work of S. Sharma, S. Chatzinotas, and B. Ottersten was partially supported by the projects H2020 SANSA and FNR SATSENT, SEMIGOD, INWIPNET, PROSAT. Also, this work was partially supported by a Marie Curie Intra European Fellowship within the 7th European Community Framework Programme for Research of the European Commission under grant agreement no. [330806], IAWICOM. Parts of this work were presented at the IEEE Global Communications Conference (GLOBECOM 2015) [1]. Following this direction, the massive MIMO paradigm, originating from [5], is a new network architecture, and is also known as large-scale antenna systems or very large MIMO. In this paradigm, the number of antennas of each base station (BS) and the number of users per BS are unconventionally large but they differ by a factor of two or four or even an order of the magnitude. Interestingly, this technology brings numerous advantages such as unprecedented spatial degrees- of-freedom enabling large capacity gains from coherent recep- tion/transmit beamforming, resilience to intra-user interference and thermal noise, and the ease of implementation because of low-complexity signal processing algorithms [6], [7]. The massive MIMO wireless systems are attractive (cost- efficient) for a network deployment, but only if the antenna elements consist of inexpensive hardware components. How- ever, most of the research contributions are based on the strong assumption of the perfect hardware and this assumption is quite idealistic in practice. In reality, besides the effect of wireless fading channel, both the transmitted and received baseband signals are also affected by the unavoidable imperfections of the transceiver hardware components. In fact, the lower the quality of the transceiver hardware, the higher the impact of occurring impairments on the performance of the system. Specifically, sev- eral phenomena exist that constitute additive or multiplicative hardware impairments e.g., in-phase/quadrature-phase (I/Q)- imbalance [8], high power amplifier non-linearities [9], and oscillator phase noise (PN) [10]. Even though the performance degradation caused due to such hardware impairments can be partially mitigated by means of suitable calibration schemes at the transmitter or compensation algorithms at the receiver, there still remains a certain amount of unaccounted distortion caused due to residual hardware impairments. The main reasons for these residual impairments are imperfect parameters estimation due to the randomness and the time variation of the hardware characteristics, inaccurate models because of the limited precision, unsophisticated compensation algorithms, etc [11]. The system model, including the additive hardware impairments, has been proposed in [11]–[15], in order to describe the aggregate effect from many impairments. These impairments are modeled as independent additive distortion noises at the BS as well as at the user. The adoption of this

Transcript of Ergodic Capacity Analysis of AF DH MIMO Relay Systems ......dual-hop (DH) amplify-and-forward (AF)...

  • 1

    Ergodic Capacity Analysis of AF DH MIMO RelaySystems with Residual Transceiver HardwareImpairments: Conventional and Large System

    LimitsAnastasios K. Papazafeiropoulos∗, Shree Krishna Sharma†, Symeon Chatzinotas†, and Björn Ottersten†

    ∗Institute for Digital Communications, University of Edinburgh, Edinburgh, U.K.†SnT - securityandtrust.lu, University of Luxembourg, Luxembourg

    Email: [email protected], {shree.sharma, symeon.chatzinotas,bjorn.ottersten}@uni.lu

    Abstract—Despite the inevitable presence of transceiver im-pairments, most prior work on multiple-input multiple-output(MIMO) wireless systems assumes perfect transceiver hardwarewhich is unrealistic in practice. In this direction, motivated by theincreasing interest in MIMO relay systems due to their improvedspectral efficiency and coverage, this paper investigates the im-pact of residual hardware impairments on the ergodic capacity ofdual-hop (DH) amplify-and-forward (AF) MIMO relay systems.Specifically, a thorough characterization of the ergodic channelcapacity of DH AF relay systems in the presence of hardwareimpairments is presented herein for both the finite and largeantenna regimes by employing results from finite-dimensionaland large random matrix theory, respectively. Regarding theformer setting, we derive the exact ergodic capacity as wellas closed-form expressions for tight upper and lower bounds.Furthermore, we provide an insightful study for the low signal-to-noise ratio (SNR) regimes. Next, the application of the freeprobability (FP) theory allows us to study the effects of thehardware impairments in future 5G deployments including alarge number of antennas. While these results are obtained for thelarge system limit, simulations show that the asymptotic resultsare quite precise even for conventional system dimensions.

    I. INTRODUCTIONThe continuous evolution of cellular networks is led by the

    rapidly increasing demand for ubiquitous wireless connectivityand spectral efficiency [2]. Initially, multiple-input multiple-output (MIMO) technology emerged by means of the pioneeringworks of Telatar and Foschini [3], [4] as an enabling techniqueto implement high data-rate systems based on their impressivecapacity scaling in the high signal-to-noise ratio (SNR) regimewith the minimum of transmit and receive antennas. In orderto characterize such a MIMO wireless link, an information-theoretic capacity has been commonly used as an importantfigure of merit.

    Copyright (c) 2015 IEEE. Personal use of this material is permitted. However,permission to use this material for any other purposes must be obtained fromthe IEEE by sending a request to [email protected].

    The work of S. Sharma, S. Chatzinotas, and B. Ottersten was partiallysupported by the projects H2020 SANSA and FNR SATSENT, SEMIGOD,INWIPNET, PROSAT. Also, this work was partially supported by a Marie CurieIntra European Fellowship within the 7th European Community FrameworkProgramme for Research of the European Commission under grant agreementno. [330806], IAWICOM.

    Parts of this work were presented at the IEEE Global CommunicationsConference (GLOBECOM 2015) [1].

    Following this direction, the massive MIMO paradigm,originating from [5], is a new network architecture, and is alsoknown as large-scale antenna systems or very large MIMO. Inthis paradigm, the number of antennas of each base station(BS) and the number of users per BS are unconventionallylarge but they differ by a factor of two or four or even anorder of the magnitude. Interestingly, this technology bringsnumerous advantages such as unprecedented spatial degrees-of-freedom enabling large capacity gains from coherent recep-tion/transmit beamforming, resilience to intra-user interferenceand thermal noise, and the ease of implementation because oflow-complexity signal processing algorithms [6], [7].

    The massive MIMO wireless systems are attractive (cost-efficient) for a network deployment, but only if the antennaelements consist of inexpensive hardware components. How-ever, most of the research contributions are based on the strongassumption of the perfect hardware and this assumption is quiteidealistic in practice. In reality, besides the effect of wirelessfading channel, both the transmitted and received basebandsignals are also affected by the unavoidable imperfections of thetransceiver hardware components. In fact, the lower the qualityof the transceiver hardware, the higher the impact of occurringimpairments on the performance of the system. Specifically, sev-eral phenomena exist that constitute additive or multiplicativehardware impairments e.g., in-phase/quadrature-phase (I/Q)-imbalance [8], high power amplifier non-linearities [9], andoscillator phase noise (PN) [10]. Even though the performancedegradation caused due to such hardware impairments can bepartially mitigated by means of suitable calibration schemesat the transmitter or compensation algorithms at the receiver,there still remains a certain amount of unaccounted distortioncaused due to residual hardware impairments. The mainreasons for these residual impairments are imperfect parametersestimation due to the randomness and the time variation ofthe hardware characteristics, inaccurate models because of thelimited precision, unsophisticated compensation algorithms,etc [11]. The system model, including the additive hardwareimpairments, has been proposed in [11]–[15], in order todescribe the aggregate effect from many impairments. Theseimpairments are modeled as independent additive distortionnoises at the BS as well as at the user. The adoption of this

  • 2

    model herein is based on its analytical tractability and theexperimental verifications in [15]. On the other hand, thereare hardware impairments multiplied with the channel vector,which might cause channel attenuations and phase shifts. Inthe case of slow variation of these impairments, they can becharacterized as sufficiently static, and thus, can be included inthe channel vector by an appropriate scaling of its covariancematrix or due to the property of circular symmetry of thechannel distribution. However, if the impairment such as thephase noise accumulates within the channel coherence period,it causes a multiplicative distortion that cannot be incorporatedby the channel vector [10], [12], [13].

    Recently, the topic concerning the investigation of theimpact of radio-frequency (RF) impairments on wirelesscommunication systems has attracted a tremendous amount ofattention with a growing interest in the direction of the studyof their effects on conventional MIMO systems (see [8]–[10],[14]–[18] and the references therein) and, more lately, on largeMIMO systems [11]–[13], [19]–[21]. For instance, experimentalresults modeling the residual hardware impairments only atthe transmitter and the study of their impacts on certainMIMO detectors such as zero-forcing took place in [15]. Moreinterestingly, regarding the channel capacity, [18] elaboratedon the derivation of high signal-to-noise ratio (SNR) bounds byconsidering only transmitter impairments for MIMO systemswith a finite number of antennas, while in [11], the authorsextended the analysis to arbitrary SNR values, but mostimportantly, by including receiver impairments. Unfortunately,the imposed fundamental capacity ceiling resulted due to thepresence of the impairments becomes more restrictive in higherrate systems such as large MIMO systems in which increasingthe transmit power cannot be any more beneficial. In addition,the authors in [11]–[13], [18], [22] showed that the transceiverhardware impairments result in a channel estimation error anda capacity ceiling even for the case of massive MIMO systems.

    Despite that relay systems have received important researchattention since they realize the performance gains of wirelesssystems cost-efficiently by means of coverage extension anduniform quality of service, only a few works have tackled theeffect of hardware impairments [22]–[25]. In particular, dualhop (DH) amplify-and-forward (AF) relay systems have beenextensively studied for the cases of both conventional and largeMIMO systems [26]–[28], but the relevant studies of the relaysystems considering the effects of hardware impairments arequite limited in the literature. Actually, the authors in [22],[24] provided the most noteworthy works by considering thesekinds of impairments in one-way and two-way DH AF relayingsystems, respectively. In particular, they considered only theoutage probability and simple capacity upper bounds for thesimplistic case of single antenna systems. However, a thoroughanalysis of the capacity of the relay systems with hardwareimpairments for the case of multiple antennas is still lackingfrom the literature. Given the importance of relay systems, theirimportance in future 5G (fifth generation) deployments is un-deniable. Specifically, relay systems are going to be employedin 5G systems by considering a large number of antennas ateach node . Furthermore, their application in heterogeneounsnetworks (HetNets) is unquestionable. Please note also that

    massive MIMO and HetNets can be combined, which makesthe study of large MIMO relay systems very contributory [29],[30]. To this end, this work focuses on the effect of additivetransceiver impairments on the ergodic capacity of the DH AFMIMO relay systems and the performance characterization dueto multiplicative impairments is the subject of the future work.

    In this paper, we acknowledge that the manifestation ofRF impairments becomes more significant in high data-ratessystems such as MIMO or more strikingly, in the future largeMIMO systems which offer a higher capacity. Nevertheless,taking into account that the deployments of both relays anda large number of antennas in a massive MIMO system aredesirable to be cost-efficient, the transceiver components shouldbe inexpensive (low quality), which makes the overall systemperformance more prone to impairments. In this context, theauthors in [25] recently studied the impact of transceiverimpairments on massive MIMO relaying systems but thework focused only on the large-antenna regime consideringdecode and forward (DF) relaying policy. To the best of ourknowledge, there appear to be no analytical ergodic capacityresults applied to DH AF MIMO systems with an arbitrarynumber of antennas and the relaying configurations in thepresence of hardware impairments. Only SISO works had beenstudied. Note that AF systems are preferable with comparisonto DF from the aspect of the ease of implementation, whichis crucial in the design of 5G systems demanding denseHetNets and low cost manufacture. As a result, it is of greatinterest and necessity to investigate the impact of residualadditive hardware impairments on DH AF relay systems withboth finite and infinite number of antennas by providing adetailed performance analysis. Especially, unlike [22] whichonly deals with single antenna relay systems, this work providesa complete performance characterization by investigating theimpact of transceiver impairments on the ergodic capacity of DFAF MIMO relay systems for both conventional (finite) and largesystem (infinite number of antennas) regimes. Especially, itmight be the first work presenting together finite and asymptoticresults bridging the gap between the two analyses and offeringand extra validation. None work till now has considered thehardware impairments in DH AF relay channels with largeMIMO or even conventional MIMO systems. Moreover, weprovide an exact theoretical expression for the ergodic capacityof the considered system instead of only the bounds. In thisdirection, the contributions of this paper can be summarizedas follows

    • Given an ideal AF MIMO dual-hop system model, wheremultiple single-antenna users (source) communicate witha BS (destination) through a relay without the sourceto destination link, we introduce a realistic model forMIMO relay systems incorporating the inevitable residualadditive hardware transceiver impairments by taking intoaccount the generalized model of [14], [15], [22]. Infact, contrary to the existing works which studied theperformance degradation due to separate single sources [8],[14], we follow an overall approach of examining theaccumulated impact of the additive hardware impairments.

    • Based on the aforementioned system model and employing

  • 3

    results from [26], we investigate the impact of RFhardware impairments on the ergodic channel capacitywhen both the relay and the BS are deployed with a finitenumber of antennas. Notably, we show that a saturationof the ergodic capacity, appearing at high SNR due toadditive hardware impairments, takes also place in DH-AFrelay MIMO channels. In particular, the study takes placeby means of derivation of a new exact analytical result,simple closed-form low-SNR expressions, and the closed-form tight upper and lower bounds on the ergodic capacityin terms of easily computed standard functions, whichprovide insightful outcomes regarding the characterizationof the degrading effects due to the residual hardwareimperfections.

    • Using tools from large random matrix theory (RMT) andcontrary to the existing literature that usually employs adeterministic equivalent analysis, we follow a differentline of realizing mathematical derivations by pursuinga free probability (FP) analysis [31]. Advantageously,the FP requires just a polynomial solution instead offixed-point equations and allows us to provide a thoroughcharacterization of the impact of residual transceiverimpairments on the capacity of DH AF systems in the largesystem limit. One of the most interesting outcomes of thepaper is the demonstration that the results coming fromboth conventional and large-antenna analyses coincide forthe conventional number of antennas.

    The remainder of this paper is structured as follows:Section II presents the signal and system models for relaysystems with multiple antennas for the cases of both idealand imperfect hardware. In Section III, we pursue a completeconventional random matrix theory analysis for DH AF MIMOchannels by means of a new analytical result for the ergodiccapacity, simple closed-form low-SNR expressions, and theclosed-form tight upper and lower bounds in the presence ofhardware impairments. Covering the need for the investigationof the ergodic capacity in the case of large MIMO systems, weemploy a FP analysis in Section IV. The numerical results areplaced in Section V, while Section VI summarizes the paper.

    Notation: Vectors and matrices are denoted by boldfacelower and upper case symbols. (·)T, (·)H, and tr(·) represent thetranspose, Hermitian transpose, and trace operators, respectively.The expectation operator, as well as the adjugate and thedeterminant of a matrix are denoted by E [·], as well asadj(·) and det(·), respectively. The diag{·} operator generatesa diagonal matrix from a given vector, and the symbol, declares definition. The notations CM and CM×N referto complex M -dimensional vectors and M × N matrices,respectively. Finally, b ∼ CN (0,Σ) denotes a circularlysymmetric complex Gaussian with zero-mean and covariancematrix Σ, and (·)+ signifies the positive part of its argument.Note that El (·) is the exponential integral function of orderl [32, Eq. 8.211.1], Kv (·) is the modified Bessel function ofthe second kind [32, Eq. 8.432.6], U(a, b, z) is the Tricomiconfluent hypergeometric function [33, Eq.07.33.02.0001.01]],and Wλ,µ(·) is the Whittaker function [34, Eq. 13.1.33].

    II. SYSTEM MODELWe consider a DH AF relay system with ideal transceiver

    hardware as illustrated in Fig. 1(a). We assign the subscript1 for the parameters between the source and the relay, andthe subscript 2 for the parameters describing the second hop,i.e., the link between the relay and the destination. Whileour analysis concerning a DH AF system comprised by threenodes (the source, a relay, and the destination) with multiplenumber of antennas is rather general, we exemplify a scenarioof practical interest without loss of any generality. Specifically,we assume that both the relay and the BS (destination) arecompact infrastructures with multiple antennas, but regardingthe source node, we assume that it consists of a number ofsingle-antenna users playing the same role as a single compactnode with an equivalent number of antennas1. Hence, K singleantenna non-cooperative users, trying to reach a distant N -antennas BS, communicate first with an intermediate relayincluding an array of M antennas (first hop). In other words,a single-input multiple-output (SIMO) multiple access channel(MAC) (users-relay) is followed by a point to point MIMOchannel (relay-BS). The BS is assumed to be aware of the totalsystem channel state information (CSI)2, while both the usersand the relay have no CSI knowledge during their transmissions.

    The mathematical representation of this model is expressedas

    y1 = H1x1 + z1, (1)

    y2 = H2√νy1 + z2

    =√νH2H1x1 +

    √νH2z1 + z2, (2)

    where the first and second equations correspond to the first(users-relay) and second (relay-BS) input-output signal models,respectively. In particular, x1 ∈ CK×1 is the Gaussian vectorof symbols simultaneously transmitted by the K users withE [x1xH1] = Q1 = µIK with µ =

    ρK , where ρ is the system

    power. H1 ∈ CM×K ∼ CN (0, IM ⊗ IK) is the concatenatedchannel matrix between the K users and the relay exhibitingflat-fading, while H2 ∈ CN×M ∼ CN (0, IN ⊗ IM ) describesthe channel matrix between the relay and the BS. In other words,both the first and second hops exhibit Rayleigh fast-fadingand are expressed by Gaussian matrices with independentidentically distributed (i.i.d.) complex circularly symmetricelements. In addition, y1 and y2 as well as z1 ∼ CN (0, IM )and z2 ∼ CN (0, IN ) denote the received signals as well asthe additive white Gaussian noise (AWGN) vectors at therelay and BS, respectively, i.e., the user signal-to-noise-ratio(SNR) equals to µ. Note that before forwarding the receivedsignal y1 at the relay, we consider that it is amplified byν = αM(1+ρ) , where we have placed a per relay-antenna fixedpower constraint αM with α being the total gain of the relay,i.e., E

    [‖√νy1‖2

    ]≤ α, with the expectation taken over signal,

    noise, and channel fading realizations. For the performancecharacterization in this paper, we assume a fixed relay gain.

    In practice, the transmitter, the relay, and the BS areaffected by certain inevitable additive impairments such as

    1This example assumes that the users are in close proximity, in order torealize the same channel as a multi-antenna source.

    2The effect of imperfect CSI will be taken into account for future work.

  • 4

    1

    User K

    K users

    x1

    xK

    x2

    AF Relay(M antennas)Noise

    z1

    y1

    x2

    BS (N antennas)Noise

    z2

    y2

    Channel 1 Channel 2

    (a)

    1

    User K

    K users Transmitterdistrortion

    t1x1

    xK

    x2

    AF Relay(M antennas)

    Noisez

    1

    y1

    x2

    Transmitterdistrortion

    t2

    BS (N antennas)Noise

    z2

    y2

    Receiverdistrortion

    r1

    Receiverdistrortion

    r2

    Channel 1 Channel 2

    (b)Fig. 1. (a) Conventional AF DH relay MIMO system with ideal transceiverhardware. (b) Generalized AF DH relay MIMO system with residual additivetransmitter and receiver hardware impairments.

    I/Q imbalance [14]. Although mitigation schemes can beincorporated in both the transmitter and receiver, residualimpairments still emerge by means of additive distortionnoises [14], [15]. In our system model, an important attentionshould be given to the relay which plays two distinctive roles.In the first hop, it operates as a receiver, while it becomes thetransmitter of the second hop. Taking this into consideration, ineach node of the system, a transmit and/or receive impairmentexists which may cause a mismatch between the intended signaland what is actually transmitted during the transmit processingand/or a distortion of the received signal at the destination.

    Introduction of the residual additive transceiver impairmentsto (1) and (2) provides the following general channel modelsfor the respective links3

    y1 =H1(x1+ηt1)+ηr1 +z1, (3)

    y2 =H2(√νy1+ηt2

    )+ηr2 + z2

    =√νH2H1(x1+ηt1)+H2

    (√ν(ηr1+z1)+ηt2

    )+ηr2 +z2, (4)

    where the additive terms ηti and ηri for i = 1, 2 are thedistortion noises coming from the residual impairments in thetransmitter and receiver of the link i, respectively. Interestingly,this model (depicted in Fig. 1(b)) allows us to investigatethe impact of the additive residual transceiver impairments,described in [14], [15], on a DH AF system. Generally, thetransmitter and the receiver distortion noises for the ith link aremodeled as Gaussian distributed, where their average poweris proportional to the average signal power, as shown bymeasurement results [15]. Mathematically speaking, we have

    ηti ∼ CN (0, δ2tidiag (qi1 , . . . , qTi)), (5)ηri ∼ CN (0, δ2ri tr(Qi) IRi), (6)

    with Ti and Ri being the numbers of transmit and receiveantennas of link i, i.e., T1 = K, T2 = M and R1 =

    3The same model has been used in [35] for the investigation of the residualadditive transceiver hardware impairments on the minimum mean square error(MMSE) filtering performance of a DH-AF (MIMO) wireless system in thelarge number of antennas regime.

    M, R2 = N , while Qi is the transmit covariance matrixof the corresponding link with diagonal elements qi1 , . . . , qTi .Note that the circularly-symmetric complex Gaussianity can bejustified by the aggregate contribution of many impairments.Moreover, δ2ti and δ

    2ri are proportionality parameters describing

    the severity of the residual impairments in the transmitter andthe receiver of link i. In practical applications, these parametersappear as the error vector magnitudes (EVM) at each transceiverside [2]. Obviously, as far as the first hop is concerned, theadditive transceiver impairments are expressed as

    ηt1 ∼ CN (0, δ2t1µIK), (7)ηr1 ∼ CN (0, δ2r1ρIM ). (8)

    Given that the input signal for the second hop is√νyi, the

    corresponding input covariance matrix is

    Q2 = νE [y1yH1] = νK(µ+ δ2t1µ+ δ

    2r1µ+

    1

    K

    )IM

    = µ̃νKIM , (9)

    where µ̃ =(µ+ δ2t1µ+ δ

    2r1µ+

    1K

    ). Note that now, ν = αKMµ̃ ,

    after accounting for fixed gain relaying. Thus, the additivetransceiver impairments for the second hop take the form4

    ηt2 ∼ CN (0, δ2t2µ̃νKIM ), (10)ηr2 ∼ CN (0, δ2r2 µ̃νKMIN ). (11)

    In particular, taking (4) into consideration, the capacity ofthe considered channel model is described by the followingproposition.

    Proposition 1: The ergodic capacity of a DH AF systemin the presence of i.i.d. Rayleigh fading with residual addi-tive transceiver hardware impairments under per user powerconstraints [Q1]k,k ≤ µ,∀k = 1 . . .K and (9) is given by

    C = E[ln det

    (IN +

    µν

    BH2H1H

    H

    1HH

    2Φ−1)]

    (12)

    = E[ln det

    (Φ +

    µν

    BH2H1H

    H

    1H̄H

    2

    )]︸ ︷︷ ︸

    C1

    −E [ln det(Φ)]︸ ︷︷ ︸C2

    , (13)

    where Φ = f2H2H1HH1HH2 + f3H2H

    H2 + IN with B =

    δ2r2 µ̃νKM+1, f1 = f̃1f3, f2 =f4δ2t1 , f3 =

    ν(δ2t2 µ̃K+δ2r1µK+1)

    B ,f4 =

    µνB , and f̃1 =

    f2+f4f3

    .Proof: Given any channel realizations H1,H2 and transmit

    signal covariance matrices Q1 and Q2 at the user and relaysides, a close observation of (4) shows that it is an instance ofthe standard DH AF system model described by (2), but witha different noise covariance given by

    Φ = νδ2t1H2H1diag (q11 , . . . , qK) HH

    1HH

    2

    + H2((νδ2r1 trQ1 + ν

    )IN + νδ

    2t2diag (q11 , . . . , qM )

    )HH2

    +(δ2r2 trQ2 + 1

    )IN . (14)

    Taking into account for the optimality of the input signal x1since it is Gaussian distributed with the covariance matrixQ1 = µIK , the proof is conluded.

    4The presence of impairments at the relay node signifies two differentdistortion noises ηr1 and ηt2 , where the latter one together with the distortionnoise ηr2 , occuring at the BS, have been amplified during the second hop.

  • 5

    The above proposition (Proposition 1) allows us to investigatethe impact of the additive transceiver impairments in DHAF systems for the cases of both finite and infinite systemdimensions.

    Remark 1: Interestingly, C1 represents the mutual informa-tion due to relaying and additive transceiver impairments, whilethe physical meaning of C2 describes the loss due to noiseamplification.

    Remark 2: Despite the resemblance of the ergodic capacitywith transceiver impairments, given by (12), with the con-ventional ergodic capacity of a DH AF system with idealhardware [26, Eq. 2], this paper shows the fundamentaldifferences that arise because the noise covariance matrix nowdepends on the product of the channel matrices H1 and H2,as shown by (14). As a result, it is non-trivial to provide thegeneralizations of the previous works on AF systems with idealhardware to the case of transceiver impairments.

    Subsequently, employing the property det(I + AB) =det(I + BA), the expressions for C1 and C2, which denotethe ergodic capacity per receive antenna with Ci = 1NCi fori = 1, 2, can be alternatively written as

    C1 =1

    NE[ln det

    (IM+f3H

    H

    2H2

    (IM+f̃1H1H

    H

    1

    ))](15)

    C2 =1

    NE[ln det

    (IM+f3H

    H

    2H2

    (IM+

    f2f3

    H1HH

    1

    ))]. (16)

    III. ERGODIC CAPACITY ANALYSIS-FINITE NUMBER OFANTENNAS

    This section presents analytical results regarding the ergodiccapacity of AF MIMO dual-hop systems under additivetransceiver impairments. Well known results under the idealassumption of perfect hardware, given in [26], are generalizedby including the practical consideration of the imperfecttransceiver hardware. It should be stressed that these resultscannot be elicited from the prior works for several reasons suchas that the noise covariance matrix depends on the product termH2H1. Specifically, the following theorem is the key resultof this section. We define p , max (M,N), q , min (M,N),s , min (K, q), u = p− q− 1, v = p+ q− 1, and q̃ = K + q,since these will be often used in our analysis.

    A. Exact Expression for Ergodic Capacity

    Theorem 1: The per receive antenna capacity of a DH AFsystem in the presence of i.i.d. Rayleigh fading channels withadditive transceiver impairments in the case of a finite numberof transmit users K as well as relay and BS antennas (M andN ) is given by

    C =2KN

    q∑l=1

    q∑k=q−s+1

    q+K−l∑i=0

    (q̃−li

    )f q̃−l−i3

    Γ(K − q + k)Gl,kIi,k, (17)

    where K and Gl,k are defined in Lemma 1 (Appendix A) and

    Ii,k =∫ ∞

    0

    ln

    (1 + f1λ

    1 + f2λ

    )e−λf3λ(2K+2k+u−i−2)/2

    ×Kv−i(

    2√λ)

    dλ, (18)

    and λ denotes the unordered eigenvalue of Z = H̃1LH̃H1.Proof: See Appendix B.

    Interestingly, Theorem 1 extends the result in [26, Eq. 39]that did not consider any aggregate hardware impairments.

    Remark 3: Given that in the second part, presenting theasymptotic behavior with respect to the number of antennas,we have derived the per-antenna capacity because our desireis to illustrate a consistency between the two regimes. As aresult, by providing the per-antenna capacity we can validatethe theoretical expressions corresponding to different numbersof antennas regimes. Actually, this is one of the contributionsof our manuscript; the bridging between the two regimes andthe depiction that the asymptotic results are tight even for theconventional number of antennas. It is worthwhile to mentionthat the per receive antenna capacity (1/N ) does not restrictus to evaluate the dependence on N .

    B. Low SNR Regime

    Despite the exact expression for the capacity of a DH AFchannel with additive transceiver impairments, obtained byTheorem 1, its lack to reveal insightful conclusions by meansof the dependencies with the various system parameters such asthe number of BS and relay antennas as well as the inevitablehardware impairments leads to the need for investigation ofthe low power regime5. Thus, in the case of low SNR, weconsider a meaningful scenario. Specifically, we let the userpower tend to zero (µ = 0), while the relay transmit power iskept constant. Note that if we let the relay power tend to zero,no transmission is possible. For the sake of simplification ofvarious expressions, we denote δ̃2t1 = δ

    2t1 + 1, δ̃

    2t2 = δ

    2t2 + 1,

    δ̃2r2 = αδ2r2 + 1, δ̃

    2tr1 = δ

    2r1 + δ

    2t1 + 1, and δ̃

    2tr1,ρ = ρδ̃

    2tr1 + 1.

    In the following, depending on the case and based on Jensen’sinequality, we are able to change the order of expectation andthe limit or derivative by applying the dominated convergencetheorem [36], since the term inside the expectation is upperbounded by an integrable function.

    It is known that the capacity in this region is well approxi-mated as [37]

    C

    (EbN0

    )≈ S0 ln

    (EbN0Eb

    N0min

    ), (19)

    where the two key element parameters EbN0min and S0 representthe minimum transmit energy per information bit and thewideband slope, respectively. In particular, we can expressthem in terms of the first and second derivatives of C (ρ) as

    EbN0min

    = limρ→0

    ρ

    C (ρ)=

    1

    Ċ (0), (20)

    S0 = −2[Ċ (0)

    ]2C̈ (0)

    ln2. (21)

    Theorem 2: In the low-SNR regime, the minimum transmitenergy per information bit, EbN0min , and the wideband slopeS0, of a DH AF channel in the presence of i.i.d. Rayleighfading channels subject to additive transceiver impairments, are

    5The high SNR case is not included in this paper due to space limitations.

  • 6

    given by (22) and (23), where E [λ]∣∣∣∣ρ=0

    and (E [λ])′∣∣∣∣ρ=0

    are

    given in Appendix C with λ being the unordered eigenvalueof Z = H̃1LH̃H1.

    Proof: See Appendix C.

    C. Tight Lower and Upper Bounds of the Ergodic Capacity

    Having in mind that the ergodic capacity, obtained in (17),can be calculated only numerically, in this section, we provideupper and lower tight bounds of the ergodic capacity intractable closed forms that can describe the easier behavior ofC. Obviously, their importance is indisputable because theyshed light on interesting properties regarding the impact ofresidual impairments on the ergodic capacity.

    Theorem 3: The ergodic capacity of AF MIMO dual-hopsystems with i.i.d. Rayleigh fading channels under additivetransceiver impairments can be upper and lower bounded by

    CU = C̃U,1 − C̃L,2 and CL = C̃L,1 − C̃U,2, (24)

    where C̃U,i, C̃L,i for i = 1, 2 are given by (49), (51).Proof: See Appendix D.

    IV. ASYMPTOTIC PERFORMANCE ANALYSIS

    Since the cost-efficiency of the massive MIMO technologydepends on the application of inexpensive hardware and suchinexpensive hardware components will make the deleteriouseffect of the residual impairments more pronounced, it is ofpivotal importance to study the ergodic capacity in the largesystem regime, i.e., users, relays, and the BS equipped with alarge number of antennas. Hence, this section presents the mainresults regarding the system performance in the large-antennaregime. Moreover, we account also for the scenario where thenumber of users increases infinitely. However, note that boththe system power ρ and the total gain of the relay α are keptfinite 6.

    A. Main Results

    Given that our interest is focused on channel matrices withdimensions tending to infinity, we employ tools from thelarge RMT. Among the advantages of the ensuing analysis,we mention the achievement of deterministic results thatmake Monte Carlo simulations unnecessary. Moreover, theasymptotic analysis can be quite accurate even for realisticsystem dimensions, while its convergence is rather fast as thechannel matrices grow larger. Under ideal hardware, the ergodiccapacity was obtained in [39], however, the deduction to thecase with RF impairments is not trivial, as stated in Section-III.Thus, after defining β , KM and γ ,

    NM , the channel capacity

    of the system under study is given by the following theorem.

    6These power constraints justify the power normalizations during thedescription of the system model, in order to keep the system power andthe total gain of the relay finite, while the dimensions grow infinite. Especially,this is requisite because the capacity would grow to infinity if the system powerwas infinite. Furthermore, it should be noted that this is a constraint which isapplied to the per-user power, but it can be straightforwardly expressed as anormalization on the system power. Finally, it should be noted that this is awell-accepted model already employed in a plethora of published papers suchas [26], [27], [38].

    Theorem 4: The capacity of a DH AF MIMO system inthe presence of i.i.d. Rayleigh fading channels with additivetransceiver impairments, when the numbers of transmit usersK as well as relay and BS antennas (M and N ) tend to infinitywith a given ratio, is given by

    C→ 1γ

    ∫ ∞0

    ln(

    1+f̃3x)(f∞Kf1/M

    (x)−f∞Kf2f3

    /M (x)

    )dx, (25)

    where f̃3 =α(1+δ̃2r1ρ+δ̃

    2t2δ̃2tr1)ρ

    δ̃2r2(1+δ̃2tr1

    )ρ, Kf1 = H

    H2H2 (IM + f1

    ×H1HH1 ) and K f2f3

    = HH2H2(IM + f2/f3H1H

    H1

    ), while

    the asymptotic eigenvalue probability density functions(a.e.p.d.f.) of Kf1/M and Kf2/f3/M , f

    ∞Kf1/M

    and f∞Kf2f3

    /M ,

    respectively, are obtained by the imaginary part of theircorresponding Stieltjes transforms S for real arguments.

    Proof: See Appendix E.

    B. Special Cases

    Herein, we investigate the ergodic capacity of the DH AFsystem with additive impairments, when its dimensions growin turn larger without a bound.

    Proposition 2: The ergodic capacity of the DH AF systemwith additive impairments, when the number of users K tendsto infinity, reduces to

    limK→∞

    C=E

    ln det Iq+

    αδ̃2t2Mδ̃2r2

    H2HH2

    Iq+α(δ2t2

    δ̃2tr1,ρ+δ2r1

    ρ+δ2t1ρ+1)

    Mδ̃2r2δ̃2tr1,ρ

    H2HH2

    .(26)

    Proof: See Appendix F-A.Proposition 3: The ergodic capacity of the DH AF system

    subject to additive impairments, when the number of relayantennas M tends to infinity, reduces to (27), where H̃1 ∼CN (0, IK ⊗ Iq).

    Proof: See Appendix F-B.Proposition 4: The ergodic capacity of the DH AF system

    subject to additive impairments, when the number of BSantennas N tends to infinity, is given by

    limN→∞

    C=E

    ln detIK+

    δ̃2t1ρ

    K(δ2t2δ̃2tr1,ρ

    +δ2r1ρ+1)

    H̃1H̃H1

    IK+δ2t1

    ρ

    K(δ2t2δ̃2tr1,ρ

    +δ2r1ρ+1)

    H̃1H̃H1

    , (28)

    where H̃1 ∼ CN (0, IK ⊗ Iq).Proof: See Appendix F-C.

    These propositions generalize the results presented in [26,Eqs. 42–44] for arbitrary K, M , and N configurations underthe unavoidable presence of additive transceiver impairments.Notably, these impairments affect the asymptotic limits, whileour proposed expressions reduced to (42)− (44) in [26] forthe ideal case where perfect transceiver hardware is assumed.Especially, in the high-SNR regime, we witness a hardeningeffect due to the various imperfections affecting differentlythe ergodic capacity. For example, when K → ∞, and thesecond hop impairments are negligible, i.e., (δ2t2 = δ

    2r2 = 0),

    the nominator of (26) does not depend on any impairments, and

  • 7

    EbN0min

    =KMδ̃2r2

    αsE [λ]∣∣∣∣ρ=0

    ln 2 (22)

    S0 =2K4M4δ̃4r2

    α3s3E2 [λ]∣∣∣∣ρ=0

    (2KMδ̃2r2

    (E [λ]

    ∣∣∣∣ρ=0

    δ̃2tr1 − (E [λ])′∣∣∣∣ρ=0

    )− α(2δ2t1 + 1)E [λ2]

    ∣∣∣∣ρ=0

    )

    (23)

    limM→∞

    C = E

    ln detIK+

    αδ̃2t1ρ

    K(δ̃2r2δ̃2tr1,ρ

    +α(δ̃2t2δ̃2tr1,ρ

    +δ2r1ρ+1))

    H̃1H̃H1

    IK+αδ2t1

    ρ

    K(δ̃2r2δ̃2tr1,ρ

    +α(δ̃2t2δ̃2tr1,ρ

    +δ2r1ρ+1))

    H̃1H̃H1

    . (27)

    it reduces to the ergodic capacity of a conventional single-hopi.i.d. Rayleigh fading MIMO channel with the transmit power α,and M transmit antennas. Another interesting intuitive outcomecomes from Proposition 3, when the relay transmit power αdiminishes to zero and the user transmit power ρ is fixed orvice-versa. Specifically, the ergodic capacity reduces to zero,which means no communication between the users and the BScan occur. Nevertheless, Proposition 4 reveals that the ergodiccapacity of this DH AF system with transceiver imperfectionsdoes not depend on the relay power α, when the number ofBS antennas N grows larger. It is also worthwhile to mentionthat the ergodic capacity is characterized by the fading of thechannel of the second hop when the number of users growslarge, while this disappears when M or N go to infinity. Insuch a case, C depends only on the channel of the first hop.

    V. NUMERICAL RESULTS

    In this section, we verify the theoretical analysis carried outin previous sections, and subsequently illustrate the impact ofimpairments on the ergodic capacity of dual-hop AF MIMOrelay systems. Obviously, a saturation of the ergodic capacityappears at high SNR, which agrees with similar results in theliterature [11], [18], [22]. The reason behind the saturation canbe easily extracted by a close observation of the system model.In other words, there is a term in the denominator of the SNRdue to hardware impairments that scales with the power.

    A. Finite Results

    Figure 2(a) provides the comparison of per-antenna ergodiccapacity versus ρ with and without imperfections consideringK = 2, M = 4, N = 3. The theoretical curve for the casewithout impairments was obtained by evaluating (39) from[26], while for the case with impairments, it was obtained byevaluating (17). Further, the simulated curves were obtainedby averaging the corresponding capacities over 103 randominstances of H1 and H2. From the figure, it can be shownthat the exact finite analysis matches well with the MonteCarlo (MC) simulation for the arbitrary values of K, M andN . The per-antenna ergodic capacity monotonically increaseswith the increase in the value of ρ in the absence of hardwareimpairments, but in the presence of impairments, the ergodic

    capacity first increases with ρ and then gets saturated after acertain value of SNR. Also, the per-antenna ergodic capacity inthe presence of impairments is almost the same as the capacityin the absence of impairments at the lower values of ρ, i.e.,ρ < 5 dB, but the gap between these two capacities increaseswith the value of ρ.

    Transmit SNR, ρ (dB)0 5 10 15 20 25 30 35 40

    Ergodic

    capacityC,nats/s/Hz

    0

    1

    2

    3

    4

    5

    6

    Finite theory: W/o impairmentsSimulated: W/o impairmentsFinite theory: W/ impairmentsSimulated: W/ impairmentsUpper bound: W/o impairmentsLower bound: W/o impairmentsUpper bound: W/ impairmentsLower bound: W/ impairments

    (a)

    Relay gain, α (dB)0 5 10 15 20 25

    Ergodic

    capacityC,nats/s/Hz

    0.2

    0.3

    0.4

    0.5

    0.6

    0.7

    0.8

    0.9

    Finite theory: W/o impairmentsSimulated: W/o impairmentsFinite theory: W/ impairmentsSimulated: W/ impairmentsUpper bound: W/o impairmentsLower bound: W/o impairmentsUpper bound: W/ impairmentsLower bound: W/ impairments

    (b)Fig. 2. Per-antenna ergodic capacity versus (a) ρ with and without im-perfections (δt1 = δt2 = δr1 = δr2 = 0.08, K = 2, M = 4,N = 3, α = 2ρ), (b) α with and without imperfections (µ = 15 dB,δt1 = δt2 = δr1 = δr2 = 0.08, K = 2, M = 1, N = 4, α = 2ρ)

  • 8

    Besides, Fig. 2(a) also depicts the theoretical upper and lowerbounds of the per-antenna ergodic capacity. The presentedtheoretical bounds for the case without impairments wereobtained by using (67) and (75) from [26]. Similarly, the boundsfor the case with impairments were obtained by evaluating (24)using the involved equations. The capacity results as well as thebounds in the absence of impairments are in close agreementwith the results presented in Fig. 6 of [26]. It can be observedthat both bounds are tight over the considered range of SNRvalues. Moreover, the closed-form upper bound coincides withthe exact capacity curve at the lower SNR regime (ρ < 5 dBin Fig. 2(a)), and the lower bound coincides with the exactcapacity curve at the higher SNR regime. Clearly, similarobservations can be noticed for the case with impairments.

    Figure 2(b) presents the comparison of the per-antennacapacity versus the relay gain α in the presence and the absenceof impairments. It can be observed that the per-antenna ergodiccapacity first increases with the increase in the value of α andsaturates after a certain value of α. Also, it can be noted thatthe upper and lower bounds are tight over the considered rangeof α for both cases. Another important observation is that thecapacity loss due to the presence of impairments is quite smallat the lower values of α and increases with the increase in thevalue of α until both capacity curves reach the saturation.

    109

    87

    65

    N

    43

    22

    4

    K

    6

    8

    0.5

    1

    1.5

    2

    2.5

    3

    3.5

    4

    10

    C,nats/s/Hz

    (a)

    109

    87

    65

    N

    43

    22

    4

    K

    6

    8

    2.5

    1

    2

    1.5

    0.5

    3

    10

    C,nats/s/Hz

    (b)Fig. 3. Per-antenna ergodic capacity versus K and N (a) without imperfections,(b) with imperfections (µ = 20 dB, δt1 = δt2 = δr1 = δr2 = 0.08, M = 4,α = 2ρ)

    Figures 3(a) and 3(b) illustrate the per-antenna ergodiccapacity versus K and N in the absence and the presence oftransceiver imperfections, respectively. From the figure, it canbe noted that the per-antenna ergodic capacity monotonicallyincreases with K, whereas it monotonically decreases withthe increase in the value of N . It should be noted that thedecreasing trend of the capacity with respect to N in Fig. 3(a)and Fig. 3(b) is due to the fact we plot the per-antenna capacity(normalized with respect to N ) instead of the capacity itself.However, the total ergodic capacity monotically increases withN and the rate of this increase is observed to be significantlyhigher than the rate of increase with respect to K.

    Transmit SNR, ρ (dB)0 5 10 15 20 25 30 35 40

    Ergodic

    capacityC,nats/s/Hz

    0

    1

    2

    3

    4

    5

    6

    7

    8

    9

    Finite theory: W/o impairmentsSimulated: W/o impairmentsFinite theory: W/ impairmentsSimulated: W/ impairmentsAsymptotic theory: W/o impairmentsAsymptotic theory: W/ impairments

    Fig. 4. Comparison of per-antenna ergodic capacity with finite and asymptoticanalyses (δt1 = δt2 = δr1 = δr2 = 0.08, K = 1, M = 2, N = 1,α = 2ρ)

    Figure 4 presents the comparison of the per-antenna ergodiccapacity versus ρ for finite and asymptotic cases by consideringthe cases of the presence and the absence of impairments (withparameters δt1 = δt2 = δr1 = δr2 = 0.08, K = 1, M = 2,N = 1, α = 2ρ). From the figure, it can be observed thatfinite results exactly match with that of the simulated capacityresults but the asymptotic results show a slight deviation withrespect to the simulated results. However, as noted in the nextsubsection, the asymptotic results match quite well with thesimulated ones even for the moderate values of M , N , and K.

    Figure 5 presents the per-antenna ergodic capacity with lowSNR approximation in the presence and the absence of impair-ments. In the presented results, EbN0min depicts the intersectionof the capacity curves with the horizontal axis. From Fig. 5, itcan is shown that not only the slope of the ergodic capacitycurve decreases when impairments are considered, but also theminimum transmit energy per information bit is affected. Inother words, the transceiver impairments generally have notonly a second-order impact on the capacity in the low-SNRregime as in [11], but in the case of the relay channel, there isalso a first-order effect. Further, another important observationis that the value of EbN0min increases with the increase in thevalue of impairments. This is due to the dependence of EbN0minon the values of δt2 and δr2 .

  • 9

    Eb/N0 (dB)-6 -5 -4 -3 -2 -1 0 1 2 3 4

    Ergodic

    capacityC,nats/s/Hz

    0

    1

    2

    3

    4

    5

    6

    7

    8

    9

    10

    W/o impairmentsW/ impairments, δ = 0.08W/ impairments, δ = 0.15W/ impairments, δ = 0.25

    Fig. 5. Per-antenna ergodic capacity with low SNR approximation (δt1 =δt2 = δr1 = δr2 = δ, K = 2, M = 3, N = 2, α = 2 dB)

    B. Asymptotic Results

    In Fig. 6, we illustrate the theoretical and simulated per-antenna ergodic capacities versus ρ for the following twocases: (i) without impairments, and (ii) with impairments ontransmitter and receiver of both links. From the figure, it can benoted that the theoretical and the simulated capacity curves forboth the considered cases match perfectly. Furthermore, the per-antenna capacity increases with ρ in the absence of impairments,i.e., δt1 = δt2 = δr1 = δr2 = 0 as expected. Moreover, anotherimportant observation is that the per-antenna capacity saturatesafter a certain value of ρ in the presence of impairments. Thetrend of the per-antenna capacity saturation with respect toρ in Fig. 6 is well aligned with the result obtained in [11]for the case of MIMO systems. However, for the consideredscenario in this paper, an early saturation of the capacity inthe presence of impairments is noted due to the introductionof the relay node impairments. Nevertheless, in Fig. 6, we alsoillustrate the effect of different values of impairments on thecapacity considering the values of δt1 = δt2 = δr1 = δr2 = δas 0.05, 0.08 and 0.15. Specifically, it can be observed thatwith the increase in the value of impairments, the saturationpoint appears earlier, i.e., at the lower values of ρ.

    Figures 7(a) and Figure 7(b) present the per-antenna capacityversus ρ and α in the presence and the absence of impairments,respectively. It can be observed that in the absence of impair-ments, the capacity increases monotonically with the increaseof both ρ and α with the slope being more steeper for the caseof ρ. In addition, the slope for both capacity curves decreasesat their higher values. Notably, in the presence of impairments,a clear saturation can be observed with the increase of ρ and αafter their certain values. Moreover, in Figs. 8(a) and 8(b), weplot the per-antenna capacity versus the channel dimensions γand β in the absence and the presence of channel impairments,respectively. In both cases, the capacity increases monotonicallywith both β and 1γ , however, the slope with respect to

    1γ is

    steeper as compared to the slope with β. This trend remainsalmost the same in the presence of impairments, but the slopeof the capacity curve with respect to 1γ in Fig. 8(b) is observedto be less steeper than in Fig. 8(a) at the higher values of 1γ .

    Figure 9(a) depicts the per-antenna ergodic capacity versus

    0 5 10 15 20 25 30 35 400

    0.1

    0.2

    0.3

    0.4

    0.5

    0.6

    0.7

    0.8

    0.9

    1

    Transmit SNR, ρ (dB)

    Ergodic

    capacity,C,nats/s/Hz

    W/o impairments, simulatedW/o impairments, theoryW/ impairments, simulated, δ = 0.05W/ impairments, theory, δ = 0.05W/ impairments, δ = 0.08W/ impairments, δ = 0.15

    Fig. 6. Per-antenna capacity versus ρ (δt1 = δt2 = δr1 = δr2 = δ, K = 50,M = 10, N = 100, β = 5, γ = 10, α = 2ρ)

    the number of source antennas, i.e., K by considering thecases with and without impairments. From the figure, it can benoted that the per-antenna ergodic capacity initially increaseswith the increase in the value of K, and it saturates at thehigher values of K for both cases. In addition, in Fig. 9(a), weplot the capacity results from the infinite K bound for bothcases. For the case without impairments, the infinite K boundcapacity curve is obtained by using (43) from [26], while forthe case with impairments, it is obtained by evaluating (26).Clearly, it is noted that the infinite K bound approximates theexact result in the infinite K regime for both cases.

    Figure 9(b) presents the variation of the per-antenna ergodiccapacity with the number of antennas at the relay, i.e., M , byconsidering parameters (ρ = 20 dB, δt1 = δt2 = δr1 = δr2 =0.08, K = 20, N = 20). From the figure, it can deduced thatthe capacity initially increases with the value of M and getsalmost saturated at the higher values of M for both cases. InFig. 9(b), we also plot the per-antenna ergodic capacity obtainedfrom the infinite M bound for the case without impairmentsusing (42) from [26]. It can be noted that the capacity resultsconsidering the infinite M bound almost approximate the exactresults in the infinite M regime. Moreover, in Fig. 9(c), weplot the per antenna capacity with the value of N for the caseswith impairments and without impairments. Further, we alsoplot the per-antenna ergodic capacity obtained from the infiniteN bound for the case without impairments using (44) from[26]. It can be noted that the capacity results considering theinfinite N bound almost approximate the exact result in theinfinite N regime.

    Till now, we have considered that the node components havethe same quality, i.e., they present the same distortion7. Inorder to illustrate the effect of different impairments on theper-antenna ergodic capacity, we plot the capacity versus δ in

    7Note that under this system model, in a practical large MIMO scenario,both the relay and the BS are going to have larger number of antennas thanthe source. Hence, they will be more prone (larger distortions) to hardwareimpairments. The reason is that large MIMO are attractive, when the cost ofthe antennas is low, which results to lower quality elements with more inducedhardware impairments.

  • 10

    0

    10

    20

    30

    40

    0

    10

    20

    30

    400

    0.1

    0.2

    0.3

    0.4

    0.5

    0.6

    0.7

    0.8

    0.9

    αρ

    C,nats/s/Hz

    (a)

    0

    10

    20

    30

    40

    0

    10

    20

    30

    400.05

    0.1

    0.15

    0.2

    0.25

    0.3

    0.35

    0.4

    αρ

    C,nats/s/Hz

    (b)Fig. 7. Per-antenna ergodic capacity versus ρ, α (β = 5, γ = 10, K = 50,M = 10, N = 100), (a) δt1 = δt2 = δr1 = δr2 = 0, (b) δt1 = δt2 =δr1 = δr2 = 0.08

    Fig. 10, by considering parameters (ρ = 5 dB, β = 5, γ = 10,N = 100, M = 10, K = 50, α = 2ρ). For this evaluation,all other impairments values are considered to be zero whileanalyzing the effect of a particular impairment. From the figure,it can be noted that the per-antenna ergodic capacity decreaseswith the increase in the value of impairment for all cases.Further, the effect of δt2 on the per-antenna ergodic capacityis found to be the most severe as compared to the effects ofother impairments. Other observations from Fig. 10 are thatthe effects of δt1 and δr1 on the per-antenna ergodic capacityare almost the same, and the impairment δr2 has significantlyless effect than the rest of impairments. However, note that theordering of the capacity curves with respect to the variationsin the considered impairments depends on the value of theparameters ρ, α, M , N , and K. In our results, the trend ofthe capacity curves with respect to different impairments wasfound to be quite stable for large values of M , N , and Ki.e., N = 100, M = 10, and K = 50 and lower values of ρ.This can be observed by (4), the definition ν = αKMµ̃ , and theconsidered set of parameters. However, for small values of M ,N , and K, the ordering of the performance curves was foundto vary significantly even with a small variation in the valuesof ρ and α.

    00.2

    0.40.6

    0.81

    0

    2

    4

    6

    8

    100

    0.5

    1

    1.5

    2

    2.5

    3

    3.5

    4

    1/γβ

    C,nats/s/Hz

    (a)

    00.2

    0.40.6

    0.81

    0

    2

    4

    6

    8

    100.2

    0.4

    0.6

    0.8

    1

    1.2

    1.4

    1.6

    1.8

    2

    1/γβ

    C,nats/s/Hz

    (b)Fig. 8. footnotesizePer-antenna ergodic capacity versus the channel dimensionsβ, γ (ρ = 20 dB, α = 2ρ) (a) δt1 = δt2 = δr1 = δr2 = 0, (b) δt1 =δt2 = δr1 = δr2 = 0.15

    VI. CONCLUSIONS

    While the transceiver hardware impairments are inherentin any communication system, their impact on DH AF relaysystems with multiple antennas was not taken into consideration,since prior work assumed the idealistic scenario of perfecthardware or single-antenna nodes. Hence, we introduced theaggregate hardware impairments on a DH AF relay system,and investigated their impact for the case of the system nodeshaving multiple number of antennas. In particular, initially, weelaborated on the MIMO ergodic capacity, when the numberof antennas is finite as in contemporary (conventional) systems.Specifically, building on some existing results, we derived anexact expression for the ergodic capacity, simplified tight lowerand upper bounds, and interestingly, low-SNR approximations.The need to provide a complete study concerning the next gen-eration systems (massive MIMO) led us to derive the ergodiccapacity in the setups with a very large number of antennas bypursuing a free probability analysis. Furthermore, the validationof the analytical results was shown by reducing to specialcases and by means of simulations. In particular, simulationsdepicted that the asymptotic results can be applicable even for

  • 11

    K

    10 20 30 40 50 60 70 80 90 100

    Ergodic

    capacityC,nats/s/Hz

    0.5

    1

    1.5

    2

    2.5

    3

    3.5

    4

    W/o impairmentsW/ impairmentsHigh K approx., w/o impairmentsHigh K approx., w/ impairments

    (a)

    M

    10 20 30 40 50 60 70 80 90 100

    Ergodic

    capacityC,nats/s/Hz

    0.5

    1

    1.5

    2

    2.5

    3

    3.5

    4

    W/o impairmentsW/ impairmentsHigh M approx., w/o impairmentsHigh M approx., w/ impairments

    (b)

    N

    10 20 30 40 50 60 70 80 90 100

    Ergodic

    capacityC,nats/s/Hz

    0.5

    1

    1.5

    2

    2.5

    3

    3.5

    4

    W/o impairmentsW/ impairmentsHigh N approx., w/o impairmentsHigh N approx., w/ impairments

    (c)Fig. 9. Per-antenna ergodic capacity versus (a) K (ρ = 20 dB, δt1 = δt2 =δr1 = δr2 = 0.08, M = 20, N = 20, α = 2ρ), (b) M (ρ = 20 dB,δt1 = δt2 = δr1 = δr2 = 0.08, K = 20, N = 20, α = 2ρ), (c) N(ρ = 20 dB, δt1 = δt2 = δr1 = δr2 = 0.08, K = 20, M = 20, α = 2ρ)

    contemporary system dimensions. Moreover, it has been shownthat the ergodic capacity with transceiver impairments saturatesafter a certain SNR and a large number of antennas. Notably,while the transceiver impairments have only a second-orderimpact on the capacity in the low-SNR regime of point-to-pointMIMO systems, there also exists a first-order effect for thecase of the relay channel. Among the future research targets,the inclusion of channel correlation, training, and more realistichardware impairments model will be considered.

    Value of impairment, δ0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

    C,nats/s/Hz

    0.04

    0.05

    0.06

    0.07

    0.08

    0.09

    0.1

    0.11

    0.12

    0.13

    0.14

    δ = 0δt1

    δt2

    δr1

    δr2

    Fig. 10. Per-antenna ergodic capacity versus δ (ρ = 5 dB, β = 5, γ = 10,N = 100, M = 10, K = 50, α = 2ρ)

    APPENDIX AUSEFUL LEMMAS

    Herein, given the eigenvalue probability distribution functionfX(x) of a matrix X, we provide useful definitions and lemmasthat are considered during our analysis.

    Lemma 1 (The pdf of the unordered eigenvalue of HLHH [26,Theorem 1]): The marginal pdf pλ (λ) of the unorderedeigenvalue λ of HLHH, where L = diag{ λ

    2i

    1+aλ2i}qi=1 and H ∼

    CN (0, IK ⊗ Iq) with p = max (M,N), q = min (M,N),and s = min (K, q), is given by

    pλ (λ) = Ae−λaλ(2K+2k+u−i−2)/2Kv−i(

    2√λ)Gl,k, (29)

    where A= 2Ks∑ql=1

    ∑qk=q−s+1

    ∑q̃−li=0

    (q̃−li )aq̃−l−i

    Γ(K−q+k) Gl,k with

    K =

    (q∏i=1

    Γ(q − i+ 1) Γ(p− i+ 1)

    )−1, (30)

    and Gl,k is the (l, k)th cofactor of a q × q matrix G whose(m,n)th entry is

    [G]m,n= a−u−m−nΓ(u+m+n) U

    (u+m+n, v+1,

    1

    a

    ). (31)

    APPENDIX BPROOF OF THEOREM 1

    Proof: Given a conventional system with finite dimensions,i.e., finite number of users and antennas, we start from (13),and we consider each term separately. Thus, for C1, we have

    C1 =E[log2 det(Φ + H2H1HH1HH2)]

    =E[log2 det

    (IN+f1H2H1H

    H

    1HH

    2 (IN+f3H2HH

    2)−1)]

    +E [log2 det(IN+f3H2HH2)], (32)

    while the other term of (13) can be written as

    C2 =E[log2 det(Φ)]

    =E[log2 det

    (IN+f2H2H1H

    H

    1HH

    2 (IN+f3H2HH

    2)−1)]

    +E [log2 det(IN+BH2HH2)]. (33)

  • 12

    Subtraction of (33) from (32) gives the ergodic capacity as

    C = E[log2 det

    (IN+f1H2H1H

    H

    1HH

    2 (IN+f3H2HH

    2)−1)]

    − E[log2 det

    (IN+f2H2H1H

    H

    1HH

    2 (IN+f3H2HH

    2)−1)]

    = C̃1 − C̃2, (34)

    where we have defined

    C̃i,E[log2det

    (IN+fiH2H1H

    H

    1HH

    2(IN+f3H2HH

    2)−1)]

    (35)

    because it will be used often throughout the proofs. Fortunately,each term of (34) is similar to [26, Eq. 2], i.e., it can beexpressed by a very concise form as

    C̃i = E[log2 det

    (IK+fiH̃1LH̃

    H

    1

    )], (36)

    where L = diag{ λ2i

    1+f3λ2i}qi=1 and H̃1 ∼ CN (0, IK ⊗ Iq). To

    this end, C can be finally derived by expressing C̃i in (34) interms of the real non-negative eigenvalues of Z = H̃1LH̃H1,since it is a K × q random non-negative definite matrixfollowing the pdf given by Lemma 1 with α = f3.

    APPENDIX CPROOF OF THEOREM 2

    The main targets are to derive the first and second derivativesof (12), or equivalently of C̃i, as can be seen by (34). Whenρ → 0, and α is fixed, we find that f1 (0) = f2 (0) = 0,while their first and second derivatives at ρ = 0 equal to

    f′

    1 (0) =αδ̃2t1KMδ̃2r2

    , f′

    2 (0) =αδ2t1KMδ̃2r2

    , and f′′

    1 (0) = −2αδ̃2t1

    δ̃2tr1KMδ̃2r2

    ,

    f′′

    2 (0) = −2αδ2t1

    δ̃2tr1KMδ̃2r2

    . Also, L (0) = diag{ λ2i

    1+f3(0)(ρ)λ2i}qi=1

    with f3 (0) =δ̃2t2

    α

    Mδ̃2r2. Thus, we have G (0) = IN . Given that

    C̃i can be written as in (36), we take the first derivative withrespect to ρ, and we have

    ˙̃Ci (0) = s

    ∫ ∞0

    (log2 (1 + fi (ρ)) pλ (λ))′∣∣∣∣ρ=0

    =s

    ln 2f′

    i (0)

    ∫ ∞0

    λpλ (λ)

    ∣∣∣∣ρ=0

    dλ (37)

    =s

    ln 2f′

    i (0)E [λ]∣∣∣∣ρ=0

    = f′

    i (0)E[tr(H̃1L (0) H̃

    H1

    )]ln 2

    , (38)

    where (37) is obtained by making several algebraic manipu-lations after taking into account that fi (0) = 0. Note that

    E [λ]∣∣∣∣ρ=0

    , given by Lemma 2, is the mean eigenvalue of

    H̃1L (ρ) H̃H1 at ρ = 0.

    Lemma 2: The nth moment of the unordered eigenvalue ofH̃1L (ρ) H̃1

    H

    is given by

    E (λn)=Ãa−2k+2l+2n+u+i−2q−1

    2 e12aW− 2t+2n+u−i+−12 ,

    v−i2

    (1

    a

    ),

    (39)

    where à = 2Ks∑ql=1

    ∑qk=q−s+1

    ∑q̃−li=0

    (q̃−li )Γ(t+p+n−i−1)Γ(t−q+n)Γ(K−q+k)

    Gl,k. Note that we have defined t = K + k.Proof: The calculation is straightforward after apply-

    ing [32, eq. 9.220.3].As far as the second derivative of C̃i is concerned, we have

    ¨̃Ci (0) = s

    ∫ ∞0

    (log2 (1 + fi (ρ)) pλ (λ))′′∣∣∣∣ρ=0

    =s

    ln 2(

    ∫ ∞0

    (f′′

    i (ρ)λ−(f′

    i (ρ))2λ2)pλ (λ)

    ∣∣∣∣ρ=0

    + 2

    ∫ ∞0

    f′

    i (ρ) (λpλ (λ))′∣∣∣∣ρ=0

    dλ ) (40)

    =s

    ln 2

    ((f′′

    i (0)E [λ]∣∣∣∣ρ=0

    −(f′

    i(0))2

    E[λ2]∣∣∣∣ρ=0

    )

    +2f′

    i(0)E′[λ]

    ∣∣∣∣ρ=0

    ), (41)

    where (40) is obtained after certain manipulations as in (37).

    Special focus must be given in the derivation of E′ [λ]∣∣∣∣ρ=0

    ,

    which is basically the derivative of the mean eigenvalueat ρ = 0. Taking a closer look at (39), it consists offour terms depending on ρ. Thus, we can rewrite (39) asE (λ) = I1I2I3I4 with I1 = Ã, I2 = a−

    u+2k+2l+i−2q+12 ,

    I3 = e12a , and I4 = W− 2t+u−i+12 , v−i2

    (1a

    ). Calculation of

    the derivatives of I2 and I3 are straightforward. Similarly,the derivative of I4 demands indirectly the derivative of theWhittaker function given by [33, Eq. (07.45.20.0005.01)].Taking this into account, I ′4 is obtained as

    I′

    4 =1

    f23 (0)

    [(v − i

    2f3(0)−

    1

    2

    )W− 2t+u−i+12 ,

    v−i2

    (1

    f3(0)

    )+ f3(0) W− 2t+u−i−12 ,

    v−i2

    (1

    f3(0)

    )](42)

    The difficulty arises during the calculation of the derivativeof I1 because the cofactor Gl,k depends on ρ. Hence, thecalculation of the derivative of the cofactor of a non-singularmatrix B, presented by the following lemma, is the first step.

    Lemma 3: Let a nonsingular matrix B with adj(B) anddet(B) denoting its adjugate and determinant. Given that thecofactor C of B is related to its adjugate according to CT =adj(B), its derivative of the cofactor C is given by

    ∂CT

    ∂u=

    tr(adj(B) ∂B∂u

    )adj(B)− adj(B) ∂B∂u adj(B)

    det(B). (43)

    Proof: Having in mind the basic property between theadjugate of a matrix and its inverse (B−1 = adj(B)det(B) ), we have

    CT = adj(B) = det(B) B−1. (44)

    Taking the differential of (44), we get

    dCT = det(B) B−1 + det (B) d(B−1

    )(45)

    = tr(adj (B) dB)adj (B)

    det(B)− adj (B) dBadj (B)

    det(B)(46)

  • 13

    where in (45) we have used the Jacobi’s formula expression8

    as well as the differential of the inverse of a matrix B, whichcan be written as [40, Eq. 8]

    d(B−1

    )= −B−1dBB−1. (47)

    Thus, the proof is concluded.Use of Lemma 3 allows to obtain

    ∂GTl,k∂ρ . However, we still need

    the derivative of the matrix G whose elements are given by (31).Specifically, we have that if ∂G

    T

    ∂ρ expresses the derivative of amatrix G, its elements are given by[∂GT

    ∂ρ

    ]m,n

    = ḟ3 (0) (u−m− n) f3(0)−u−m−n=1

    ×Γ(u+m+ n)(

    U

    (u+m+ n, v + 1,

    1

    f3(0)

    )− 1f3(0)

    U

    (u+m+ n+ 1, v + 2,

    1

    f3(0)

    )). (48)

    where we have used [33, Eq. (07.33.20.0005.01)] that providesthe derivative of the Tricomi confluent hypergeometric functionas ∂U(a,b,z)∂ρ = −aU(a+ 1, b+ 1, z). Having now obtainedeverything necessary to calculate I ′1,

    ¨̃Ci (0) is derived since

    E′ [λ]∣∣∣∣ρ=0

    can be calculated my means of straightforward

    substitutions. The proof is concluded after substitutions ofthe derived ˙̃Ci (0) and

    ¨̃Ci (0) to (20) and (21), and makingnecessary algebraic manipulations.

    APPENDIX DPROOF OF THEOREM 3

    Taking the general form of C̃i from (34), we present itsupper and lower bounds, C̃U,i and C̃L,i, respectively. Bothbounds are obtained by applying the necessary changes to [26,Theorems 5 and 6]. More concretely, the upper bound is

    C̃i (ρ) ≤ C̃U,i (ρ) = log2 (Kdet(Ξi)) , (49)

    where Ξi is a q × q matrix with elements given by (50). withτ = u+m+ n− 1, and ϑτ (B) = Γ(τ) U

    (τ, v + 1, 1B

    ).

    Regarding the lower bound, [26, Theorems 6] enables us towrite

    C̃i (ρ) ≥ C̃L,i (ρ)

    =s log2

    1+fi(ρ)exp1s

    s∑k=1

    ψ(N−s+k)+Kq∑

    k=q−s+1

    det(Wk)

    ,(51)

    where Wk is a q× q matrix whose (m,n)-th element is givenby

    {Wk}m,n =

    {B1−τϑτ−1 (B) , n 6= kζm+n (B) , n = k

    (52)

    and the fact that ζt (B) =∑2q−ti=0 B

    2q−t−iΓ(v − i)(2q−ti

    ) (ψ (v − i)−

    ∑v−i−1l=0 gl

    (1B

    ))with τ and ϑτ (·) given

    8Jacobi’s formula expresses the differential of the determinant of amatrix B in terms of the adjugate of B and the differential of B asdB = tr(adj (B) dB).

    as above, while gl (x) = exEl+1 (x). Finally, the upper andlower bounds of C are obtained by means of a combinationof (49) and (51).

    APPENDIX EPROOF OF THEOREM 4

    The asymptotic limits of the capacity terms (15) and (16),when the channel dimensions tend to infinity while keepingtheir finite ratios β = KM , γ =

    NM fixed, are expressed in terms

    of a generic expression as

    Ci=1

    Nlim

    K,M,N→∞E[ln det(IM+f3HH2H2(IM+αH1HH1))]

    =M

    Nlim

    K,M,N→∞E

    [1

    M

    M∑i=1

    ln

    (1+f̃3λi

    (1

    MKα

    ))]

    → 1γ

    ∫ ∞0

    ln(

    1+f̃3x)f∞Kα/M (x) dx, (53)

    where Ci corresponds to C1 or C2 depending on the value of i,i.e., if α = f1 or if α = f2/f3, respectively. Note that f̃3 =

    f3M =α(1+δ̃2r1ρ+δ̃

    2t2

    (δ̃2tr1)ρ)δ̃2r2(1+δ̃

    2tr1

    )ρ. In addition, λi (X) is the ith

    ordered eigenvalue of matrix X, and f∞X denotes the a.e.p.d.f.of X. Moreover, for the sake of simplification of our analysis,we have made use of certain variable definitions similar to [38].In particular, we have M̃α = IM + αH1HH1 , Ñ1 = H1H

    H1,

    Ñ2 = HH2H2, and Kα = H

    H2H2

    (IM + αH1H

    H1

    )= Ñ2M̃α.

    The analysis behind the derivation of the a.e.p.d.f. of Kα/M ,which leads asymptotically to a deterministic function byapplying principles of free probability theory [31], is providedin [1]. Especially, we use the free multiplicative convolution,in order to obtain the a.e.p.d.f. of the product of two matricesas in [38], [39].

    APPENDIX FPROOF OF ASYMPTOTIC BOUNDS

    Let us define F = δ̃2r2(

    1 + δ̃2tr1ρ)

    , then f1 =αρδ̃2t1KMF =

    f̄1KMF , f2 =

    αρδ2t1KMF =

    f̄2KMF , and f3 =

    BMF with B =

    α(

    1 + δ2r1ρ+ δ2t2

    (1 + δ̃2tr1ρ

    )).

    A. Proof of Proposition 2When the number of users increases infinitely (K → ∞),

    C̃i, given by (36), can be expressed as

    C̃i = E[log2 det

    (Iq+fiH̃

    H

    1H̃1L)]. (54)

    The Law of Large Numbers (LLN) provides thatlimK→∞

    H̃1H̃H1

    K → Iq . Hence, (54) becomes

    limK→∞

    C̃i = E[log2 det(Iq+fiL)] , (55)

    which after substituting L and making certain algebraicmanipulations leads to

    limK→∞

    C̃i = E[log2 det

    (Iq+

    f̄i + BMF

    H2HH

    2

    )]− E

    [log2 det

    (Iq+

    BMF

    H2HH

    2

    )]. (56)

    Substitution to (34) gives the desired result.

  • 14

    {Ξi}m,n =

    {B1−τϑτ−1 (B) , n ≤ q −NB1−τϑτ−1 (B) + fi (ρ) (N − q + n)B−τϑτ (B) . n > q −N

    (50)

    B. Proof of Proposition 3

    In the case of infinite number of relay antennas (M →∞),we recall the LLN, i.e., we have limM→∞

    H̃2H̃H2

    M → IN , orequivalenely, limM→∞

    λ2iM → 1, for i = 1, . . . , N since q = N .

    Thus, initially C̃i can be written as

    C̃i = E[log2 det

    (IK+

    f̄iKF

    H̃1L̃H̃H

    1

    )], (57)

    where L̃ = diag{ λ2i

    M(1+Aλ2i )}Ni=1. As a result, we have

    limM→∞

    C̃i = E[log2 det

    (IK+

    f̄iK (F + B)

    H̃1H̃H

    1

    )]. (58)

    The desired result is obtained by means of (34).

    C. Proof of Proposition 4

    By increasing the number of BS antennas N infinitely, weobtain that λ2i → ∞. Consequently, the diagonal matrix Lbecomes a scaled identity matrix equal to f3IM , since q = M .Thus, C̃i reads as

    C̃i = E[log2 det

    (IK+

    fif3

    H̃H1H̃1

    )]. (59)

    The desired result is yielded by a straightforward substitutionto (34).

    REFERENCES

    [1] A. K. Papazafeiropoulos, S. K. Sharma, and S. Chatzinotas, “Impactof transceiver impairments on the capacity of dual-hop relay massiveMIMO systems,” in Proc. of IEEE Global Communications Conference(GLOBECOM 2015) - Workshop on Massive MIMO: From theory topractice, 2015.

    [2] H. Holma and A. Toskala, LTE for UMTS: Evolution to LTE-Advanced,Wiley, Ed., 2011.

    [3] E. Telatar, “Capacity of multi-antenna gaussian channels,” Europ. Trans.on Telecom., vol. 10, no. 6, pp. 585–595, 1999.

    [4] G. J. Foschini and M. J. Gans, “On limits of wireless communicationsin a fading environment when using multiple antennas,” Wireless Pers.Commun., vol. 6, no. 3, pp. 311–335, 1998.

    [5] T. Marzetta, “Noncooperative cellular wireless with unlimited numbersof base station antennas,” IEEE Trans. Wireless Commun., vol. 9, no. 11,pp. 3590–3600, November 2010.

    [6] F. Rusek, D. Persson, B. K. Lau, E. Larsson, T. Marzetta, O. Edfors, andF. Tufvesson, “Scaling up MIMO: Opportunities and challenges withvery large arrays,” IEEE Signal Processing Mag., vol. 30, no. 1, pp.40–60, Jan 2013.

    [7] H. Q. Ngo, E. Larsson, and T. Marzetta, “Energy and spectral efficiencyof very large multiuser MIMO systems,” IEEE Trans. Commun., vol. 61,no. 4, pp. 1436–1449, April 2013.

    [8] J. Qi and S. Aïssa, “Analysis and compensation of I/Q imbalancein MIMO transmit-receive diversity systems,” IEEE Trans. Commun.,vol. 58, no. 5, pp. 1546–1556, 2010.

    [9] ——, “On the power amplifier nonlinearity in MIMO transmit beam-forming systems,” IEEE Trans. Commun., vol. 60, no. 3, pp. 876–887,2012.

    [10] H. Mehrpouyan, A. Nasir, S. Blostein, T. Eriksson, G. Karagiannidis,and T. Svensson, “Joint estimation of channel and oscillator phase noisein MIMO systems,” IEEE Trans. Signal Processing, vol. 60, no. 9, pp.4790–4807, Sept 2012.

    [11] X. Zhang, M. Matthaiou, E. Björnson, M. Coldrey, and M. Debbah, “Onthe MIMO capacity with residual transceiver hardware impairments,” inProc. IEEE Int. Conf. Commun. IEEE, 2014, pp. 5299–5305.

    [12] E. Björnson, J. Hoydis, M. Kountouris, and M. Debbah, “Massive MIMOsystems with non-ideal hardware: Energy efficiency, estimation, andcapacity limits,” IEEE Trans. Inform. Theory, vol. 60, no. 11, pp. 7112–7139, Nov 2014.

    [13] E. Björnson, M. Matthaiou, and M. Debbah, “Massive MIMO with non-ideal arbitrary arrays: Hardware scaling laws and circuit-aware design,”IEEE Trans. Wireless Commun., vol. 14, no. no.8, pp. 4353–4368, Aug.2015.

    [14] T. Schenk, RF imperfections in high-rate wireless systems: impact anddigital compensation. Springer Science & Business Media, 2008.

    [15] C. Studer, M. Wenk, and A. Burg, “MIMO transmission with residualtransmit-RF impairments,” in ITG/IEEE Work. Smart Ant. (WSA). IEEE,2010, pp. 189–196.

    [16] B. Goransson, S. Grant, E. Larsson, and Z. Feng, “Effect of transmitterand receiver impairments on the performance of MIMO in HSDPA,” inIEEE 9th Int. Workshop Signal Process. Adv. Wireless Commun. (SPAWC).IEEE, 2008, pp. 496–500.

    [17] E. Björnson, P. Zetterberg, and M. Bengtsson, “Optimal coordinatedbeamforming in the multicell downlink with transceiver impairments,”in IEEE Global Commun.Conf. (GLOBECOM), 2012, Dec 2012, pp.4775–4780.

    [18] E. Björnson, P. Zetterberg, M. Bengtsson, and B. Ottersten, “Capacitylimits and multiplexing gains of MIMO channels with transceiverimpairments,” IEEE Commun. Lett., vol. 17, no. 1, pp. 91–94, 2013.

    [19] F. Athley, G. Durisi, and U. Gustavsson, “Analysis of massive MIMOwith hardware impairments and different channel models,” arXiv preprintarXiv:1501.04200, 2015.

    [20] M. Vehkapera, T. Riihonen, M. Girnyk, E. Bjö, M. Debbah, R. Kildehøj,and R. Wichman, “Asymptotic analysis of SU-MIMO channels withtransmitter noise and mismatched joint decoding,” IEEE Trans. Commun.,vol. 63, no. 3, pp. 749–765, 2015.

    [21] J. Choi, “Downlink multiuser beamforming with compensation of channelreciprocity from RF impairments,” IEEE Trans. Commun., vol. 63, no. 6,pp. 2158–2169, June 2015.

    [22] E. Björnson, M. Matthaiou, and M. Debbah, “A new look at dual-hoprelaying: Performance limits with hardware impairments,” IEEE Trans.Commun., vol. 61, no. 11, pp. 4512–4525, 2013.

    [23] Y. Yang, H. Hu, J. Xu, and G. Mao, “Relay technologies for WiMAX andLTE-advanced mobile systems,” IEEE Commun. Mag., vol. 47, no. 10,pp. 100–105, 2009.

    [24] M. Matthaiou, A. Papadogiannis, E. Björnson, and M. Debbah, “Two-wayrelaying under the presence of relay transceiver hardware impairments,”IEEE Commun. Lett., vol. 17, no. 6, pp. 1136–1139, June 2013.

    [25] X. Xia, D. Zhang, K. Xu, W. Ma, and Y. Xu, “Hardware impairmentsaware transceiver for full-duplex massive MIMO relaying,” IEEE Trans.on Signal Proc., vol. 63, no. 24, pp. 6565–6580, 2015.

    [26] S. Jin, M. R. McKay, C. Zhong, and K. K. Wong, “Ergodic capacityanalysis of amplify-and-forward MIMO dual-hop systems,” IEEE Trans.on Inform. Theory, vol. 56, no. 5, pp. 2204–2224, 2010.

    [27] V. Morgenshtern and H. Bolcskei, “Crystallization in large wirelessnetworks,” IEEE Trans. Inform. Theory, vol. 53, no. 10, pp. 3319–3349,Oct 2007.

    [28] J. Wagner, B. Rankov, and A. Wittneben, “Large n analysis of amplify-and-forward MIMO relay channels with correlated rayleigh fading,” IEEETrans. Inform. Theory, vol. 54, no. 12, pp. 5735–5746, Dec 2008.

    [29] C.-X. Wang, F. Haider, X. Gao, X.-H. You, Y. Yang, D. Yuan, H. Aggoune,H. Haas, S. Fletcher, and E. Hepsaydir, “Cellular architecture and keytechnologies for 5G wireless communication networks,” IEEE Commun.Mag., vol. 52, no. 2, pp. 122–130, 2014.

    [30] A. Osseiran, F. Boccardi, V. Braun, K. Kusume, P. Marsch, M. Maternia,O. Queseth, M. Schellmann, H. Schotten, H. Taoka, H. Tullberg,M. Uusitalo, B. Timus, and M. Fallgren, “Scenarios for 5G mobileand wireless communications: The vision of the METIS project,” IEEECommun. Mag., vol. 52, no. 5, pp. 26–35, May 2014.

    [31] R. Couillet and M. Debbah, Random matrix methods for wirelesscommunications. Cambridge University Press, 2011.

  • 15

    [32] I. S. Gradshteyn and I. M. Ryzhik, “Table of integrals, series, andproducts,” Alan Jeffrey and Daniel Zwillinger (eds.), Seventh edition(Feb 2007), vol. 885, 2007.

    [33] The Wolfram functions site. [Online]. Available: http://functions.wolfram.com

    [34] M. Abramowitz and I. A. Stegun, Handbook of mathematical functions:with formulas, graphs, and mathematical tables. Courier Corporation,1964, no. 55.

    [35] A. K. Papazafeiropoulos, S. K. Sharma, and S. Chatzinotas, “MMSEfiltering performance of DH-AF massive MIMO relay systems withresidual transceiver impairments,” in Proc. of IEEE InternationalConference on Communications (ICC 2016), Kuala Lumpur, Malaysia,May 2016, 2016.

    [36] R. G. Bartle, The elements of integration and Lebesgue measure. JohnWiley & Sons, 2014.

    [37] A. Lozano, A. M. Tulino, and S. Verdú, “Multiple-antenna capacity inthe low-power regime,” IEEE Trans. Inf. Theory, vol. 49, no. 10, pp.2527–2544, 2003.

    [38] S. Chatzinotas, “MMSE filtering performance of dual-hop amplify-and-forward multiple-access channels,” IEEE Wireless Commun. Lett., vol. 2,no. 1, pp. 122–125, 2013.

    [39] S. Chatzinotas and B. Ottersten, “Capacity analysis of dual-hop amplify-and-forward MIMO multiple-access channels,” in International Con-ference on Wireless Communications & Signal Processing (WCSP),Hunagshan, China, Oct 2012.

    [40] K. Petersen and M. Pedersen, “The matrix cookbook,” URL http://www2.imm. dtu. dk/pubdb/p. php, vol. 3274, Nov. 2012.

    Anastasios Papazafeiropoulos [S’06ŰM’10]is cur-rently a Research Fellow in IDCOM at the Universityof Edinburgh, U.K. He obtained the B.Sc in Physicsand the M.Sc. in Electronics and Computers scienceboth with distinction from the University of Patras,Greece in 2003 and 2005, respectively. He thenreceived the Ph.D. degree from the same university in2010. From November 2011 through December 2012he was with the Institute for Digital Communications(IDCOM) at the University of Edinburgh, U.K.working as a postdoctoral Research Fellow, while

    during 2012-2014 he was a Marie Curie Fellow at Imperial College London,U.K. Dr. Papazafeiropoulos has been involved in several EPSCRC and EUFP7 HIATUS and HARP projects. His research interests span massive MIMO,5G wireless networks, full-duplex radio, mmWave communications, randommatrices theory, signal processing for wireless communications, hardware-constrained communications, and performance analysis of fading channels.

    Shree Krishna Sharma (S’12-M’15) received theM.Sc. degree in information and communicationengineering from the Institute of Engineering, Pul-chowk, Nepal; the M.A. degree in economics fromTribhuvan University, Nepal; the M.Res. degree incomputing science from Staffordshire University,Staffordshire, U.K.; and the Ph.D. degree in WirelessCommunications from University of Luxembourg,Luxembourg in 2014. Dr. Sharma worked as aResearch Associate at Interdisciplinary Centre forSecurity, Reliability and Trust (SnT), University of

    Luxembourg for two years, where he was involved in EU FP7 CoRaSat project,EU H2020 SANSA, ESA project ASPIM, as well as Luxembourgish nationalprojects Co2Sat, and SeMIGod. He is currently working as a PostdoctoralFellow at Western University, Canada. His research interests include Internet ofThings (IoT), cognitive wireless communications, Massive MIMO, Intelligentsmall cells, and 5G and beyond wireless systems.

    In the past, Dr. Sharma was involved with Kathmandu University, Dhulikhel,Nepal, as a Teaching Assistant, and he also worked as a Part-Time Lecturerfor eight engineering colleges in Nepal. He worked in Nepal Telecom formore than four years as a Telecom Engineer in the field of informationtechnology and telecommunication. He is the author of more than 70 technicalpapers in refereed international journals, scientific books, and conferences.He received an Indian Embassy Scholarship for his B.E. study, an ErasmusMundus Scholarship for his M. Res. study, and an AFR Ph.D. grant fromthe National Research Fund (FNR) of Luxembourg. He received Best PaperAward in CROWNCOM 2015 conference, and for his Ph.D. thesis, he received“FNR award for outstanding PhD Thesis 2015” from FNR, Luxembourg.He is a member of IEEE and has been serving as a reviewer for severalinternational journals and conferences; and also as a TPC member for anumber of international conferences including IEEE ICC, IEEE PIMRC, IEEEGlobecom, IEEE ISWCS and CROWNCOM.

    Dr. Symeon Chatzinotas (S’06ŰM’09ŰSM’13) iscurrently the Deputy Head of the SIGCOM Re-search Group, Interdisciplinary Centre for Security,Reliability, and Trust, University of Luxembourg,Luxembourg. In the past, he has worked on numer-ous R&D projects for the Institute of InformaticsTelecommunications, National Center for ScientificResearch Demokritos, Institute of Telematics and In-formatics, Center of Research and Technology Hellas,and Mobile Communications Research Group, Centerof Communication Systems Research, University of

    Surrey, Surrey, U.K. He has received the M.Eng. degree in telecommunicationsfrom Aristotle University of Thessaloniki, Thessaloniki, Greece, and the M.Sc.and Ph.D. degrees in electronic engineering from the University of Surrey,Surrey, U.K., in 2003, 2006, and 2009, respectively. Dr. Chatzinotas has morethan 200 publications, 1600 citations and an H-Index of 22 according toGoogle Scholar. He is the co-recipient of the 2014 Distinguished Contributionsto Satellite Communications Award, and Satellite and Space CommunicationsTechnical Committee, IEEE Communications Society, and CROWNCOM 2015Best Paper Award. His research interests include multiuser information theory,co-operative/cognitive communications and wireless networks optimization.

    http://functions.wolfram.comhttp://functions.wolfram.com

  • 16

    Björn Ottesten [S’87, M’89, SM’99, F’04] receivedhis M.S. degree in electrical engineering and appliedphysics from Linköping University, Sweden, in1986, and his Ph.D. degree in electrical engineeringfrom Stanford University, California, in 1989. Hehas held research positions at the Department ofElectrical Engineering, Linköping University; theInformation Systems Laboratory, Stanford University;the Katholieke Universiteit Leuven, Belgium; and theUniversity of Luxembourg. From 1996 to 1997, hewas the director of research at ArrayComm Inc, a

    start-up in San Jose, California, based on his patented technology. In 1991,he was appointed a professor of signal processing with the Royal Institute ofTechnology (KTH), Stockholm, Sweden. From 1992 to 2004, he was the headof the Department for Signals, Sensors, and Systems, KTH, and from 2004 to2008, he was the Dean of the School of Electrical Engineering, KTH. Currently,he is the director of the Interdisciplinary Centre for Security, Reliability andTrust, University of Luxembourg. As Digital Champion of Luxembourg, heacts as an adviser to the European Commission. His research interests includesecurity and trust, reliable wireless communications, and statistical signalprocessing. He is a Fellow of the EURASIP and has served as a memberof the IEEE Signal Processing Society Board of Governors. He has servedas an Associate Editor for IEEE Transactions on Signal Processing and onthe Editorial Board of IEEE Signal Processing Magazine. He is currentlyEditor-in-Chief of the EURASIP Signal Processing Journal and a member ofthe Editorial Boards of the EURASIP Journal of Applied Signal Processing andFoundations and Trends in Signal Processing. He coauthored journal papersthat received the IEEE Signal Processing Society Best Paper Award in 1993,2001, 2006, and 2013, and three IEEE conference papers receiving Best PaperAwards. He was the recipient of the IEEE Signal Processing Society TechnicalAchievement Award in 2011. He was a first recipient of the European ResearchCouncil Advanced Research Grant.

    IntroductionSystem ModelErgodic Capacity Analysis-Finite Number of AntennasExact Expression for Ergodic CapacityLow SNR RegimeTight Lower and Upper Bounds of the Ergodic Capacity

    Asymptotic Performance AnalysisMain ResultsSpecial Cases

    Numerical ResultsFinite ResultsAsymptotic Results

    ConclusionsAppendix A: Useful LemmasAppendix B: Proof of Theorem 1Appendix C: Proof of Theorem 2Appendix D: Proof of Theorem 3Appendix E: Proof of Theorem 4Appendix F: Proof of Asymptotic BoundsProof of Proposition 2Proof of Proposition 3Proof of Proposition 4

    ReferencesBiographiesAnastasios PapazafeiropoulosShree Krishna Sharma (S'12-M'15)Dr. Symeon Chatzinotas (S'06ŒM'09ŒSM'13)Björn Ottesten