Abstract

Orthogonal frequency division multiplexing (OFDM) is a powerful modulation choice for wideband wireless communication systems. However, its high peak-to-average power ratio greatly limits the high power amplifier (HPA) power efficiency. Here, we present the design of an adaptive predistorter to compensate the distortion caused by the HPA. Specifically, we deal with the implementation issue of the proposed predistorter in Lee and de Figueiredo's work (2006). The performance improvement by predistorter is verified by both floating-point simulation and fixed-point simulation, where the latter includes the distortion effects from the hardware. The bit widths for OFDM signals, ADC, and DAC are evaluated, and the bit width of 10 is shown to be sufficient for the hardware design.

1. Introduction

Othogonal frequency division multiplexing (OFDM) has attracted a lot of attention from the modern wireless communication community, because of its several desirable features for high-speed data transmission. In OFDM, a broadband signal is broken down into multiple narrowband subcarriers and implemented efficiently by using the IFFT algorithm [1]. OFDM advantages include: lower intersymbol interference, efficient use of frequency/spectrum through the use of different modulation/coding across subcarriers, and superior narrowband interference suppression capabilities. However, we need to consider the practical hardware limitations of low-cost RF and mixed signal devices when designing OFDM systems for broadband wireless data transmission. One of them is the high power amplifier (HPA) linearity and dynamic range, since OFDM signals have higher peak to average power ratios (PAPRs) than other high-performance modulations, and thus extra care is required. One of the most promising approaches to the mitigation of the PAPR problem is to use a predistorter applied to the OFDM signal prior to its entry into HPA. Its purpose is to compensate the nonlinearity of the HPA and improve the system performance. Many researchers have been investigating OFDM predistorter schemes [25]. However, all of these techniques are based on a general approximation form for the nonlinear system, rather than exploiting specific forms gleaned from physical device considerations. Due to this reason, we proposed a closed-form predistorter represented by a few parameters [6].

In this paper, we will apply Rapp's SSPA model [7] for HPA devices and show the implementation plan of the predistorter which was introduced in [6]. Furthermore, this paper also provides the design of a tracking algorithm for the case in which the practical HPA is unknown and varying. Finally, the simulation results are presented to investigate the performance improvement from predistorter, and study the distortion effects caused by saturation, overflow, and quantization with different number of bit widths, since the bit width of OFDM baseband (OFDM BB) and DAC/ADC is limited by cost and design constraints in real systems.

2. System Description

As shown in the block diagram of Figure 1, the proposed OFDM predistorter is placed after the OFDM baseband (BB) block to compensate for the degradation function in HPA.

In Figure 1, 𝑟, 𝑞, and 𝑢 are amplitudes of output of the BB, predistorter, and HPA block, respectively. From the normalized Rapp's SSPA model [7], we have 𝑢[𝑟]=𝑟(1+(𝑟/𝐴0)2𝑝)1/2𝑝,(1) where 𝑢 is the HPA output amplitude, 𝐴0>0 is the maximum output amplitude from HPA, and 𝑝>0 is the parameter which controls the smoothness of the transition from the linear to the saturating region. Please note that the phase distortion for HPA is very small and hence can be neglected [7], and thus, the predistorter mainly focuses on amplitude compensation. In order to compensate the nonlinearity of the HPA, using the predistorter, the HPA output 𝑢 is targeted to be linear to 𝑟. Thus, 𝑞(𝑟)(1+(𝑞(𝑟)/𝐴0)2𝑝)1/2𝑝=𝑟.(2) Thus, we can derive the following equation [6]: 𝑞𝑟(𝑟)=(1(𝑟/𝐴0)2𝑝)1/2𝑝,𝑟<𝐴0.(3)

When 𝑟𝐴0, (3) has no solution, and 𝑞 has to be clipped which will be explained in the following section. Figure 2 shows a compensation example of the predistorter [6]. The upper and lower lines pertain to the PD and SSPA model, respectively, and the solid line represents the compensated effect. It shows similar effect with soft envelop limiter.

3. Design Architecture

The closed form expression of predistorter output 𝑞 is shown in the previous section. However, the information of HPA parameters 𝐴0 and 𝑝 are unknown and time varying in practice. Figure 3 shows the detailed design of the architecture of our predistorter for a real time-varying environment.

The major block in the OFDM predistorter is an LMS block to update 𝐴0 and 𝑝 estimations and 𝑞 estimation block which will be explained next in detail. As for other supporting blocks, R/P (rectangular coordinates transfer to polar coordinates) and P/R (polar coordinates transfer to rectangular coordinates) are employed with a 12-stage cordic algorithm. A look-up table is used to store the precalculated values for the calculation of LMS update and 𝑞 estimation.

3.1. LMS Update

The total period is divided into two stages: the training stage covers the start time and periodic pilot time, and the remaining time is the operation stage (see Figure 4).

During the training stage, 𝑞 estimation block is off, that is, 𝑞 is set to be equivalent to the input signal magnitude of 𝑟, and correspondent HPA output 𝑢 is known. The goal during this stage is to track the solution of time-varying 𝐴0 and 𝑝 by LMS algorithm. We define mean square error as follows:

𝐽𝐴0𝑞,𝑝=𝐸1+𝑞/𝐴02𝑝1/2𝑝𝑢2.(4)

From the appendix, we have the expressions of 𝜕𝐽(𝐴0,𝑝)/𝜕𝐴0 and 𝜕𝐽(𝐴0,𝑝)/𝜕𝑝. For LMS algorithm, expected value is replaced by instantaneous value. Therefore, 𝐴0=𝐴(𝑛+1)0(𝑛)𝜇1𝑞(𝑛)𝐴1+𝑞(𝑛)/0(𝑛)2̂𝑝(𝑛)1/2̂𝑝(𝑛)𝑢(𝑛)𝑞(𝑛)𝐴0(𝑛)2̂𝑝(𝑛)+11𝐴1+(𝑞(𝑛)/0(𝑛)2̂𝑝(𝑛)1/2̂𝑝(𝑛)+1,̂𝑝(𝑛+1)=̂𝑝(𝑛)𝜇2𝑞(𝑛)𝐴1+𝑞(𝑛)/0(𝑛)2̂𝑝(𝑛)1/2̂𝑝(𝑛)𝑢(𝑛)𝑞(𝑛)𝐴1+𝑞(𝑛)/0(𝑛)2̂𝑝(𝑛)1/2̂𝑝(𝑛)1𝐴̂𝑝(𝑛)1+𝑞(𝑛)/0(𝑛)2̂𝑝(𝑛)ln𝑞(𝑛)𝐴0(𝑛)𝑞(𝑛)𝐴0(𝑛)2̂𝑝(𝑛)+𝐴ln1+𝑞(𝑛)/0(𝑛)2̂𝑝(𝑛)2̂𝑝(𝑛)2,(5) where 𝜇1 and 𝜇2 are step factors which will be defined in the following section, and the initial settings are 𝐴0(0)=1 and 𝑝(0)=1.

3.2. 𝑞 Estimation

Figure 4 shows that after 𝑁1 samples in training stage, LMS is turned on once per 𝑁2 samples to update the estimates of 𝐴0 and 𝑝, and turned off during the remaining time (operation stage). Within the operation stage, predistorter is on, and 𝑞 estimation is calculated based on the LMS estimations of 𝐴0 and 𝑝. That is, 𝑞(𝑛)=𝑟(𝑛)𝐴1𝑟(𝑛)/0(𝑛)2̂𝑝(𝑛)1/2̂𝑝(𝑛)𝐴,𝑟(𝑛)<0𝑞(𝑛),max𝐴,𝑟(𝑛)0(𝑛).(6) Please note that when 𝐴𝑟(𝑛)0(𝑛), 𝑞(𝑛) is clipped to 𝑞max. In this paper, we set 𝑞max=8, which will be shown to be suitable in the numerical result session.

3.3. Look-up Table

Five sets of function results are required to be stored in the look-up table to calculate the following functions: 𝐹1(𝑥,𝑦)=[𝑥2𝑦/(1𝑥2𝑦)]1/2𝑦, 𝐹2(𝑥)=ln(𝑥), 𝐹3(𝑥,𝑦)=ln[1+𝑥2𝑦], 𝐹4(𝑥,𝑦)=[𝑥2𝑦/1+𝑥2𝑦]1/2𝑦, and 𝐹5(𝑥,𝑦)=𝑥2𝑦/1+𝑥2𝑦.

Nonlinear quantization is applied to save table space. As for 𝐹1(𝑥,𝑦), 𝑦 is quantized to 𝑚1/32 when 33𝑚196; 𝑥 is quantized to 𝑚2/32 when 1𝑚224, 𝑚2/64 when 49𝑚254, 𝑚2/128 when 109𝑚2123, or 𝑚2/256 when 247𝑚2255. Thus, table I requires the size of [64(24+6+15+9)]×16=3456×16. As for 𝐹2(𝑥), 𝑥 is quantized to 𝑚/512 when 1𝑚60, 𝑚/256 when 61𝑚120, 𝑚/64 when 31𝑚512 (𝑥 will be clipped as eight if it exceeds eight). Thus, table II requires the size of (60+60+482)×16=602×16. As for 𝐹3(𝑥,𝑦), 𝑦 is quantized to 𝑚1/32 when 33𝑚196; 𝑥 is quantized to 𝑚2/32 when 1𝑚231, 𝑚2/16 when 16𝑚231, 𝑚2/8 when 16𝑚264. Thus, table III requires the size of [64(31+16+49)]×16=6144×16. As for 𝐹4(𝑥,𝑦), 𝑦 is quantized to 𝑚1/32 when 33𝑚196; 𝑥 is quantized to 𝑚2/64 when 1𝑚263, 𝑚2/32 when 32𝑚263, 𝑚2/16 when 32𝑚247, 𝑚2/8 when 24𝑚264. Thus, table IV requires the size of [64(63+32+16+41)]×16=9728×16. As for 𝐹5(𝑥,𝑦), 𝑦 is quantized to 𝑚1/32 when 33𝑚196; 𝑥 is quantized to 𝑚2/32 when 1𝑚263, 𝑚2/8 when 16𝑚264. Thus, table V requires the size of [64(63+49)]×16=6944×16. The total size for all of these five look-up tables is 26874×16 whose area is less than 0.5mm2 for CMOS18.

3.4. Complexity Evaluation

Based on the precalculated parameters, (5) can be expressed as 𝐴0𝐴(𝑛+1)=0(𝑛)𝜇1𝐴0(𝑛)𝐹4𝑞(𝑛)𝐴0(𝑛),̂𝑝(𝑛)𝑢(𝑛)𝐹4𝑞(𝑛)𝐴0(𝑛),̂𝑝(𝑛)𝐹4𝑞(𝑛)𝐴0,(𝑛),̂𝑝(𝑛)(7)̂𝑝(𝑛+1)=̂𝑝(𝑛)𝜇2𝐴0(𝑛)𝐹4𝑞(𝑛)𝐴0𝐴(𝑛),̂𝑝(𝑛)𝑢(𝑛)0(𝑛)𝐹4𝑞(𝑛)𝐴0𝐹(𝑛),̂𝑝(𝑛)4𝐴𝑞(𝑛)/0(𝑛),̂𝑝(𝑛)̂𝑝(𝑛)𝐹2𝑞(𝑛)𝐴0(+𝐹𝑛)3𝐴𝑞(𝑛)/0(𝑛),̂𝑝(𝑛)2̂𝑝2.(𝑛)(8) After HPA parameters 𝐴0 and 𝑝 estimation, the 𝑞 is estimated by 𝐴̂𝑞(𝑛)=0(𝑛)𝐹1𝑞(𝑛)𝐴0(𝑛),̂𝑝(𝑛).(9) Therefore, the complexity in total includes 5 addition/subtracitons and 15 multiplications which are relative low.

4. Numerical Results

From (1), HPA results in a highly nonlinear situation with high input amplitude, and small distortions vice versa. Therefore, a relative level of power back off is required to reduce HPA distortion. Here, we define input back-off (IBO) as IBO=10log10𝐴20𝑃in,(10) where 𝑃in is input average power of OFDM signal. Next, we will perform the algorithmic level and hardware level (fixed-point) simulations, while the latter include all the distortion effects in hardware such as round-off error and coefficient quantization.

We set the simulation parameters as follows.

(i)16QAM modulated OFDM signal with 64-point IFFT are studied. (ii)The average input back-off power is 6.375 dB, if not being mentioned. (iii)The start training sequence is employed with a length of 𝑁1=160 training samples, and every training sample per 𝑁2=16 OFDM symbols is applied to the following sequence. (iv)𝐴0 and 𝑝 are both assumed to be Gaussian random numbers with mean of 1 and variance of 0.0025. (v)Step factor 𝜇1=1.5 in training stage and 0.5 in the operation stage, while 𝜇2 is set to be as much as six times of 𝜇1. (vi)The bit width of OFDM output 𝐼𝑥,𝑄𝑥, the bit width of DAC input 𝐼𝑦,𝑄𝑦, and ADC output are evaluated, since the former is limited by the area cost, and the latter is limited by DAC/ADC design. (vii)The channel is assumed as AWGN with variance of 𝜂0/2.

From Figure 5, it shows that the bit width of 10 is recommended for OFDM BB output, DAC input, and ADC output, since there is not much improvement to increase bit width beyond 10. The proposed implementation plan of predistorter is shown to improve system performance even including degradation effect from hardware.

5. Conclusion

In this paper, we have provided an implementation plan of the proposed predistorter in [6] to compensate the nonlinear distortion of SSPA. We used an LMS algorithm for time-varying environment, which we have shown to be capable of tracking SSPA parameters. Finally, a fixed-point simulation including hardware degradation factor was performed to verify the superior performance of the proposed implementation scheme.

Appendix

A. Derivative of 𝐽(𝐴0,𝑝) w.r.t 𝐴0

𝐴𝜕𝐽0,𝑝𝜕𝐴0=𝜕𝐸𝑞/(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑢2𝜕𝐴0𝑞=2𝐸(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑞𝑢2𝑝1+(𝑞/𝐴0)2𝑝1/2𝑝1𝑞2𝑝𝐴(2𝑝)02𝑝1𝑞=2𝐸(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑞𝑢𝐴02𝑝+11(1+(𝑞/𝐴0)2𝑝)1/2𝑝+1.(A.1)

B. Derivative of 𝐽(𝐴0,𝑝) w.r.t 𝑝

𝐴𝜕𝐽0,𝑝=𝜕𝑝𝜕𝐸[𝑞/(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑢]2𝑞𝜕𝑝=2𝐸(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑢𝜕[𝑞/(1+(𝑞/𝐴0)2𝑝)1/2𝑝]𝑞𝜕𝑝=2𝐸(1+(𝑞/𝐴0)2𝑝)1/2𝑝1𝑢𝑞exp𝑞2𝑝ln1+𝐴02𝑝𝜕[(1/2𝑝)ln(1+(𝑞/𝐴0)2𝑝)]𝑞𝜕𝑝=2𝐸(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑞𝑢𝑞1+𝐴02𝑝1/2𝑝12𝑝𝜕[ln(1+(𝑞/𝐴0)2𝑝)]𝑞𝜕𝑝+ln1+𝐴02𝑝𝜕(1/2𝑝)𝑞𝜕𝑝=2𝐸(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑞𝑢𝑞1+𝐴02𝑝1/2𝑝112𝑝(1+(𝑞/𝐴0)2𝑝)𝜕[𝑞/𝐴02𝑝]𝑞𝜕𝑝+ln1+𝐴02𝑝12𝑝2𝑞=2𝐸(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑞𝑢𝑞1+𝐴02𝑝1/2𝑝112𝑝(1+(𝑞/𝐴0)2𝑝)𝜕exp2𝑝ln𝑞/𝐴0𝑞𝜕𝑝+ln1+𝐴02𝑝12𝑝2𝑞=2𝐸(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑞𝑢𝑞1+𝐴02𝑝1/2𝑝1𝑞2𝑝(1+(𝑞/𝐴0)2𝑝)𝑞exp2𝑝ln𝐴0𝜕2𝑝ln𝑞/𝐴0𝜕𝑝+ln1+𝑞/𝐴02𝑝12𝑝2𝑞=2𝐸(1+(𝑞/𝐴0)2𝑝)1/2𝑝𝑞𝑢(1+(𝑞/𝐴0)2𝑝)1/2𝑝1𝑝(1+(𝑞/𝐴0)2𝑝)𝑞ln𝐴0𝑞𝐴02𝑝+ln(1+(𝑞/𝐴0)2𝑝)2𝑝2.(B.1)