This article describes the digital predistortion (DPD) function for the ADRV9002. Some of the debugging techniques used can also be applied to general DPD systems. First, background information on DPD is outlined along with some of the typical issues users may encounter when experimenting with their system. Lastly, it describes tuning strategies that can be applied to the DPD algorithm with the help of a DPD software tool to analyze performance.
Digital predistortion (commonly known as DPD) is an algorithm widely used in wireless communication systems. DPD’s purpose is to suppress the spectral regrowth on the wideband signal that is passed through the radio frequency power amplifier (PA),1 thereby improving the PA’s overall efficiency. In general, PAs have nonlinear effects and inefficiency when dealing with high power input signals. The nonlinear effect and the spectral interferences are caused by the spectral regrowth to the neighbor bands. Figure 1 shows spectrum regrowth before and after DPD correction using the TETRA1 standard on the ADRV9002 platform.
The ADRV9002 offers an internal, programmable, and power optimized DPD algorithm that can be customized to correct the nonlinear effect of the PA, thus improving the overall adjacent channel power ratios (ACPR). Despite the desired benefits that DPD brings to communication systems, it is often very difficult for an inexperienced person to start working with DPD, not to mention getting it set up properly. This is largely due to numerous factors that could contribute to errors and thus poor DPD performance. Even after hardware is set up properly, it may still be challenging to pinpoint the correct parameters to fine tune DPD and obtain the optimal solution. This article aims to help engineers who use the DPD option in the ADRV9002. We also include some typical issues that a user often encounters, and we provide some general strategies on fine tuning a DPD model with the available parameters in order to obtain optimal DPD performance. The device also includes a MATLAB® tool to help users analyze DPD. This should help eliminate many common mistakes and provide some insights on the internal DPD operations. This article will help users get started with DPD and provide useful information on both theoretical concepts and resolving practical issues.
The ADRV9002 offers up to 20 MHz signal bandwidth when enabling the DPD option. This is due to the receiving bandwidth being limited to 100 MHz. Typically, DPD will operate with a receiving bandwidth 5× the transmitter bandwidth, so that the third and fifth intermodulation signals can be seen and corrected. The highest PA peak power signal the ADRV9002 supports is around the 1 dB (commonly known as P1dB) compression region. This metric indicates the severity of PA compression. If the PA is compressed beyond the P1dB point, it is not guaranteed that DPD will work properly. However, this is not a strict requirement; as we have seen in many cases, DPD works over the P1dB point and still provides very good ACPR. However, it’s going to be a case-by-case investigation. In general, if the compression is too severe, DPD can potentially run into instability and crash issues. We will discuss more about the compression region in later sections, including how to observe the current PA compression status using the MATLAB tool.
More details on DPD can be found in UG-1828, in the “Digital Predistortion” chapter.
There are two basic approaches to perform the DPD function. The first is called an indirect DPD, where a signal is captured before and after the PA. This differs from the direct DPD approach where a signal is taken before the DPD block and after the PA. The advantages and disadvantages of each are beyond the scope of this article. Indirect DPD looks at the signal before and after the PA to learn its nonlinear behavior and does the reverse on the DPD block. Direct DPD looks at the signal before DPD and after the PA and eliminates the error between the two by applying predistortion on the DPD block. Users should know that the ADRV9002 uses the indirect approach and the implications that are associated with it. It’s also important to know when using the MATLAB tool, capture data also refers to the indirect approach.
Figure 2 shows a high level DPD operation block diagram for the ADRV9002. Input signal u(n) goes into the DPD block. DPD will predistort the signal and generate x(n). Here we call this transmit capture, although it’s really the predistorted version of the transmit signal. The signal then goes through the PA to become y(n), which eventually gets sent out into the air. We call y(n) the receive capture, although it’s really the transmit signal after the PA. y(n) then feeds back to the receiver port, used as an observation receiver. Essentially, the DPD engine will take captures of x(n) and y(n), then generate the coefficients, which will be applied in the next iteration of DPD.
Mode of Operation
ADRV9002 supports both TDD and FDD operations on DPD. In TDD mode, DPD is updated for every transmit frame. This means the receiver will act as an observation path during the transmit frame. In FDD, since the transmitter and receiver are both running at the same time, a dedicated receiver channel is needed. ADRV9002 has 2T2R, which can support DPD in 2T2R/1T1R TDD and 1T1R FDD modes.
The following equations show the DPD model implemented in the transmit path.
u(n) is the input signal to DPD
x(n) is the output signal of DPD
T is the total number of taps of the DPD model
ψt is the polynomial function to implement the lookup table (LUT) for tap t
lt is the amplitude delay
kt is the data delay
at,lt,i is the coefficient calculated by the DPD engine
bt,lt,i is the switch to enable or disable the term
i is the index and power of the polynomial term
Users can configure the number of polynomial terms for each tap. ADRV9002 provides three memory term taps and one cross term tap, each with an order from 0 to 7.
Users may select a default model option provided by ADRV9002 (shown in Figure 3), which should work for most common cases. Alternatively, users can choose their own model by enabling and disabling terms. The first three taps (0 to 2) indicate the memory terms, where Tap 1 is the center tap. Tap 3 is the cross term tap.
Note Tap 3 (or the cross term tap) should not have the zeroth-order term enabled, to differentiate from the memory term taps.
- LUT Size: Users can set the LUT size. The ADRV9002 provides two options, 256 and 512. With the 512 size, users will have a better quantization noise level, and thus better ACPR, as a larger size will generally provide a better resolution of the signal. For narrow-band applications, we recommend using 512 as the default option. 256 could be used for wideband as the noise level is not as stringent, and the computation and power can be improved.
- Pre-LUT Scale: Users can set the pre-LUT scaler to scale the input data to fit better on the compander. The compander takes the signal from the transmitter and compresses it to fit in the 8-bit LUT address. Depending on their input signal level, users can adjust this value to optimize the LUT utilization. The values can be set in range (0, 4) with a step of 0.25. There is more on the compander in the last section of this article.
To perform DPD, users will have to enable an external loopback path on the PA and then set the feedback power to make sure it’s not out of range. Note it’s the peak power, not the average power. Power that is too strong or too weak will impact DPD performance. Users also need to set the external path delay, which can be obtained using External_Delay_Measurement.py. This script can be found in the ADRV9002 evaluation software installation path under the IronPython folder.
Note that the external delay only needs to be set for high sample rate profiles (for example, LTE 10 MHz). For low sample rate profiles (TETRA1 25 kHz), the user can set it to 0. Later in this article, we will use the software tool to observe the capture data to see the external delay effect.
Users can configure the number of samples. By default, users can set 4096 samples. It is recommended to use default values. In most cases, the default 4096 samples will provide optimal solutions for DPD.
- Additional Power Scale is a more advanced parameter. For the most part, it’s recommended to use the default value of 4 for the ADRV9002. This parameter has to do with the internal correlation matrix. From our experiment, the default value gives the best performance for the existing waveforms and PAs we tested. In rare cases, where input signal amplitude is extremely small or large, users can try to adjust this value to smaller and larger values so the correlation matrix maintains a proper condition number and therefore more stable solution.
- Rx/Tx Normalization: Users should set the receiver/transmitter normalization to the region where the data is linear. In Figure 6, the linear region is shown in red. In this region the power of data has not reached the compression region and is high enough for gain calculation. Once the region is selected, DPD can make an estimate on the gain of the transmitter and receiver, and proceed with further processing on the algorithm. For most cases, –25 dBFS to –15 dBFS should accommodate most standard PAs. However, users should still pay attention as special PAs could have very different shapes of AM/AM curves, in which case a proper modification will be needed. This will be described in more detail in later sections of this article.
A typical setup is shown in Figure 7. A low-pass filter is needed before the signal goes into the PA, to prevent LO signal harmonics. In certain cases, where internal LO phase noise performance does not satisfy the application, external LO may be needed. In such case, the external LO source needs to be synchronized with DEV_CLK. This is typically needed for narrow-band DPD, where the close-band noise requirement is more stringent. It is generally recommended to have a variable attenuator before the PA to prevent potential damage to the PA. The feedback signal should have the proper attenuation to have peak power set as discussed in the previous section.
Download the IronPython library in order to execute the IronPython code on the GUI.
Here users can run dpd_capture.py in the IronPython window in the GUI, as shown in Figure 8, provided along with the MATLAB tool to get capture data for the transmitter and receiver. The DPD sample rate is also included as part of the captured file.
Note this script should be run either in a primed or calibrated state.
The MATLAB tool analyzes the captured data from dpd_capture.py. This tool will help check signal integrity, signal alignment, PA compression level, and, at last, the fine tuning of DPD.
The MATLAB tool requires MATLAB Runtime. A first-time install will take some time to download. Once installed, users can load the data that is captured by the IronPython script, and then observe the plots, as shown in Figure 9.
Users can also set the high/low threshold on the normalization of the data and hit Reload to see the changes.
First, we have the normalized transmitter and receiver data plotted in the time domain. Users can zoom in to observe the status of the alignment of the transmitter and receiver. We only show the real part of the data, but users can easily plot the imaginary part as well. Normally, real and imaginary parts should both be either aligned or unaligned.
Then we have the transmitter and receiver spectra—the blue is transmitter and red is receiver. Note this is indirect DPD—the transmitter data will be the predistorted data, not the transmitter datapath over the SSI port.
Next we have two AM/AM curves, both in linear and dB scales. These are important metrics on DPD performance and PA compression status.
The AM/PM curve and receiver/transmitter phase difference are also provided.
Additionally, we also have the high and low threshold numbers. These numbers should match what’s set in the ADRV9002 TES evaluation software.
Note since we have provided APIs to capture data, users can develop their own plots and analysis models if needed. The tool provides some of the common checks for analyzing DPD. The APIs are:
adi_ADRV9002_dpd_CaptureData_Read, which is the read DPD captured data and must be run in a calibrated or primed state.
adi_ADRV9002_DpdCfg_t → dpdSamplingRate_Hz, which is the DPD sample rate, read-only parameter.
DPD can be affected by many different factors. Therefore, it’s worthwhile to make sure all the potential issues listed are considered and examined by the user. Before considering all issues, users should make sure hardware is connected correctly.
Transmit Data Overload
Figure 10 shows a high level block diagram of DPD implementation by ADRV9002. Transmitter data coming from the interface can overload the DAC. If the DAC is overloaded, the RF signal of the transmitter will be distorted even before involvement of the PA. Therefore, it is critical to make sure transmitter data does not overload the DAC.
To see if the transmitter DAC is overloading, users can just observe it from the GUI. Figure 11 shows a TETRA1 25 kHz waveform. The peak is still far away from the digital full scale. For the ADRV9002, it’s recommended to be at least a few dB from full scale, to avoid potential overload of the DAC. It’s difficult to quantify how much users should back off—this is because DPD will try to perform predistortion, and the predistorted signal will be “peak expanded,” therefore potentially overloading the DAC. This depends on how DPD is reacting to a particular PA—generally, the more compressed the PA, the more room it needs for peak expansion.
Receiver Data Overload
Another common error is the receiver data overloading the feedback ADC. This is caused by not having enough attenuation going back to the receiver port. The effect, as you can observe from the debugging tool, is that receiver data is clipped and because of this the transmitter and receiver cannot effectively align, causing DPD to have a calculation error. DPD typically will behave extremely poorly, resulting in increased noise on the whole spectrum.
Receiver Data Underload
Compared to receiver overload, this issue can often be overlooked. It is caused by not properly setting the feedback attenuation. A user may put too much attenuation to the feedback path, which makes receiver data too small. By default, –18 dBm peak is recommended for the ADRV9002 because it will bring the data from analog to digital to a good known power level for DPD. However, users can tune this number to fit their needs. Users should know that the DPD feedback receiver does not use the same attenuator that regular receivers use, and it has a much higher step size. The level of attenuation is adjusted by the peak power level set by the user. –23 dBm is the lowest power level (with 0 attenuation)—beyond that, users will run into low power levels, which will impact DPD performance. As a rule of thumb, users should make sure the feedback power is always measured and set correctly. Oftentimes, users tend to try different power levels and forget to set the feedback power properly, which causes this issue.
TDD vs. FDD
DPD in TDD mode must be run in the automated state machine. When evaluating with TES, in the manual TDD mode, users can still enable DPD, but performance will be poor. This is because DPD will only operate frame based. In manual TDD mode, the length of a frame will be determined by the transmit/receive enable signal toggle. In other words, each play and stop is a frame. However, in the time it takes for a human being to toggle, the PA has already turned to a different state in terms of temperature. Therefore, it’s impossible to maintain the DPD state without using the automated TDD mode where transmit enable signals can be frequently toggled. In FDD mode, however, DPD should perform normally.
For example, a user may want to use TETRA1, which follows a TDD-like frame scheme (it’s actually TDM-FDD). Therefore, directly selecting TDD mode and manually checking DPD will not be desired, and DPD tends to perform poorly. Instead, users can either use the “Custom FDD” profile and pick the same sample rate and bandwidth as TETRA1, or users can set TETRA1 TDD frame timing and use automated TDD mode. Both methods can provide much better performance than manual TDD.
ADRV9002 will try to time align transmitter and receiver data. When data is captured by a user, they are expected to be aligned. The delay measurement is done in the initial calibration time. However, for high sample rate profiles, more precise subsample alignment needs to be done separately.
DPD is an adaptive algorithm that requires taking the error of the two entities, aka the transmitter and receiver. Before taking the error of the transmitter and the receiver, the two signals need to be properly aligned—especially if a high sample rate profile is used (for example, LTE10). The alignment is critical because the intervals between samples are small. Therefore, users will need to run the script External_Delay_Measurement.py to extract the external path delay. This number can be entered under Board Configuration → Path Delay.
The effect of not having transmitter and receiver data aligned is that users will observe a much noisier AM/AM curve.
After setting the path delay number, we can observe the AM/AM and AM/PM curves to be much cleaner and less noisy. Phase difference is also much smaller.
Each PA has its own specification in terms of how much compression it can handle. Although the P-1dB data is typically given in the data sheets, practically it is still recommended to take precise measurement on the DPD to make sure that the compression point is at P-1dB. The DPD software provides the user the ability to look at the AM/AM curve based on the captured data to observe how close the compression point is compared to P-1dB.
If, however, a signal is beyond P-1dB, then this will potentially cause DPD to be unstable or even break, having the spectrum jump to a very high level and never come back down. In Figure 19, compression is way beyond the 1 dB region on the peaks, and the shape of the curve also starts to become flatter. This is a sign that the PA is overdriven and to increase more power on the output, input will be pushed a lot more to support the output power level. At this point, in case the user decides to continue to increase the input power, the DPD performance will decrease.
General Strategy Model Picking and Tuning
The idea of indirect DPD is to have data captured before and after the PA, while the DPD engine will try to mimic the opposite effect of the PA. The LUTs are used to apply this effect using the coefficients, and the model is polynomial based. This means DPD is more like a curve-fitting problem, and users will try to use the terms to “curve fit” the nonlinearity effect. The difference is the curve-fitting problem fits a single curve, while DPD also must account for the memory effect. ADRV9002 has three memory taps and one cross tap for modeling DPD LUTs.
Figure 20 shows the three memory taps and the one cross tap that ADRV9002 provides. The general strategy is similar to a curve-fitting problem. Users can start with some baseline and add and remove terms. In general, a center tap must exist (Tap 1). Users can add and remove terms one by one to test the effect of DPD. Then users can add two more memory taps (taps 0 and 2) to add in the effect of memory effect correction. Note since the ADRV9002 has two side taps, these taps should be the same—that is, symmetrical. Adding and removing terms should also be done by a one-by-one approach. Lastly users can experiment with cross term. Cross terms complete the curve-fitting problem from a mathematics point of view, thus providing better performance from DPD.
Note users should not skip terms by leaving them blank, as this will cause DPD to have undesired behavior. Note also users should not set the zeroth term on the cross term tap, as this is not valid also from a mathematics point of view.
Compander and Pre-LUT Scaler
In a previous section, we mentioned the compander. When first reading the user’s guide, this concept may create some confusion on what it means or what to choose (256 or 512). The purpose of the compander is to compress the input data and fit it in the LUT.
The general shape of the compander is a square root, where you have I/Q data coming in. Before we put them in the LUTs, the equation √(i(n)2+q(n)2) will be used to get the signal magnitude from previous equations. However, since square root is an expensive operation in terms of speed, and we also need to map them into LUT (8 bits or 9 bits), ergo the compander. Figure 22 is the ideal square root curve. The actual implementation will not be shown here, but in short it’s going to be an estimation of the square root curve.
Once we understand how data is fit into the LUTs we can start tuning data more intelligently. ADRV9002 has the option of choosing 8 bits (256) or 9 bits (512) for the LUT size. A bigger LUT means doubling the address locations for data. This means a finer resolution of the data and, in general, a better quantization noise level. For narrow-band applications, since the noise is so important, we recommend always using the 512 option. For wideband applications, since noise level is not so crucial, either option can be used. However, if the 512 option is used, slightly more power will be consumed and computation will be slower.
Histogram and CFR
We briefly mentioned pre-scale in DPD configuration. This parameter is used to provide a boost of the input data to LUTs. The reason this boost is needed is because in some cases, the data is not utilized properly by DPD. For a PA compression problem like this, it’s the high amplitude samples that really get compressed and cause problems. Therefore, we must not treat all samples equally; instead we want to focus our attention on the high amplitude samples.
Take a look at the TETRA1 standard waveform histogram (see Figure 23 and Figure 24). We can observe that the majority of values occur on medium to high amplitude regions. The reason is because the TETRA1 standard uses a D-QPSK modulation scheme, and the result is the signal will have constant envelope. The peak power does not differentiate too much from the average power.
This is desired for DPD. As mentioned previously, DPD will catch more high amplitude samples and therefore better characterize the behavior of the PA.
Now we look at the LTE10 standard in a similar way. LTE uses an OFDM modulation scheme, which combines hundreds and thousands of subcarriers together. Here we have the magnitude and power again for LTE10. We can easily observe the difference in contrast to TETRA1, that the peaks are very far away from the main average.
In the power histogram (see Figure 26), if we zoom in on the far end, we can observe that there are still very high peaks occurring but with very low probability. This is very undesirable for DPD. There are two reasons.
First, the low probability count of high peaks (high amplitude signals) will make the PA extremely inefficient. For example, LTE PAPR is about 11 dB. That is a big difference. To avoid damaging the PA, the input level will need to back off by a very big margin. Therefore, the PA is not utilizing the majority of its gain ability to boost power.
Second is the high peaks also are wasting the utilization of the LUTs. Because of these high peaks, LUTs will allocate a lot of resources for them and only a small portion of the LUTs are allocated for a majority of the data. This will degrade the DPD performance.
Crest factor reduction (CFR) is a technique that moves the signal peaks down to a level that is more acceptable. This is typically used in OFDM type signals. ADRV9002 does not include on-chip CFR, so this is a function that needs to be implemented externally. In the ADRV9002 TES evaluation software, we also include the CFR version of the LTE waveforms for this purpose. CFR_sample_rate_15p36M_bw_10M.csv is shown in Figure 28. We can observe at high power, the signal is being peak limited to a certain level (tilt at the end), due to CFR. This effectively pushes the PAPR to about 6.7 dB, which is almost a 5 dB difference. The operation of CFR will “hurt” the data, in the sense that EVM will degrade. However, compared to the whole waveform, the high level amplitude peaks have very small probability of occurrence and the benefits are tremendous.
DPD is a complex algorithm that many people find difficult to work with. It takes a lot of effort and carefulness to set up both the hardware and software to obtain the optimal results. ADRV9002 offers an integrated DPD on chip, which dramatically decreases the complexity. ADRV9002 also comes with a DPD software tool to help users analyze their DPD performance.