# Analog Processing to Enable Scalable High-Throughput mm-Wave Wireless Fiber Systems

(Invited Paper)

Mahmoud Sawaby<sup>(1)</sup>, Babak Mamandipoor<sup>(2)</sup>, Upamanyu Madhow<sup>(2)</sup>, Amin Arbabian<sup>(1)</sup>
(1) EE Department, Stanford University, Stanford, CA, USA
(2) ECE Department, University of California, Santa Barbara, CA, USA
Emails: {msawaby, arbabian}@stanford.edu, {bmamandi, madhow}@ece.ucsb.edu

*Abstract*—This work explores challenges in silicon integration of scalable high-throughput "Wireless Fiber" links that exploit the increase in spatial and spectral degrees of freedom at higher carrier frequencies due to LOS MIMO spatial multiplexing and higher bandwidths. In order to utilize these increased degrees of freedom, however, hardware must scale in dynamic range, speed and number of antenna elements. To this end, we examine tradeoffs in the partitioning of functionality between the transmitter and receiver, as well as between the analog and digital domains, and investigate a new scalable analog processing architecture for the receiver.

*Index Terms*—Line of Sight, MIMO, antenna array, spatial multiplexing, broadband communication systems, mm-wave, analog, digital.

# I. INTRODUCTION

Today's demand for mobile multimedia and connectivity present unprecedented data rate requirements on wireless networks [1]. Multi-Gigabit per second (Gbps) links are needed for many applications, including but not limited to streaming high-fidelity multimedia to mobile users, implementing over-the-air backhaul links for 5G wireless networks, high speed interconnects inside data centers [2], and in supplying internet through the sky [3]. Moreover, the availability of high-throughput low-cost wireless provide efficient links for moderate range applications, which fills the gap between current technologies, such as copper cable (inexpensive moderate speed links at limited range), and optical fiber (high speeds, large range, but with high integration and deployment complexity and power consumption overhead).

Recent improvements in silicon implementation open up the possibility of exploiting spectral and spatial Degrees of Freedom (DoF) available in mm-wave frequencies [4]. In addition to capacity gains due to large swaths of bandwidth available at mm-wave, we can achieve high spatial multiplexing gains, even in pure line-of-sight (LoS) environments with reasonable array sizes: the spatial DoF for a transmit array of aperture  $A_{TX}$ , and a receiver array of aperture  $A_{RX}$ , in a LoS environment, with link range of D and carrier wavelength  $\lambda$ , is given by  $\frac{A_{TX}A_{RX}}{(D\lambda)^2}$ [5]. That is, the spatial DoF is inversely proportional to  $\lambda^2$ . Putting this together with the roughly linear scaling of bandwidth with carrier frequency, we can hope for a *cubic* increase in capacity with frequency.

Many authors have studied optimal geometrical array placement in order to achieve maximum spatial multiplexing gains using parallel linear and planar arrays in LoS environments



Fig. 1. (a) A  $2 \times 2$  MIMO array showing different spatial signatures at each receive element, (2) conceptual drawing of mm-Wave antenna array with signal routing, (3) effect of spatial multiplexing on constellation dynamic range.

[6], [7]. Moreover, the feasibility of transmitting multiple independent data streams through a LoS MIMO link in mmwave band has been demonstrated in hardware for  $2 \times 2$  [8] and  $4 \times 4$  [9] links. Our focus in this paper is on designing *scalable* and *high-throughput* LoS MIMO links, taking into account the constraints and trade-offs in hardware implementation as well as non-idealities in channel realization.

In previous work [10], we showed that conventional linear space-time techniques for equalization in fully digital transceiver architectures lead to performance floors, and proposed a method to mitigate those limitations using small analog delays. However, this work did not account for the actual hardware limitations of implementing individual components, such as the ADCs or the equalizers. In fact, for such high performance systems, the complexity and feasibility of implementing individual operations strongly depend on the processing domain (analog or digital) [11] and the original domain for the data to be processed. In this paper, we focus on the actual analog processing implementation, and investigate the challenges and scaling limitations for high speed LoS MIMO.

### **II. HARDWARE IMPLEMENTATION CHALLENGES**

Due to the super-linear increase of the DoF with frequency, the capacity of a mm-wave link can be orders of magnitude higher than a conventional low frequency link. In this section, we briefly describe some challenges in high-speed mm-wave circuits, and provide insight on why replicating the conventional low frequency architectures can significantly limit utilization of the DoF available on mm-wave channels.

*1) Limited Transmit Power:* In a LoS wireless link, the signal-to-noise ratio (SNR) at the input of a receiver is given by:

$$SNR \propto G_{TX}G_{RX} \frac{P_{TX}}{\sigma^2 f_{BW}} \left(\frac{\lambda}{\pi D}\right)^2,$$
 (1)

where  $G_{TX}$  and  $G_{RX}$  are the transmit and receive antenna gains,  $P_{TX}$  is the transmit power,  $\sigma^2$  is the noise power density at the receiver input,  $f_{BW}$  is the noise bandwidth.

As we scale up the frequency into the mm-wave regime while fixing the range (D) and the transmitter and receiver apertures, we notice the following:

- $G_{TX}$  and  $G_{RX}$  increase with frequency  $(\propto 1/\lambda^2)$  [12].
- $f_{BW}$  increases linearly with the signal bandwidth ( $\propto 1/\lambda$ ).
- $P_{TX}$  decreases due to circuit limitations ( $\propto \lambda$ ) [13].

Substituting these findings in (1), it is clear that the received SNR does not benefit from the high antenna gain that come with scaling the carrier frequency into the mm-wave regime. In order to increase the communication range, one of the remaining few options is to operate the transmitter as close as possible to the peak power, which means to transmit signals with the smallest possible peak-to-average power ratios (PAPR), restricting the amount of precoding that can be done at the transmitter.

2) Analog-to-Digital Conversion Complexity: It is important to note that using spatial multiplexing over a LoS MIMO channel has a similar effect on the received signal as having a more complicated multipath channel. To illustrate the consequences for implementation complexity, consider the LoS MIMO system shown in Figure 1(c) with QPSK constellations used at the transmitter side. The independent streams arrive at each receiver element with random phases and relative delays, causing the received signal to have a more complex structure, requiring ADCs with larger dynamic range to limit information loss due to digitization.

With today's technologies and circuit techniques, the resolution capabilities of high speed ADC running at tens of Gigasymbols per second (GSps) is limited to only a few bits [14]. Moreover, the power consumption of such high speed ADCs scales with speed and resolution as follows [15]:

$$P_{ADC} \propto f_s^2 \cdot 2^{\#Bits}$$

The strong dependency of ADC power consumption on sampling frequency limits practical implementations to closeto-Nyquist-rate sampling frequencies. However, not employing fractional sampling can cause performance floors in fully digital architectures [10]. Also, the complexity and power consumption of the ADC scales exponentially with its resolution, doubling for each additional bit.



Fig. 2. (a) TX and RX arrays, (b) effects of misalignments on the channel.

# III. REALIZATION OF MOSTLY-ANALOG ARCHITECTURES

In a fully digital architecture, the higher dynamic range requirements in a LoS MIMO receiver does not affect the ADC complexity alone, but is also reflected on the complexity of digital processing after the ADC. In conventional systems, digital processing is traditionally more appealing due to many benefits, such as (1) the higher noise margins to crosstalk and other forms of noise coupling from the environment, (2) the continued benefits from device size scaling, (3) the advances in automated digital implementation tools and methodologies, and (4) the flexibility of programming and adapting the hardware to different functionalities. However, from a Shannon capacity point of view, this form of signal representation wastes most of the capacity of circuits and wires by loading them with only binary voltages, trading off throughput per transistor count with reliability and modularity.

As we scale the speed requirements, the benefits that digital implementations enjoy start to fade in comparison to their analog counterparts. In a sense, we can view analog processing as a way to compress digital signals into fewer nets, potentially leading to simpler routing between system blocks and fewer transistors at the inputs and outputs of each block, which can be very beneficial in high throughput MIMO detection as we show latter in this paper.

## A. Channel Decomposition

For the purpose of this study, we consider a uniform square antenna array with four antennas at each of the TX and RX sides. The antenna configuration is shown in Figure 2(a). The array spacing is given by [7]:

$$d_{H,V}^{opt} = \sqrt{\frac{D\lambda}{N_{H,V}}},\tag{2}$$

where  $d_H^{opt}$  and  $d_V^{opt}$  are the optimum horizontal and vertical spacing,  $N_H$  and  $N_V$  are the number of array elements in the horizontal and vertical dimensions, respectively, D denotes the link distance, and  $\lambda$  is the carrier wavelength. The values used in this study are annotated to Figure 2(a). The carrier frequency is chosen to be 130 GHz. In theory,  $4 \times 4$  MIMO system can give up to four fold more capacity over the corresponding SISO system. It can be shown from Equation (2) that antenna spacing for LoS MIMO is only practical in the case of mmwave links, were the wavelength is small enough for the needed antenna spacing to be feasible. For a two dimensional  $4 \times 4$ MIMO array with the geometry defined by Equation (2), the channel matrix is given by:

$$H_{LoS} = \begin{bmatrix} 1\angle 0^{\circ} & 1\angle 180^{\circ} & 1\angle 90^{\circ} & 1\angle 90^{\circ} \\ 1\angle 90^{\circ} & 1\angle 0^{\circ} & 1\angle 180^{\circ} & 1\angle 90^{\circ} \\ 1\angle 90^{\circ} & 1\angle 90^{\circ} & 1\angle 0^{\circ} & 1\angle 180^{\circ} \\ 1\angle 180^{\circ} & 1\angle 90^{\circ} & 1\angle 90^{\circ} & 1\angle 0^{\circ} \end{bmatrix}, \quad (3)$$

which represents a full-rank channel, hence it can be inverted using zero-forcing (ZF) detection. The ZF detector used is memory-less (single tap) spatial equalizer. The memory-less assumption is not valid if the TX or RX array is slightly tilted. We can assume a tilt as shown in Figure 2(b), where TX and RX elements have extra delay  $z^{-\tau_i}$  and  $z^{-\mu_i}$ , and random complex phase shifts of  $\alpha_i$  and  $\beta_i$ . The channel matrix accounting for a tilt in the TX and RX side is given by

$$\begin{aligned} H_{tilted} &\approx diag \left[ \begin{array}{ccc} \alpha_1 z^{-\tau_1} & \alpha_2 z^{-\tau_2} & \alpha_3 z^{-\tau_3} & \alpha_4 z^{-\tau_4} \end{array} \right] \\ &\times H_{LoS} \times diag \left[ \begin{array}{ccc} \beta_1 z^{-\mu_1} & \beta_2 z^{-\mu_2} & \beta_3 z^{-\mu_3} & \beta_4 z^{-\mu_4} \end{array} \right] \\ &= T_{RX} \times H_{LoS} \times T_{TX} \\ &= T_{RX, \, \varepsilon T_s} \times T_{RX, \, nT_s} \times H_{LoS} \times T_{TX, \, \varepsilon T_s} \times T_{TX, \, nT_s}, \end{aligned}$$
(4)

where  $T_{\dots, \varepsilon T_s}$  and  $T_{\dots, nT_s}$  are the diagonal delay matrices representing sub-bit period and full-bit period delays, respectively, at each of the TX and RX sides. The diagonalization done here assumes that the small tilting perturbations does not cause rank reduction (and thus the matrix  $H_{LoS}$  is unchanged), which is a good approximation for angles around 10°, especially since the number of array elements is small [7]. At such tilting angles, the terms  $z^{-\tau_i}$  and  $z^{-\mu_i}$  can be as large as four symbol period delays for symbol rates around 20 GSps.

#### B. Analog Components

Using the channel decomposition described in section III-A, we can see that MIMO detection need two main operations: (1) complex multiply-and-add to invert the LoS part of the channel ( $H_{LoS}$ ), and (2) true-time delay to compensate for the excess memory due to misalignments. In this subsection we investigate the performance trade-offs of analog components that can be used to implement these functionalities.

1) Analog Multiply and Add: Mathematically, the multiplication of two complex row vector V and a column vector S can be represented as:  $\sum_{\forall k} [(v_{k, real} \cdot s_{k, real} - v_{k, imag} \cdot s_{k, imag}) + j(v_{k, imag} \cdot s_{k, real} + v_{k, real} \cdot s_{k, imag})]$ , where j is the imaginary square root of -1. If we denote each individual iteration on k by  $i_k = i_{k, real} + ji_{k, imag}$ , then one possible implementation to this product using analog hardware is shown in Figure 3(a). The output current  $i_k$  from such circuit is proportional to the product  $s_k$  and  $v_k$ .

The two main performance metrics of such multiplier are the operating speed and the signal dynamic range, and are given by:

$$Dynamic Range \propto \frac{I_{ds}}{f_{BW}},\tag{5}$$

and

$$Speed \propto \frac{I_{ds}}{C_{load}},$$
 (6)



Fig. 3. (a) Analog multiply and add cell, and (b) its power consumption trade-off with SNR.

where  $I_{ds}$  is the DC current through each device,  $f_{BW}$  is the noise bandwidth, and  $C_{load}$  is the effective load at the output node. Equations (5) and (6) indicate that the current consumption of the circuit has to be large enough to satisfy the dynamic range and speed requirements, simultaneously. Using the process parameters of an advanced CMOS technology, Figure 3(b) compares the current consumption derived from Equations (5) and (6) as we sweep the signal dynamic range. The speed limited part of the plot (for small to moderate dynamic range) is very interesting for the following reasons:

- The power consumption of the analog circuit is independent from dynamic range scaling (for example using a larger constellation size modulation). This is not true for digital circuits where every added bit increases the complexity in virtually all of the operating regions.
- Optimizing an analog circuit for speed with less dynamic range considerations usually yields small device sizes and shorter wires, which results in compact silicon area that can be competitive to high speed digital implementations.

2) Analog True-Time Delay: Analog true-time delay (TTD) [16] implementations are generally an approximation to the exponential  $e^{-ST_d}$ , where  $T_d$  is the amount of delay in seconds and is usually programmable in discrete steps. There are two common approximations to this exponential in the literature: Taylor and Padé approximations [17]. These approximations are usually implemented in a form of a filter approximation [18], or by tuning the electrical length of the delay line the signal passes through. Although passive implementations of TTD are possible [19], [20], active implementations are usually preferred since they provide better gain stability for the same delay tuning range.

Table I summarizes some recent implementations of active programmable delay lines. An interesting note is that the ratio between delay-bandwidth product to the power consumption of the delay line is almost constant independent of the technology and the architecture used in the implementation. Using this observation, we can conclude that the power consumption of an analog delay line increases linearly with the required delay

| Ref  | Delay<br>Range<br>[ps] | BW<br>[GHz] | Power<br>Consumption<br>[mW] | DBW/ Power<br>Consumption [W <sup>-1</sup> ] |
|------|------------------------|-------------|------------------------------|----------------------------------------------|
| [18] | 140                    | 7           | 53                           | 18                                           |
| [21] | 75                     | 12          | 25                           | 36                                           |
| [22] | 550                    | 2.5         | 90                           | 15                                           |
| [23] | 87.5                   | >20         | 65                           | >27                                          |

Table I TRUE-TIME DELAY IMPLEMENTATION SURVEY



Fig. 4. The proposed mostly-analog architecture.

range for a fixed operating bandwidth. Comparing this finding to digital implementations requires us first to distinguish between two types of digital delays:

- 1) Digital symbol-period delays: which are relatively simple to implement using low-cost digital flip-flops.
- Digital sub-symbol-period delays: which would generally require oversampling at the ADC, leading to increased complexity and super-linear increase in the power consumption [15].

With this in mind, if we consider the channel decomposition described in Section III-A, it is clear from Equation (4) that the maximum amount of delay required to invert the channel is determined by sum of the two longest delays in the diagonal matrices  $T_{TX}$  and  $T_{RX}$ . Approximately half the amount of this delay can be compensated for in the digital domain (for example at the transmitter side by pre-aligning the bit streams and sending  $X = T_{TX, nT_s}^{-1} \times U$ , where U is the transmitter message) reducing the complexity of the detector. On the other hand, the sub-symbol-period delays in the matrices  $T_{TX, \varepsilon T_s}$  and  $T_{RX, \varepsilon T_s}$  require small amount of delays and can be implemented in the analog domain more efficiently.

## C. Performance Comparison

Using the ideas described in the previous sections, we now describe our proposed analog LoS MIMO detector and evaluate its performance. Figure 4 shows the proposed analog channel separation network (CSN). The complex multiplyand-add block is composed of an array of the analog multiplyand-add circuit shown in Figure 3(a), that is used to invert the  $H_{LoS}$  part of the channel and separate the individual streams before feeding them to the input of the slicer (or more generally, the ADC). This aligns perfectly with the desire to reduce the dynamic range of the signal at the ADC input, thus reducing its complexity.



Excess channel delays due to misalignments at the receiver are corrected using digitally controlled analog delays. Figure 5 compares the performance of our proposed architecture with the optimized fully digital implementation described in [10]. For both systems, we assume the same LoS MIMO channel described in Section III-A, with 20 GSps QPSK bit loading on each stream. The symbol error rate (SER) for each of the two systems is shown in Figure 5 along with the ideal ISI-free QPSK performance. The hardware components for both systems are designed to minimize the SNR penalty in the SER performance. The mostly-analog architecture uses simple single-bit slicers instead of the power hungry 5-bit ADCs required in the fully digital architecture, cutting down the power consumption in the full receiver to almost one third compared to the fully digital architecture. The benefits in terms of reduced power consumption become more significant as we increase the modulation order to medium sized constellations, since ADC requirements are tightly coupled to the modulation size in a conventional architecture. This is not the case for analog processing as long as it operates in the dynamic-rangeinsensitive regime described in Section III-B.

# D. Analog Channel Identification

We now show that analog techniques can also be extended to MIMO channel identification. Traditionally, adaptive filtering techniques, such as least-mean-square algorithms [24], are used to iteratively tune the detector and learn the channel coefficients. Analog channel adaptation techniques are usually based on sending orthogonal low frequency pilots at the transmitter and recording their magnitudes and phases at the receiver to learn the channels [9]. Both techniques can be efficient in estimating the memory-less part of the channel  $(H_{LoS})$ , but require high resolution ADCs running at high speeds to be able to program the analog delay lines. In order to avoid the complexity of using high-speed ADCs, we need to first identify the information that can be collected using simple analog operations. Let us first introduce one extra analog operation to obtain the peak values for each of the received signals, which can be easily achieved using simple analog peak detectors, as shown in Figure 6. If we turn on one transmitter at a time, and assuming we first run the transmitters at low speeds (so that the memory part of the channel is irrelevant), we can record all the individual quadrature channel gains  $|h_{i,I}|$  and  $|h_{i,Q}|$ . The relative sign of the coefficients can be found by mixing two channels, once by addition and once by subtraction; the maximum between these two indicate the relative sign between the streams. For example, if  $|h_{i,I} + h_{i,Q}|$ 



Fig. 6. Analog channel identification hardware.



Fig. 7. Flow chart for channel identification.

is greater than  $|h_{i,I} - h_{i,Q}|$ , then  $h_{i,I}$  and  $h_{i,Q}$  have the same sign. This idea of maximizing the sum can be extended to learn the relative delays between the quadrature channels, this time with the transmitters running at high rate. The flow chart shown in Figure 7 describes the proper sequence to apply this method for channel identification.

## **IV. CONCLUSIONS**

In this paper, we have presented new methodologies and architectures for implementing high throughput wireless LoS MIMO transceivers. We propose efficient and scalable analog processing techniques that can dramatically reduce the complexity of such systems without sacrificing performance. The trade-offs of the analog components required to realize such architectures is studied, and an example architecture utilizing those ideas is introduced and compared to a conventional fully digital architecture. We extend analog processing ideas to channel identification, and show that channel state information can be obtained with minimal hardware overhead added to the proposed analog architecture.

#### ACKNOWLEDGMENT

This work was supported in part by Systems on Nanoscale Information fabriCs (SONIC), one of the six SRC STARnet Centers, sponsored by MARCO and DARPA, and by the National Science Foundation under grants CNS-1518632 and CNS-1518812.

#### REFERENCES

 "Cisco Visual Networking Index: Forecast and Methodology, 2011– 2016," CISCO White paper, 2012.

- [2] C. F. Lam, H. Liu, and R. Urata, "What Devices Do Data Centers Need?" in *IEEE Proc. on Fiber Optics and Optical Communications*, 2014.
- [3] "Facebook Demonstrates Record-Breaking Data Rate Using Millimeter-Wave Technology," [Online]. Available: https://goo.gl/BrGVbH.
- [4] M. Tabesh, J. Chen, C. Marcu, L. Kong, S. Kang, A. M. Niknejad, and E. Alon, "A 65 nm CMOS 4-Element Sub-34 mW/Element 60 GHz Phased-Array Transceiver," *IEEE J. of Solid-State Circuits*, vol. 46, no. 12, pp. 3018–3032, 2011.
- [5] E. Torkildson, U. Madhow, and M. Rodwell, "Indoor Millimeter Wave MIMO: Feasibility and Performance," *IEEE Trans. on Wireless Communications*, vol. 10, no. 12, pp. 4150–4160, 2011.
- [6] F. Bohagen, P. Orten, and G. E. Oien, "Design of Optimal High-Rank Line-of-Sight MIMO Channels," *IEEE Trans. on Wireless Communications*, vol. 6, no. 4, pp. 1420–1425, 2007.
- [7] P. Larsson, "Lattice array receiver and sender for spatially orthonormal MIMO communication," in *IEEE 61st Vehicular Technology Conf.*, vol. 1, 2005, pp. 192–196.
- [8] C. Sheldon, E. Torkildson, M. Seo, C. P. Yue, U. Madhow, and M. Rodwell, "A 60GHz line-of-sight 2x2 MIMO link operating at 1.2 Gbps," in *IEEE Antennas and Propagation Society International Symp.*, 2008, pp. 1–4.
- [9] C. Sheldon, M. Seo, E. Torkildson, M. Rodwell, and U. Madhow, "Four-Channel Spatial Multiplexing Over a Millimeter-Wave Line-of-Sight Link," in *IEEE International Microwave Symp. Digest*, 2009, pp. 389– 392.
- [10] B. Mamandipoor, M. Sawaby, A. Arbabian, and U. Madhow, "Hardware-Constrained Signal Processing for Mm-Wave LoS MIMO," in *IEEE 49th Asilomar Conf. on Signals, Systems and Computers*, 2015, pp. 1427– 1431.
- [11] R. Sarpeshkar, "Analog Versus Digital: Extrapolating from Electronics to Neurobiology," *IEEE J. of Neural Computation*, vol. 10, no. 7, pp. 1601–1638, Oct 1998.
- [12] C. A. Balanis, Antenna Theory: Analysis and Design. John Wiley & Sons, 2016.
- [13] A. M. Niknejad, D. Chowdhury, and J. Chen, "Design of CMOS Power Amplifiers," *IEEE Trans. on Microwave Theory and Techniques*, vol. 60, no. 6, pp. 1784–1796, 2012.
- [14] B. Murmann, "ADC Performance Survey 1997-2016," [Online]. Available: http://web.stanford.edu/ murmann/adcsurvey.html.
- [15] —, "Energy Limits in A/D Converters," in *IEEE Faible Tension Faible Consommation*, June 2013.
- [16] H. Hashemi, T. s. Chu, and J. Roderick, "Integrated True-Time-Delay-Based Ultra-Wideband Array Processing," *IEEE Communications Magazine*, vol. 46, no. 9, pp. 162–172, September 2008.
- [17] S. M. Kashmiri, S. A. P. Haddad, and W. A. Serdijn, "High-Performance Analog Delays: Surpassing Bessel-Thomson by Pade-Approximated Gaussians," in *IEEE International Symp. on Circuits and Systems*, May 2006.
- [18] N. Rajesh and S. Pavan, "Design of Lumped-Component Programmable Delay Elements for Ultra-Wideband Beamforming," *IEEE J. of Solid-State Circuits*, vol. 49, no. 8, pp. 1800–1814, 2014.
- [19] H. Veenstra, M. Notten, D. Zhao, and J. R. Long, "A 3-Channel True-Time Delay Transmitter for 60GHz Radar-Beamforming Applications," in *IEEE European Solid-State Circuits Conf.*, Sept 2011, pp. 143–146.
- [20] Q. Ma, D. Leenaerts, and R. Mahmoudi, "A 10-50GHz True-Time-Delay Phase Shifter with Max 3.9 Delay Variation," in *IEEE Radio Frequency Integrated Circuits Symp.*, June 2014, pp. 83–86.
- [21] F. Loi, E. Mammei, F. Radice, M. Bruccoleri, S. Erba, M. Bassi, and A. Mazzanti, "A 25-gb/s fir equalizer based on highly linear allpass delay-line stages in 28-nm lp cmos," in *IEEE Radio Frequency Integrated Circuits Symp.*, 2015, pp. 303–306.
- [22] S. K. Garakoui, E. A. M. Klumperink, B. Nauta, and F. E. van Vliet, "Compact Cascadable g m -C All-Pass True Time Delay Cell With Reduced Delay Variation Over Frequency," *IEEE J. of Solid-State Circuits*, vol. 50, no. 3, pp. 693–703, March 2015.
- [23] A. Momtaz and M. M. Green, "An 80 mW 40 Gb/s 7-Tap T/2-Spaced Feed-Forward Equalizer in 65 nm CMOS," *IEEE J. of Solid-State Circuits*, vol. 45, no. 3, pp. 629–639, 2010.
- [24] J. B. Foley and F. M. Boland, "Comparison Between Steepest Descent and LMS Algorithms in Adaptive Filters," *IEEE Proc. on Communications, Radar and Signal Processing*, vol. 134, no. 3, pp. 283–289, June 1987.