ABSTRACT

This paper presents the design of a low-cost 12-bit Analog-to-Digital Converter (ADC) for digital radio applications in a 0.13µm CMOS technology. It employs a 10-stage pipelined architecture preceded by a sample-and-hold (S/H) block that multiplexes two single-ended or differential input signals. The ADC delivers 12-bit data up to 44MSPS, which enables its use in other high frequency applications, such as video-rate data acquisition, high-speed data transmission, and wireline communications. The complete cell, including on-chip voltage and current reference generation, occupies 1mm$^2$ and consumes 200mW at full speed when supplied with 3.3V.

1. INTRODUCTION

Digital radio is receiving an ever increasing attention from radio broadcasters and users. Apart from the increased number of channels and interference-free reception quality, digital radio enables multimedia data reception by end users, including text, pictures, and even video.

To the already existing DAB (Digital Audio Broadcasting) [1], a new standard called DRM (Digital Radio Mondiale) [2] is being incorporated. Unlike digital systems that require a new frequency allocation, the DRM signal is designed to fit in with the existing AM broadcast band plan, based on signals of 9 kHz or 10kHz bandwidth. It has modes requiring as little as 4.5kHz or 5kHz bandwidth, plus modes that can take advantage of wider bandwidths, up to 20kHz.

New software/hardware developments are being carried out at the industry side to give support to these standards. From the hardware point of view, like in other application scenarios (video, wireless communications, etc.) multi-standard capability is added to a set of already challenging requirements, such as high-resolution, low power consumption (essential for mobile equipments) and compactness. The latter has prompted maximal integration, which ends up with the necessity of high-performance analog functions implemented in digital-oriented deep-submicron CMOS processes.

In this paper we present the design of a 12-bit pipeline ADC for digital radio in 0.13µm CMOS. In order to cope with present and forthcoming standards, the ADC can operate up to 44MSPS, including specific modes for DAB and DRM. In DAB applications, the ADC operates in undersampling mode, i.e. converting a narrow-band signal centered at a frequency much higher than the sampling rate. In DRM mode, classical Nyquist-rate conversion is allowed, but it requires only half of the maximum output rate. In order to optimize the ADC consumption with respect to the power budget in each operating mode, a programmable bias current mechanism is implemented.

The rest of this paper is organized as follows: Section 2 presents the ADC architecture. The main sub-blocks, the content of a pipeline stage and the S/H circuit, are described in Sections 3 and 4, respectively. Section 4 is devoted to circuit design aspects. Also in this section, the design of the building blocks is illustrated. Finally, Section 5 summarizes available results.

2. ADC ARCHITECTURE

Figure 1 shows the converter architecture. It consists of a 10-stage pipelined ADC using redundant multi-bit quantization and correction logic. In order to improve linearity [3], the first stage uses 3bit quantization whereas the rest use 2bits. The last stage is just a 2bit flash ADC. This architecture establishes a good trade-off between circuit complexity and performance, with reduced power consumption [4].

While clk is high, the analog input is being sampled differentially at the input sample-and-hold capacitors. The sampled input is held at the falling edge of clk. While clk is low, the held input voltage is buffered by the S/H amplifier which drives the first pipelined ADC stage.
For the same reason, the first stage is not 3bit but to the fact that only 3 levels need to be discerned in the only 2 in stages 2 to 9, which require a 3-level DAC. Due to the flash quantizer in the stage. In this converter, 4 unitary negative reference voltages, depending on the output of the reconstruction digital-to-analog converter (DAC) and an correction logic before being sent to the output buffers.

3. PIPELINE STAGES

Each pipelined stage contains a quantizer (flash ADC), a reconstruction digital-to-analog converter (DAC) and an inter-stage residue amplifier. The two latter functions are actually integrated in a single block, the so-called multiplying DAC (MDAC). In operation, the ADC quantizes the input to the stage and the quantized value, reconverted to the analog domain by the DAC, is subtracted from the input to produce a residue. The residue is amplified and output by the residue amplifier.

Figure 2 shows the schematic of the MDAC in an N-bit implemented using switched-capacitor (SC) circuitry. During the sampling phase, the input signal is stored in 2N-1 identical capacitors, while the OTA is reset. During the residue-amplification amplification, one of these capacitors is placed in a feedback configuration around the OTA, whereas the remaining ones sample either the positive or the negative reference voltages, depending on the output of the flash quantizer in the stage. In this converter, 4 unitary capacitors are used in the first stage (7-level DAC) and only 2 in stages 2 to 9, which require a 3-level DAC. Due to the fact that only 3 levels need to be discerned in the second and following stages, instead of the $2^3 = 4$ levels present in a regular 2-bit ADC justifies the appellative “1.5-bit per stage” normally accompanying such topologies [5]. For the same reason, the first stage is not 3bit but 2.8bit.

Figure 1 – 10-stage pipeline ADC architecture

When clk goes back high, the first stage produces its residue, which is acquired by the second stage. At the same time, the input S/H goes back to acquiring a new analog input sample. When clk goes low again, the second stage produces its residue, which is acquired by the third stage. An identical process is repeated for the remaining stages, resulting in a 9th stage residue that is sent to the 10th stage ADC for final evaluation. With this timing the ADC exhibits a latency of 6 clock cycles (including the S/H).

Results from all the stages are digitally synchronized such that the results can be properly combined in the correction logic before being sent to the output buffers.

Figure 2 – MDAC schematic

These comparison levels are determined in the in-stage quantizer. Since the required resolution is low, a flash ADC is used for such purpose. Figure 3 shows the schematic of the 7-level flash ADC in the first stage. It is composed of 6 fully-differential comparators that receive the thresholds from a resistor ladder. After generation of the thermometric code, it is encoded to control the MDAC switches. Also, a redundant 3-bit code is passed to the digital part. The use of edge-triggered D flip-flops prevents from metastability of the preceding comparators.

Figure 3 – 7-level flash ADC

The quantizers in the remaining stages use a similar topology but only two comparators are required in the 3-level flash ADC. The comparators, based on regenerative latches, are strobed slightly before the end of the sampling phase, while the input signal is being stored at the MDAC sampling capacitors. With this timing the decision made is available to control the MDAC switching at the beginning of residue-amplification phase.

Figure 4 shows the residue characteristics of each pipeline stage.
branch), so that 0 volt single-ended input is translated into
a negative differential voltage equal to \( \text{vrefn-vrefp} \). By
properly scaling the capacitors, the biggest single-ended
input of interest is translated into \( \text{vrefp-vrefn} \). This
operation does not affect the OTA common-mode input
and output voltages, which are dynamically kept at proper
values.

The input sample-and-hold is designed to
accommodate two different signal full-scale ranges, namely
1.25Vpp and 2.5Vpp. This is achieved by switching the
gain of the switched-capacitor amplifier between two
values, by adapting the value of the sampling capacitor \( C_{\text{in}} \),
while keeping fixed the feedback one. This is the reason
why two identical input branches are shown in Figure 5.
When low gain is selected only one of the branches is
actually switched.

\[ V_{\text{in}} = \frac{V_{\text{ref}}^2}{2(P_Q + P_N + P_{\text{HD}})} \]

where \( V_{\text{ref}} \) stands for the reference voltage, ultimately
defined by the supply voltage, and \( P_Q, P_N \), and \( P_{\text{HD}} \) are
the in-band powers of quantization error, circuit noise, and
distortion, respectively.

Scaling down the supply voltage by a factor \( \alpha \) directly
maps into a proportional decrease of the reference voltage
that can be used and, hence, imposes a reduction of the
signal power by a factor \( \alpha^2 \). Also, quantization noise power
\( P_Q \) scales down in the same proportion. However, circuit
noise \( P_N \) and distortion \( P_{\text{HD}} \) do not scale automatically, but
keep essentially constant. The latter may be even bigger
due to non-linearities of the building blocks (OTAs,
switches, etc.), which are more evident in a low-voltage context.

Thus, to keep a given SINAD, both circuit noise and harmonic distortion must be lowered by design in the same factor $\alpha^2$. Both aspects are carefully considered next.

### 4.1. Circuit noise

Like in most discrete-time data circuits, noise generated by the OTAs and switch on resistances are sampled by the switched capacitors at the clock frequency. In order to achieve good enough settling within the available time-slot (half a clock period), the time constant of the switch on resistance and the associated capacitor is chosen to be several times smaller than the clock period. With this, the equivalent bandwidth of the noise becomes larger than half the sampling frequency and aliasing of the sampled noise spectra takes place. This gives rise to the well-known $kT/C$ noise contributions.

In every ADC stage, the noise generated in the MDAC switches and OTA suffers from aliasing and so does the noise that the stage receives from the previous one and from the reference voltage generator.

Fortunately, given that each MDAC has a gain higher than unity, their contributions to the overall input-equivalent noise are progressively attenuated as we go from the first to the last pipelined stage. So, while the first stage noise directly adds to the input-equivalent noise, that of the backend stages are almost negligible. This interesting feature allows scaling down the backend stages with respect to noise, thus resulting in a more efficient design.

For this purpose, detailed theoretical noise models have been used to get initial estimates of noise-related circuit parameters (such as size of unitary capacitors, switch on resistance, OTA dynamics, etc.). In addition, behavioral simulation including circuit noise models has been employed to fine-tune these figures.

As a matter of example, Figure 6 shows the full-differential SC schematic of a MDAC. During phase phi1 (sampling), noise generated in the switch on-resistance shown is sampled at the input capacitors. In this process, since the equivalent bandwidth of this noise is higher than half the sampling frequency, the noise is undersampled and hence its power spectral density is folded back. During phase phi2 (residue generation and amplification) the noise of the switched on-resistance displayed and also that of the OTA is sampled at the following MDAC input stage and the process continues along the pipelined stages. Table 1 collects the noise contributors to the output-equivalent noise of a pipelined MDAC. The total noise power is obtained as the summation of all the contributions. Similar equations were obtained for the S/H amplifier.

### Table 1 – Summary of MDAC circuit noise contributors

<table>
<thead>
<tr>
<th>Contribution</th>
<th>Expression for MDAC output noise power</th>
</tr>
</thead>
<tbody>
<tr>
<td>Reference switches, broadband filtered by equivalent noise bandwidth of MDAC during residue amplification phase</td>
<td>$P_{n_{\text{RDAC}}} = 2 \cdot \frac{K}{M} \cdot \frac{C_{\text{pin}}}{C_{\text{pout}}} \cdot \frac{v_{\text{CM}}}{V_{\text{ref}}}^2$</td>
</tr>
<tr>
<td>Reference generation block, broadband filtered by equivalent noise bandwidth of MDAC during residue amplification phase</td>
<td>$P_{n_{\text{RDAC}}} = \frac{P_{n_{\text{RDAC, var}}} + 1}{4} \cdot \frac{C_{\text{pin}}}{C_{\text{pout}}} \cdot \frac{v_{\text{CM}}}{V_{\text{ref}}}^2$</td>
</tr>
<tr>
<td>MDAC OTA, broadband filtered by equivalent noise bandwidth of MDAC during residue amplification phase</td>
<td>$P_{n_{\text{OTA}}} + \frac{P_{n_{\text{OTA, var}}} + 1}{4} \cdot \frac{C_{\text{pin}}}{C_{\text{pout}}} \cdot \frac{v_{\text{CM}}}{V_{\text{ref}}}^2$</td>
</tr>
</tbody>
</table>

$K$ is the gain of the MDAC during the residue amplification phase.
$M$ is the on resistance of the switches connected to $\pm V_{\text{ref}}$.
$C_{\text{pin}}$ is the input capacitance of the MDAC OTA with feedback factor during the residue amplification phase.
$P_{n_{\text{RDAC, var}}}$ is the noise power spectral density of the resistor $R_1$ of the VREF stage.
$P_{n_{\text{OTA, var}}}$ is the noise power spectral density of the resistor $R_2$ of the VREF stage.
$P_{n_{\text{OTA}}}$ is the noise power spectral density of the OTA of the MDAC stage.
$P_{n_{\text{OTA, var}}}$ is the feedback factor of the VREF stage.

It is worth noticing that low-frequency noise...
components (such as flicker noise generated by the OTAs) are not undersampled and hence its contribution to the overall circuit noise power is certainly negligible.

Figure 7 shows a comparative of the S/H and MDAC noise powers when referred to the ADC input. Note that the S/H and the first MDAC are the blocks contributing most to the overall noise, while the relative importance decreases in the backend stages. In fact, the value of the unitary capacitor (for noise considerations set to 0.5pF, and 0.9pF at the S/H and first MDAC, respectively), is progressively scaled down to 80fF used in the two last MDAC stages.

Figure 7 – Comparative of noise contributions of the S/H and ADC stages

4.2. OTAs

Apart from electronic noise, the OTAs in the S/H and MDACs may be source of non-linearity. This can be either static, due to insufficient and/or non-linear open-loop DC gain, or dynamic, associated to limited gain-bandwidth product and, mostly, slew-rate.

In practice, both static and dynamic characteristics are intimately related to the signal swing required at the OTA’s output. This is because in order to get high DC gain, on the one hand MOS devices must be properly saturated, which requires a significant voltage drop, the higher the faster the dynamic requirements (i.e. the more biasing current); on the other, due to output conductance degradation, a single output device might not be sufficient for achieving high DC gain, thus requiring cascode structures and, hence, more voltage drops. Exemplified in Figure 8(a), this results in open loop DC gain that considerably varies along the signal swing (shadowed area), thus creating distortion and limiting the useful signal range.

This triple gain-speed-range trade-off can be alleviated by resorting to two-stage OTAs. These are in general more power-hungry than their single-stage counterparts but one can achieve almost rail-to-rail swing with high, constant open-loop gain; see example in Figure 8(b).

Figure 9 shows the schematic of the OTA in the S/H stage. This is the most demanding in terms of open-loop gain, noise and dynamics. As a result, a powerful topology has been selected. It consists of a pMOS-input folded-cascode first stage plus a nMOS-input differential-pair-based second stage. This topology allows us to get almost rail-to-rail output swing with a good control of the current in the second stage. The amplifier uses cascode compensation for better stability and lower power consumption. It also uses a regulated cascode current source at the differential input pair for improving common-mode rejection ratio (CMRR), especially important when the input is single-ended.

The rest of OTAs in the MDACs, use a similar topology but without regulated cascode current source because they actually work with differential signals. Also the folded-cascode input stage was replaced by telescope one. Moreover, thanks to the decreasing impact of the backend MDACs on the overall resolution, their OTAs can be progressively scaled down, thus lowering power and area.

Figure 8 - Effective open-loop DC-gain for (a) folded-cascode OTA; (b) two-stage OTA
4.3. Switches

When in on state, analog switches used in SC circuits must provide small enough resistance in order to allow good settling behavior. Moreover, this on resistance should be constant along the signal range in order not to provoke dynamic distortion. That is why CMOS switches are normally used. However, both objectives are difficult to fulfill in low-voltage implementations; basically because the threshold voltages of the MOS devices do not scale down in the same proportion as the supply voltage, thus decreasing the range in which the on-resistance keeps approximately constant, see Figure 10. It may even happen that no one of the complementary MOS is conducting in a given range, i.e. a “dead” zone would appear in the signal swing for which the switch is actually off. Fortunately, this is not the case, even in worst-case conditions, for usual 0.13um CMOS processes.

If thick-oxide MOS devices (capable of operating with around 3.3V nominal supply) are available, the use of such devices in critical switches can solve the problem because the constant-resistance range is much more noticeable in these switches, see Figure 10.

If not, or wherever the on-resistance is not constant enough over process corners, temperature and supply, the Vgs voltage applied to the switching device can be boosted to make it independent of the signal being transmitted. This can be done with the circuit shown in Figure 11 [6], where the supply voltage is dynamically added to the signal voltage, so that Vgs = Vdd, no matter the input level. Moreover, the circuit is conceived to keep voltages Vgs, Vgd, and Vds of the switching device within the rate operating voltage of the technology, so that device reliability can be assured.
4.4. Flash quantizers and comparators
The correction logic usually included in pipelined ADCs renders them very insensitive to circuit imperfections in the quantizers of each stage, so that no special techniques for offset, gain, and non-linearity correction are actually required in these blocks. For this reason, the simple differential-input flash quantizer shown in Figure 3 is adopted.

The most important specification of its main building block, the comparator, is that it must solve in short enough time. For this purpose, the regenerative latch shown in Figure 12 is used. It compares the differential input with a differential reference voltage obtained by resistive division from the reference voltage \((\text{vrefp} - \text{vrefn})\).

![Figure 12 - Low kick-back noise comparator with regenerative latch](image)

In this topology, resistors \(M_1\) to \(M_6\) are actually implemented with MOS transistors in ohmic region, with their gates connected to the supply. It uses a cascode differential input pair, which reduces kickback noise. For the same reason, the input pair is not switched off when the latch is released, but its current is derived to ground.

4.5. Reference voltage and current generation
The cell includes the on-chip reference generation circuitry shown in Figure 13. The seed for reference generation can be obtained from an on-chip bandgap voltage generator or from a dedicated external input \(\text{vrefin}\). The reference input (from bandgap or external) is properly buffered before entering a resistive amplifier that generates the differential reference voltage \((\text{vref} = \text{vrefp} - \text{vrefn})\) around the common-mode \(\text{vcm}\), obtained by resistive division from supply. The common-mode voltage is also buffered to be accessed externally.

![Figure 13 – On-chip reference generation circuitry](image)

The bandgap is also employed for generating a stable reference current for biasing the OTAs. The simplified circuitry is shown in Figure 14. For better control of the reference current, a low-spreading external resistor is used. The actual bias current can be programmed to be adapted to the ADC operation mode.

![Figure 14 – Schematic for bias current generation](image)

5. ILLUSTRATIVE RESULTS

The complete ADC has been exhaustively simulated both at behavioral level (using SIMSIDES [7]) and full-electrical level (using SPECTRE). These simulations included process, temperature (-40, 125degC), supply (3.3±10%) and clock duty-cycle (±20%) variations, as well as device mismatch, especially capacitors. Capacitors used are metal-insulator-metal capacitors (MIM) with 1fF/um² and good matching properties (1%/sqrt(um²) area law).

Figure 15 shows typical output spectra obtained from behavioral simulations at 44MSPS for 7.5MHz and 22MHz full-scale sinewaves (edge of the Nyquist band). Note that in-band error power is dominated by noise (quantization + circuit noise).
Figure 16 shows the 256-point FFT of output sampled at 44MSPS when the S/H-ADC is simulated at device level (using SPECTRE) with a full-scale, 2.58MHz sinewave. Total Harmonic Distortion (THD), dominated by the presence of a 3rd-order harmonic, is -75dB – low enough for the required effective resolution (min. 10bit ENOB).

The simulations above do not include capacitor mismatch. When it is included, non-linearity dominates the error power so that a SINAD distribution like that in Figure 17 is obtained. The worst-case value is almost 63dB, which correspond to 10.2bit ENOB.

In the same conditions the differential (DNL) and integral (INL) non-linearity curves shown in Figure 18 are obtained. INL is bounded by +1.38/-1.75LSB (referred to 12bit).

Table 2 summarizes the main performance parameters.

<table>
<thead>
<tr>
<th>Parameter</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>Resolution</td>
<td>12bit</td>
</tr>
<tr>
<td>Conversion rate</td>
<td>44MSPS</td>
</tr>
<tr>
<td>Input range</td>
<td>1.25 or 2.5Vpp</td>
</tr>
<tr>
<td>ENOB at Nyquist</td>
<td>10bit (w.c.), 11bit (typ.)</td>
</tr>
<tr>
<td>DNL (12bit)</td>
<td>&lt;±0.8LSB</td>
</tr>
<tr>
<td>INL (12bit)</td>
<td>&lt;±1.8LSB</td>
</tr>
<tr>
<td>Power cons. (3.3V)</td>
<td>200mW @ 44MSPS, 100mW @ 24.6MSPS (DRM), 40mW @ 8.2MSPS (DAB)</td>
</tr>
<tr>
<td>Active area</td>
<td>1mm²</td>
</tr>
</tbody>
</table>
REFERENCES
