Front-End Electronics and IP Blocks in 65 nm CMOS Technology for Pixel Detectors in X-ray Imaging Applications

Luigi Gaioni for the BG/PV group



Future Detectors for the European XFEL

18-19 Sept 2023

#### Front-end Electronics for imaging applications

• PixFEL Project (INFN Pavia/Bergamo, Pisa, Trento): developing front-end electronics for diffraction imaging applications at FELs

2

• FALCON Project (UniBG, UniPV, ANL): development of a top tier detector for X-ray ptychography

#### IP Blocks in ASICs for High Energy Physics Applications

- CERN RD53 Collaboration
- Analog and M/S blocks

#### Research Groups

- University of Pavia: L. Ratti, G. Torilla, S. Giroletti, C. Vacchi, F. Shojaei
- University of Bergamo: V. Re, L. Gaioni, M. Manghisoni, G. Traversi, E. Riceputi, P. Lazzaroni, A. Galliani, L. Ghislotti

Develop a four-side buttable, multi-layer module for the assembly of large area detectors with minimum dead area



# **Front-end Channel**





110 um

- Charge sensitive amplifier dynamic signal compression and programmable gain
- Time variant filter two different versions, transconductor+FCF and DGI
- Analog-to-digital conversion 10/9 bit SAR ADC

- 65 nm CMOS technology (1 poly 9 metal stack)
- Power dissipation: 230 uW/350 uW (depending on the filter solution)
- Cell area: 110 um x 110 um

### Prototypes and test chips





mini@sic chips, ~2 mm x 2 mm, TSMC 65 nm technology PFM2



32x32, 110 um pitch channels multi-project wafer (MPW) run half array DGI-based, half FCF-based 10 bit with MIM capacitors in PFM2





32x32, 110 um pitch channels engineering run DGI-based shaper only 1, 2, 3 keV gain configurations 9 bit SAR ADC with MOM capacitors Designed for peripheral TSV implementation

## CSA with dynamic signal compression





7



Larger ENC in the case of NMOS feedback capacitor

- larger stray capacitance at the preamplifier input
- the PMOS capacitance, being integrated in an Nwell, is less sensitive to noise propagating through the substrate

# FALCON Project for Ptychography applications

- International collaboration between ANL (Chicago, USA), UniBG and UniPV (Italy).
- Development of a **top tier detector for X-ray ptychography**.
- The **pixelated** detector will operate at frequency up to **1 MHz** and a **128-by-128 matrix** is envisioned.
- Moderate dynamic range at the input: 256 photons @ 5 keV, 9 keV, 25 keV.





Hemperek T. (2021). Advances in Pixel Detectors.

9



- Prototype Fast Readout for ptYchography Applications w/ 16 pixels.
- Commercial 65 nm CMOS technology.
- Single photon detection, with an ENC of ~250 e<sup>-</sup> rms @ C<sub>D</sub> = 100 fF.
- Adapts to **3 input photon energies**: 5 keV, 9 keV and 25 keV.
- Input dynamic range up to 256 photons in each mode.
- **RC-CR shaper** with 4 selectable peaking times (230 ns ÷ 530 ns).

150 um DIC COMP

10

• **SOT comparator chain** to reject <1 input photon signals.

150 um

- 10-bit SAR ADC.
- Power consumption: **~220 μW/px.**
- Area: **150 μm x 150 μm.**

# Charge Sensitive Amplifier



- Detector output modelled as in figure  $(T_{\delta} = 11.4 \text{ ns})$ , with  $C_{D} = 100 \text{ fF}$ .
- Too short of a time for a complete readout (CSA integration, shaping and A/D conversion).
- CSA integration is performed in a tunable exposure time (t<sub>exp</sub>) and for t > t<sub>exp</sub> the readout is forced to idle state by a discharging switch on the CSA.
- Signal at the switching is proportional to integrated charge.



Shaper





- Noise in terms of equivalent charge at the input (e<sup>-</sup> rms).
- Only schematic-implementation of CSA was considered (should be the main contributor to noise).
- The rest of the circuit is ideal.
- 500 transient noise simulations for each point.



- Shorter t<sub>exp</sub> shows better noise performance.
- Longer t<sub>p</sub> enable lower noise.
- Combinations of (t<sub>exp</sub>, t<sub>p</sub>) for which ENC < 200 e<sup>-</sup> rms exist.

# pY16 and pYTS ASICs



- Area of 1.7 mm x 2 mm.
- CLCC68 package.
- Elementary cells are mirrored about the vertical axis to isolate analog from digital.
- On the left, pFREYA16 (pY16), composed by a 8by-2 matrix of elementary cells and peripheral circuitry.
- On the right, pYTS, a series of test structures arranged in the same 8-by-2 matrix fashion:
  - CSA only.
  - SHAP only.
  - ANALOG only.
  - ADC only (different versions)
- Prototype chips have been received last week, test setup is being developed

### RD53 IP blocks in 65 nm CMOS

# **RD53** collaboration

- RD53 collaboration was established to design and develop pixel chips for ATLAS/CMS phase 2 upgrades
- The RD53 project includes 24 institutes, ~20 designers
- Extremely challenging requirements for HL-LHC
  - Hit rates: 3 GHz/cm², small pixels: 50 x 50  $\mu m^2$
  - Radiation: **500 Mrad** 10<sup>16</sup> neq/cm<sup>2</sup> over 5 years
- Technology: 65nm CMOS
- Characterization of the 65 nm CMOS technology in harsh radiation environment
- Design of a rad-hard IP library (Analog front-ends, DACs, ADCs, CDR/PLL, high-speed serializers, RX/TX, ShuntLDO, ...) qualified through a series of test chips



**RD53A** Size: 20 x 11.5 mm<sup>2</sup> (Aug 2017)



**RD53B-ATLAS (ITkPix-V1)** size: 20 x 21 mm<sup>2</sup> (Mar 2020)



**RD53B-CMS (CROC-V1)** size: 21.6 x 18.6 mm<sup>2</sup> (May 2021)

#### **Production chips**

- RD53C-ATLAS (ITkPix-V2) submitted in March 2023
- RD53C-CMS (CROC-V2) to be submitted in Sept 2023



# RD53 chip floorplan

• Analog Chip Bottom (ACB)

includes building block for Calibration, Bias, Monitoring and Clock/Data recovery

• Digital Chip Bottom (DCB)

synthesized logic for communication to/from chip, readout and configuration

• Padframe (common to ATLAS/CMS)

includes I/O blocks with ESD protections and distributed ShuntLDO regulator for serial powering



| Block                                | Description                                                                                                                                                     |
|--------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Analogue front end                   | The ATLAS chip versions use a differential front end.<br>The CMS chip versions use a linear front end.                                                          |
| Shunt LDO                            | Enables start-up and serial powering. Constant input current shared between chips, modules on serial chains. 1 LDO for digital power, 1 LDO for analogue power. |
| Clock & Data Recovery<br>(CDR)/PLL   | Recovers a 160MHz clock and command/trigger stream. The PLL generates internal clocks: 160 MHz, 64 MHz, 640 MHz and 1.28 GHz.                                   |
| Bias circuit                         | Provides biases to the pixel array. Based on bandgap references.                                                                                                |
| Calibration circuit                  | Injects hits into the pixel array, to calibrate its response.                                                                                                   |
| Monitoring block                     | Digitises analogue quantities using a voltage mux, current mux and 12-bit ADC                                                                                   |
| Temperature and<br>Radiation sensors | Temperature sensors: polysilicon resistors. Radiation sensors: based on PMOS devices with a linear variation in voltage in the dose range 10 - 1000 Mrad.       |
| LVDS pads/drivers                    | Pads and drivers for differential inputs/outputs                                                                                                                |

- Based on Bandgap reference circuits (low sensitivity for V/I to temperature variations)
- Tuning by means of 4 wire-bond trimming pads (no risk of SEU bit flips) → optimal value is found during wafer probing
- Tuned current Iref is mirrored and used by DACS for the bias of analog front-ends, CDR and other IPs



4% difference over 120°C temp. range

# Bandgap reference





- Current in R0  $\propto$  difference of two VGS  $\rightarrow$  **PTAT**, current in R1  $\propto$  to VGS  $\rightarrow$  **CTAT**
- Current sum is mirrored onto R2, output is T-independent
- **5 bit** trimming
- Integrated in a **mini@sic prototype** before pre-production RD53 submission

#### Bandgap reference – test results



**10 BGRs** characterized (5 ASICs) Vref  $\approx$  440mV ± 10mV (conf. 16) 2 ASICs have been characterized in the climatic chamber at the INFN Pavia between -40°C and +100°C One chip irradiated (room T) up to **1 Grad(SiO<sub>2</sub>) TID** of 10-keV X-rays.

21

Dose rate of about 1 krad(SiO<sub>2</sub>)/s.

During irradiation the bandgaps were biased as in the real application

Radiation induced  $\Delta V_{REF} \approx 5\% @ 1 Grad$ 



- Each pixel in the chip matrix is equipped with an injection circuit for test/calibration
- Local generation of the analog test pulse starting from 2 defined DC voltages (CAL\_HI and CAL\_ME) distributed to all
  pixels and a 3rd level (local GND)
- Two operation modes which allow to generate **two consecutive signals** of the same polarity or to inject **different charges in neighboring pixels** at the same time

### Injection cap measure



- Possibility to measure the value of injection capacitor using a dedicated circuit to define accurately the injected charge at the preamplifier input
- Integrated in the Analog Chip Bottom
- Two sections: the cap measure and parasitic cap measure. The first one consists of an array of 100 capacitors, each identical to the injection cap → routing metal is needed, which adds parasitic capacitance. An identical array with the capacitors removed is therefore integrated to evaluate the parasitic capacitance.
- The circuit is based on a charge pump with NMOS and PMOS transistors controlled by non-overlapping clocks → current in R<sub>IMUX</sub> proportional to the capacitance

# General purpose ADC



- Based on a Successive-Approximation Register (SAR) architecture
- In RD53 chips, the ADC is fed with the bunch crossing clock (40MHz). A 1024:1 frequency divider generates the 39 kHz internal clock driving the ADC
- The SAR ADC consists of three main circuits:
  - a **12-bit DAC** based on a capacitance network supplied through the reference voltage (VREF)
  - a high sensitivity **comparator**
  - a SAR logic block including the frequency divider

24



- Capacitive DAC based on a **bridge structure**  $\rightarrow$  larger unit capacitance  $\rightarrow$  better element matching
- Nonlinearity due by mismatch of the bridge capacitance and by parasitic capacitance in the DAC array
- 6 trimming bits allows to adjust Cadj to compensate the non-linearity

25

### ADC comparator



- **3-stage comparator** with two differential operational transconductance amplifiers with diode and current source loads followed by a dynamic latch comparator
- The first stage input transistors are critical for linearity and accuracy
- Large transistors → improved offset and less sensitive to radiation but large G-S capacitance (dependent on the input voltage!) →increased non-linearity
- A compromise has been found to keep low non-linearity with good tolerance to radiation

- Different temperature sensors are implemented in the RD53B chip :
  - Three active sensors are implemented in the chip bottom close to the shunt LDO circuit, considered the hottest part of the chip
    - Based on NMOS diode-connected transistor
    - Large area devices -> very tolerant to the TID effect
  - Two sensors based on r-poly resistances are placed on the top and the bottom of the chip
    - Monitor the **temperature difference** between the top and the bottom of the pixel array
  - NTC device implemented outside the chip with bias current provided from the chip



#### Basic sensor structure



- I<sub>BIAS</sub> and R×I<sub>BIAS</sub> are applied consecutively to the sensor
- The temperature is calculated off-line based on the difference  $\Delta VD$

$$T_{ab} = \frac{q}{n_f \times k_B \times \ln R} \times \Delta V_D$$

Potential sources of temperature measurement error include the ratio R of the two sensor bias currents →
 Dynamic Element Matching is used to mitigate this effect



# **C-SLVS** Driver

- The developed C-SLVS features a differential current-steering architecture with a voltage swing of 200 mV (programmable) on a 100 Ω termination resistance and a common mode of 600 mV.
- The driver architecture is based on a Bridged-Switch Current Source scheme. The 2 mA biasing current is switched through a 100 Ω termination resistance, according to the input data stream. The output current of the transmitter can be trimmed, by means of three configuration bits, in a range from 500 µA to 2.5 mA.
- To achieve insensitivity to PVT variations, a simple **low power common-mode feedback** has also been included. The common mode voltage is sensed by two resistors, which are connected to the output node and compared with a reference voltage.



#### C-SLVS Driver – test results



| Eye Amplitude | $376.7~\pm11.7~\mathrm{mV}$ |
|---------------|-----------------------------|
| Eye Height    | <b>365</b> mV               |
| Eye Width     | 752 ps (0.9 UI)             |
| rms Jitter    | <b>11</b> ps                |
| Eye Rise Time | <b>309</b> ps               |
| Eye Fall Time | <b>220</b> ps               |

F. De Canio – TWEPP 2017, Sept 11-14, 2017

- Two ASICs are bonded directly on the PCB
- The termination resistance is connected to the driver through a 5.5 cm microstrip differential pair
- 1.2 Gbit/s CMOS PRBS signal applied to the driver input
- Transmitter output signal via Differential Probe on termination resistance

# **C-SLVS Receiver**

- The **receiver** is a rail-to-rail stage, able to detect differential signals with a common mode from 100 mV to 1 V.
- Based on three stages
  - a **fully differential amplifier** with a cross-coupled load and with a bandwidth close to 1.2 GHz
  - a differential-to-single ended amplifier with a full swing CMOS output chip voltage
  - a chain of **inverters**



VDD

Power dissipation: 2.5 mW Area: 90  $\mu m$  x 115  $\mu m$ 

#### C-SLVS Receiver – test results



F. De Canio – TWEPP 2017, Sept 11-14, 2017

- The receiver has been stimulated with a 1.2 Gbit/s differential PRBS signal
- 100 Ω internal termination resistance
- The receiver is followed by CML driver provided by the microelectronics group of CERN
- The minimum detectable signal at 1.2 Gbit/s is an input differential voltage of 150 mV
- The eye diagram is measured at CML output at 1.2 Gbit/s, when at the input VID = 200 mV VCM = 200 mV is applied

# Conclusions

- In the framework of the PixFEL project, funded by INFN, a front-end chip has been designed in a commercial 65 nm CMOS technology
  - The front-end circuit includes a charge sensitive amplifier with dynamic compression of the signal, able to cope with a dynamic range from 1 to 10000 photons
- The FALCON project aims at the development of a top tier detector for X-ray ptychography
  - a front-end able to cope with moderate dynamic range is under development. The front-end is designed to be low power ( $\leq 150 \ \mu$ W), while an ENC  $\leq 200 \ e^{-1}$  rms can be achieved in a small pixel area(150  $\mu$ m x 150  $\mu$ m).
- A number of analog and M/S IP blocks in 65 nm CMOS has been designed by the RD53 Collaboration, aiming at the development of pixel readout chips for ATLAS/CMS phase 2 upgrades

#### **PixFEL Project**

- G. Rizzo et al., "The PixFEL project: development of advanced X- ray pixel detectors for application at future FEL facilities", Journal of Instrumentation, 2015 JINST 10 C02024. doi: 10.1088/1748-0221/10/02/C02024
- M. Manghisoni, et al., "Dynamic Compression of the Signal in a Charge Sensitive Amplifier: Experimental Results", IEEE Trans. Nucl. Sci., vol. 65, no. 1, 2018. doi: 10.1109/TNS.2017.2784095
- L. Lodola, "Interleaved SAR ADC for in-pixel conversion in future X-ray FEL applications", 2015 PRIME Conference, DOI: 10.1109/PRIME.2015.7251339
- M. Pezzoli et al "Characterization of PFM3, a 32x32 readout chip for PixFEL X-ray imagers " 2019 IEEE NSS, doi: 10.1109/NSS/MIC42101.2019.9059651

#### **FALCON Project**

- P. Lazzaroni, et al., "FALCON readout channel for X-ray ptychography applications", 17th Conference on Ph.D Research in Microelectronics and Electronics (PRIME), 2022. doi: 10.1109/PRIME55000.2022.9816837
- P. Lazzaroni, et al., "A low-noise readout channel for X-ray ptychography applications", presented at 2022 IEEE NSS

#### RD53

• M. Garcia-Sciveres, F. Loddo, J. Christiansen, "RD53B Manual", CERN-RD53-PUB-19-002

# Backup slides

## Time variant filter: transconductor + FCF



Flip capacitor filter (\*)

- events with a known repetition rate  $\rightarrow$  time-variant shaping
- trapezoidal weighting function by feedback capacitor flipping
- performs correlated double sampling (CDS)

Charge sensitivity



(\*) L. Bombelli et al., "A fast current readout strategy for the XFEL DePFET detector", Nucl. Instr. and Methods, vol. A624, pp. 360-366, 2010



36

# Differential gated integrator (DGI)





**Baseline** integration

$$V_{o}(t) = -\frac{t}{RC} V_{A} \Longrightarrow V_{o}(\tau) = -\frac{\tau}{RC} V_{A}$$

Signal integration

$$\mathsf{V}_{\mathsf{o}}(\mathsf{t})=\mathsf{V}_{\mathsf{o}}(2\tau)+\frac{\mathsf{t}}{\mathsf{R}C}\mathsf{V}_{\mathsf{A}}^{*}\Rightarrow\mathsf{V}_{\mathsf{o}}(3\tau)=\frac{\tau}{\mathsf{R}C}(\mathsf{V}_{\mathsf{A}}^{*}-\mathsf{V}_{\mathsf{A}}^{*})$$

#### DGI transient response



Response to a signal from 10 to 10000 photons at a 100 ns integration time

Response to a 100 photon input signal for different gain configurations at a 250 ns integration time

### Time interleaved SAR ADC

Two split capacitive DACs in a time-interleaved structure; for each DAC

- pre-charge during one sampling period
- conversion during the subsequent period

#### No need for a dedicated stage for fast DAC charging



### Time interleaved SAR ADC



- $|DNL| < 1 LSB \rightarrow$  no missing codes
- SNR = 55.84 dB compatible with 300 uV rms noise referred to the ADC input
- ENOB = 9



V<sub>IN</sub>

Ms

#### Time interleaved SAR ADC



 $f_s = 4.5 \text{ MHz}$ 

### FALCON - Full channel transient



42

#### Injection circuit & monitor





A DAC sets the amplitude of the pulses.

The **shape of the pulse train is set by the strobe** (STB) signal, which steers the injected charge from a **monitor (MON) branch** to the CSA and back.

The switch on the CSA input branch is set to a **fixed voltage** to avoid clock feedthrough on the CSA.

- A constant bias branch is added to improve linearity for low DAC currents in continuous train mode.
- The circuit is split between periphery and pixel to minimise power consumption on pixel and maximise DAC and monitor performance, but only a single pixel of those connected can be injected at a time.
- The **MON** is realized through a **TIA** and keeps the steering circuit balanced.
- The MON has multiple modes to adapt to the input charge injected, accordingly to the CSA mode.

# FALCON Project for Ptychography applications

- Micro-Computed Tomography (μCT) is limited to micrometre-resolution.
- New techniques, such as X-ray ptychography, can achieve nanometreresolution:
  - A coherent X-ray beam is scanned across a specimen of interest, point by point:
    - Step-scan ptychography.
    - Fly-scan ptychography.
  - At each position a diffraction pattern is generated and a modestsized frame is acquired at very high rate.
  - Iterative phasing techniques are applied to the obtained diffraction patterns to achieve a super-resolution beyond lens limits.



44

# RD53 chip floorplan





Quad

- Isolation strategy: two different DNWs for analog and digital
- DNW-isolated analog 'islands':
  - Occasional PFETs using body NW for sub isolation
  - DNW shorted to VDDA
- DNW-isolated digital 'sea':
  - DNW biased at VDDD
- Global substrate not used by supply or device bodies
- Digital logic synthesized for 8 x 8 pixels to form a pixel Core



Core



# Monitoring block



This blocks is exploited for the digitization of several sensitive parameters in the chip:

- Voltages or currents in different sections of the chip
- Temperature
- Total ionizing dose

The monitoring block includes:

- an analog current multiplexer (IMUX) followed by an analog voltage multiplexer (VMUX)
- a 12-bit ADC