## A driven pose estimation for spacecraft in-orbit servicing using FPGA acceleration

Silvia Romero Azpitarte<sup>1,2</sup>, Javier Ferre Martín, Agustín Fernández León<sup>3</sup>

<sup>1</sup> Universidad Carlos III de Madrid, <sup>2</sup> GMV Aerospace and Defence S.A.U., <sup>3</sup> European Space Agency (ESA), ESTEC





DEPLOYING ONTO THE FPGA Monocular In this poster the proposed **DL method** doesn't apply to classic camera for the close-range phase of the pose estimation. 3D fiducial rendezvous is presented. markers



| DPU            | С                 | ZD                                  | X                                                            | 8                              | G                  | ISA1                                       | B512                           |
|----------------|-------------------|-------------------------------------|--------------------------------------------------------------|--------------------------------|--------------------|--------------------------------------------|--------------------------------|
| DPU IP<br>core | CNN<br>(ResNet50) | Hardware<br>platform is<br>Zynq DDR | Quantization method<br>is DECENT™ (Deep<br>Compression Tool) | Quantization bitwidth is 8-bit | General<br>purpose | Instruction Set<br>Architecture<br>version | 512 DSP<br>blocks<br>allocated |
|                |                   |                                     |                                                              |                                |                    |                                            |                                |

| Resource | Utilization | Percentage in KRIA |
|----------|-------------|--------------------|
| LUT      | 30030       | 25.64%             |
| LUTRAM   | 3602        | 6.25%              |
| FF       | 41171       | 17.58%             |
| URAM     | 18          | 28.13%             |
| DSP      | 134         | 10.74%             |
| BUFG     | 4           | 1.14%              |
| PLL      | 1           | 12.50%             |

| Platfor<br>mLatencyGPU*126,000<br>msFPGA30,100 ms |                                                                                  | N° instances<br>evaluated                      | Avg time per<br>instance                                        |  |
|---------------------------------------------------|----------------------------------------------------------------------------------|------------------------------------------------|-----------------------------------------------------------------|--|
|                                                   |                                                                                  | 16,456                                         | 7.657 ms                                                        |  |
|                                                   |                                                                                  | 100                                            | 301 ms                                                          |  |
| inferenc                                          | <b>e</b> time given                                                              | the resource                                   | utilization.                                                    |  |
| inferenc                                          | <b>e</b> time given                                                              | the resource                                   | utilization.                                                    |  |
| inferenc                                          | e time given<br>State                                                            | the resource Avg Powe                          | utilization.<br>r Consumption                                   |  |
| KRIA                                              | e time given<br>State<br>KR260 Idle                                              | the resource Avg Powe 3.                       | utilization.<br>r Consumption<br>820 W                          |  |
| KRIA<br>KRIA k<br>post-im                         | e time given<br>State<br>KR260 Idle<br>KR260 Vivado<br>plementation<br>rediction | the resource<br>Avg Powe<br>3.<br>3.           | utilization.<br>r Consumption<br>820 W<br>886 W                 |  |
| KRIA<br>KRIA<br>KRIA<br>post-im<br>pr<br>KR260 F  | e time given<br>State<br>KR260 Idle<br>R260 Vivado<br>plementation<br>rediction  | the resource<br>Avg Powe<br>3.<br>1 3.<br>2 4. | utilization.<br><b>r Consumption</b><br>820 W<br>886 W<br>030 W |  |

shows a performance degradation of 4.22% for position and 10.96% for orientation of the quantized model on the FPGA compared to the original model on the





After quantization, fewer than 30 of 16,456 test images failed, mainly in extreme illumination cases. The model **remains robust** despite quantization and hardware transition. The model outperformed the classic CAT method in handling occlusions, lighting changes, and sudden shifts. Future work could merge both methods or apply filtering for added reliability.

| Funan    | Total avg. ab | Allow. Error |             |  |
|----------|---------------|--------------|-------------|--|
| Error    | GPU*          | FPGA         | (red lines) |  |
| Position | 2.011 mm      | 2.096 mm     | 20 mm       |  |
| Rotation | 0.073 deg     | 0.081 deg    | 3 deg       |  |

\*NVIDIA GeForce RTX 3070 (5888 CUDA cores). 8GB memory. Driver version 560.35.03. CUDA version 12.6

Future migration to a space-certified platform could be achieved using either a NanoXplore FPGA, which would require a complete redesign due to its unique architecture and lack of compatibility with the Vitis Al workflow, or a radiation-tolerant FPGA from AMD Xilinx. The latter includes options such as the XQR Versal, which offers built-in Al acceleration, improved fault tolerance, and a more familiar development environment, potentially allowing for partial reuse of the existing workflow.