High-Throughput, Sorted QR Accelerator for Non-linear Processing in Open-RAN Systems

Thomas James Thomas; Marcin Filo; Konstantinos Nikitopoulos

doi:10.1109/ACCESS.2024.3381499

Back

High-Throughput, Sorted QR Accelerator for Non-linear Processing in Open-RAN Systems

Journal article

Open access

Peer reviewed

High-Throughput, Sorted QR Accelerator for Non-linear Processing in Open-RAN Systems

Thomas James Thomas, Marcin Filo and Konstantinos Nikitopoulos

IEEE access, Vol.12, pp.44564 - 44572

25/03/2024

DOI: https://doi.org/10.1109/ACCESS.2024.3381499

Abstract

field programmable gate array

Hardware acceleration

multiple-input multiple-output

non-linear detection

Open radio access network

sorted QR decomposition

Open Radio Access Networks (Open-RAN) require cost- and energy-efficient solutions to facilitate their deployment at scale. A significant concern in multiple-input multiple-output (MIMO) systems employing traditional linear processing is the substantial number of radio frequency (RF) chains at the base station (BS), which is required to ensure the accurate decoding of spatially multiplexed streams. Recently, however, practical non-linear approaches, which facilitate near-optimal parallelizable tree searches, have been successfully implemented on actual systems and demonstrated the capability to considerably reduce the required RF chains without affecting user performance. Like QR decomposition (QRD) being used to perform channel inversion in linear systems, these non-linear approaches employ a sorted QRD (SQRD) to curtail the search complexity. However, this can be a significant bottleneck for general software-based non-linear solutions, preventing them from fully exploiting the gains. To address the latency limitations with SQRD, this work presents a high throughput hardware accelerator based on reformulating the underlying Modified Gram Schmidt process (MGS) to extract further parallelism than previous designs. Implementations of the proposed architecture demonstrate at least 2-fold improvements in the achievable throughput and processing latency over existing 4×4 and 8×8 field programmable gate array (FPGA) implementations and can be scaled up to 16×16 MIMO systems. Further, the proposed accelerator is integrated with the software framework that can considerably offload the processing burden for higher number of streams under strict latency conditions.

Files and links (2)

pdf

High_Throughput_Sorted_QR_Accelerator3.38 MBDownload View

Author's Accepted Manuscript Open Access

url

https://doi.org/10.1109/ACCESS.2024.3381499View

Published (Version of record)CC BY-NC-ND V4.0, Open

Metrics

1 Record Views

Details

Title: High-Throughput, Sorted QR Accelerator for Non-linear Processing in Open-RAN Systems
Creators: Thomas James Thomas (Author)
Marcin Filo (Author) - University of Surrey, School of Computer Science and Electronic Engineering
Konstantinos Nikitopoulos (Corresponding Author) - University of Surrey, School of Computer Science and Electronic Engineering
Publication Details: IEEE access, Vol.12, pp.44564 - 44572
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date published: 25/03/2024
Grants: Innovate UK (United Kingdom, Swindon)
Grant note: This work was supported by the HiPer-RAN Project, a winner of the UK’s DSIT Open Networks Ecosystem Competition and the NL-COMM Project, a winner of Innovate UK and UK’s DSIT Future Telecommunications Challenge Competition.
Identifiers: 99925366002346
Copyright: © 2024 The Authors. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Journal article

High-Throughput, Sorted QR Accelerator for Non-linear Processing in Open-RAN Systems

Abstract

Files and links (2)

Metrics

Details

Usage Policy