DNCCQ-PPO: A dynamic network congestion control algorithm based on deep reinforcement learning for XQUIC

Wenhui Yu; Jinyao Liu; Xiaoqiang Di; Pei Xiao; Hui Qi

doi:10.1016/j.jnca.2025.104371

Back

DNCCQ-PPO: A dynamic network congestion control algorithm based on deep reinforcement learning for XQUIC

Journal article

Peer reviewed

DNCCQ-PPO: A dynamic network congestion control algorithm based on deep reinforcement learning for XQUIC

Wenhui Yu, Jinyao Liu, Xiaoqiang Di, Pei Xiao and Hui Qi

Journal of network and computer applications, Vol.245, p.104371

01/2026

DOI: https://doi.org/10.1016/j.jnca.2025.104371

Abstract

Congestion control

Deep reinforcement learning

Dynamic network

QUIC

TCP

The diversity of network forms and services poses challenges to the TCP protocol in achieving good performance. The current XQUIC implementation of the QUIC protocol still adopts TCP’s heuristic congestion control mechanisms, resulting in limited performance gains. In recent years, reinforcement learning-based congestion control has emerged as an effective alternative to traditional strategies, but existing algorithms are not optimized for dynamic network characteristics. In this paper, we propose a deep reinforcement learning-based congestion control algorithm, Dynamic Network Congestion Control for QUIC Based on PPO (DNCCQ-PPO). To address the heterogeneity of dynamic network training environments, we introduce a novel sampling interaction mechanism, action space, and reward function, and propose an asynchronous distributed training scheme. Additionally, we develop a generalized reinforcement learning framework for congestion control algorithm development that supports XQUIC, and verify the performance of DNCCQ-PPO within this framework. Experimental results demonstrate the algorithm’s fast convergence and excellent training performance. In performance tests, DNCCQ-PPO achieves throughput comparable to that of CUBIC while reducing latency by 54.78%. In multi-stream fairness tests, it outperforms several mainstream algorithms. In satellite network simulations, DNCCQ-PPO maintains high throughput while reducing latency by 69.58% and 72.77% compared to CUBIC and PCC, respectively.

Metrics

1 Record Views

Details

Title: DNCCQ-PPO: A dynamic network congestion control algorithm based on deep reinforcement learning for XQUIC
Creators: Wenhui Yu - Changchun University of Science and Technology
Jinyao Liu - Qingdao Academy of Intelligent Industries
Xiaoqiang Di - Changchun University of Science and Technology
Pei Xiao - University of Surrey
Hui Qi - Changchun University of Science and Technology
Publication Details: Journal of network and computer applications, Vol.245, p.104371
Publisher: Elsevier Ltd
Publication Date: 01/2026
Identifiers: 991089518302346; WOS:001615160600001
Academic Unit: School of Computer Science & Electronic Engineering
Language: English
Resource Type: Journal article

DNCCQ-PPO: A dynamic network congestion control algorithm based on deep reinforcement learning for XQUIC

Abstract

Metrics

Details

Usage Policy