Abstract
Integrated sensing and communication (ISAC) technology endows users with environmental awareness capabilities, which will play a crucial role in future mobile edge computing (MEC) systems. In this paper, we consider the design of sensingassisted beam alignment and investigate resource management for a task-oriented mmWave integrated sensing, communication, and computing (ISCC) system, which can be formulated as a preference-agnostic multi-objective optimization problem. To solve this problem, we first introduce a multi-objective Markov decision process (MOMDP) to reformulate the original problem and innovatively propose a preference-agnostic multi-objective soft actor-critic (PA-MOSAC) algorithm. To demonstrate the effectiveness of our proposed system architecture and resource management algorithm, we also introduce a traditional mmWave MEC (T-MEC) system based on the same set of system parameters as a benchmark. The proximal policy optimization (PPO) algorithm, known for its robustness, is used to address resource management in the T-MEC system. Through a comprehensive comparative analysis of the two systems and algorithms, we discover that our proposed sensing-assisted beam alignment can reduce task execution delay by 25% with only 1% increase in energy consumption. We also verify the convergence of our proposed PA-MOSAC algorithm and demonstrate its superior performance over the benchmark scheme.