Fault Detection of Cyber-Physical Systems Using a Transfer Learning Method Based on Pre-Trained Transformers

Pooya Sajjadi; Fateme Dinmohammadi; Mahmood Shafiee

doi:10.3390/s25134164

Back

Fault Detection of Cyber-Physical Systems Using a Transfer Learning Method Based on Pre-Trained Transformers

Journal article

Open access

Peer reviewed

Fault Detection of Cyber-Physical Systems Using a Transfer Learning Method Based on Pre-Trained Transformers

Pooya Sajjadi, Fateme Dinmohammadi and Mahmood Shafiee

Sensors, Vol.25(13), p.4164

04/07/2025

DOI: https://doi.org/10.3390/s25134164

PMID: 40648419

Abstract

cyber-physical systems (CPSs)

prognostics and health management (PHM)

machine learning (ML)

fault detection and diagnosis

transformers

transfer learning

explainable artificial intelligence (XAI)

Highlights What are the main findings? A pre-trained transformer model, fine-tuned with transfer learning, significantly improves fault detection in cyber-physical systems (CPSs) despite limited fault-labeled data. The proposed method achieves a high average F1-score of 93.38% on industrial CPS datasets, outperforming traditional CNN and LSTM models. What is the implication of the main finding? Transformer-based transfer learning enables more reliable fault diagnostics in industrial CPS environments where data scarcity and domain shifts are common. The approach demonstrates practical scalability from controlled lab conditions to real-world industrial applications.Highlights What are the main findings? A pre-trained transformer model, fine-tuned with transfer learning, significantly improves fault detection in cyber-physical systems (CPSs) despite limited fault-labeled data. The proposed method achieves a high average F1-score of 93.38% on industrial CPS datasets, outperforming traditional CNN and LSTM models. What is the implication of the main finding? Transformer-based transfer learning enables more reliable fault diagnostics in industrial CPS environments where data scarcity and domain shifts are common. The approach demonstrates practical scalability from controlled lab conditions to real-world industrial applications.Abstract As industries become increasingly dependent on cyber-physical systems (CPSs), failures within these systems can cause significant operational disruptions, underscoring the critical need for effective Prognostics and Health Management (PHM). The large volume of data generated by CPSs has made deep learning (DL) methods an attractive solution; however, imbalanced datasets and the limited availability of fault-labeled data continue to hinder their effective deployment in real-world applications. To address these challenges, this paper proposes a transfer learning approach using a pre-trained transformer architecture to enhance fault detection performance in CPSs. A streamlined transformer model is first pre-trained on a large-scale source dataset and then fine-tuned end-to-end on a smaller dataset with a differing data distribution. This approach enables the transfer of diagnostic knowledge from controlled laboratory environments to real-world operational settings, effectively addressing the domain shift challenge commonly encountered in industrial CPSs. To evaluate the effectiveness of the proposed method, extensive experiments are conducted on publicly available datasets generated from a laboratory-scale replica of a modern industrial water purification facility. The results show that the model achieves an average F1-score of 93.38% under K-fold cross-validation, outperforming baseline models such as CNN and LSTM architectures, and demonstrating the practicality of applying transformer-based transfer learning in industrial settings with limited fault data. To enhance transparency and better understand the model's decision process, SHAP is applied for explainable AI (XAI).

Files and links (1)

url

https://doi.org/10.3390/s25134164View

Published (Version of record) Open CC BY V4.0

Metrics

1 Record Views

Details

Title: Fault Detection of Cyber-Physical Systems Using a Transfer Learning Method Based on Pre-Trained Transformers
Creators: Pooya Sajjadi (Author) - University of Surrey, School of Engineering
Fateme Dinmohammadi (Author) - University of West London
Mahmood Shafiee (Author) - University of Surrey, School of Engineering
Publication Details: Sensors, Vol.25(13), p.4164
Publisher: MDPI
Number of pages: 25
Publication Date: 04/07/2025
Grants: University of West London (United Kingdom, London) - UWL
Grant note: The first author gratefully acknowledges the support provided by the Vice Chancellor’s PhD Scholarship from the University of West London (UWL). This scholarship has been instrumental in enabling the research presented in this work.
Identifiers: 991118794802346; WOS:001527542000001
Copyright: © 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Academic Unit: Mechanical Engineering Sciences
Language: English
Resource Type: Journal article
Data Access Statement: SWaT dataset used in the publication is publicly available. All codes and software used for this paper are open source and available at https://github.com/pooyasa/pre-trained-trasformer-fd (accessed on 12 June 2025).

Fault Detection of Cyber-Physical Systems Using a Transfer Learning Method Based on Pre-Trained Transformers

Abstract

Files and links (1)

Metrics

Details

Usage Policy