A novel self-learning model to classify unlabeled multivariate time-series applied to fault diagnosis

Ilan Sousa Figueirêdo; Lílian Lefol Nani Guarieiro; Erick Giovani Sperandio Nascimento

doi:10.1016/j.geoen.2025.213988

Back

A novel self-learning model to classify unlabeled multivariate time-series applied to fault diagnosis

Journal article

Peer reviewed

A novel self-learning model to classify unlabeled multivariate time-series applied to fault diagnosis

Ilan Sousa Figueirêdo, Lílian Lefol Nani Guarieiro and Erick Giovani Sperandio Nascimento

Geoenergy Science and Engineering, Vol.253, 213988

30/09/2025

DOI: https://doi.org/10.1016/j.geoen.2025.213988

Abstract

Deep learning

Multiclass classification tasks

Multivariate time-series

Self-learning

Semi-supervised learning

The traditional supervised learning paradigm relies on large volumes of annotated data, which is often costly and labor-intensive to obtain, creating a major bottleneck in developing deep learning solutions. To overcome this limitation, we propose a novel self-learning model for failure classification in multivariate time-series data using a semi-supervised approach that combines unsupervised and supervised learning. Initially, an unsupervised method identifies normal and faulty patterns to pseudo-label a small dataset. A deep supervised learning model is then trained with these pseudo-labels, incorporating a confidence layer to assign prediction confidence scores. This enables iterative refinement and progressive construction of a labeled dataset from unlabeled data. Furthermore, transfer learning is employed to support multiclass fault classification, allowing the model to generalize across evolving fault types. Our contribution lies in the unique orchestration of unsupervised preprocessing, confidence-guided supervision, and transfer learning to adaptively retain prior knowledge while minimizing human annotation. This makes the proposed framework particularly well-suited for dynamic environments where labeled failure data is scarce and incrementally available. •A novel self-learning model to classify unlabeled multivariate time-series.•Combines unsupervised learning, supervised learning, and transfer learning.•Reduces manual labeling needs in industrial fault diagnosis scenarios.•Model validated on a real-world multivariate time-series data with scarce labels.•It achieved similar performance to fully supervised DL models but with no labels.

Metrics

1 Record Views

Details

Title: A novel self-learning model to classify unlabeled multivariate time-series applied to fault diagnosis
Creators: Ilan Sousa Figueirêdo - SENAI CIMATEC
Lílian Lefol Nani Guarieiro - SENAI CIMATEC
Erick Giovani Sperandio Nascimento - SENAI CIMATEC
Publication Details: Geoenergy Science and Engineering, Vol.253, 213988
Publisher: Elsevier B.V
Number of pages: 17
First online publication date: 02/06/2025
Publication Date: 30/09/2025
Grant note: Ilan Sousa Figueirêdo: Writing – original draft, Investigation, Data curation. Lílian Lefol Nani Guarieiro: Writing – review & editing, Supervision. Erick Giovani Sperandio Nascimento: Writing – review & editing, Writing – original draft, Validation, Supervision, Software, Resources, Project administration, Methodology, Investigation, Funding acquisition, Formal analysis, Conceptualization.
Identifiers: 991000066602346; WOS:001507555400001
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Journal article
Data Access Statement: The data associated with this research are available for access and use via the following link: https://github.com/ricardovvargas/3w_dataset. The dataset has been made publicly available and is hosted on GitHub. Additionally, the data are published in conjunction with the following scientific article: https://doi.org/10.1016/j.petrol.2019.106223. This article contains further details and context regarding the dataset and its usage within the scope of the study. Researchers and interested parties are encouraged to refer to the provided link and article for comprehensive information on the dataset’s content, methodology, and any specific terms of use or citation requirements.

A novel self-learning model to classify unlabeled multivariate time-series applied to fault diagnosis

Abstract

Metrics

Details

Usage Policy