Robust multimodal crowd counting with modality reconfigurability

Penghui Shao; Chaoqun Ma; Yi Zheng; Ferrante Neri; Yang Meng; Anyong Qing; Yang Wang

doi:10.1177/10692509261433298

Back

Robust multimodal crowd counting with modality reconfigurability

Journal article

Peer reviewed

Robust multimodal crowd counting with modality reconfigurability

Penghui Shao, Chaoqun Ma, Yi Zheng, Ferrante Neri, Yang Meng, Anyong Qing and Yang Wang

Integrated computer-aided engineering

23/03/2026

DOI: https://doi.org/10.1177/10692509261433298

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Interdisciplinary Applications

Engineering

Engineering, Multidisciplinary

Science & Technology

Technology

Crowd counting aims to estimate the number of individuals in images, and the use of multimodal data has been shown to significantly enhance counting accuracy. However, such approaches are highly sensitive to the loss or corruption of data from any single modality, leading to severe performance degradation. To address this limitation, a new problem setting-Modality-Reconfigurable Crowd Counting-is introduced, in which a model is required to maintain robust performance even when one of the input modalities (e.g., RGB or thermal) is perturbed or entirely unavailable. Modality reconfigurability is achieved through effective cross-modal information transfer, enabled by a Feature Patches Generator that leverages Margin Ranking Loss across multiple network layers to align and transfer discriminative features between modalities. Additionally, a Negative Knowledge Transfer Prevention module is incorporated to suppress misleading or detrimental cross-modal signals. State-of-the-art performance is demonstrated on RGB-T crowd counting benchmarks, with consistent accuracy maintained under both complete and degraded modality conditions.

Files and links (1)

url

https://doi.org/10.1177/10692509261433298View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: Robust multimodal crowd counting with modality reconfigurability
Creators: Penghui Shao - Chinese Academy of Sciences
Chaoqun Ma - Mongolian University of Science and Technology
Yi Zheng - Mongolian University of Science and Technology
Ferrante Neri - Nanjing University of Information Science and Technology
Yang Meng - Chongqing University of Posts and Telecommunications
Anyong Qing - Foshan University
Yang Wang - Concordia University
Publication Details: Integrated computer-aided engineering
Publisher: Sage
Number of pages: 13
Publication Date: 23/03/2026
Grant note: 62301098 / National Natural Science Foundation of China; National Natural Science Foundation of China (NSFC) 2023CQBSHTBT004 / Chongqing Postdoctoral Research Project Special Funding 202107000087 / China Scholarship Council KJQN202300618 / Science and Technology Research Program of Chongqing Municipal Education Commission CSTB2023NSCQ-BHX0109 / Chongqing Postdoctoral Science Foundation Project 2024KJTW0019 / Inner Mongolia Autonomous Region Science and Technology Breakthrough Project Jiangsu Distinguished Professor Programme
Identifiers: 991116688902346; WOS:001720165600001
Academic Unit: School of Computer Science & Electronic Engineering
Language: English
Resource Type: Journal article

Robust multimodal crowd counting with modality reconfigurability

Abstract

Files and links (1)

Metrics

Details

Usage Policy