Surrey researchers Sign in
CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing
Preprint   Open access

CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing

Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li and Wenwu Wang
arXiv.org
Cornell University Library, arXiv.org
11/10/2023

Abstract

Perception Segments Visual signals Semantics
url
https://arxiv.org/pdf/2310.07517.pdfView
Preprint (Author's original) Open

Metrics

26 Record Views

Details

Usage Policy