Logo image
Open Research University homepage
Surrey researchers Sign in
Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Other

Leveraging Foundation models for Unsupervised Audio-Visual Segmentation

Swapnil Bhosale, Haosen Yang, Diptesh Kanojia and Xiatian Zhu
arXiv.org
Cornell University Library, arXiv.org
13/09/2023

Abstract

Annotations Audio data Pixels Segmentation Supervised learning Training

Metrics

Details

Logo image

Usage Policy