YFCC-CelebA Face Attributes Datasets

Mehmet Can Yavuz; Sara Atito Ali Ahmed; Mehmet Efe Kisaaga; Hasan Ocak; Berrin Yanikoglu; Sara Ahmed

doi:10.1109/SIU53274.2021.9477959

Back

Conference proceeding

YFCC-CelebA Face Attributes Datasets

Mehmet Can Yavuz, Sara Atito Ali Ahmed, Mehmet Efe Kisaaga, Hasan Ocak, Berrin Yanikoglu and Sara Ahmed

2021 29th Signal Processing and Communications Applications Conference (SIU), pp.1-4

09/06/2021

DOI: https://doi.org/10.1109/SIU53274.2021.9477959

Abstract

Biometrics

Face Attributes

Face recognition

Faces

Internet

Multimedia Web sites

Nose

Semi Supervised Learning

Signal processing algorithms

Skin

Webly Supervised Learning

The scales of the data accessible through internet search engines can reach hundreds of millions, or even billions. The existence of such large weak-labeled databases has gained importance in the training of face recognition algorithms. Starting with the publicly available YFCC100M, we propose a weaklylabeled subset for multi-label face recognition for self-supervised methods. A 392K image subset of YFCC100M of 128x128 images was obtained by querying for the 40 facial attributes. We made this dataset publicly available for other face recognition studies, by sharing the IDs, the links and the bounding boxes1. To reduce outliers with respect to CelebA, we apply the Elliptic Envelope algorithm, in the the latent feature space learned over CelebA, obtaining 353K face images. MixMatch algorithm is applied to this last set, to obtain pseudo labels. Pretraining with these pseudo-labels and final fine-tuning with CelebA brings an improvement of 0.4% points in the Area Under the ROC Curve (AUC) score over the system trained only with CelebA.

Metrics

9 Record Views

3 Times Cited - Web of Science

Details

Title: YFCC-CelebA Face Attributes Datasets
Creators: Mehmet Can Yavuz - Sabancı Üniversitesi
Sara Atito Ali Ahmed - Sabancı Üniversitesi
Mehmet Efe Kisaaga - Sabancı Üniversitesi
Hasan Ocak - Sabancı Üniversitesi
Berrin Yanikoglu - Sabancı Üniversitesi
Sara Ahmed - Surrey Business School
Publication Details: 2021 29th Signal Processing and Communications Applications Conference (SIU), pp.1-4
Publisher: IEEE
Date published: 09/06/2021
Identifiers: 99818949302346
Academic Unit: Surrey Business School
Language: English
Resource Type: Conference proceeding

YFCC-CelebA Face Attributes Datasets

Abstract

Metrics

Details

Usage Policy