Differentiable Architecture Search with Attention Mechanisms for Generative Adversarial Networks

Yu Xue; Kun Chen; Ferrante Neri

doi:10.1109/TETCI.2024.3369998

Back

Differentiable Architecture Search with Attention Mechanisms for Generative Adversarial Networks

Journal article

Open access

Peer reviewed

Differentiable Architecture Search with Attention Mechanisms for Generative Adversarial Networks

Yu Xue, Kun Chen and Ferrante Neri

IEEE transactions on emerging topics in computational intelligence, Vol.8(4), pp.3141-3151

08/2024

DOI: https://doi.org/10.1109/TETCI.2024.3369998

Abstract

Computer architecture

Convolution

Optimization

Microprocessors

Generators

Task analysis

Feature extraction

Generative adversarial networks

neural architecture search

attention mechanism

generative model

Machine Learning

—Generative adversarial networks (GANs) are machine learning algorithms that can efficiently generate data such as images. Although GANs are very popular, their training usually lacks stability, with the generator and discriminator networks failing to converge during the training process. To address this problem and improve the stability of GANs, in this paper, we automate the design of stable GANs architectures through a novel approach: differentiable architecture search with attention mechanisms for generative adversarial networks (DAMGAN). We construct a generator supernet and search for the optimal generator network within it. We propose incorporating two attention mechanisms between each pair of nodes in the supernet. The first attention mechanism, down attention, selects the optimal candidate operation of each edge in the supernet, while the second attention mechanism, up attention, improves the training stability of the supernet and limits the computational cost of the search by selecting the most important feature maps for the following candidate operations. Experimental results show that the architectures searched by our method obtain a state-of-the-art inception score (IS) of 8.99 and a very competitive Fréchet inception distance (FID) of 10.27 on the CIFAR-10 dataset. Competitive results were also obtained on the STL-10 dataset (IS = 10.35, FID = 22.18). Notably, our search time was only 0.09 GPU days. Index Terms—Generative adversarial networks, neural architecture search, attention mechanism, generative model.

Files and links (1)

pdf

final4.50 MBDownload View

Author's Accepted Manuscript Open Access

Metrics

180 File views/ downloads

84 Record Views

Details

Title: Differentiable Architecture Search with Attention Mechanisms for Generative Adversarial Networks
Creators: Yu Xue - Nanjing University of Information Science and Technology
Kun Chen - Nanjing University of Information Science and Technology
Ferrante Neri (Author) - University of Surrey, School of Computer Science and Electronic Engineering
Publication Details: IEEE transactions on emerging topics in computational intelligence, Vol.8(4), pp.3141-3151
Publisher: IEEE; PISCATAWAY
Number of pages: 11
First online publication date: 21/03/2024
Publication Date: 08/2024
Date accepted for publication: 11/02/2024
Grant note: National Natural Science Foundation of China
No Statement Available
Identifiers: 99860066602346; WOS:001189565700001
Copyright: © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.”
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Journal article

Differentiable Architecture Search with Attention Mechanisms for Generative Adversarial Networks

Abstract

Files and links (1)

Metrics

Details

Usage Policy