Academy & Industry Research Collaboration Center (AIRCC)

Volume 11, Number 23, December 2021

GAN-Based Data Augmentation and Anonymization for Mask Classification

  Authors

Mustafa Çelik1, 2, Ahmet HaydarÖrnek1, 3, 1Huawei Turkey R&D Center, Turkey, 2Istanbul Technical University, Turkey, 3Konya Technical University, Turkey

  Abstract

Deep learning methods, especially convolutional neural networks (CNNs), have made a major contribution to computer vision. However, deep learning classifiers need large-scale annotated datasets to be trained without over-fitting. Also, in high-data diversity, trained models generalize better. However, collecting such a large-scale dataset remains challenging. Furthermore, it is invaluable for researchers to protect the subjects' confidentiality when using their personal data such as face images. In this paper, we propose a deep learning Generative Adversarial Networks (GANs) which generates synthetic samples for our mask classification model. Our contributions in this work are two-fold that the synthetics images provide. First, GANs' models can be used as an anonymization tool when the subjects' confidentiality is matters. Second, the generated masked/unmasked face images boost the performance of the mask classification model by using the synthetic images as a form of data augmentation. In our work, the classification accuracy using only traditional data augmentations is 93.71 %. By using both synthetic data and original data with traditional data augmentations the result is 95.50 %. It is shown that the GAN-generated synthetic data boosts the performance of deep learning classifiers.

  Keywords

Convolutional Neural Network, Data Anonymization, Data Augmentation, Generative Adversarial Network, Mask classification.