A Realistic Image Generation of Face from Text Description Using the Fully Trained Generative Adversarial Networks

Muhammad Zeeshan Khan; Saira Jabeen; Muhammad Usman Ghani Khan; Tanzila Saba; Asim Rehmat; Amjad Rehman; Usman Tariq

doi:10.1109/ACCESS.2020.3015656

A Realistic Image Generation of Face from Text Description Using the Fully Trained Generative Adversarial Networks

Muhammad Zeeshan Khan, Saira Jabeen, Muhammad Usman Ghani Khan, Tanzila Saba, Asim Rehmat, Amjad Rehman, Usman Tariq

Management Information Systems

Research output: Contribution to journal › Article › peer-review

61 Scopus citations

Abstract

Text to face generation is a sub-domain of text to image synthesis. It has a huge impact on new research areas along with the wide range of applications in the public safety domain. Due to the lack of dataset, the research work focused on the text to face generation is very limited. Most of the work for text to face generation until now is based on the partially trained generative adversarial networks, in which the pre-trained text encoder has been used to extract the semantic features of the input sentence. Later, these semantic features have been utilized to train the image decoder. In this research work, we propose a fully trained generative adversarial network to generate realistic and natural images. The proposed work trained the text encoder as well as the image decoder at the same time to generate more accurate and efficient results. In addition to the proposed methodology, another contribution is to generate the dataset by the amalgamation of LFW, CelebA and locally prepared dataset. The dataset has also been labeled according to our defined classes. Through performing different kinds of experiments, it has been proved that our proposed fully trained GAN outperformed by generating good quality images by the input sentence. Moreover, the visual results have also strengthened our experiments by generating the face images according to the given query.

Original language	English
Article number	9163356
Pages (from-to)	1250-1260
Number of pages	11
Journal	IEEE Access
Volume	9
DOIs	https://doi.org/10.1109/ACCESS.2020.3015656
State	Published - 2021

Keywords

CNN
data augmentation
face synthesis
GAN
image generation
legal identity for all
text to face

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/ACCESS.2020.3015656

Cite this

@article{0e094202a69d422eb22070318e001d47,

title = "A Realistic Image Generation of Face from Text Description Using the Fully Trained Generative Adversarial Networks",

abstract = "Text to face generation is a sub-domain of text to image synthesis. It has a huge impact on new research areas along with the wide range of applications in the public safety domain. Due to the lack of dataset, the research work focused on the text to face generation is very limited. Most of the work for text to face generation until now is based on the partially trained generative adversarial networks, in which the pre-trained text encoder has been used to extract the semantic features of the input sentence. Later, these semantic features have been utilized to train the image decoder. In this research work, we propose a fully trained generative adversarial network to generate realistic and natural images. The proposed work trained the text encoder as well as the image decoder at the same time to generate more accurate and efficient results. In addition to the proposed methodology, another contribution is to generate the dataset by the amalgamation of LFW, CelebA and locally prepared dataset. The dataset has also been labeled according to our defined classes. Through performing different kinds of experiments, it has been proved that our proposed fully trained GAN outperformed by generating good quality images by the input sentence. Moreover, the visual results have also strengthened our experiments by generating the face images according to the given query.",

keywords = "CNN, data augmentation, face synthesis, GAN, image generation, legal identity for all, text to face",

author = "Khan, \{Muhammad Zeeshan\} and Saira Jabeen and Khan, \{Muhammad Usman Ghani\} and Tanzila Saba and Asim Rehmat and Amjad Rehman and Usman Tariq",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2021",

doi = "10.1109/ACCESS.2020.3015656",

language = "English",

volume = "9",

pages = "1250--1260",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - A Realistic Image Generation of Face from Text Description Using the Fully Trained Generative Adversarial Networks

AU - Khan, Muhammad Zeeshan

AU - Jabeen, Saira

AU - Khan, Muhammad Usman Ghani

AU - Saba, Tanzila

AU - Rehmat, Asim

AU - Rehman, Amjad

AU - Tariq, Usman

PY - 2021

Y1 - 2021

N2 - Text to face generation is a sub-domain of text to image synthesis. It has a huge impact on new research areas along with the wide range of applications in the public safety domain. Due to the lack of dataset, the research work focused on the text to face generation is very limited. Most of the work for text to face generation until now is based on the partially trained generative adversarial networks, in which the pre-trained text encoder has been used to extract the semantic features of the input sentence. Later, these semantic features have been utilized to train the image decoder. In this research work, we propose a fully trained generative adversarial network to generate realistic and natural images. The proposed work trained the text encoder as well as the image decoder at the same time to generate more accurate and efficient results. In addition to the proposed methodology, another contribution is to generate the dataset by the amalgamation of LFW, CelebA and locally prepared dataset. The dataset has also been labeled according to our defined classes. Through performing different kinds of experiments, it has been proved that our proposed fully trained GAN outperformed by generating good quality images by the input sentence. Moreover, the visual results have also strengthened our experiments by generating the face images according to the given query.

AB - Text to face generation is a sub-domain of text to image synthesis. It has a huge impact on new research areas along with the wide range of applications in the public safety domain. Due to the lack of dataset, the research work focused on the text to face generation is very limited. Most of the work for text to face generation until now is based on the partially trained generative adversarial networks, in which the pre-trained text encoder has been used to extract the semantic features of the input sentence. Later, these semantic features have been utilized to train the image decoder. In this research work, we propose a fully trained generative adversarial network to generate realistic and natural images. The proposed work trained the text encoder as well as the image decoder at the same time to generate more accurate and efficient results. In addition to the proposed methodology, another contribution is to generate the dataset by the amalgamation of LFW, CelebA and locally prepared dataset. The dataset has also been labeled according to our defined classes. Through performing different kinds of experiments, it has been proved that our proposed fully trained GAN outperformed by generating good quality images by the input sentence. Moreover, the visual results have also strengthened our experiments by generating the face images according to the given query.

KW - CNN

KW - data augmentation

KW - face synthesis

KW - GAN

KW - image generation

KW - legal identity for all

KW - text to face

UR - http://www.scopus.com/inward/record.url?scp=85099143366&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2020.3015656

DO - 10.1109/ACCESS.2020.3015656

M3 - Article

AN - SCOPUS:85099143366

SN - 2169-3536

VL - 9

SP - 1250

EP - 1260

JO - IEEE Access

JF - IEEE Access

M1 - 9163356

ER -

A Realistic Image Generation of Face from Text Description Using the Fully Trained Generative Adversarial Networks

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this