A deep learning approach for text generation

Ahmed Elmogy, Belal Mahmoud, Mohamed Saleh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

One of the most challenging language modeling problems is text generation. The importance of language modeling comes from its involvement in many language processing tasks such as conversational system, speech to text, and text summarization. The language models are typically trained to learn the occurrence of next word in a sequence based on previous words in the text. However, when it comes to testing, it is highly expected that the entire sequence will be generated from the scratch which is computationally not suitable to many applications. In this paper, one of the popular deep learning architectures called bidirectional recurrent neural network (BRNN) to develop a text generation approach. The recurrent neural network used in the developed approach uses long short-term memory (LSTM). By using LMST recurrent neural network, the proposed approach is well suited to making predictions based on time series data. Two representations are considered in this paper; word to vector (word2vec) and one-hot representations. The word2vec model is used with two datasets called Twilight and Alice in Wonderland stories while the one-hot model is used only with Alice in Wonderland story only. The experimental results show that the word2vec model outperforms the one-hot model when using to large data sets.

Original languageEnglish
Title of host publication29th International Conference on Computer Theory and Applications, ICCTA 2019 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages102-106
Number of pages5
ISBN (Electronic)9781728152752
DOIs
StatePublished - 29 Oct 2019
Event29th International Conference on Computer Theory and Applications, ICCTA 2019 - Alexandria, Egypt
Duration: 29 Oct 201931 Oct 2019

Publication series

Name29th International Conference on Computer Theory and Applications, ICCTA 2019 - Proceedings

Conference

Conference29th International Conference on Computer Theory and Applications, ICCTA 2019
Country/TerritoryEgypt
CityAlexandria
Period29/10/1931/10/19

Keywords

  • BRNN
  • Deep learning
  • LSTM
  • One-hot vector
  • Text generation
  • Word2vec

Fingerprint

Dive into the research topics of 'A deep learning approach for text generation'. Together they form a unique fingerprint.

Cite this