A Hybrid Intelligent Text Watermarking and Natural Language Processing Approach for Transferring and Receiving an Authentic English Text Via Internet

Anwer Mustafa Hilal; Fahd N. Al-Wesabi; Abdelzahir Abdelmaboud; Manar Ahmed Hamza; Mohammad Mahzari; Abdulkhaleq Q.A. Hassan

doi:10.1093/comjnl/bxab087

A Hybrid Intelligent Text Watermarking and Natural Language Processing Approach for Transferring and Receiving an Authentic English Text Via Internet

Anwer Mustafa Hilal, Fahd N. Al-Wesabi, Abdelzahir Abdelmaboud, Manar Ahmed Hamza, Mohammad Mahzari, Abdulkhaleq Q.A. Hassan

English Language and Literature

Research output: Contribution to journal › Article › peer-review

9 Scopus citations

Abstract

Due to the rapid increase in the exchange of text information via internet networks, the security and the reliability of digital content have become a major research issue. The main challenges faced by researchers are authentication, integrity verification, and tampering detection of the digital contents. In this paper, a Robust English Text Watermarking and Natural Language Processing Approach (RETWNLPA) is proposed based on word mechanism and first level order of Markov model to improve the accuracy of tampering detection of sensitive English text. The RETWNLPA approach embeds and detects the watermark logically without altering the original text document. Based on the hidden Markov model (HMM), the first-level order of word mechanism is used to analyze the interrelationship between English text. The extracted features are used as watermark information and integrated with text zero-watermarking techniques. To detect eventual tampering, RETWNLPA has been implemented and validated with attacked English text. Experiments were performed on four datasets of varying sizes under random locations of common tampering attacks. The simulation results prove the tampering detection accuracy of our method against all kinds of tampering attacks. Comparison results show that RETWNLPA outperforms baseline approaches HNLPZWA (an intelligent hybrid of natural language processing and zero-watermarking approach) and ZWAFWMMM (Zero-Watermarking Approach based on Fourth level order of Word Mechanism of Markov Model) in terms of tampering detection accuracy.

Original language	English
Pages (from-to)	423-435
Number of pages	13
Journal	Computer Journal
Volume	65
Issue number	2
DOIs	https://doi.org/10.1093/comjnl/bxab087
State	Published - 1 Feb 2022

Keywords

content authentication
hidden Markov model
natural language processing
tampering detection
text analysis
zero-watermarking

Access to Document

10.1093/comjnl/bxab087

Cite this

@article{ae3d719ae72240218f61c1d333cf3cb7,

title = "A Hybrid Intelligent Text Watermarking and Natural Language Processing Approach for Transferring and Receiving an Authentic English Text Via Internet",

abstract = "Due to the rapid increase in the exchange of text information via internet networks, the security and the reliability of digital content have become a major research issue. The main challenges faced by researchers are authentication, integrity verification, and tampering detection of the digital contents. In this paper, a Robust English Text Watermarking and Natural Language Processing Approach (RETWNLPA) is proposed based on word mechanism and first level order of Markov model to improve the accuracy of tampering detection of sensitive English text. The RETWNLPA approach embeds and detects the watermark logically without altering the original text document. Based on the hidden Markov model (HMM), the first-level order of word mechanism is used to analyze the interrelationship between English text. The extracted features are used as watermark information and integrated with text zero-watermarking techniques. To detect eventual tampering, RETWNLPA has been implemented and validated with attacked English text. Experiments were performed on four datasets of varying sizes under random locations of common tampering attacks. The simulation results prove the tampering detection accuracy of our method against all kinds of tampering attacks. Comparison results show that RETWNLPA outperforms baseline approaches HNLPZWA (an intelligent hybrid of natural language processing and zero-watermarking approach) and ZWAFWMMM (Zero-Watermarking Approach based on Fourth level order of Word Mechanism of Markov Model) in terms of tampering detection accuracy.",

keywords = "content authentication, hidden Markov model, natural language processing, tampering detection, text analysis, zero-watermarking",

author = "Hilal, \{Anwer Mustafa\} and Al-Wesabi, \{Fahd N.\} and Abdelzahir Abdelmaboud and Hamza, \{Manar Ahmed\} and Mohammad Mahzari and Hassan, \{Abdulkhaleq Q.A.\}",

year = "2022",

month = feb,

day = "1",

doi = "10.1093/comjnl/bxab087",

language = "English",

volume = "65",

pages = "423--435",

journal = "Computer Journal",

issn = "0010-4620",

publisher = "Oxford University Press",

number = "2",

}

TY - JOUR

T1 - A Hybrid Intelligent Text Watermarking and Natural Language Processing Approach for Transferring and Receiving an Authentic English Text Via Internet

AU - Hilal, Anwer Mustafa

AU - Al-Wesabi, Fahd N.

AU - Abdelmaboud, Abdelzahir

AU - Hamza, Manar Ahmed

AU - Mahzari, Mohammad

AU - Hassan, Abdulkhaleq Q.A.

PY - 2022/2/1

Y1 - 2022/2/1

N2 - Due to the rapid increase in the exchange of text information via internet networks, the security and the reliability of digital content have become a major research issue. The main challenges faced by researchers are authentication, integrity verification, and tampering detection of the digital contents. In this paper, a Robust English Text Watermarking and Natural Language Processing Approach (RETWNLPA) is proposed based on word mechanism and first level order of Markov model to improve the accuracy of tampering detection of sensitive English text. The RETWNLPA approach embeds and detects the watermark logically without altering the original text document. Based on the hidden Markov model (HMM), the first-level order of word mechanism is used to analyze the interrelationship between English text. The extracted features are used as watermark information and integrated with text zero-watermarking techniques. To detect eventual tampering, RETWNLPA has been implemented and validated with attacked English text. Experiments were performed on four datasets of varying sizes under random locations of common tampering attacks. The simulation results prove the tampering detection accuracy of our method against all kinds of tampering attacks. Comparison results show that RETWNLPA outperforms baseline approaches HNLPZWA (an intelligent hybrid of natural language processing and zero-watermarking approach) and ZWAFWMMM (Zero-Watermarking Approach based on Fourth level order of Word Mechanism of Markov Model) in terms of tampering detection accuracy.

AB - Due to the rapid increase in the exchange of text information via internet networks, the security and the reliability of digital content have become a major research issue. The main challenges faced by researchers are authentication, integrity verification, and tampering detection of the digital contents. In this paper, a Robust English Text Watermarking and Natural Language Processing Approach (RETWNLPA) is proposed based on word mechanism and first level order of Markov model to improve the accuracy of tampering detection of sensitive English text. The RETWNLPA approach embeds and detects the watermark logically without altering the original text document. Based on the hidden Markov model (HMM), the first-level order of word mechanism is used to analyze the interrelationship between English text. The extracted features are used as watermark information and integrated with text zero-watermarking techniques. To detect eventual tampering, RETWNLPA has been implemented and validated with attacked English text. Experiments were performed on four datasets of varying sizes under random locations of common tampering attacks. The simulation results prove the tampering detection accuracy of our method against all kinds of tampering attacks. Comparison results show that RETWNLPA outperforms baseline approaches HNLPZWA (an intelligent hybrid of natural language processing and zero-watermarking approach) and ZWAFWMMM (Zero-Watermarking Approach based on Fourth level order of Word Mechanism of Markov Model) in terms of tampering detection accuracy.

KW - content authentication

KW - hidden Markov model

KW - natural language processing

KW - tampering detection

KW - text analysis

KW - zero-watermarking

UR - http://www.scopus.com/inward/record.url?scp=85125439382&partnerID=8YFLogxK

U2 - 10.1093/comjnl/bxab087

DO - 10.1093/comjnl/bxab087

M3 - Article

AN - SCOPUS:85125439382

SN - 0010-4620

VL - 65

SP - 423

EP - 435

JO - Computer Journal

JF - Computer Journal

IS - 2

ER -

A Hybrid Intelligent Text Watermarking and Natural Language Processing Approach for Transferring and Receiving an Authentic English Text Via Internet

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this