TY - JOUR
T1 - A Hybrid Intelligent Text Watermarking and Natural Language Processing Approach for Transferring and Receiving an Authentic English Text Via Internet
AU - Hilal, Anwer Mustafa
AU - Al-Wesabi, Fahd N.
AU - Abdelmaboud, Abdelzahir
AU - Hamza, Manar Ahmed
AU - Mahzari, Mohammad
AU - Hassan, Abdulkhaleq Q.A.
N1 - Publisher Copyright:
© 2021 The British Computer Society 2021. All rights reserved. For permissions, please e-mail: [email protected].
PY - 2022/2/1
Y1 - 2022/2/1
N2 - Due to the rapid increase in the exchange of text information via internet networks, the security and the reliability of digital content have become a major research issue. The main challenges faced by researchers are authentication, integrity verification, and tampering detection of the digital contents. In this paper, a Robust English Text Watermarking and Natural Language Processing Approach (RETWNLPA) is proposed based on word mechanism and first level order of Markov model to improve the accuracy of tampering detection of sensitive English text. The RETWNLPA approach embeds and detects the watermark logically without altering the original text document. Based on the hidden Markov model (HMM), the first-level order of word mechanism is used to analyze the interrelationship between English text. The extracted features are used as watermark information and integrated with text zero-watermarking techniques. To detect eventual tampering, RETWNLPA has been implemented and validated with attacked English text. Experiments were performed on four datasets of varying sizes under random locations of common tampering attacks. The simulation results prove the tampering detection accuracy of our method against all kinds of tampering attacks. Comparison results show that RETWNLPA outperforms baseline approaches HNLPZWA (an intelligent hybrid of natural language processing and zero-watermarking approach) and ZWAFWMMM (Zero-Watermarking Approach based on Fourth level order of Word Mechanism of Markov Model) in terms of tampering detection accuracy.
AB - Due to the rapid increase in the exchange of text information via internet networks, the security and the reliability of digital content have become a major research issue. The main challenges faced by researchers are authentication, integrity verification, and tampering detection of the digital contents. In this paper, a Robust English Text Watermarking and Natural Language Processing Approach (RETWNLPA) is proposed based on word mechanism and first level order of Markov model to improve the accuracy of tampering detection of sensitive English text. The RETWNLPA approach embeds and detects the watermark logically without altering the original text document. Based on the hidden Markov model (HMM), the first-level order of word mechanism is used to analyze the interrelationship between English text. The extracted features are used as watermark information and integrated with text zero-watermarking techniques. To detect eventual tampering, RETWNLPA has been implemented and validated with attacked English text. Experiments were performed on four datasets of varying sizes under random locations of common tampering attacks. The simulation results prove the tampering detection accuracy of our method against all kinds of tampering attacks. Comparison results show that RETWNLPA outperforms baseline approaches HNLPZWA (an intelligent hybrid of natural language processing and zero-watermarking approach) and ZWAFWMMM (Zero-Watermarking Approach based on Fourth level order of Word Mechanism of Markov Model) in terms of tampering detection accuracy.
KW - content authentication
KW - hidden Markov model
KW - natural language processing
KW - tampering detection
KW - text analysis
KW - zero-watermarking
UR - http://www.scopus.com/inward/record.url?scp=85125439382&partnerID=8YFLogxK
U2 - 10.1093/comjnl/bxab087
DO - 10.1093/comjnl/bxab087
M3 - Article
AN - SCOPUS:85125439382
SN - 0010-4620
VL - 65
SP - 423
EP - 435
JO - Computer Journal
JF - Computer Journal
IS - 2
ER -