Domain Ontology Learning using Link Grammar Parser and WordNet

Dmytro Dosyn; Yousef Ibrahim Daradkeh; Vira Kovalevych; Mykhailo Luchkevych; Yaroslav Kis

Domain Ontology Learning using Link Grammar Parser and WordNet

Dmytro Dosyn
, Yousef Ibrahim Daradkeh
, Vira Kovalevych
, Mykhailo Luchkevych
, Yaroslav Kis

Computer Engineering

Research output: Contribution to journal › Conference article › peer-review

6 Scopus citations

Abstract

The problem of knowledge discovery that comes down to information pertinence evaluation still stay unsolved because of absence of practical effective methods and means of ontology learning. Despite active development of natural language processing tools, they mostly reach the level of semantics, named entity recognition and sentiment analysis but not pragmatics. On the other hand, ontology is the only instrument, “measuring ruler” to compare and estimate the usefulness of information for some particular user, which knowledge could be represented by such ontology as his hierarchical task network (HTN). The need to build separate HTN ontology for each user puts on the agenda the task of design of the automated ontology learning from text. With aim to solve this task the system of automated and semi-automated ontology learning from text had been developed using Carnegie Mellon Link Grammar Parser and WordNet API. Two approaches to distinguish semantic relations in natural language text (NLT) sentences were adopted: analysis of the sentence constituent trees – for explicit relations recognition and Naïve Bayes supervised learning – for recognition implicit semantic relations which need not only verb phrase but other parts of sentence due to its ambiguity. Developed approach was implemented in the Java desktop application using OWL API and Protégé-OWL API. Experimental results were compared to expert analysis and had shown good recognition reliability.

Original language	English
Pages (from-to)	14-36
Number of pages	23
Journal	CEUR Workshop Proceedings
Volume	3312
State	Published - 2022
Event	4th International Workshop of Modern Machine Learning Technologies and Data Science, MoMLeT and DS 2022 - Leiden, Netherlands Duration: 25 Nov 2022 → 26 Nov 2022

Keywords

Knowledge Discovery
Link Grammar Parser
Natural Language Processing
Naïve Bayes Classifier
Ontology Learning
Pertinence Estimation
Protégé-OWL API
WordNet

Cite this

@article{ddde4511d47e43959ecba0f3b0df16ab,

title = "Domain Ontology Learning using Link Grammar Parser and WordNet",

abstract = "The problem of knowledge discovery that comes down to information pertinence evaluation still stay unsolved because of absence of practical effective methods and means of ontology learning. Despite active development of natural language processing tools, they mostly reach the level of semantics, named entity recognition and sentiment analysis but not pragmatics. On the other hand, ontology is the only instrument, “measuring ruler” to compare and estimate the usefulness of information for some particular user, which knowledge could be represented by such ontology as his hierarchical task network (HTN). The need to build separate HTN ontology for each user puts on the agenda the task of design of the automated ontology learning from text. With aim to solve this task the system of automated and semi-automated ontology learning from text had been developed using Carnegie Mellon Link Grammar Parser and WordNet API. Two approaches to distinguish semantic relations in natural language text (NLT) sentences were adopted: analysis of the sentence constituent trees – for explicit relations recognition and Na{\"i}ve Bayes supervised learning – for recognition implicit semantic relations which need not only verb phrase but other parts of sentence due to its ambiguity. Developed approach was implemented in the Java desktop application using OWL API and Prot{\'e}g{\'e}-OWL API. Experimental results were compared to expert analysis and had shown good recognition reliability.",

keywords = "Knowledge Discovery, Link Grammar Parser, Natural Language Processing, Na{\"i}ve Bayes Classifier, Ontology Learning, Pertinence Estimation, Prot{\'e}g{\'e}-OWL API, WordNet",

author = "Dmytro Dosyn and Daradkeh, \{Yousef Ibrahim\} and Vira Kovalevych and Mykhailo Luchkevych and Yaroslav Kis",

note = "Publisher Copyright: {\textcopyright} 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).; 4th International Workshop of Modern Machine Learning Technologies and Data Science, MoMLeT and DS 2022 ; Conference date: 25-11-2022 Through 26-11-2022",

year = "2022",

language = "English",

volume = "3312",

pages = "14--36",

journal = "CEUR Workshop Proceedings",

issn = "1613-0073",

publisher = "CEUR-WS",

}

TY - JOUR

T1 - Domain Ontology Learning using Link Grammar Parser and WordNet

AU - Dosyn, Dmytro

AU - Daradkeh, Yousef Ibrahim

AU - Kovalevych, Vira

AU - Luchkevych, Mykhailo

AU - Kis, Yaroslav

PY - 2022

Y1 - 2022

N2 - The problem of knowledge discovery that comes down to information pertinence evaluation still stay unsolved because of absence of practical effective methods and means of ontology learning. Despite active development of natural language processing tools, they mostly reach the level of semantics, named entity recognition and sentiment analysis but not pragmatics. On the other hand, ontology is the only instrument, “measuring ruler” to compare and estimate the usefulness of information for some particular user, which knowledge could be represented by such ontology as his hierarchical task network (HTN). The need to build separate HTN ontology for each user puts on the agenda the task of design of the automated ontology learning from text. With aim to solve this task the system of automated and semi-automated ontology learning from text had been developed using Carnegie Mellon Link Grammar Parser and WordNet API. Two approaches to distinguish semantic relations in natural language text (NLT) sentences were adopted: analysis of the sentence constituent trees – for explicit relations recognition and Naïve Bayes supervised learning – for recognition implicit semantic relations which need not only verb phrase but other parts of sentence due to its ambiguity. Developed approach was implemented in the Java desktop application using OWL API and Protégé-OWL API. Experimental results were compared to expert analysis and had shown good recognition reliability.

AB - The problem of knowledge discovery that comes down to information pertinence evaluation still stay unsolved because of absence of practical effective methods and means of ontology learning. Despite active development of natural language processing tools, they mostly reach the level of semantics, named entity recognition and sentiment analysis but not pragmatics. On the other hand, ontology is the only instrument, “measuring ruler” to compare and estimate the usefulness of information for some particular user, which knowledge could be represented by such ontology as his hierarchical task network (HTN). The need to build separate HTN ontology for each user puts on the agenda the task of design of the automated ontology learning from text. With aim to solve this task the system of automated and semi-automated ontology learning from text had been developed using Carnegie Mellon Link Grammar Parser and WordNet API. Two approaches to distinguish semantic relations in natural language text (NLT) sentences were adopted: analysis of the sentence constituent trees – for explicit relations recognition and Naïve Bayes supervised learning – for recognition implicit semantic relations which need not only verb phrase but other parts of sentence due to its ambiguity. Developed approach was implemented in the Java desktop application using OWL API and Protégé-OWL API. Experimental results were compared to expert analysis and had shown good recognition reliability.

KW - Knowledge Discovery

KW - Link Grammar Parser

KW - Natural Language Processing

KW - Naïve Bayes Classifier

KW - Ontology Learning

KW - Pertinence Estimation

KW - Protégé-OWL API

KW - WordNet

UR - https://www.scopus.com/pages/publications/85146115987

M3 - Conference article

AN - SCOPUS:85146115987

SN - 1613-0073

VL - 3312

SP - 14

EP - 36

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

T2 - 4th International Workshop of Modern Machine Learning Technologies and Data Science, MoMLeT and DS 2022

Y2 - 25 November 2022 through 26 November 2022

ER -

Domain Ontology Learning using Link Grammar Parser and WordNet

Abstract

Keywords

Other files and links

Fingerprint

Cite this