Multi-modal remote perception learning for object sensory data

Nouf Abdullah Almujally; Adnan Ahmed Rafique; Naif Al Mudawi; Abdulwahab Alazeb; Mohammed Alonazi; Asaad Algarni; Ahmad Jalal; Hui Liu

doi:10.3389/fnbot.2024.1427786

Multi-modal remote perception learning for object sensory data

Nouf Abdullah Almujally, Adnan Ahmed Rafique, Naif Al Mudawi, Abdulwahab Alazeb, Mohammed Alonazi, Asaad Algarni, Ahmad Jalal, Hui Liu

Information Systems

Research output: Contribution to journal › Article › peer-review

22 Scopus citations

Abstract

Introduction: When it comes to interpreting visual input, intelligent systems make use of contextual scene learning, which significantly improves both resilience and context awareness. The management of enormous amounts of data is a driving force behind the growing interest in computational frameworks, particularly in the context of autonomous cars. Method: The purpose of this study is to introduce a novel approach known as Deep Fused Networks (DFN), which improves contextual scene comprehension by merging multi-object detection and semantic analysis. Results: To enhance accuracy and comprehension in complex situations, DFN makes use of a combination of deep learning and fusion techniques. With a minimum gain of 6.4% in accuracy for the SUN-RGB-D dataset and 3.6% for the NYU-Dv2 dataset. Discussion: Findings demonstrate considerable enhancements in object detection and semantic analysis when compared to the methodologies that are currently being utilized.

Original language	English
Article number	1427786
Journal	Frontiers in Neurorobotics
Volume	18
DOIs	https://doi.org/10.3389/fnbot.2024.1427786
State	Published - 2024

Keywords

multi-modal
objects recognition
sensory data
simulation environment
simulation environment multi-modal
visionary sensor

Access to Document

10.3389/fnbot.2024.1427786

Cite this

@article{f89586b6df3d42b0884362fbdd2c6b8f,

title = "Multi-modal remote perception learning for object sensory data",

abstract = "Introduction: When it comes to interpreting visual input, intelligent systems make use of contextual scene learning, which significantly improves both resilience and context awareness. The management of enormous amounts of data is a driving force behind the growing interest in computational frameworks, particularly in the context of autonomous cars. Method: The purpose of this study is to introduce a novel approach known as Deep Fused Networks (DFN), which improves contextual scene comprehension by merging multi-object detection and semantic analysis. Results: To enhance accuracy and comprehension in complex situations, DFN makes use of a combination of deep learning and fusion techniques. With a minimum gain of 6.4\% in accuracy for the SUN-RGB-D dataset and 3.6\% for the NYU-Dv2 dataset. Discussion: Findings demonstrate considerable enhancements in object detection and semantic analysis when compared to the methodologies that are currently being utilized.",

keywords = "multi-modal, objects recognition, sensory data, simulation environment, simulation environment multi-modal, visionary sensor",

author = "Almujally, \{Nouf Abdullah\} and Rafique, \{Adnan Ahmed\} and \{Al Mudawi\}, Naif and Abdulwahab Alazeb and Mohammed Alonazi and Asaad Algarni and Ahmad Jalal and Hui Liu",

note = "Publisher Copyright: Copyright {\textcopyright} 2024 Almujally, Rafique, Al Mudawi, Alazeb, Alonazi, Algarni, Jalal and Liu.",

year = "2024",

doi = "10.3389/fnbot.2024.1427786",

language = "English",

volume = "18",

journal = "Frontiers in Neurorobotics",

issn = "1662-5218",

publisher = "Frontiers Media SA",

}

TY - JOUR

T1 - Multi-modal remote perception learning for object sensory data

AU - Almujally, Nouf Abdullah

AU - Rafique, Adnan Ahmed

AU - Al Mudawi, Naif

AU - Alazeb, Abdulwahab

AU - Alonazi, Mohammed

AU - Algarni, Asaad

AU - Jalal, Ahmad

AU - Liu, Hui

PY - 2024

Y1 - 2024

N2 - Introduction: When it comes to interpreting visual input, intelligent systems make use of contextual scene learning, which significantly improves both resilience and context awareness. The management of enormous amounts of data is a driving force behind the growing interest in computational frameworks, particularly in the context of autonomous cars. Method: The purpose of this study is to introduce a novel approach known as Deep Fused Networks (DFN), which improves contextual scene comprehension by merging multi-object detection and semantic analysis. Results: To enhance accuracy and comprehension in complex situations, DFN makes use of a combination of deep learning and fusion techniques. With a minimum gain of 6.4% in accuracy for the SUN-RGB-D dataset and 3.6% for the NYU-Dv2 dataset. Discussion: Findings demonstrate considerable enhancements in object detection and semantic analysis when compared to the methodologies that are currently being utilized.

AB - Introduction: When it comes to interpreting visual input, intelligent systems make use of contextual scene learning, which significantly improves both resilience and context awareness. The management of enormous amounts of data is a driving force behind the growing interest in computational frameworks, particularly in the context of autonomous cars. Method: The purpose of this study is to introduce a novel approach known as Deep Fused Networks (DFN), which improves contextual scene comprehension by merging multi-object detection and semantic analysis. Results: To enhance accuracy and comprehension in complex situations, DFN makes use of a combination of deep learning and fusion techniques. With a minimum gain of 6.4% in accuracy for the SUN-RGB-D dataset and 3.6% for the NYU-Dv2 dataset. Discussion: Findings demonstrate considerable enhancements in object detection and semantic analysis when compared to the methodologies that are currently being utilized.

KW - multi-modal

KW - objects recognition

KW - sensory data

KW - simulation environment

KW - simulation environment multi-modal

KW - visionary sensor

UR - http://www.scopus.com/inward/record.url?scp=85205729189&partnerID=8YFLogxK

U2 - 10.3389/fnbot.2024.1427786

DO - 10.3389/fnbot.2024.1427786

M3 - Article

AN - SCOPUS:85205729189

SN - 1662-5218

VL - 18

JO - Frontiers in Neurorobotics

JF - Frontiers in Neurorobotics

M1 - 1427786

ER -

Multi-modal remote perception learning for object sensory data

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this