Evaluating the Performance of Large Language Models on Arabic Lexical Ambiguities: A Comparative Study with Traditional Machine Translation Systems

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

The rapid advancement in natural language processing (NLP) has led to the development of large language models (LLMs) with impressive capabilities in various tasks, including machine translation. However, the effectiveness of these new systems in handling linguistic complexities, such as Arabic lexical ambiguity, remains underexplored. This study investigates whether LLMs can outperform traditional machine translation (MT) systems in translating Arabic lexical ambiguities, characterized by homonyms, heteronyms, and polysemes. The evaluation involves two prominent LLMs, OpenAI's GPT and Google's Gemini, and compares their performance with traditional MT systems, Google Translate and SYSTRAN. The results indicate that GPT and Gemini offer substantial improvements in translation accuracy and intelligibility over traditional MT systems and highlight the advanced capabilities of LLMs in handling the complexities of Arabic, suggesting a significant step forward in machine translation technologies. This study highlights the potential of LLMs to overcome the limitations of traditional MT systems and provides a foundation for future research. The results contribute to the ongoing development of more effective and accurate translation systems, emphasizing the importance of adopting advanced AI technologies in the field of machine translation.

Original languageEnglish
Pages (from-to)354-363
Number of pages10
JournalWorld Journal of English Language
Volume15
Issue number3
DOIs
StatePublished - May 2025

Keywords

  • Arabic lexical ambiguity
  • GPT
  • Gemini
  • Large Language Model systems
  • machine translation
  • translation studies

Fingerprint

Dive into the research topics of 'Evaluating the Performance of Large Language Models on Arabic Lexical Ambiguities: A Comparative Study with Traditional Machine Translation Systems'. Together they form a unique fingerprint.

Cite this