A Tapestry of Tongues: A Novel Provenance Approach for Arabic Linguistic Styles and Lineage Tracing

Authors

  • Adel Sabour University of Washington, Computer Science and Systems, Tacoma, USA
  • Mohamed Ali University of Washington
  • Abdeltawab Hendawi University of Rhode Island

Keywords:

Arabic Diacritization, Quranic Linguistic Styles, NLP, LSTM, GRU, Transformer-Based Model, Provenance Tracking, Arabic Dialect, Textual Data Processing

Abstract

This research introduces a novel framework for analyzing Arabic linguistic styles, focusing on the Quran as a case study. Unlike previous studies that often focus on a single style or regional variations, ours examines the unique features of multiple styles within the single text. Our approach addresses inconsistencies in Arabic character representation. This creates a standardized format for analyzing the text, ensuring consistent results. This research utilizes a Provenance Tracker to connect different linguistic styles and document their lineage through narrators By analyzing a single text that showcases multiple styles, this study paves the way for identifying the unique characteristics of each linguistic style. Additionally, the research develops algorithms to map individual letters, words, and diacritics between each style. Moreover, this allows for in-depth linguistic analysis, revealing the unique characteristics that define each style. This opens new avenues for research into the Arabic language’s rich stylistic diversity.

Author Biography

  • Adel Sabour, University of Washington, Computer Science and Systems, Tacoma, USA

    Adel Sabour is currently pursuing his PhD in Computer Science and Systems at the School of Engineering and Technology, University of Washington, Tacoma (UWT). He holds an MS degree from the Department of Computer Science and Information at the Institute of Statistical Studies and Research, Cairo University, Egypt. Adel’s research is multifaceted, encompassing data mining, machine learning, natural language processing (NLP), computer vision, speech recognition, and spatial database systems. He has a keen interest in leveraging big data for innovative solutions in these areas.

Downloads

Published

04-07-2024

How to Cite

Sabour, A., Ali, M., & Hendawi, A. (2024). A Tapestry of Tongues: A Novel Provenance Approach for Arabic Linguistic Styles and Lineage Tracing. Journal of Quranic Sciences and Research, 5(1), 46-60. https://publisher.uthm.edu.my/ojs/index.php/jqsr/article/view/16175