KAROMA: Karonese Morphologycal Analyzer Based on Graph Theory

Authors

  • Ichwanul Muslim Karo Karo Medan State University
  • Mohd Farhan Bin Md. Fudzee Universiti Tun Hussein Onn Malaysia
  • Shahreen Binti Kasim Universiti Tun Hussein Onn Malaysia
  • Azizul Azhar Ramli Universiti Tun Hussein Onn Malaysia

Keywords:

KAROMA, graph theory, member checking, text similarity-based

Abstract

Karonese is a local language of Karo ethnics from north Sumatra, Indonesia. Karonese terms have unique phonology, which exhibits variations in spellings and pronunciations while retaining the same meaning and in time. A morphological analyzer is a very critical issue for the enhancement of Natural language Processing (NLP) research on local languages, as well as in Karonese. This work proposed a morphology analyzer of Karonese based on graph theory (KAROMA). With its unique phonology, the formation of the Karonese morphology analyzer uses a word-based morphology approach.  Karonese terms that exhibit variations in spellings and pronunciations while retaining the same meaning and in time are expressed in a completed graph. Thus, the set of completed graphs form the Karonese WordNet. Furthermore, the stemming and lemmatization mechanism for Karonese is checked in the WordNet. This study also provides two KAROMA evaluators; member checking-based and text similarity-based by modified cosine similarity. The KAROMA evaluation process involves synthetic sentences of Karonese to calculate its text similarity. As a result, KAROMA detects the uniqueness of Karonese terms and normalizes them. The performance of KAROMA is 99% based on member checking and 97.16% of text similarity-based. Of course, this success is part of the development of NLP research for Karonese, such as sentiment analysis, text summarization, etc

Downloads

Published

21-06-2024

Issue

Section

Articles

How to Cite

Karo Karo, I. M., Bin Md. Fudzee , M. F. ., Binti Kasim, S. ., & Ramli , A. A. . (2024). KAROMA: Karonese Morphologycal Analyzer Based on Graph Theory. Journal of Soft Computing and Data Mining, 5(1), 91-103. https://publisher.uthm.edu.my/ojs/index.php/jscdm/article/view/16685