Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain

Suhaimi Ab Rahman, and Nazlia Omar, (2017) Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain. Asia-Pacific Journal of Information Technology and Multimedia, 6 (1). pp. 13-21. ISSN 2289-2192

[img]
Preview
PDF
465kB

Official URL: http://ejournal.ukm.my/apjitm/issue/view/899

Abstract

The process of detection for the head and modifier in Malay sentences from the cultural heritage domain is difficult to identify. This is due to the position of head and modifier which varies in sentences depending on the sentence structures. Hence, there are different point of views about the theory and concept of detection for the head and modifier in a compound noun that have been discussed by language experts. Additionally, the existing research is also limited especially in the areas of computational linguistics. Therefore, research should be conducted to identify appropriate methods especially used in the detection of head and modifier which appear in Malay setences from the cultural heritage domain. The aim of this study is to construct a list of heuristic rules to be used for detecting the position of compound nouns in Malay sentences from cultural heritage domain. By using 15 rules, the position of head and modifier that exist in a compound noun can also be detected. These rules are called heuristic rules. The purpose of formulating these 15 rules is to detect the head and modifier that exist in the Malay sentences from the cultural heritage domain. To measure the accuracy of the results, precision, recall and F1-score values are used. Based on the results of the experiments, Sentence Structure of Malay Cultural Heritage Domain (SADWBM) have an F1-score of 80.4% compared to Noun Phrase Structure (SFN) which is 56%. Consequently, SADWBM shows better scores compared to SFN. Therefore it is clear that the approach used in this study is effective in resolving the identified problems.

Item Type:Article
Keywords:Head and Modifier; Noun Phrase Structure (SFN); Sentence Structure of Malay Cultural Heritage Domain (SADWBM); Heuristic Rule
Journal:Asia - Pasific Journal of Information Technology and Multimedia (Formerly Jurnal Teknologi Maklumat dan Multimedia)
ID Code:11840
Deposited By: ms aida -
Deposited On:03 Jul 2018 03:13
Last Modified:09 Jul 2018 04:05

Repository Staff Only: item control page