Query translation for multilingual content with semantic technique

Norita Md Norwawi, and Sundresan a/l Perumal, and Emran Huda, and Waka Jeng, (2020) Query translation for multilingual content with semantic technique. Sains Malaysiana, 49 (9). pp. 2113-2118. ISSN 0126-6039


Official URL: http://www.ukm.my/jsm/malay_journals/jilid49bil9_2...


Cross-lingual information retrieval (CLIR) allows user query in a different language from the language of target resources. Thus, translation is the key element in the query processing. There are three translation approaches: query, document, or hybrid query-document. However, query translation is very challenging due to the polysemy problem. Different linguistic nature of the languages will lead to ambiguity of meaning subsequently user’s true intention could be misinterpreted. This paper presents a semantic technique on query translation for a multilingual knowledge repository to improve the query processing. Offline translated documents or parallel corpora in English, Arabic, and Malay language including Jawi text was used as the data. Set of keywords were constructed preidentified by expert related to prophetic food. These keywords were annotated with the relevant Quranic verses, Hadith texts, Manuscript text images and scientific article determined by expert. The synonym and context-based translation was annotated together with the specific keyword. A query will do a three-way pattern match based on the keyword indexing list that link to the relevant documents. A one-stop knowledge repository on prophetic food was developed as a proof of concept using sources are from al-Quran, Hadith, classical manuscript, and scientific articles verified by experts to ensure the content authenticity and integrity.

Item Type:Article
Keywords:Cross lingual information retrieval; One stop knowledge repository; Prophetic food; Query translation; Semantic technique
Journal:Sains Malaysiana
ID Code:15907
Deposited By: ms aida -
Deposited On:01 Dec 2020 05:11
Last Modified:06 Dec 2020 09:12

Repository Staff Only: item control page