Improving the tool for analyzing Malaysia’s demographic change: data standardization analysis to form geo-demographics classification profiles using k-means algorithms

Kamarul Ismail, and Nasir Nayan, and Siti Naielah Ibrahim, (2016) Improving the tool for analyzing Malaysia’s demographic change: data standardization analysis to form geo-demographics classification profiles using k-means algorithms. Geografia : Malaysian Journal of Society and Space, 12 (6). pp. 34-42. ISSN 2180-2491

[img]
Preview
PDF
360kB

Official URL: http://www.ukm.my/geografia/v2/index.php?cont=a&it...

Abstract

Clustering is one of the important methods in data exploratory in this era because it is widely applied in data mining.Clustering of data is necessary to produce geo-demographic classification where k-means algorithm is used as cluster algorithm. K-means is one of the methods commonly used in cluster algorithm because it is more significant. However, before any data are executed on cluster analysis it is necessary to conduct some analysis to ensure the variable used in the cluster analysis is appropriate and does not have a recurring information. One analysis that needs to be done is the standardization data analysis. This study observed which standardization method was more effective in the analysis process of Malaysia’s population and housing census data for the Perak state. The rationale was that standardized data would simplify the execution of k-means algorithm. The standardized methods chosen to test the data accuracy were the z-score and range standardization method. From the analysis conducted it was found that the range standardization method was more suitable to be used for the data examined.

Item Type:Article
Keywords:Algorithm; Data mining; Geo-demographics; K-means; standardization; Z-score
Journal:Geografia ; Malaysian Journal of Society and Space
ID Code:10309
Deposited By: ms aida -
Deposited On:13 Apr 2017 23:54
Last Modified:18 Apr 2017 08:21

Repository Staff Only: item control page