An algorithm for sample and data dimensionality reduction using Fast Simulated Annealing
Autor
Łukasik, Szymon
Kulczycki, Piotr
Data wydania
2011
Miejsce wydania
Berlin ; Heidelberg
Wydawca
Springer
Opublikowane w
Advanced Data Mining and Applications : 7th International Conference, ADMA 2011, Beijing, China, December 17-19, 2011, Proceedings. Part 1 / ed. by J. Tang et al.
Strony
152-161
Język
angielski
ISBN
978-3-642-25853-4 (Online)
DOI
10.1007/978-3-642-25853-4_12
Uwagi
ADMA 2011 : 7th International Conference on Advanced Data Mining and Applications, Beijing, China, December 17-19, 2011, Proceedings, Part I
Abstrakt
This paper deals with dimensionality and sample length reduction applied to the tasks of exploratory data analysis. Proposed technique relies on distance preserving linear transformation of given dataset to the lower dimensionality feature space. Coefficients of feature transformation matrix are found using Fast Simulated Annealing - an algorithm inspired by physical annealing of solids. Furthermore the elimination or weighting of data elements which, as an effect of above mentioned transformation, were moved significantly from the rest of the dataset can be performed. Presented method was positively verified in routines of clustering, classification and outlier detection. It ensures proper efficiency of those procedures in compact feature space and with reduced data sample length at the same time.