Comparative Study of Clustering Algorithms in Text Mining Context

Author
Keywords
Abstract
The spectacular increasing of Data is due to the appearance of networks and smartphones. Amount 42% of world population using internet [1]; have created a problem related of the processing of the data exchanged, which is rising exponentially and that should be automatically treated. This paper presents a classical process of knowledge discovery databases, in order to treat textual data. This process is divided into three parts: preprocessing, processing and post-processing. In the processing step, we present a comparative study between several clustering algorithms such as KMeans, Global KMeans, Fast Global KMeans, Two Level KMeans and FWKmeans. The comparison between these algorithms is made on real textual data from the web using RSS feeds. Experimental results identified two problems: the first one quality results which remain for algorithms, which rapidly converge. The second problem is due to the execution time that needs to decrease for some algorithms.
Year of Publication
2016
Journal
International Journal of Interactive Multimedia and Artificial Intelligence
Volume
3
Issue
Regular Issue
Number
7
Number of Pages
42-45
Date Published
06/2016
ISSN Number
1989-1660
Citation Key
URL
DOI
Attachment