Please use this identifier to cite or link to this item: http://localhost:8080/xmlui/handle/123456789/5230
Title: PERFORMANCE EVALUATION OF SEMANTIC BASED AND ONTOLOGY BASED TEXT DOCUMENT CLUSTERING TECHNIQUES (Conference Paper)
Authors: Punitha, S.C
Punithavalli, M
Keywords: Dataming
Document clustering
HSTC
Feature Selection
TCFSmethod
Issue Date: 2012
Publisher: Elsevier
Abstract: The amount of digital information is created and used is steadily growing along with the development of sophisticated hardware and software. This has increased the need for powerful algorithms that can interpret and extract interesting knowledge from these data. Data mining is a technique that has been successfully exploited for this purpose. Text mining, a category of data mining, considers only digital documents or text. Text Clustering is the process of grouping text or documents such that the document in the same cluster are similar and are dissimilar from the one in other clusters. This paper studies the working of two sophisticated algorithms. The first work is a hybrid method that combines pattern recognition process with semantic driven methods for clustering documents, while the second uses an ontology-based approach to cluster documents. Through experiments, the performance of both the selected algorithms is analyzed in terms of clustering efficiency and speed of clustering.
URI: https://doi.org/10.1016/j.proeng.2012.01.839
ISSN: 18777058
Appears in Collections:m) 2012-Scopus Open Access (PDF)



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.