A  SURVEY ON VARIOUS APPROACHES IN  DOCUMENT CLUSTERING

Please use this identifier to cite or link to this item: http://localhost:8080/xmlui/handle/123456789/2332

Title:	A SURVEY ON VARIOUS APPROACHES IN DOCUMENT CLUSTERING
Authors:	K, Sathiyakumari G, Manimekalai V, Pream Sudha
Keywords:	text mining document clustering information extraction
Issue Date:	Sep-2011
Publisher:	International Journal of Computer Technology and Application
Abstract:	Document clustering is the process of segmenting a particular collection of texts into subgroups including content based similar ones. The purpose of document clustering is to meet human interests in information searching and understanding. Nowadays all paper documents are in electronic form, because of quick access and smaller storage. So, it is a major issue to retrieve relevant documents from the larger database. Text mining is not a standalone task that human analysts typically engage in. The goal is to transform text composed of everyday language in a structured, database format. In this way, heterogeneous documents are summarized and presented in a uniform manner. Among others, the challenging problems of document clustering are big volume, high dimensionality and complex semantics.
URI:	http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.208.8005&rep=rep1&type=pdf http://localhost:8080/xmlui/handle/123456789/2332
ISSN:	2229-6093
Appears in Collections:	International Journals

Files in This Item:

File	Description	Size	Format
A SURVEY ON VARIOUS APPROACHES IN DOCUMENT CLUSTERING.docx		10.42 kB	Microsoft Word XML	View/Open