Friday, January 30, 2009

More text mining reference

Categorization Identifies main themes in a document by placing the document into a pre-defined set of topics. Relies on a thesarus.

Clustering Groups documents on the fly instead of categories that are pre-defined

Concept Linking Links documents based on their common shared concepts. Helps find information they wouldnt normally find using traditional searching

Information Visualization A visual representation of documents or corpus

Information Retrieval Indexing and retrieval of textual documents, finding a set of ranked documents that are relevant to query

No comments: