Tuesday, December 16, 2008

Some basic text mining terminology

Corpus A collection of texts or documents
Sentiment Analysis Aims to determine the attitude of a speaker or a writer with respect to some topic
Lexicon Dictionary or encyclopedia
Taxonomy Arrangement based on hierarchial structure
Multi-word Term A group of words represented by a single term
Entities Names, addresses, Social Security Numbers, Company names, etc.

No comments: