Thursday, December 4, 2008

Document and text Summarization

Text summarization software processes and summarizes the document in the time it would take the user to read the first paragraph. The goal is to reduce the length and detail of a document while retaining its main points and overall meaning.

This can be done by doing some natural language understanding which generates sentences through knowledge to represent the text. It can be done simply by extracting some key sentences from a document or it can simply do keyword summaries.

None of these work.

Try MS Word. It has a summarization option. They only way to get it to work is to design your document in a way that it is obvious. For example everything under executive summary will be tagged as part of the summarization.

No comments: