Auto Summarization tool using java

Auto summarization provides a concise summary for a document.

Add a Review
108 Downloads (This Week)
Last Update:
  Browse Code Git Repository

Screenshots

Description

Auto summarization provides a concise summary for a document. In this I present a Statistical approach to addressing the text generation problem in domain-independent, single-document summarization.
My thesis Includes salton’s vector space model which divides the sentences into categories which can also be used for summarizing the contents in WebPages.

The summarizer initially breaks the entire document into sentences based on the separators.
The Second step is that the unnecessary words are removed from the document.
The document after removing the stop words is revised again for the unique words. Unique words are the one which have the same meaning or might be redundant in the document. These are removed by a method called stemming.

By using the Stemming mechanism the occurrence of a word is calculated and the results are displayed in the format of how many times they occur and the number of sentences they have occurred.

Auto Summarization tool using java Web Site

Categories

Features

  • It provides the abstract of lengthy text documents in about 10 lines.
  • It is completely written in Java Swing and no db required.
  • Just run the main class file summary.java to generate the UI.
  • As of now this can accept only.txt files. trying to extend it for all supported formats and also for search results
  • Added the PPT for the modules involved in this app.

Update Notifications





Write a Review

User Reviews

Be the first to post a review of Auto Summarization tool using java!

Additional Project Details

Intended Audience

End Users/Desktop

User Interface

Java Swing

Programming Language

Java

Registered

2013-09-07
Screenshots can attract more users to your project.
Features can attract more users to your project.

Icons must be PNG, GIF, or JPEG and less than 1 MiB in size. They will be displayed as 48x48 images.