Goal of this project is to have a NLP tool that would give statistical analysis results based on Google Ngram data.

Furthermore, it is now just a NetBeans project without a final JAR.
Furthermore, there will be a github version for anyone who wishes to contribute.
In the future versions, user will be able to convert a single word to numerical data, to be able to compare two words and get the comparison data, and to be able to do the same for the sentences, paragraphs and documents.

I will JAR-it once I decide that it can be called a final release.

This project was made by creating a corpus from the Google Ngrams data for English Language, version 20120701.
EOWL list of English words was used to filter-out the words from Ngrams data.
For each year, per word, the data was added and calculated to describe the average appearance of a word per document for a given year.
Before using this program, you MUST download the corpus.

Project Samples

Project Activity

See All Activity >

Follow Natural Language Analysis with Ngrams

Natural Language Analysis with Ngrams Web Site

Other Useful Business Software
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Natural Language Analysis with Ngrams!

Additional Project Details

Registered

2015-01-25