Natural Language Analysis with Ngrams download

Goal of this project is to have a NLP tool that would give statistical analysis results based on Google Ngram data.

Furthermore, it is now just a NetBeans project without a final JAR.
Furthermore, there will be a github version for anyone who wishes to contribute.
In the future versions, user will be able to convert a single word to numerical data, to be able to compare two words and get the comparison data, and to be able to do the same for the sentences, paragraphs and documents.

I will JAR-it once I decide that it can be called a final release.

This project was made by creating a corpus from the Google Ngrams data for English Language, version 20120701.
EOWL list of English words was used to filter-out the words from Ngrams data.
For each year, per word, the data was added and calculated to describe the average appearance of a word per document for a given year.
Before using this program, you MUST download the corpus.

Project Samples

Project Activity

See All Activity >

Follow Natural Language Analysis with Ngrams

Natural Language Analysis with Ngrams Web Site

Other Useful Business Software

Fully Managed MySQL, PostgreSQL, and SQL Server

Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free

Rate This Project

User Reviews

Be the first to post a review of Natural Language Analysis with Ngrams!

Additional Project Details

Registered

2015-01-25

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
kama.ai

A Responsible AI Agent platform providing accurate, accountable, and safe AI for your organization. As a Composite (hybrid) platform, it combines Knowledge Graph AI, governed Generative AI, and Intelligent Automation technologies. This combination gives you trusted answers that are accurate...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
Enterprise Bot

Enterprise Bot, based in Switzerland, is a pioneer in Conversational AI, Process Automation, and Generative AI. With the trust of esteemed enterprise giants across industries like Generali, SIX, SBB, DHL, and SWICA, Enterprise Bot is revolutionizing both customer and employee experiences....

See Software
Quaeris

Align analytics to your everyday business workflows. Your business relies on people, data and documents, but the process of using them is broken. QuaerisAI enables seamless downstream workflows across your People, Documents and Data Assets. Use natural language search on data, documents and...

See Software
GPT-4

GPT-4 (Generative Pre-trained Transformer 4) is a large-scale unsupervised language model, yet to be released by OpenAI. GPT-4 is the successor to GPT-3 and part of the GPT-n series of natural language processing models, and was trained on a dataset of 45TB of text to produce human-like text...

See Software