Similarity

Similarity is a Java toolkit for calculating similarity scores between text strings. It provides a collection of algorithms for word similarity, phrase similarity, sentence similarity, paragraph similarity, semantic comparison, sentiment tendency, and approximate word discovery. The project is designed to teach and apply natural language similarity methods while keeping the architecture practical and customizable. It includes approaches such as edit distance, cosine similarity, Euclidean distance, Jaccard similarity, Jaro distance, Jaro-Winkler distance, Manhattan distance, SimHash with Hamming distance, and Sørensen-Dice coefficient. It also supports Java dependency integration through Maven or Gradle workflows. It is useful for Chinese NLP projects, search features, duplicate detection, recommendation systems, and text analysis experiments.

Features

Java text similarity toolkit
Word, phrase, sentence, and paragraph comparison
Multiple distance algorithms
Sentiment tendency analysis
Approximate word discovery
Maven and Gradle integration

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Similarity

Similarity Web Site

Other Useful Business Software

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Rate This Project

User Reviews

Be the first to post a review of Similarity!

Additional Project Details

Programming Language

Java

Related Categories

Java Libraries

Registered

21 hours ago

Similar Business Software

SurveyJS

SurveyJS is an embeddable, self-hosted, white-label form builder for teams building custom forms, surveys, questionnaires, and other data collection tools inside web applications. It runs entirely on the client and is fully compatible with all modern JavaScript frameworks, including React,...

See Software
Emotion

Emotion is a performant, flexible CSS-in-JS library designed for writing CSS styles using JavaScript, supporting both string-based and object-based styles while delivering a strong developer experience, complete with source maps, labels, and testing utilities. It offers two powerful usage...

See Software
SpreadJS

Deliver true Excel-like spreadsheet experiences, fast - with zero dependencies on Excel. Create financial apps, dashboards, charts, pivot tables, performance benchmarks, science lab notebooks, and other similar JavaScript spreadsheet applications. JavaScript spreadsheet components are software...

See Software
Polymer

The Polymer library provides a set of features for creating custom elements. These features are designed to make it easier and faster to make custom elements that work like standard DOM elements. Similar to standard DOM elements, Polymer elements can be instantiated using a constructor or...

See Software
DHTMLX

DHTMLX is a JavaScript UI library that provides a set of highly customizable and flexible components for building modern and responsive web applications. The library includes more than 30 UI components, such as Gantt, Scheduler, Kanban, diagrams, charts, grids, spreadsheets, calendars, trees,...

See Software
Webix

JavaScript UI library and framework for speeding up web development. JS Framework for cross-platform web Apps development 102 UI widgets and feature-rich CSS / HTML5 JavaScript controls. Save at least 3000+ development hours by using ready-made widgets and UI controls. Develop Web UI 30% faster....

See Software

Report inappropriate content

Similarity

Text similarity calculation Toolkit for Java

Get an email when there's a new version of Similarity

Features

Project Samples

Project Activity

Categories

License

Follow Similarity

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered