Redundancy-Aware Topic Modeling
Copy Paste Redundancy or Data Duplication are prevalent in many corpora.This redundancy has a negative impact on the quality of text mining and topic modeling in particular. This is a software package of a novel variant of Latent Dirichlet Allocation (LDA)
topic modeling, Red-LDA, which takes into account the inherent redundancy of corpora when
modeling content.
My site: http://www.cs.bgu.ac.il/~cohenrap/
Lab site: http://www.cs.bgu.ac.il/~nlpproj/
Sister project: http://sourceforge.net/projects/corpusredundanc/
Follow RedLDA
Other Useful Business Software
Gen AI apps are built with MongoDB Atlas
MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of RedLDA!