BERTopic is a topic modeling technique that leverages transformers and c-TF-IDF to create dense clusters allowing for easily interpretable topics whilst keeping important words in the topic descriptions. BERTopic supports guided, supervised, semi-supervised, manual, long-document, hierarchical, class-based, dynamic, and online topic modeling. It even supports visualizations similar to LDAvis! Corresponding medium posts can be found here, here and here. For a more detailed overview, you can read the paper or see a brief overview. After having trained our BERTopic model, we can iteratively go through hundreds of topics to get a good understanding of the topics that were extracted. However, that takes quite some time and lacks a global representation. Instead, we can visualize the topics that were generated in a way very similar to LDAvis. By default, the main steps for topic modeling with BERTopic are sentence-transformers, UMAP, HDBSCAN, and c-TF-IDF run in sequence.

Features

  • It assumes some independence between these steps which makes BERTopic quite modular
  • The main steps for topic modeling with BERTopic are sentence-transformers
  • BERTopic has many functions
  • Fit the model and predict documents
  • Get all topic information
  • Generate topic labels
  • After having trained your BERTopic model, several are saved within your model

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow BERTopic

BERTopic Web Site

Other Useful Business Software
Go From AI Idea to AI App Fast Icon
Go From AI Idea to AI App Fast

One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of BERTopic!

Additional Project Details

Operating Systems

Windows

Programming Language

Python

Related Categories

Python Large Language Models (LLM), Python Generative AI

Registered

2023-03-24