Showing 15 open source projects for "topic modeling"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Shift, the browser that merges all of your web apps into one powerful window. Icon
    Shift, the browser that merges all of your web apps into one powerful window.

    Your power browser.

    Streamline everything you do online when you install Shift and access thousands of apps without leaving your browser. Connect all of your Gmail, Outlook, and Office 365 accounts and manage everything from one centralized window. Build out your Shift browser with apps that integrate seamlessly so you have ultra-fast access to all the tools you use to stream, shop, work, browse, and stay connected. Shift brings it all together.
    Try for Free
  • 1
    BERTopic

    BERTopic

    Leveraging BERT and c-TF-IDF to create easily interpretable topics

    BERTopic is a topic modeling technique that leverages transformers and c-TF-IDF to create dense clusters allowing for easily interpretable topics whilst keeping important words in the topic descriptions. BERTopic supports guided, supervised, semi-supervised, manual, long-document, hierarchical, class-based, dynamic, and online topic modeling. It even supports visualizations similar to LDAvis! Corresponding medium posts can be found here, here and here. For a more detailed overview, you can read...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    gensim

    gensim

    Topic Modelling for Humans

    Gensim is a Python library for topic modeling, document indexing, and similarity retrieval with large corpora. The target audience is the natural language processing (NLP) and information retrieval (IR) community.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    ktrain

    ktrain

    ktrain is a Python library that makes deep learning AI more accessible

    ktrain is a Python library that makes deep learning and AI more accessible and easier to apply. ktrain is a lightweight wrapper for the deep learning library TensorFlow Keras (and other libraries) to help build, train, and deploy neural networks and other machine learning models. Inspired by ML framework extensions like fastai and ludwig, ktrain is designed to make deep learning and AI more accessible and easier to apply for both newcomers and experienced practitioners. With only a few lines...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    codeforces-go

    codeforces-go

    Solutions to Codeforces by Go

    Golang algorithm competition template library. Due to the complexity of algorithm knowledge points, it is necessary to classify the algorithms you have learned and the questions you have done. An algorithm template should cover the following points. Basic introduction to the algorithm (core idea, complexity, etc.) Reference links or book chapters (good material) Template code (can contain some comments, usage instructions) Template supplements (extra codes in common question types, modeling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • 5
    Texthero

    Texthero

    Text preprocessing, representation and visualization from zero to hero

    Texthero is a python package to work with text data efficiently. It empowers NLP developers with a tool to quickly understand any text-based dataset and it provides a solid pipeline to clean and represent text data, from zero to hero.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods...
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    WEMax

    WEMax

    Work Efficiency Maximize Framework: Analyze work and line processes

    - Work in 2013 - The WEMax proposes a framework for the continuous performance improvement of manufacturing lines. The WEMax framework consists of two main activities: 1) assembly work process improvement, including time and motion study, and 2) improvement of line balance efficiency. Although there have been numerous studies on this topic, most of them deal with partial issues rather than the continuous performance improvement of the whole assembly line, which this paper addresses. To develop...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Reviz-it

    Software tools to re-tell stories in a better way and expand them

    ... to find which ones are inspiring. - Use the inspiring word clouds to rephrase the story in an original way, then expand it. Enrich with various text mining algorithms to retrieve automatically the different ways the same thing is said in a given context (series of publications on same topic or from same organization for example): latent semantic analysis, topic modeling, rule-based text mining, etc. This allows rewriting a text with the specific 'style' of a corpus.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Twitter Research Data Collector
    It gives facility of collecting tweets through Twitter Streaming API w.r.t different search criteria and to save tweets in CSV and ARFF (WEKA) file formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Red Hat Ansible Automation Platform on Microsoft Azure Icon
    Red Hat Ansible Automation Platform on Microsoft Azure

    Red Hat Ansible Automation Platform on Azure allows you to quickly deploy, automate, and manage resources securely and at scale.

    Deploy Red Hat Ansible Automation Platform on Microsoft Azure for a strategic automation solution that allows you to orchestrate, govern and operationalize your Azure environment.
    Learn More
  • 10

    jLDADMM

    A Java package for the LDA and DMM topic models

    The Java package jLDADMM is released to provide alternative choices for topic modeling on normal or short texts. It provides implementations of the Latent Dirichlet Allocation topic model and the one-topic-per-document Dirichlet Multinomial Mixture model (i.e. mixture of unigrams), using collapsed Gibbs sampling. In addition, jLDADMM supplies a document clustering evaluation to compare topic models. See the usage of jLDADMM in its website at http://jldadmm.sourceforge.net/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    RedLDA

    Redundancy Aware LDA Gibbs Sampler

    Redundancy-Aware Topic Modeling Copy Paste Redundancy or Data Duplication are prevalent in many corpora.This redundancy has a negative impact on the quality of text mining and topic modeling in particular. This is a software package of a novel variant of Latent Dirichlet Allocation (LDA) topic modeling, Red-LDA, which takes into account the inherent redundancy of corpora when modeling content. My site: http://www.cs.bgu.ac.il/~cohenrap/ Lab site: http://www.cs.bgu.ac.il/~nlpproj...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    TextProcessor

    A Java package to preprocess text datasets for posterior text analysis

    The TextProcessor Java package is a text processing toolkit, which provides some frequently used text processing functions such as stemming, removing stop-words, generating a term vocabulary, and calculating the term-doc frequency matrix. Basic topic mining models such as LDA and sparse NMF are also supported. The package can also generate feature files from a given text dataset with LDA and LIBSVM format for posterior procedures such as classification or clustering. The toolkit is also being...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Topic Model Alignment

    Aligns two LDA topic models

    This script aligns two topic models produced by MALLET (http://mallet.cs.umass.edu/) Reciprocal topic pairs are reported with JS divergence measure. Reciprocal pair (i,j) is defined when the distance of topic i from the first model (M1) and topic j from the second model (M2) is minimal for all pairs (i,k) for k in M2 and (l,j) for l in M1 (best match for both topics). This is useful for: 1. Qualitatively comparing different modeling parameters or algorithms 2. Identifying stable topics...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A graphical tool to discover topics from collections of text documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    K3Studio is the universal workbench for 2d/3d modeling, visualization and simulation. The main topic is simulation and visualization of automata networks, but it can be used for diagram drawing, flowcharting, presentation, as a CAD, GIS,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next