Search Results for "text classification" - Page 5

Showing 148 open source projects for "text classification"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1

    sgmweka

    Weka wrapper for the SGM toolkit for text classification and modeling.

    Weka wrapper for the SGM toolkit for text classification and modeling. Provides Sparse Generative Models for scalable and accurate text classification and modeling for use in high-speed and large-scale text mining. Has lower time complexity of classification than comparable software due to inference based on sparse model representation and use of an inverted index. The provided .zip file is in the Weka package format, giving access to text classification. ...
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    Speechalyzer

    Speechalyzer

    Process large speech data wrt transcription, labeling and annotation

    Speechalyzer: a tool for the daily work of a 'speech worker' It is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text, text-to-speech and speech classification software systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3

    Document Classification

    Document/Text Classification using Naive Bayes model.

    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Problem Description: 20 newsgroup Classification problem Bayesian learning for classifying net news text articles: Naive Bayes classifiers are among the most successful known algorithms for learning to classify text documents. We will provide a data set containing 20,000 newsgroup messages drawn from the 20 newsgroups. The dataset contains 1000 documents from each of the 20 newsgroups. 1.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Sequencia

    Protein Primary Sequnece Analysis

    ...SEQUENCIA Tool is an offline Tool of Primary Sequence Analysis, which is quite prevalent Topic for Researchers all over the world. Sequence Name, Sequence Length, Absorbance, Net charge, Iso electric charge, Amino acid composition, Amino acid classification, Aliphatic Index, Instability Index, Average Hydropathy etc are the Primary Sequence Analysis related Attributes for which this Tool exist. This work includes all the Physiochemical Properties- related to Primary Sequence Analysis under a common platform. JAVA, BIOJAVA were used under Platform Independent architecture. The Tool includes Properties of being offline where result can be stored in Text Format, here we can paste more than one sequence and also upload FASTA file to compute Parameters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    GTkNN

    GPU-based Textual kNN (GT-kNN)

    The following code is a parallel kNN implementation that uses GPUs for the high dimensional data in text classification. You can use it to classify documents using kNN or to generate meta-features based on the distances between a query document and its k nearest neigbors
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    JInsect
    The JINSECT toolkit is a Java-based toolkit and library that supports and demonstrates the use of n-gram graphs within Natural Language Processing applications, ranging from summarization and summary evaluation to text classification and indexing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    VADER

    VADER

    Lexicon and rule-based sentiment analysis tool

    VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool designed for analyzing the sentiment of text, particularly in social media and short text formats. It is optimized for quick and accurate analysis of positive, negative, and neutral sentiments.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 11

    Persica-A new Persian corpus for NLP

    This project presents a new corpus for NEWS text analysis in Persian

    Lack of multi-application text corpus despite of the surging text data is a serious bottleneck in the text mining and natural language processing especially in Persian language. This project presents a new corpus for NEWS articles analysis in Persian called Persica. NEWS analysis includes NEWS classification, topic discovery and classification, category classification and many more procedures.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12

    webtextanalysis

    Mining knowledge from text data

    This project aims to implement in java the following text mining techniques: Text Language Detection, Keywords and keyphrases extraction, Text Classification, Text Clustering, Single or multiple documents Summarization, Plagiarism Detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    The Apelon DTS (Distributed Terminology System) is an integrated set of open source components that provides comprehensive terminology services in distributed application environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TextBlob

    TextBlob

    TextBlob is a Python library for processing textual data

    Simple, Pythonic, text processing, Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more. TextBlob stands on the giant shoulders of NLTK and pattern, and plays nicely with both.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Fast Matrix for Java

    General purpose matrix utilities for Java in Parallel Computing

    Fast Matrix for Java (fm4j) is a general-purpose matrix utility library for computing with dense matrices. fm4j encapsulated different underlying implementations and select the optimal one in run-time depending on the size of the input matrix. Moreover, fm4j employs Java (Tm) Concurrency to take advantage of the computation power of multi-cor processors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    TextProcessor

    A Java package to preprocess text datasets for posterior text analysis

    The TextProcessor Java package is a text processing toolkit, which provides some frequently used text processing functions such as stemming, removing stop-words, generating a term vocabulary, and calculating the term-doc frequency matrix. Basic topic mining models such as LDA and sparse NMF are also supported. The package can also generate feature files from a given text dataset with LDA and LIBSVM format for posterior procedures such as classification or clustering. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    WaveSorter

    A powerful, versatile tool for offilne spike analysis and sorting

    ...It can be run in batch mode. It supports a wide array of binary file formats as well as ASCII text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    OWL Vocabulary Generator

    A tool to generate OWL files from Medical vocabularies.

    This tool takes native sources of SNOMED text files, LOINC text database and multi-hierachical file and RxNORM RRF files and generate OWL files meant for classification of agents of each vocabulary.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This project aims to build a suite of Natural Language Processing tools. Modules will include corpus indexing and access tools, a part-of-speech tagger, tokenisers, text classification software, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Ragnarok Online Site Engine (ROse) - Free Control Panel (CMS) for eAthena Ragnarok Online server Emulator. Written in PHP language. Use smarty php class for templates. contains many functions for manage eAthena dabases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    ProseVis

    ProseVis is a visualization tool for analyzing the sound of text.

    ...These features comprise sound including parts-of-speech, accent, phoneme, stress, tone, break index. ProseVis allows a reader to map the features extracted from OpenMary (http://mary.dfki.de/) Text-to-speech System and predictive classification data to the "original" text. We developed this project with the ultimate goal of facilitating a reader's ability to analyze and disseminate the results in human readable form. Research has shown that mapping the data to the text in its original form allows for the kind of human reading that literary scholars engage: words in the context of phrases, sentences, lines, stanzas, and paragraphs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This project aims to implement in java the following text mining techniques: Text Language Detection, Keywords and keyphrases extraction, Text Classification, Text Clustering, Single or multiple documents Summarization, Plagiarism Detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A fast, configurable faceted classification and browsing tool for large collection of unstructured data. Its major features include multi-faceted search, hit counts, term highlights and population of free text search results into facets. An AJAX app.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Jems
    Jems is an e-book management system. It is smart. It can automatically rename files & documents, classification, arrangement, etc. It also contains a search engine, that you can use it to find things you want.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Provides a set of tools for processing text, such as text extraction and classification. Classification implementations to be implemented include: Bayesian and Statistical (N-gram).
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo