Showing 368 open source projects for "text processing"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    SemNotes

    SemNotes

    Semantic Note-taking tool for KDE

    SemNotes is a semantic note taking tool for KDE4, built on top of Nepomuk-KDE. The tool is still under development, but it is already usable, provided that KDE4 is installed and the Nepomuk running.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    MTT Tools is a package of software solutions for the text processing via Meaning-Text methodology (theory). General targets are: - provide powerfull tools for processing text; - provide framework for creating linguistic processors and etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Sanchay
    Sanchay is a collection of tools and APIs for language researchers. It has some implementations of NLP algorithms, some flexible APIs, several user friendly annotation interfaces and Sanchay Query Language for language resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Pylero
    Pylero is an open-source Python-based text generator.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    This project'll be the core engine of many voice based platforms,which can be implemented into your projects,websites...etc to provide an Arabic speech service, where your servers can interact with the clients through Arabic Speech Recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    OCR c++ library. Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    a natural language processing engine to allow Apache Lucene to process text
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    NLPTools-ES is a Spanish plugin for GATE (General Architecture for Text Engineering). It includes a tokenizer, sentence splitter, gazetteer, pos tagger.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The Lingual Quanta is an organization created by software engineers that are interested in Natural Language Processing technologies focused in libraries useful for projects such as grammar checkers, text markups etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Voice is a text to speech program with many features. Some of the features include: Reads Text, Rich Text and Word Documents aloud. Custom greeting. Professional document editor. Clipboard monitoring and processing. Good looking animated character.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Concrete Voice is a text to speech program. It can read the time, anounce weather, read text file, save text files to audio files, open any text file (supports all text encoding formats) and many more advance stuff!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MutationFinder is a biomedical natural language processing (NLP) system for extracting mentions of point mutations from free text. MutationFinder achieves high performance (99% precision, 81% recall on blind test data) as an information extraction system
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    iDocs is a intellectual document work flow with text mining options project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    "Java Artificial Intelligence Markup Language PAD" is a tool that manages ProgramD AI (on local or remote machines) and AIML files with real-time previews and it provides a network support to test AI capabilities over many network protocols.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Bi-gram applications based on language models produced by SRILM from Chinese Wikipedia corpus, include Chinese word segmenter, word-based (not character-based) Traditional-Simplified Chinese converter and Chinese syllable-to-word converter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Collection of Statistical Language Processing Tools and Modules for Information Retrieval, Document Classification, Vectorization, Pattern Matching, Knowledge/Text Mining related problems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Got any emails with obnoxious inline text? Long text stories with bad formatting? Files that an OCR didn't quite translate right? RTF format files and no easy way to read or modify them? Then eBookFormatter is for you!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This is a toolkit for medical natural language processing (NLP). The core engine is general enough to be used in a variety of text processing domains, though the toolkit includes specific support for medical reports and patient de-identification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Infomap NLP software performs automatic indexing of words and documents from free-text corpora, using a variant of LSA to enable information retrieval and other applications. It was developed by the Infomap Project at Stanford University's CSLI.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB