Showing 144 open source projects for "text processing"

View related business solutions
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 1
    Mathematics formula renderer for Microsoft Word. Easy to use and really fast + Mathematics drawing toolbar for Microsoft Word + Math exercices storage Database for Microsoft Access. Available in French and English!!!
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    xBB-code is the PHP library to parse and edit text formatted with BBCode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    n-squared is a light weight, super powered note pad application that stores notes in an embedded database for easy searching. It has a tabbed interface, syntax highlighting, encryption, and more!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    This project is a highly configurable Virtual Unicode Keyboard written in Java. You might want to use it to type some words in a foreign character set with your mouse.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    MutationFinder is a biomedical natural language processing (NLP) system for extracting mentions of point mutations from free text. MutationFinder achieves high performance (99% precision, 81% recall on blind test data) as an information extraction system
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    iDocs is a intellectual document work flow with text mining options project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    "Java Artificial Intelligence Markup Language PAD" is a tool that manages ProgramD AI (on local or remote machines) and AIML files with real-time previews and it provides a network support to test AI capabilities over many network protocols.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    OOoLatex is no more maintained. Please consider using TexMaths (http://roland65.free.fr/texmaths/) OOoLatex is a set marcos designed to provide latex support into OpenOffice. Complex equations can be inserted as images, the latex code is saved into the image attribute while simpler equations are expanded into symbol characters to be inserted as text.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    The Java Text Categorizing Library (JTCL) is a pure java implementation of libTextCat which in turn is "a library that was primarily developed for language guessing, a task on which it is known to perform with near-perfect accuracy."
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    WikiPDF is a mediawiki extension based on Wiki2PDF that adds PDF/LaTeX features to mediawiki. Wiki2PDF is a python script to convert multiple articles of a mediawiki based wiki (pre-configured to use with www.wikipedia.org) to a single LaTeX or PDF file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ASCIIMathML.js: a JavaScript to convert ASCII math notation (and some LaTeX) to Presentation MathML while your webpage loads. Now also simple graphs are translates to SVG. Works with Firefox 2.0+ or with Internet Explorer 6/7+MathPlayer+Adobe SVGview.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Open Office Server Daemon based on older daemon written in python (oood). Open Office is unstable as a server (memory leaks, not multithreaded, ...), this daemon makes it working in long-term without having to change anything in your code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    TMG - Text Mining for german language documents
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Calc2LaTeX is an OpenOffice.org Calc (Spreadsheet) macro for converting tables. It makes making tables on LaTeX very easy.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    This is a toolkit for medical natural language processing (NLP). The core engine is general enough to be used in a variety of text processing domains, though the toolkit includes specific support for medical reports and patient de-identification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AutoSummary uses Natural Language Processing to generate a contextually-relevant synopsis of plain text. It uses statistical and rule-based methods for part-of-speech tagging, word sense disambiguation, sentence deconstruction and semantic analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A tool kit for multiplexing annotations and management of features for textual annotation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    LineSort is a plug-in tool that adds advanced sorting capabilities to BBEdit, the Macintosh text editor from Bare Bones Software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Research Description Language (RDL) is an XML application for describing and publishing scientific research efforts. Research Editor (REd) is a tool for editing RDL documents, and exporting them to LaTeX, PDF, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB