Showing 144 open source projects for "text processing"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 1
    OmniHelp is a cross-platform, browser-independent, tri-pane help viewer built in pure JavaScript and CSS with HTML 4. Some functions (such as help embedding) may in the future be in Java, C, or C++; CSH is fully supported. All code is under the LGPL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    PTools is a set of useful tools written in Pascal. It includes: scientific calculator, archiver, text editor, remote adminitration and more. It is designed to be portable across operating systems, specially Java-based mobiles, Windows and Unixes.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    This is a library to extract raw unicode text from any written documents (office documents such as PDF, Word, OpenOffice, ...). It should be useful to developpers of search engine, text processing, corpus analysis, ....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Ub3rMath

    Ub3rMath

    Simple math parsing library for C++

    A math parsing library for C++ with a number of powerful features to allow flexible interpretation of mathematical formula in text form.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Mavscript ermöglicht es in einem Textdokument Berechnungen durchzuführen. Die eigentliche Berechnung verarbeitet das Algebraprogramm Yacas (oder der Java-Interpreter BeanShell).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    This project aims to be a dictionary manager for EDICT and CCEDICT like dictionaries, using GTK and Qt as GUIs (also looking for comandline operations).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SemaRule Navigator
    SemaRule Navigator is an Integrated Suite of Open-Source and Free-License Software, placing Semantic and Text Analysis Technologies in the toolbox of Researchers, Students, and Enterprises.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    osm2postgresql is a Linux script loading openstreetmap data into a postgresql database setting up if needed a server & database (postgis+hstore). Advanced processing is performed, easing rendering. http://wiki.openstreetmap.org/wiki/Osm2postgresql
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    A set of Unix command line tools for quick and convenient batch processing of tabular text files (a.k.a., tab-delimited, csv, or flat file format) with a header line. Provides delimiter and compression detection, column reference by name. * tblmap: per-line ("map") computation: derive columns through an expression, delete, reorder, filter rows. * tblred: compute ("reduce") aggregations (e.g., sum, average) over groups defined by key columns
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Apolda is a plugin for the Gate framework (see http://sourceforge.net/projects/gate/) that annotates texts with labels of concepts from an arbitrary OWL-ontology.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TextMarker
    TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    transpose is a fast and efficient command-line tool that transposes (the rows become the columns, the columns become the rows) or reshapes a matrix in plain text. Inputs and outputs can be files or streams.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Kurdi Nus
    A portable, platform-independent, open source tool for converting different Kurdish scripts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A lyrical analysis and classification tool focused specifically on rhyming style in rap lyrics. Functions include phonetic transcription, rhyme visualization, and rapper classification.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Java Data Processing Project aims to manipulate text file containing scientific data such as time/intensity scan or spectrum. Jdpp provide large flexibility to read all possible time duration and date format. It is originally design to treat MS data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Embeddable Predictive Text Library
    A C (and JavaScript) library providing predictive text functions. The API is very simple and provides dictionary autocomplete and partial/full matching. Sample cellphone-like examples are included.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The Scheme Natural Language Toolkit (S-NLTK) is a Scheme R6RS library for language and text processing, and various tasks related to symbolic and statistical analysis of language data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Why is it quicker to express simple equations in plaintext that it is to write the equivalent LaTeX? Easylatex is a preprocessor to make writing LaTeX much quicker. Project activity mode (http://bayleshanks.com/pamv1 ): sporadic
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Sanchay
    Sanchay is a collection of tools and APIs for language researchers. It has some implementations of NLP algorithms, some flexible APIs, several user friendly annotation interfaces and Sanchay Query Language for language resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Pylero
    Pylero is an open-source Python-based text generator.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    OCR c++ library. Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    We are using a large archive of newspaper stories(GigaWordCorpus) as input to a parallel MPI program, and produce from that a list of top R terms of varying lengths M through N that are especially interesting. The program is done in C using MPI.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB