Showing 28 open source projects for "text processing"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Lesan

    Lesan

    New way to create web server and NoSQL data model

    Lesan is a multilingual text processing and translation library designed for natural language processing (NLP) applications. It provides tools for text normalization, tokenization, and translation across multiple languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    eXist-db

    eXist-db

    eXist Native XML Database and Application Platform

    eXist-db is an open-source, native XML database and application platform that provides a powerful environment for storing, querying, and managing XML documents. It is designed for complex data management needs, offering XQuery, XSLT, and RESTful web services for interacting with structured data.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    ATTENTION! Morfologik is now at GitHub: https://github.com/morfologik/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Sedna Native XML Database

    Sedna Native XML Database

    Sedna is a native full-featured XML database management system.

    Sedna is a free native XML database which provides a full range of core database services - persistent storage, ACID transactions, security, indices, hot backup. Flexible XML processing facilities include W3C XQuery implementation, tight integration of XQuery with full-text search facilities and a node-level update language.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    SetFon focus is an interface web based for Praat resources (www.praat.org) wich focus speech sound annalysis; it is a gerent program for acoustic analysis PHP/Mysql based. Developed with the framework SIMP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    A python script that uses wxwidgets. View or edit delimited data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Large Text File converter

    Java Based Heavy-duty utilitity to process large delimited text files

    TextZilla is a Multithreaded Java utility which can process huge size delimited text files to extract, convert, encode, decode, encrypt/decrypt text data from source and write it in desired output file or files. It provides fully extensible framework based on which Java classes can be created, for example it currently has MD5 conversion capability, based on same design classes for 3DES ,AES or any other Algo can be created. Another strength of this tool is in its configurability, it's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    LM - Report Asistent will be an utility, that helps dataminers, who use LISP-Miner, with writing analytical repostrs. Report Asisten will connect LISP-Miner with MS Word (R) editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    .Net-Control for PDF creation with dynamic Content. It includes a WYSIWYG-Editor to design the PDF-Documents and a Interface to Link SQL-Queries with Fields on the PDFs. Databases are Access(OleDb), MySql(-Client), MS Sql(-Client), all Odbc. Uses itext.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SynApp2
    SynApp2 builds feature-packed web applications and versatile PDF reports for MySQL and Oracle Database. The SynApp2 web application generator and MVC framework is written in PHP and JavaScript.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    FirebirdConfig
    FirebirdConfig provides you with a friendly user interface in order to configurate the firebird database operation modifying the firebird.conf file with the most intuitive and functional graphic.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    This project aims to develop and distribute the database for Japanese Kanji (or CJK Unified Ideographs) for Unicode and ISO/IEC 10646.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    UProm for Agile Development
    UProm Requirement Management System (traceability and requirement coverage). Includes : - An admin module to manage SCRUM Project and tree of requirements - A multi-user Web-based client - Tools for databases management (HSQLDB, PostGreSQL,...) UProm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Functional XML parsing framework: SAX/DOM and SXML parsers with support for XML Namespaces and validation. Related to SSAX are SXPath queries and SXML transformations, with applications to XML/HTML authoring and literate Scheme and XML programming.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    WikiPDF is a mediawiki extension based on Wiki2PDF that adds PDF/LaTeX features to mediawiki. Wiki2PDF is a python script to convert multiple articles of a mediawiki based wiki (pre-configured to use with www.wikipedia.org) to a single LaTeX or PDF file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Xax is an html and text editor for the development of offline compendiums of information to be viewed in a visual way. Xax mimics MediaWiki software to allow users an environment to avoid disconnections and other internet based errors when editing wikis
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    csvplus is a collection of Perl-based utilities for querying csv files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Lightweight system for running a weblog. Features multiple authors, topics, Trackback, RSS (amongst others). TruBlog comes with easy installation and strong caching mechanisms, it's localisable and produces a valid XHTML. Theming is done through CSS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    NovaX is a set of programs that is being devloped for small company's and beginners in the fields of HTML and programming. Coded in Python and C++, this is also a good replacement for MS Office. NOTE: This requires Python to be on your PC. ( Python.org )
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Project to create a unified FAQ XML format with all applicable software to convert it to various formats, such as multiple forms of HTML, TeX, PDF, text files, etc. Useful for most of "FAQ keepers" on various forums and discussion lists.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    HindiDict creates a Latex formatted Hindi dictionary from a text file. The entries will be sorted both by Hindi and by English.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    You may manage your bibliography with this tool: export to BibTeX and HTML, shortcut for citation in LaTeX-code, internationalization... Due to lack of time this project is stalled, please see JabRef on http://jabref.sourceforge.net/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The most powerful non-commercial translation memory software (TM tool) with enhanced capabilities, like networking/collaboration (http, rpc), encoding conversion, project management capabilities, email capability with attachments, file tree diff etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB