Showing 37 open source projects for "batch text processing"

View related business solutions
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Lesan

    Lesan

    New way to create web server and NoSQL data model

    Lesan is a multilingual text processing and translation library designed for natural language processing (NLP) applications. It provides tools for text normalization, tokenization, and translation across multiple languages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    ArangoDB-Community/pyArango

    ArangoDB-Community/pyArango

    Python Driver for ArangoDB with built-in validation

    PyArango is a Python driver for ArangoDB, a multi-model NoSQL database. It provides a Pythonic way to interact with ArangoDB, allowing developers to manage collections, execute AQL queries, and integrate ArangoDB's document, graph, and key-value storage models into Python applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    EfficientDynamoDb

    EfficientDynamoDb

    High-performance C# DynamoDb library

    EfficientDynamoDb is a high-performance, low-level DynamoDB client for .NET, designed to offer better efficiency and flexibility compared to AWS's official SDK. It provides improved memory management, reduced allocations, and a fluent API for seamless integration with DynamoDB services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    eXist-db

    eXist-db

    eXist Native XML Database and Application Platform

    eXist-db is an open-source, native XML database and application platform that provides a powerful environment for storing, querying, and managing XML documents. It is designed for complex data management needs, offering XQuery, XSLT, and RESTful web services for interacting with structured data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
    Try Free
  • 5
    RefDB is a reference database and bibliography tool for SGML, XML, and LaTeX documents, sort of a Reference Manager or BibTeX for markup languages. It is portable and known to run on Linux, Free/NetBSD, OSX, Solaris, and Windows/Cygwin.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 6

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    Cetus

    Cetus

    Cetus is a high performance middleware that provides routing

    ...Multi-process lock-free improves operating efficiency. Supports transparent backend connection pooling. Support SQL read-write separation. Support data sub-database. Support distributed transaction processing. Support insert batch operations. Support for conditional distinct operations. Enhanced SQL route parsing and injection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    RadonDB

    RadonDB

    RadonDB is an open source, cloud-native MySQL database

    RadonDB is a cloud-native database based on MySQL, and architected in fully distributed cluster that enable unlimited scalability (scale-out), capacity and performance. It supported distributed transaction that ensure high data consistency, and leveraged MySQL as storage engine for trusted data reliability. RadonDB is compatible with MySQL protocol, and sup-porting automatic table sharding as well as batch of automation feature for simplifying the maintenance and operation workflow. RadonDB...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Cut Your Data Warehouse Bill by 54% Icon
    Cut Your Data Warehouse Bill by 54%

    Migrate from Snowflake, Redshift, or Databricks with free tools. No SQL rewrites.

    BigQuery delivers 54% lower TCO with serverless scale and flexible pricing. Free migration tools handle the SQL translation automatically.
    Try Free
  • 10
    Cosmos DB Spark

    Cosmos DB Spark

    Apache Spark Connector for Azure Cosmos DB

    ...The connector allows you to easily read to and write from Azure Cosmos DB via Apache Spark DataFrames in Python and Scala. It also allows you to easily create a lambda architecture for batch-processing, stream-processing, and a serving layer while being globally replicated and minimizing the latency involved in working with big data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Apache PredictionIO

    Apache PredictionIO

    Machine learning server for developers and ML engineers

    ...Quickly build and deploy an engine as a web service on production with customizable templates; respond to dynamic queries in real-time once deployed as a web service; evaluate and tune multiple engine variants systematically; unify data from multiple platforms in batch or in real-time for comprehensive predictive analytics; speed up machine learning modeling with systematic processes and pre-built evaluation measures; support machine learning and data processing libraries such as Spark MLLib and OpenNLP; implement your own machine learning models and seamlessly incorporate them into your engine; simplify data infrastructure management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ATTENTION! Morfologik is now at GitHub: https://github.com/morfologik/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Sedna Native XML Database

    Sedna Native XML Database

    Sedna is a native full-featured XML database management system.

    Sedna is a free native XML database which provides a full range of core database services - persistent storage, ACID transactions, security, indices, hot backup. Flexible XML processing facilities include W3C XQuery implementation, tight integration of XQuery with full-text search facilities and a node-level update language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SetFon focus is an interface web based for Praat resources (www.praat.org) wich focus speech sound annalysis; it is a gerent program for acoustic analysis PHP/Mysql based. Developed with the framework SIMP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    OstrivDB

    OstrivDB

    Embedded NoSQL object-oriented database engine for .NET and Mono

    OstrivDB is an embedded NoSQL object-oriented database engine for Microsoft .NET and Mono. OstrivDB was named after the Oster river (the little river in little town where author lives). Features: - Caching for Select queries. - Indexing. - Batch of queries. - Views. - InMemory objects and views. - No special query language (LINQ used). - Integrated sorting and paging. - Multithreaded data processing. - Custom input/output and serialization. Requirements: - Microsoft .NET Framework 4.0 or Mono (tested on FreeBSD 9.1 and Mono 2.11.4).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A python script that uses wxwidgets. View or edit delimited data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Large Text File converter

    Java Based Heavy-duty utilitity to process large delimited text files

    TextZilla is a Multithreaded Java utility which can process huge size delimited text files to extract, convert, encode, decode, encrypt/decrypt text data from source and write it in desired output file or files. It provides fully extensible framework based on which Java classes can be created, for example it currently has MD5 conversion capability, based on same design classes for 3DES ,AES or any other Algo can be created. Another strength of this tool is in its configurability, it's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    osm2postgresql is a Linux script loading openstreetmap data into a postgresql database setting up if needed a server & database (postgis+hstore). Advanced processing is performed, easing rendering. http://wiki.openstreetmap.org/wiki/Osm2postgresql
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    miMas - GUI tool for SQL developers

    miMas - GUI tool for SQL developers

    A UI tool for any database

    miMas is an intuitive GUI tool for SQL query development and data browsing. It provides a common set of features across all databases and can be installed on any Operating System which has a Java Runtime available. The tool uses JDBC drivers for connecting to a database. For most popular databases (Oracle, DB2, MySQL, MS SQL Server, Sybase, Cloudscape, PostgreSQL and Firebird), the JDBC driver is already included. For connecting to databases other than these, please refer to the tool's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SynApp2
    SynApp2 builds feature-packed web applications and versatile PDF reports for MySQL and Oracle Database. The SynApp2 web application generator and MVC framework is written in PHP and JavaScript.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    This project aims to develop and distribute the database for Japanese Kanji (or CJK Unified Ideographs) for Unicode and ISO/IEC 10646.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    UProm for Agile Development
    UProm Requirement Management System (traceability and requirement coverage). Includes : - An admin module to manage SCRUM Project and tree of requirements - A multi-user Web-based client - Tools for databases management (HSQLDB, PostGreSQL,...) UProm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Converts GIS data from CARIS NTX format into ESRI shapefile format, preserving virtually all features and attributes. Supports points, lines, polygons, text, spot heights and soundings. Includes a batch utility, an API, and an ArcView GIS Extension.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Functional XML parsing framework: SAX/DOM and SXML parsers with support for XML Namespaces and validation. Related to SSAX are SXPath queries and SXML transformations, with applications to XML/HTML authoring and literate Scheme and XML programming.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    WikiPDF is a mediawiki extension based on Wiki2PDF that adds PDF/LaTeX features to mediawiki. Wiki2PDF is a python script to convert multiple articles of a mediawiki based wiki (pre-configured to use with www.wikipedia.org) to a single LaTeX or PDF file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB