Showing 48 open source projects for "batch text processing"

View related business solutions
  • Push Code. Get a Production URL. Done. Icon
    Push Code. Get a Production URL. Done.

    Cloud Run deploys any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try Cloud Run Free
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Lesan

    Lesan

    New way to create web server and NoSQL data model

    Lesan is a multilingual text processing and translation library designed for natural language processing (NLP) applications. It provides tools for text normalization, tokenization, and translation across multiple languages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    ArangoDB-Community/pyArango

    ArangoDB-Community/pyArango

    Python Driver for ArangoDB with built-in validation

    PyArango is a Python driver for ArangoDB, a multi-model NoSQL database. It provides a Pythonic way to interact with ArangoDB, allowing developers to manage collections, execute AQL queries, and integrate ArangoDB's document, graph, and key-value storage models into Python applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    EfficientDynamoDb

    EfficientDynamoDb

    High-performance C# DynamoDb library

    EfficientDynamoDb is a high-performance, low-level DynamoDB client for .NET, designed to offer better efficiency and flexibility compared to AWS's official SDK. It provides improved memory management, reduced allocations, and a fluent API for seamless integration with DynamoDB services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    eXist-db

    eXist-db

    eXist Native XML Database and Application Platform

    eXist-db is an open-source, native XML database and application platform that provides a powerful environment for storing, querying, and managing XML documents. It is designed for complex data management needs, offering XQuery, XSLT, and RESTful web services for interacting with structured data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 5
    RefDB is a reference database and bibliography tool for SGML, XML, and LaTeX documents, sort of a Reference Manager or BibTeX for markup languages. It is portable and known to run on Linux, Free/NetBSD, OSX, Solaris, and Windows/Cygwin.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 6

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    Cetus

    Cetus

    Cetus is a high performance middleware that provides routing

    ...Multi-process lock-free improves operating efficiency. Supports transparent backend connection pooling. Support SQL read-write separation. Support data sub-database. Support distributed transaction processing. Support insert batch operations. Support for conditional distinct operations. Enhanced SQL route parsing and injection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    RadonDB

    RadonDB

    RadonDB is an open source, cloud-native MySQL database

    RadonDB is a cloud-native database based on MySQL, and architected in fully distributed cluster that enable unlimited scalability (scale-out), capacity and performance. It supported distributed transaction that ensure high data consistency, and leveraged MySQL as storage engine for trusted data reliability. RadonDB is compatible with MySQL protocol, and sup-porting automatic table sharding as well as batch of automation feature for simplifying the maintenance and operation workflow. RadonDB...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 10
    Cosmos DB Spark

    Cosmos DB Spark

    Apache Spark Connector for Azure Cosmos DB

    ...The connector allows you to easily read to and write from Azure Cosmos DB via Apache Spark DataFrames in Python and Scala. It also allows you to easily create a lambda architecture for batch-processing, stream-processing, and a serving layer while being globally replicated and minimizing the latency involved in working with big data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    HeidiSQL
    HeidiSQL is a useful and reliable tool designed for web developers using the popular MySQL server, Microsoft SQL databases and PostgreSQL. Code repository and tickets have moved to https://github.com/HeidiSQL/HeidiSQL in Nov 2017.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Apache PredictionIO

    Apache PredictionIO

    Machine learning server for developers and ML engineers

    ...Quickly build and deploy an engine as a web service on production with customizable templates; respond to dynamic queries in real-time once deployed as a web service; evaluate and tune multiple engine variants systematically; unify data from multiple platforms in batch or in real-time for comprehensive predictive analytics; speed up machine learning modeling with systematic processes and pre-built evaluation measures; support machine learning and data processing libraries such as Spark MLLib and OpenNLP; implement your own machine learning models and seamlessly incorporate them into your engine; simplify data infrastructure management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ATTENTION! Morfologik is now at GitHub: https://github.com/morfologik/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    CSV SQL Console

    Executes SQL query and redirects output to new CSV file

    This tool is designed to join multiple CSV files into one CSV file with possibility to sort, group and filter data. Based on CSV/Text ODBC Driver it supports almost all futures of the SQL. Works offline. Does not require Microsoft Access, SQL Server and other heavy tools. See wiki for details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Sedna Native XML Database

    Sedna Native XML Database

    Sedna is a native full-featured XML database management system.

    Sedna is a free native XML database which provides a full range of core database services - persistent storage, ACID transactions, security, indices, hot backup. Flexible XML processing facilities include W3C XQuery implementation, tight integration of XQuery with full-text search facilities and a node-level update language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SetFon focus is an interface web based for Praat resources (www.praat.org) wich focus speech sound annalysis; it is a gerent program for acoustic analysis PHP/Mysql based. Developed with the framework SIMP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    OstrivDB

    OstrivDB

    Embedded NoSQL object-oriented database engine for .NET and Mono

    OstrivDB is an embedded NoSQL object-oriented database engine for Microsoft .NET and Mono. OstrivDB was named after the Oster river (the little river in little town where author lives). Features: - Caching for Select queries. - Indexing. - Batch of queries. - Views. - InMemory objects and views. - No special query language (LINQ used). - Integrated sorting and paging. - Multithreaded data processing. - Custom input/output and serialization. Requirements: - Microsoft .NET Framework 4.0 or Mono (tested on FreeBSD 9.1 and Mono 2.11.4).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A python script that uses wxwidgets. View or edit delimited data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This is a handy small tool to execute batch SQL files in SQL server. Just add any number of SQL file in it and execute all in one sort. Tool will notify when a new file found in repository or any modification is done on added sql files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This project deals with developing a distributed infrastructure to handle and process sensor data including their provenance data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Large Text File converter

    Java Based Heavy-duty utilitity to process large delimited text files

    TextZilla is a Multithreaded Java utility which can process huge size delimited text files to extract, convert, encode, decode, encrypt/decrypt text data from source and write it in desired output file or files. It provides fully extensible framework based on which Java classes can be created, for example it currently has MD5 conversion capability, based on same design classes for 3DES ,AES or any other Algo can be created. Another strength of this tool is in its configurability, it's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LM - Report Asistent will be an utility, that helps dataminers, who use LISP-Miner, with writing analytical repostrs. Report Asisten will connect LISP-Miner with MS Word (R) editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    osm2postgresql is a Linux script loading openstreetmap data into a postgresql database setting up if needed a server & database (postgis+hstore). Advanced processing is performed, easing rendering. http://wiki.openstreetmap.org/wiki/Osm2postgresql
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    .Net-Control for PDF creation with dynamic Content. It includes a WYSIWYG-Editor to design the PDF-Documents and a Interface to Link SQL-Queries with Fields on the PDFs. Databases are Access(OleDb), MySql(-Client), MS Sql(-Client), all Odbc. Uses itext.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    miMas - GUI tool for SQL developers

    miMas - GUI tool for SQL developers

    A UI tool for any database

    miMas is an intuitive GUI tool for SQL query development and data browsing. It provides a common set of features across all databases and can be installed on any Operating System which has a Java Runtime available. The tool uses JDBC drivers for connecting to a database. For most popular databases (Oracle, DB2, MySQL, MS SQL Server, Sybase, Cloudscape, PostgreSQL and Firebird), the JDBC driver is already included. For connecting to databases other than these, please refer to the tool's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB