Showing 25 open source projects for "batch text processing"

View related business solutions
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 1
    tika-python

    tika-python

    Python binding to the Apache Tika™ REST services

    A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    SageMaker Spark Container

    SageMaker Spark Container

    Docker image used to run data processing workloads

    Apache Spark™ is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for stream processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Atgen A2 Automation

    Atgen A2 Automation

    Atgen A2 is a radically simple Workload Automation & Scheduling tool.

    Atgen A2 is a radically simple IT automation solution that automates application builds, continuous testing, data parsing & processing, report generation, batch processing, job scheduling, and many other IT needs. Avoid running repetitive batch jobs and managing access to deploy and update your applications — automate in a language that approaches plain English, using SSH or WinRM, with no agents to install on remote systems. https://www.atgensoft.com/ You can install a released version of Atgen A2 via package manager(yum) by downloading latest release. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    2DFMDBarcodeCreator

    2DFMDBarcodeCreator

    Create FMD style DataMatrix pack codes & DataMatrix aggregate codes

    Read CSV files of FMD pack data, create and print FMD DataMatrix codes for all packs in the file. Create aggregate barcode files for packs within the same batch. The Aggregate function is for labelling outer packs allowing codes to be checked without breaking a box seal and scanning packs individually
    Downloads: 1 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • 5
    TEXT2DATA

    TEXT2DATA

    Text Analytics Platform

    Bring Text Analytics Platform that uses NLP (Natural Language Processing) and Machine Learning to your work environment. Extract essential information from your text documents and let Artificial Intelligence save your time. Get detailed and agile reports on your unstructured data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DSTK - Data Science TooKit 3

    DSTK - Data Science TooKit 3

    Data and Text Mining Software for Everyone

    DSTK - Data Science Toolkit 3 is a set of data and text mining softwares, following the CRISP DM model. DSTK offers data understanding using statistical and text analysis, data preparation using normalization and text processing, modeling and evaluation for machine learning and algorithms. It is based on the old version DSTK at https://sourceforge.net/projects/dstk2/ DSTK Engine is like R.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Zoom Accounts

    Zoom Accounts

    Accounting and CRM components for Delphi

    Zoom Accounts is a set of Delphi components and Application Templates intended for creation of integrated accounting and CRM systems for medium and larger size enterprises. HB Reports is required to compile this project --------------------------------------------------------- https://sourceforge.net/projects/hb-reports/
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    BPMN View

    BPMN View

    Viewing and printing business process diagrams in a popular BPMN format

    A simple free tool to view and print business process diagrams in a popular BPMN format
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9

    RPTParse

    Quartus project compilation reports parsing tool

    Altera Quartus creates .rpt files during synthesis, place&route and bitstream generation stages of FPGA project compilation. Verification engineer checks these reports, finds warning messages and put them into own report. But big projects can contain a lot of warnings, and manual warning search is very boring and long process. That's why this parsing tool was created. This software parse report file, classifies warning types and place them into own report, where types of warnings and their...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
    Try Free
  • 10
    JBEAM

    JBEAM

    Java based Batch scheduling, processing and monitoring framework

    JBEAM can be leveraged to 1. Define a batch calendar 2. Schedule and execute end of the day processing of business objects 3. Schedule execution of inbound and outbound interface jobs 4. Schedule any end of business day or monthly reports 5. Monitoring of the Jobs status and execution
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    TeleScope

    TeleScope

    XML Data Stream Broker/Replicator

    TeleScope is the efficient intensive-load XML data stream broker, replicator and simple event processing platform (SEP) written in C for the Fedora 17-18, Slackware 13-14, Red Hat Enterprise Linux 6 (RHEL-6) Linux distributions. The platform is intended to be operated upon the single number/word values and is not meant to be deployed for full-text XML stream analysis. TeleScope has internal query language with a set of standard logical operators that allows to construct relatively complex query expressions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    giServer

    giServer

    giServer the easy to use and extensible batch and integration server

    The giServer is an easy-to-use integration server for process automation and event-driven or scheduled execution of batch jobs. Instead of using complex XML configuration files an elaborate GUI for batch job management is included. Some possible usage scenarios are: - Automatic processing of incoming data files - Big Data applications - Process automation - Data Mining/Aggregation applications - Automatic Reporting - Processing and analysis of database records
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    CADminTools5

    CADminTools5

    License Tracker and User Database

    ...Most CAD software uses floating licenses, this tool gives you a web-based interface to see what's happening each day or week, and spot usage trends. It also connects with a user databse, so you can track overall usage. The components are: - ASP web pages - Access database - Batch scripts - Text configuration files No programming knowledge is required to use this. The default passwords for editing/deleting are editme/deleteme. More details at http://sourceforge.net/p/cadmintools5/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    QueLang

    QueLang is a designing tool to use for Questionnaire Design.

    ...QueLang Survey Manager can concentrate Results of answered Questionnaires in a single database for mass statistical processing. Includes: -full documentation -GUI Interface -CLI Interface -Survey Manager TODO: - Write a decent parser (I have to study for that) - Add some more macros - Answer images (instead of text only) -You can always tell me what you want me implement, and I will include it (if possible) in the next update!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    help-desk-ru

    System of registration and processing of requests for IT-service.

    Designed for organizations that provide IT-services - to support services. Provides configuration of client organizations and facilities maintenance, user management. Allows users-customer to create service requests that contain: a brief description, detailed text, the service object, attached files. Then support service handles requests, performing the necessary work and reflecting it in the application. The life cycle of the application has a number of states(status), transitions...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    DataStreamProcessor

    TPL (transfer,parse,load) tool for batch files.

    This is an enterprise-strength system for batch file processing, e.g. transfer, parse and load data using batch (text) files within and without the enterprise. The system is controlled through parameters and doesn't require any programming, code generation or code deployment. This is a heavy duty back end system with no GUI. Nonetheless it's very easy to use, easier than most GUI-based ETLs, even easier to install.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SemaRule Navigator
    SemaRule Navigator is an Integrated Suite of Open-Source and Free-License Software, placing Semantic and Text Analysis Technologies in the toolbox of Researchers, Students, and Enterprises.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    pdfInspect
    pdfInspect offers a flexible GUI interface for viewing the internal structure and content of a PDF file. Wraps the Apache PDFBox library; example of application built with Superficial http://superficial.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    AML Filter is an enterprise class open source software solution meant for anti money laundering compliance and risk reduction. The system screens names and related data against AML watchlists with efficiency, accuracy and with negligible false positives.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SplitPDF -SplitPDF.jar- is a ‘command-line driven’ Java-program, it splits a PDF-file by bookmarks into separated PDF’s. The bookmark is used as title for the newly created PDF. Extremely usefull and fast in a batch processing environment.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    A Pure Java Office suite which is compatible with the MS file formats. Since it doesn't require native libraries, it can be loaded inside any browsers and any platforms. Notably it promotes Enterprise 2.0 by combination of uEngine BPM and web office
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The Java library to decline russian personal names, surnames, patronames, occupation titles and organization units.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    EDI-Knight is an integration and translation software focussing EDI (Electronic Data Interchange), CSV (Character Separated Values), and XML. There's a command line tool for batch processing as well as a Java/J2EE conformant API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The Collective Idenitity Project's goal is to build a p2p infrastructure aimed at fostering true community thinking. Largely text based, and peer reviewed, this software is an experiment to find a social convergence on a wide range of topics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A knowledgment management system written in Java under JBoss 4.2.3 Server, with richfaces 3.3.0BETA4. Including fileconversion from html to pdf and rich:editor component without special syntaxing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB