Showing 26 open source projects for "batch text processing"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    pdfcpu

    pdfcpu

    A PDF processor written in Go

    ...The main focus lies on strong support for batch processing and scripting via a rich command line. At the same time pdfcpu wants to make it easy to integrate PDF processing into your Go-based backend system by providing a robust command set. Always make sure your work is based on the latest commit! pdfcpu is still Alpha - bugfixes are committed on the fly and will be mentioned in the next release notes.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    Kapacitor

    Kapacitor

    Open source framework for processing, monitoring, and alerting

    Open source framework for processing, monitoring, and alerting on time series data. Kapacitor is a real-time data processing engine for monitoring and alerting, specifically designed to work with time-series data from InfluxDB.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    OpenAI Go

    OpenAI Go

    The official Go library for the OpenAI API

    ...It enables developers to integrate OpenAI’s models and features into Go applications with a clean and idiomatic interface. The library provides support for a wide range of API endpoints including chat completions, assistants, embeddings, image generation, audio processing, and batch jobs. It includes built-in tools for handling authentication, managing API requests, and parsing structured responses. The repository also offers examples to help developers quickly set up projects and test different API calls. Designed for reliability and ease of use, it is maintained to stay aligned with the evolving OpenAI API specifications.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    gse

    gse

    Go efficient multilingual NLP and text segmentation

    Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. Gse is implements jieba by golang, and try add NLP support and more feature. Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes. Support user and embed dictionary, Part-of-speech/POS tagging, analyze segment info, stop and trim words. Support multilingual: English, Chinese, Japanese and others. Support Traditional Chinese. Support HMM cut text...
    Downloads: 4 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Memobase

    Memobase

    Fast backend for long-term AI user memory via structured profiles

    ...Instead of relying purely on traditional embedding-based retrieval or RAG systems, Memobase uses profile and timeline structures to deliver memory that reflects user context efficiently and meaningfully. The system focuses on three principal performance metrics: high search performance, reduced large language model (LLM) costs through batch processing techniques, and low latency with minimal SQL operations. Memobase supports integration with existing LLM workflows via APIs and SDKs (including Python, Node, and Go), making it easy to adopt within diverse application stacks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    unipdf

    unipdf

    Golang PDF library for creating and processing PDF files (pure go)

    UniDoc UniPDF is a PDF library for Go (golang) with capabilities for creating and reading, processing PDF files. The library is written and supported by FoxyUtils.com, where the library is used to power many of its services. Every release of our libraries is automatically tested against known vulnerabilities and do not pass unless everything is remediated. All changes are carefully reviewed by our team.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    Asynq

    Asynq

    Simple, reliable, and efficient distributed task queue in Go

    Asynq is a Go library for queueing tasks and processing them asynchronously with workers. It's backed by Redis and is designed to be scalable yet easy to get started. Client puts tasks on a queue. Server pulls tasks off queues and starts a worker goroutine for each task. Tasks are processed concurrently by multiple workers. Task queues are used as a mechanism to distribute work across multiple machines. A system can consist of multiple worker servers and brokers, giving way to high...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Cloudreve

    Cloudreve

    Self-hosted file management and sharing system

    Cloudreve is a full-featured self-hosted file management and sharing system. It supports multi-cloud storage backends, user/group permissions, file previews/editing, offline downloading via Aria2, and a polished web/PWA frontend.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    KrillinAI

    KrillinAI

    Video translation and dubbing tool powered by LLMs

    ...It integrates several stages of the pipeline: video acquisition (either from local files or remote via download tools), speech recognition (ASR), subtitle segmentation and alignment, machine translation (with context-aware translation to preserve semantics), and voice cloning + text-to-speech (TTS) to produce dubbed audio tracks. KrillinAI supports both landscape and portrait videos, which makes it suitable for a wide range of platforms — from YouTube to TikTok or other vertical-video sites — and ensures correct formatting and layout for the final video. The tool offers “one-click” workflows and desktop versions, lowering the barrier for users who may not be familiar with video editing or audio processing pipelines.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
    Try Free
  • 10
    KubeAI

    KubeAI

    Private Open AI on Kubernetes

    Get inferencing running on Kubernetes: LLMs, Embeddings, Speech-to-Text. KubeAI serves an OpenAI compatible HTTP API. Admins can configure ML models by using the Model Kubernetes Custom Resources. KubeAI can be thought of as a Model Operator (See Operator Pattern) that manages vLLM and Ollama servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Weaviate

    Weaviate

    Weaviate is a cloud-native, modular, real-time vector search engine

    Weaviate in a nutshell: Weaviate is a vector search engine and vector database. Weaviate uses machine learning to vectorize and store data, and to find answers to natural language queries. With Weaviate you can also bring your custom ML models to production scale. Weaviate in detail: Weaviate is a low-latency vector search engine with out-of-the-box support for different media types (text, images, etc.). It offers Semantic Search, Question-Answer-Extraction, Classification, Customizable...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    Amazon ECS Container Agent

    Amazon ECS Container Agent

    Amazon Elastic Container Service Agent

    Run highly secure, reliable, and scalable containers. Launch thousands of containers across the cloud using your preferred continuous integration and delivery (CI/CD) and automation tools. Optimize your time with AWS Fargate serverless compute for containers, which eliminates the need to configure and manage control plane, nodes, and instances. Save up to 50 percent on compute costs with autonomous provisioning, auto-scaling, and pay-as-you-go pricing. Integrate seamlessly with AWS...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    openwechat

    openwechat

    golang WeChat SDK

    ...Breakthrough login restrictions. No need to scan the QR code repeatedly to log in. Support multiple WeChat accounts to log in at the same time. Message reply, send text, pictures, files, emoji and other messages to designated objects (friends, groups) Hot login (no need to repeatedly scan the code to log in), custom message processing, file download, message anti-withdrawal. Obtain object information, set friend notes, pull friends into groups, etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    GoAWK

    GoAWK

    A POSIX-compliant AWK interpreter written in Go, with CSV support

    GoAWK now uses a bytecode compiler and includes native support for CSV files. AWK is a fascinating text processing language, and The AWK Programming Language is a wonderfully concise book describing it. The A, W, and K in AWK stand for the surnames of the three original creators: Alfred Aho, Peter Weinberger, and Brian Kernighan. Kernighan is also an author of The C Programming Language (“K&R”), and the two books have that same each-page-packs-a-punch feel.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    docconv

    docconv

    Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

    A Go wrapper library to convert PDF, DOC, DOCX, XML, HTML, RTF, ODT, Pages documents and images (see optional dependencies below) to plain text. See go help install for details on the installation location of the installed docd executable. Make sure that the full path to the executable is in your PATH environment variable. To add image support to the docconv library you first need to install and build gosseract. Now you can add -tags ocr to any go command when building/fetching/testing docconv to include support for processing images. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Lingua-Go

    Lingua-Go

    The most accurate natural language detection library for Go

    Lingua-Go is a Golang implementation of the Lingua language detection library, providing efficient and accurate language identification for Go-based applications. Its task is simple: It tells you which language some text is written in. This is very useful as a preprocessing step for linguistic data in natural language processing applications such as text classification and spell checking. Other use cases, for instance, might include routing e-mails to the right geographically located customer service department, based on the e-mails' languages. Language detection is often done as part of large machine-learning frameworks or natural language processing applications. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    QOR

    QOR

    Libraries written in Go abstracting features needed for business apps

    QOR is architected from the ground up to accelerate development and deployment of Content Management Systems, E-commerce Systems, and Business Applications. QOR is comprised of modules that abstract common features for such systems, such as a configurable and flexible back office, a content publishing system, a media library, and much more. We built QOR at The Plant to accelerate development for our clients, it is at the core of our engineering strategy, and we maintain it on an ongoing...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Slides

    Slides

    Terminal based presentation tool

    Slides in your terminal.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    VideoSrt

    VideoSrt

    Windows-GUI

    ...VideoSrtIt is written in Golanglanguage and developed based on lxn/walk Windows-GUI toolkit. Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. ...
    Downloads: 45 This Week
    Last Update:
    See Project
  • 20
    Pixel

    Pixel

    A hand-crafted 2D game library in Go

    ...See the requirements for the list of libraries necessary for compilation. Fast 2D graphics, sprites, and primitive shapes with immediate mode style IMDraw (circles, rectangles, lines, ...) Optimized drawing with Batch. Text drawing with text package. Audio through a separate Beep library. Simple and convenient API. Drawing a sprite to a window is as simple as sprite.Draw(window, matrix) Wanna know where the center of a window is? window.Bounds().Center() Full documentation and tutorial. Works on Linux, macOS and Windows. Window creation and manipulation (resizing, fullscreen, multiple windows, ...) ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    prose NLP

    prose NLP

    Golang library for text processing

    Prose is a natural language processing library for Go, designed for text analysis tasks like tokenization, named entity recognition, and dependency parsing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    RadonDB

    RadonDB

    RadonDB is an open source, cloud-native MySQL database

    RadonDB is a cloud-native database based on MySQL, and architected in fully distributed cluster that enable unlimited scalability (scale-out), capacity and performance. It supported distributed transaction that ensure high data consistency, and leveraged MySQL as storage engine for trusted data reliability. RadonDB is compatible with MySQL protocol, and sup-porting automatic table sharding as well as batch of automation feature for simplifying the maintenance and operation workflow. RadonDB...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Blackfriday

    Blackfriday

    Blackfriday, a markdown processor for Go

    Blackfriday is a Markdown processor implemented in Go. It is paranoid about its input (so you can safely feed it user-supplied data), it is fast, it supports common extensions (tables, smart punctuation substitutions, etc.), and it is safe for all utf-8 (unicode) input. HTML output is currently supported, along with Smartypants extensions. It started as a translation from C of Sundown. Blackfriday itself does nothing to protect against malicious content. If you are dealing with user-supplied...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    up

    up

    A tool for writing Linux pipes with instant live preview

    ...The main goal of the Ultimate Plumber is to help interactively and incrementally explore textual data in Linux, by making it easier to quickly build complex pipelines, thanks to a fast feedback loop. This is achieved by boosting any typical Linux text-processing utils such as grep, sort, cut, paste, awk, wc, perl, etc., etc., by providing a quick, interactive, scrollable preview of their results. To start using up, redirect any text-emitting command (or pipeline) into it. Use PgUp/PgDn and Ctrl-[←]/Ctrl-[→] for basic browsing through the command output. In the input box at the top of the screen, start writing any bash pipeline; then press Enter to execute the command you typed, and the Ultimate Plumber will immediately show you the output of the pipeline in the scrollable window.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Primitive Pictures

    Primitive Pictures

    Reproducing images with geometric primitives

    Primitive Pictures is an image processing command-line tool written in Go that reproduces images using geometric primitives (triangles, rectangles, ellipses, polygons, etc.). The core algorithm is iterative and “hill-climbing”: given a target image, it repeatedly finds the best single shape to add that will reduce the error between the current approximation and the target image, then draws that shape.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB