Showing 22 open source projects for "text batch processing tools"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 1
    OpenAI Go

    OpenAI Go

    The official Go library for the OpenAI API

    ...It enables developers to integrate OpenAI’s models and features into Go applications with a clean and idiomatic interface. The library provides support for a wide range of API endpoints including chat completions, assistants, embeddings, image generation, audio processing, and batch jobs. It includes built-in tools for handling authentication, managing API requests, and parsing structured responses. The repository also offers examples to help developers quickly set up projects and test different API calls. Designed for reliability and ease of use, it is maintained to stay aligned with the evolving OpenAI API specifications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    MCScanX

    MCScanX

    MCScanX: Multiple Collinearity Scan toolkit X version

    MCScanX is a toolkit for detecting gene synteny and collinearity, aiding in the evolutionary analysis of gene arrangements across multiple genomes. It extends the original MCScan algorithm by incorporating additional utilities for visualization and downstream analyses.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    OmniTools is a self-hosted web application that bundles a large collection of everyday utilities into a single clean interface you can run on your own infrastructure. It’s designed to replace the random assortment of “free online tools” people use for quick tasks, while avoiding ads, tracking, and the need to upload sensitive files to unknown servers. A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent to the backend. The tool catalog spans both technical and non-technical needs, including image, video, audio, PDF, text, date/time, math, and data format utilities like JSON/CSV/XML helpers. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    baoyu-skills

    baoyu-skills

    Skills shared by Baoyu for improving daily work efficiency with Claude

    ...The system supports marketplace-style installation, where users can selectively install or update individual skills rather than a monolithic package. It integrates with various external services, including AI APIs and browser automation tools, to expand its capabilities beyond basic text processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    Cookbook (Google Gemini)

    Cookbook (Google Gemini)

    Examples and guides for using the Gemini API

    ...It provides a structured learning path with quick-start tutorials for beginners and practical examples for advanced users. The repository covers a wide range of Gemini capabilities, including text, images, video, speech, robotics, and multimodal interactions. It highlights newly introduced features such as Gemini 2.5 models (Flash and Pro), Gemini’s native image generation, Veo for video generation, robotics-focused reasoning models, and Lyria for TTS and music generation. The Cookbook also includes tutorials on advanced API workflows such as grounding answers with external tools, batch-mode request handling, and live multimodal interactivity with LiveAPI. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 7
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Ollama-rs

    Ollama-rs

    A simple and easy-to-use library for interacting with the Ollama API

    Ollama-rs is a Rust library designed to provide a simple and efficient interface for interacting with the Ollama API, enabling developers to integrate local large language models into Rust applications. It follows the official Ollama API closely, ensuring compatibility while offering an idiomatic Rust experience with strong typing and asynchronous execution. The library supports a wide range of operations, including text generation, chat interactions, embeddings, and model management, making...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OnlineToolsBook

    OnlineToolsBook

    Online tool cheats, write a high-quality manual for online tools

    ...For someone who frequently resorts to ad-hoc web tools to solve tasks (text manipulation, image processing, conversion, utilities), OnlineToolsBook acts as an aggregator of “cheat sheets” or curated pointer collection rather than a specific application. The intention appears to be long-term: the repository can be updated to reflect new tools, remove broken ones, organize categories, or provide usage hints — so it becomes a living, crowd-maintained reference.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    p5.js

    p5.js

    Client-side JS platform for artists, designers and students to express

    p5.js is a JavaScript library for creative coding, with a focus on making coding accessible and inclusive for artists, designers, educators, beginners, and anyone else! p5.js is free and open-source because we believe software, and the tools to learn it, should be accessible to everyone. Using the metaphor of a sketch, p5.js has a full set of drawing functionality. However, you’re not limited to your drawing canvas. You can think of your whole browser page as your sketch, including HTML5 objects for text, input, video, webcam, and sound. p5.js is an interpretation of Processing for today’s web. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    LLPhant

    LLPhant

    A comprehensive PHP Generative AI Framework

    LLPhant is a PHP-based generative AI framework designed to bring large language model capabilities into modern PHP applications with a structure inspired by frameworks like LangChain. It provides developers with a set of abstractions for building AI-powered features such as chat systems, retrieval pipelines, and automated workflows while remaining simple enough to integrate into existing Symfony or Laravel projects. The framework focuses on usability by offering straightforward APIs for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Transformers4Rec

    Transformers4Rec

    Transformers4Rec is a flexible and efficient library

    Transformers4Rec is an advanced recommendation system library that leverages Transformer models for sequential and session-based recommendations. The library works as a bridge between natural language processing (NLP) and recommender systems (RecSys) by integrating with one of the most popular NLP frameworks, Hugging Face Transformers (HF). Transformers4Rec makes state-of-the-art transformer architectures available for RecSys researchers and industry practitioners. Traditional recommendation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Terminals Are Sexy

    Terminals Are Sexy

    A curated list of Terminal frameworks, plugins & resources

    This repository is a curated, community-driven collection of tools, tips, themes, and resources that make working in the terminal more productive and enjoyable. It pulls together high-quality links to terminal emulators, shells, prompts, plugins, fonts, color schemes, and command-line utilities, alongside learning materials for sharpening everyday CLI skills. The list favors practical, well-maintained projects and is organized so you can quickly discover alternatives for common tasks (navigation, search, networking, monitoring, text processing, and more). ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc..
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Pure Bash Bible

    Pure Bash Bible

    A collection of pure bash alternatives to external processes

    pure-bash-bible is a collection of pure Bash scripting techniques that demonstrate how to accomplish common and complex tasks using only built-in Bash features. Its goal is to reduce reliance on external tools like sed, awk, or grep, which can slow down scripts and add unnecessary dependencies. The project is organized as a reference book of function-based code snippets, each showcasing practical solutions for string manipulation, text processing, file operations, and more. By relying exclusively on Bash built-ins, these methods can make scripts faster, more portable, and easier to maintain. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    PyTorch Natural Language Processing

    PyTorch Natural Language Processing

    Basic Utilities for PyTorch Natural Language Processing (NLP)

    PyTorch-NLP is a library for Natural Language Processing (NLP) in Python. It’s built with the very latest research in mind, and was designed from day one to support rapid prototyping. PyTorch-NLP comes with pre-trained embeddings, samplers, dataset loaders, metrics, neural network modules and text encoders. It’s open-source software, released under the BSD3 license. With your batch in hand, you can use PyTorch to develop and train your model using gradient descent. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 200 This Week
    Last Update:
    See Project
  • 18
    aeneas

    aeneas

    Automagically synchronize audio and text (aka forced alignment)

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    JPDF Tools
    JPDF Tools is a GUI java program built on the JPDF Export library. Its main aim is to create pdf files by inserting texts, images or tables. Users can also merge PDF files, split PDF files, merge images into PDF files and soon convert from and to PDF files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A collection of open source libraries and tools that provide solutions for common problems in processing Arabic text, especially in web applications. text normalization, phrase segmentation, text indexing, stop word lists, common spelling mistakes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    this is a small JSP tag library which allows you to create PDF documents within your JSPs. All you need to do is add the jar file to your lib folder under WEB-INF and the tld file in a tld folder under WEB-INF and you are ready to use the tags.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The Umber project provides simple, flexible, "earthy" Java tool libraries for developers. The tools supplement common tasks like XML handling, data processing, and PDF generation, but without the complex and arcane APIs of most modern implementations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB