Search Results for "processing" - Page 6

Sort By:

Showing 964 open source projects for "processing"

View related business solutions

Python Clear Filters & Widen Search

Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
Earn up to 16% annual interest with Nexo.
Access competitive interest rates on your digital assets.

Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
1

TorchRL

A modular, primitive-first, python-first PyTorch library

TorchRL is an open-source Reinforcement Learning (RL) library for PyTorch. TorchRL provides PyTorch and python-first, low and high-level abstractions for RL that are intended to be efficient, modular, documented, and properly tested. The code is aimed at supporting research in RL. Most of it is written in Python in a highly modular way, such that researchers can easily swap components, transform them, or write new ones with little effort.

Downloads: 50 This Week

Last Update: 2026-02-05
See Project
2

NeMo Retriever Library

Document content and metadata extraction microservice

...It processes various document types by splitting them into components such as text, tables, charts, and images, and then applies OCR and contextual analysis to convert them into structured data formats. The system is built on NVIDIA NIM microservices, enabling high-performance parallel processing and efficient handling of large datasets. It supports multiple extraction strategies for different document formats, balancing accuracy and throughput depending on the use case. Additionally, it can generate embeddings for extracted content and integrate with vector databases like Milvus, making it well-suited for retrieval-augmented generation pipelines.

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
3

docext

An on-premises, OCR-free unstructured data extraction

...The system is designed to operate entirely on-premises, allowing organizations to process sensitive documents without relying on external cloud services. Unlike traditional document processing pipelines that rely heavily on optical character recognition, docext leverages multimodal AI models capable of understanding both visual and textual information directly from document images. This allows the system to detect and extract structured elements such as tables, signatures, key fields, and layout information while maintaining semantic understanding of the document content. ...

Downloads: 3 This Week

Last Update: 2026-03-12
See Project
4

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification

...Because the library integrates machine learning algorithms with signal processing tools, it enables researchers to develop complete audio analysis pipelines using a single framework.

Downloads: 3 This Week

Last Update: 2026-03-10
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
5

Automatic text summarizer

Module for automatic summarization of text documents and HTML pages

Sumy is an automatic text summarization library that provides multiple algorithms for extracting key content from documents and articles. Simple library and command line utility for extracting summary from HTML pages or plain texts. The package also contains a simple evaluation framework for text summaries. Implemented summarization methods are described in the documentation. I also maintain a list of alternative implementations of the summarizers in various programming languages.

Downloads: 1 This Week

Last Update: 2026-02-14
See Project
6

loonflow

A workflow engine base on django python

...Scenario services), if there is a certain development capability, it is recommended to use only the back-end engine function, and the front-end customized development according to the scenario can be dispersed in various internal background management systems (such as personnel, operation and maintenance, monitoring, cmdb, etc.). Since version 1.1.x, loonflow comes with a front-end interface for creating and processing work orders, which can be used directly. The official version is shown in the release . It is recommended to use the latest version.

Downloads: 3 This Week

Last Update: 2026-03-24
See Project
7

txtai

Build AI-powered semantic search applications

txtai executes machine-learning workflows to transform data and build AI-powered semantic search applications. Traditional search systems use keywords to find data. Semantic search applications have an understanding of natural language and identify results that have the same meaning, not necessarily the same keywords. Backed by state-of-the-art machine learning models, data is transformed into vector representations for search (also known as embeddings). Innovation is happening at a rapid...

Downloads: 8 This Week

Last Update: 2026-03-17
See Project
8

CogDB

Micro Graph Database for Python Applications

Cog is a lightweight, embedded graph database for Go that provides a simple interface for storing and querying graph-based data structures, making it useful for knowledge representation and graph analytics.

Downloads: 6 This Week

Last Update: 2 days ago
See Project
9

Frigate

NVR with realtime local object detection for IP cameras

Frigate - NVR With Realtime Object Detection for IP Cameras A complete and local NVR designed for Home Assistant with AI object detection. Uses OpenCV and Tensorflow to perform realtime object detection locally for IP cameras. Use of a Google Coral Accelerator is optional, but highly recommended. The Coral will outperform even the best CPUs and can process 100+ FPS with very little overhead.

Downloads: 44 This Week

Last Update: 2026-03-19
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

Paperless-AI

AI-powered document analysis and tagging for Paperless-ngx

Paperless-AI is an AI-powered extension designed to enhance document management within Paperless-ngx by automating analysis, classification, and organization tasks. It continuously monitors incoming documents and processes them using various AI backends, enabling automatic assignment of titles, tags, document types, and correspondents. It integrates with multiple OpenAI-compatible services as well as local models, giving users flexibility in how document intelligence is handled. A key...

Downloads: 4 This Week

Last Update: 2026-03-17
See Project
11

API-for-Open-LLM

Openai style api for open large language models

API-for-Open-LLM is a lightweight API server designed for deploying and serving open large language models (LLMs), offering a simple way to integrate LLMs into applications.

Downloads: 0 This Week

Last Update: 2025-01-22
See Project
12

newspaper4k

Python library for scraping and analyzing online news articles easily

...Newspaper4k also includes natural language processing capabilities that can generate summaries and identify keywords from extracted article text. Newspaper4k supports both single-article extraction and full news site processing, allowing users to build sources representing entire publications and iterate through their articles. It maintains compatibility with the original project so that existing code written for newspaper3k can continue working with minimal changes.

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
13

E2M

E2M converts various file types (doc, docx, epub, html, htm, url

...The mirrored repository allows users to access the project’s codebase independently from its original hosting platform while preserving the development history and release artifacts. Systems like e2m often serve as middleware components that connect different software systems or facilitate data processing pipelines. By acting as a transformation layer, the software can support workflows such as converting data formats, integrating services, or bridging incompatible systems. The mirror hosted on SourceForge ensures that developers can continue accessing the project even if the primary repository becomes unavailable.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
14

RuView

Turn WiFi signals into real-time human sensing and spatial awareness.

...Unlike traditional vision systems, RuView operates without cameras, wearables, or cloud connectivity, making it a privacy-first sensing solution. The system runs on low-cost hardware such as ESP32 sensor meshes and performs signal processing and machine learning directly at the edge. By learning the RF signature of each environment over time, RuView adapts automatically to different spaces and improves its sensing accuracy. Designed for applications ranging from healthcare monitoring to disaster response, it enables spaces to gain spatial awareness using the radio signals already present in the environment.

Downloads: 82 This Week

Last Update: 2026-04-06
See Project
15

LLM-Aided OCR Project

Enhances Tesseract OCR output using LLMs (local or API)

...The project is particularly useful for digitizing historical documents, research papers, and scanned materials where traditional OCR often struggles. It also includes tools for processing batches of images or documents, enabling automated document digitization workflows.

Downloads: 0 This Week

Last Update: 2026-03-22
See Project
16

Chinese-XLNet

Chinese XLNet pre-trained model

Chinese-XLNet is a Chinese language pre-trained model based on the XLNet architecture, providing an advanced foundation for natural language processing tasks in Mandarin and other Chinese dialects. Unlike traditional masked language modeling, XLNet uses a permutation language modeling objective that captures bidirectional context more effectively by training over all possible token orderings, yielding richer contextual representations. This model is trained on large-scale Chinese text datasets to learn linguistic patterns, long-range dependencies, and semantic nuance typical of Chinese writing, making it useful for tasks like text classification, question answering, named entity recognition, and language generation. ...

Downloads: 0 This Week

Last Update: 2026-01-15
See Project
17

Umi-OCR

OCR software, free and offline

...It includes a highly efficient offline OCR engine with built-in multilingual recognition libraries, so users can extract text across multiple languages with high accuracy directly on their machines. The software supports flexible usage patterns including screenshot capture OCR, batch processing of large sets of images or documents, PDF parsing, QR code detection, and layout-aware paragraph output. Users can interact with Umi-OCR through a graphical interface, command-line options, or HTTP interfaces, making it adaptable to both casual desktop usage and programmatic automation. Because the project is open source, developers can inspect, modify, and extend its capabilities, and plugins allow for different recognition engines or enhanced features.

Downloads: 41 This Week

Last Update: 2026-01-15
See Project
18

Colossal-AI

Making large AI models cheaper, faster and more accessible

The Transformer architecture has improved the performance of deep learning models in domains such as Computer Vision and Natural Language Processing. Together with better performance come larger model sizes. This imposes challenges to the memory wall of the current accelerator hardware such as GPU. It is never ideal to train large models such as Vision Transformer, BERT, and GPT on a single GPU or a single machine. There is an urgent demand to train models in a distributed environment. ...

Downloads: 0 This Week

Last Update: 2025-05-28
See Project
19

Memobase

Fast backend for long-term AI user memory via structured profiles

...Instead of relying purely on traditional embedding-based retrieval or RAG systems, Memobase uses profile and timeline structures to deliver memory that reflects user context efficiently and meaningfully. The system focuses on three principal performance metrics: high search performance, reduced large language model (LLM) costs through batch processing techniques, and low latency with minimal SQL operations. Memobase supports integration with existing LLM workflows via APIs and SDKs (including Python, Node, and Go), making it easy to adopt within diverse application stacks.

Downloads: 5 This Week

Last Update: 14 minutes ago
See Project
20

Dream Textures

Stable Diffusion built-in to Blender

...Inpaint to fix up images and convert existing textures into seamless ones automatically. Outpaint to increase the size of an image by extending it in any direction. Perform style transfer and create novel animations with Stable Diffusion as a post processing step. Dream Textures has been tested with CUDA and Apple Silicon GPUs. Over 4GB of VRAM is recommended.

Downloads: 8 This Week

Last Update: 2024-08-26
See Project
21

Stanza

Stanford NLP Python library for many human languages

Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of...

Downloads: 6 This Week

Last Update: 2026-02-26
See Project
22

Quantitative Trading System

A comprehensive quantitative trading system with AI-powered analysis

...The project is designed to provide an end-to-end infrastructure for building and operating algorithmic trading strategies in financial markets. It includes tools for collecting and processing market data from multiple sources, performing statistical and machine learning analysis, and generating trading signals based on quantitative models. The system supports real-time data streaming, allowing strategies to respond to market conditions as they evolve. QuantMuse also incorporates advanced risk management features, including portfolio monitoring, risk limits, and dynamic position sizing to control exposure.

Downloads: 2 This Week

Last Update: 2026-03-12
See Project
23

ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

ComfyUI-LTXVideo is a bridge between ComfyUI’s node-based generative workflow environment and the LTX-Video multimedia processing framework, enabling creators to orchestrate complex video tasks within a visual graph paradigm. Instead of writing code to apply effects, transitions, edits, and data flows, users can assemble nodes that represent video inputs, transformations, and outputs, letting them prototype and automate video production pipelines visually.

Downloads: 2 This Week

Last Update: 2026-03-06
See Project
24

Open Gauss

Project-scoped Lean workflow orchestrator from Math, Inc.

Open Gauss is an enterprise-grade open-source relational database management system designed to handle large-scale data processing with high performance, reliability, and security. It is based on the PostgreSQL ecosystem but significantly extends its capabilities through architectural optimizations, AI-driven features, and enterprise-level enhancements. The database organizes data using the relational model, storing structured information in tables composed of rows and columns while supporting standard SQL for querying and management. ...

Downloads: 3 This Week

Last Update: 2026-04-05
See Project
25

Hugging Face - Speech To Speech

Open speech-to-speech models and pipelines by Hugging Face toolkit AI

This project from Hugging Face focuses on enabling direct speech-to-speech processing using modern machine learning models. It provides tools and reference implementations that allow audio input to be transformed into audio output without requiring an intermediate text representation. Hugging Face - Speech To Speech builds on recent advances in speech modeling, combining components such as speech recognition, translation, and synthesis into unified pipelines.

Downloads: 3 This Week

Last Update: 2026-03-18
See Project