Showing 61 open source projects for "data processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Data Formulator

    Data Formulator

    Create rich visualizations with AI

    To create rich visualizations, data analysts often need to iterate back and forth among data processing and chart specification to achieve their goals. To achieve this, analysts need not only proficiency in data transformation and visualization tools but also efforts to manage the branching history consisting of many different versions of data and charts. Recent LLM-powered AI systems have greatly improved visualization authoring experiences, for example by mitigating manual data transformation barriers via LLMs' code generation ability. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ThingsBoard

    ThingsBoard

    Device management, data collection, processing and visualization

    ...Define relations between your devices, assets, customers or any other entities. Collect and store telemetry data in a scalable and fault-tolerant way. Visualize your data with built-in or custom widgets and flexible dashboards. Share dashboards with your customers. Define data processing rule chains. Transform and normalize your device data. Raise alarms on incoming telemetry events, attribute updates, device inactivity, and user actions.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    ChatLab

    ChatLab

    Local-first AI chat analysis tool for insights from conversation data

    ...ChatLab emphasizes a local-first approach, meaning all chat data is processed and stored on the user’s device rather than being uploaded to external servers. It supports large-scale datasets through streaming parsing and multi-worker processing, allowing it to handle millions of messages efficiently. ChatLab also includes visualization features that present trends, activity patterns, and interaction metrics in a clear and accessible format.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Lesan

    Lesan

    New way to create web server and NoSQL data model

    Lesan is a multilingual text processing and translation library designed for natural language processing (NLP) applications. It provides tools for text normalization, tokenization, and translation across multiple languages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    ...It’s designed to replace the random assortment of “free online tools” people use for quick tasks, while avoiding ads, tracking, and the need to upload sensitive files to unknown servers. A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent to the backend. The tool catalog spans both technical and non-technical needs, including image, video, audio, PDF, text, date/time, math, and data format utilities like JSON/CSV/XML helpers. It’s also packaged for straightforward self-hosting, with a lightweight Docker image and simple run commands, so it can be deployed quickly on a homelab or internal network.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    Fincept Terminal

    Fincept Terminal

    FinceptTerminal is a modern finance application

    Fincept Terminal is an open-source financial intelligence platform aimed at bringing powerful market analysis and investment research tools to a broad audience without the prohibitive cost of proprietary terminals. The project provides both command-line and graphical interfaces that let users access real-time market data, economic indicators, and advanced analytics directly from a unified terminal environment, supporting stocks, forex, commodities, and more. Its architecture integrates Python, TypeScript, Rust, and React, reflecting both a robust data processing backend and a modern desktop UI experience. FinceptTerminal emphasizes AI-powered insights and automation, offering technical and fundamental analysis, sentiment data, and customizable workflows that help traders and analysts make informed decisions efficiently. ...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 7
    LiteParse

    LiteParse

    A fast, helpful, and open-source document parser

    ...It also includes mechanisms for validation and error handling, ensuring that outputs conform to expected schemas and reducing the need for manual postprocessing. The library is particularly useful for tasks such as data extraction, document processing, and building pipelines that require structured outputs from natural language input.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    AionUi

    AionUi

    Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex

    ...Instead of forcing users to work in separate terminals for each tool, AionUi automatically detects installed CLI tools and provides a central visual workspace where sessions can run in parallel, contexts are preserved, and conversations are saved locally without sending data to external servers. It enhances productivity by offering smart file management features like batch renaming, automatic organization, and intelligent file classification, thereby reducing manual overhead when working with large datasets or complex document structures. AionUi also supports a remote WebUI mode, allowing users to access their local AI tools securely over a network from other devices while keeping all processing and data on their own hardware.
    Downloads: 49 This Week
    Last Update:
    See Project
  • 9
    Documind

    Documind

    Open-source platform for extracting structured data from documents

    Documind is an advanced document processing tool that leverages AI to extract structured data from PDFs. It is built to handle PDF conversions, extract relevant information, and format results as specified by customizable schemas.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    ScreenPipe

    ScreenPipe

    AI app store powered by 24/7 desktop history. open source

    Screenpipe is an AI app store powered by continuous desktop history recording. It operates entirely locally, offering developers a platform to build, distribute, and monetize AI applications that leverage comprehensive contextual data from users' desktop activities. ​
    Downloads: 35 This Week
    Last Update:
    See Project
  • 11
    AI-Media2Doc

    AI-Media2Doc

    AI tool converting video/audio into structured documents instantly

    ...It is designed to transform multimedia inputs into formats such as knowledge notes, summaries, mind maps, and social-style articles, making content easier to review and reuse. AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not uploaded externally. It separates client-side media handling from backend AI processing, reducing data exposure while still enabling transcription and document generation. AI-Media2Doc supports flexible customization through prompts, allowing users to tailor output styles based on their needs. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    Fast CSV

    Fast CSV

    CSV parser and formatter for node

    A high-performance Node.js library for parsing and formatting CSV data efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Neuroglancer

    Neuroglancer

    WebGL-based viewer for volumetric data

    ...The viewer is built with a multi-threaded architecture, separating rendering and data processing to ensure smooth performance even with massive datasets. Extensively used in neuroscience research, Neuroglancer supports integration with tools.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    OCRBase

    OCRBase

    MD/.JSON Document OCR and structured data extraction API

    OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput. The core output is designed for downstream automation, producing structured...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    baoyu-skills

    baoyu-skills

    Skills shared by Baoyu for improving daily work efficiency with Claude

    baoyu-skills is a comprehensive plugin-based toolkit designed to enhance productivity within AI-assisted development environments, particularly Claude Code, by providing a wide range of modular “skills” for content generation, automation, and data processing. The project organizes its functionality into categories such as content creation, AI generation, and utility tools, enabling users to extend their workflows through reusable components. Each skill is implemented as a structured module with its own configuration, scripts, and execution logic, allowing for flexible customization and extension. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 16
    Gooo

    Gooo

    Toolkit for developing web applications in Vue, Templ, and Go

    Gooo is an open-source project that focuses on providing tools and utilities for interacting with and analyzing online content, particularly in the context of automation and data retrieval workflows. The repository appears to function as a lightweight utility toolkit that can be adapted for specific use cases such as scraping, automation, or content processing. It is structured to allow developers to customize and extend its functionality depending on their needs, rather than acting as a fully packaged end-user application. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Nexent

    Nexent

    Zero-code platform for building AI agents from natural language input

    ...It focuses on a zero-code approach, allowing users to define workflows and agent behavior purely through language prompts, significantly lowering the barrier to entry for AI development. Built on the MCP ecosystem, Nexent integrates a wide range of tools, models, and data sources into a unified environment for agent creation and execution. Nexent supports multi-agent collaboration, enabling multiple intelligent agents to interact and coordinate tasks within complex workflows. It also includes capabilities for data processing, knowledge tracing, and multimodal interaction, allowing agents to work with different input and output formats. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Matter AI

    Matter AI

    Matter AI is open-source AI Code Reviewer Agent

    Matter AI is an AI-powered platform designed to enhance productivity through automated content generation, data analysis, and decision support. It leverages machine learning models to process text, analyze patterns, and generate insights, making it suitable for businesses looking to optimize data-driven decision-making. Matter AI integrates with various data sources and provides customizable AI workflows tailored to different industries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Pluely

    Pluely

    The Open Source Alternative to Cluely

    ...The system focuses on orchestrating tasks performed by large language models and other AI components, allowing developers to define structured workflows where models interact with tools, APIs, and external systems. By providing a modular architecture for building AI pipelines, the platform enables developers to connect multiple processing steps such as data retrieval, prompt execution, analysis, and response generation. The project emphasizes flexibility, allowing developers to extend the platform with custom integrations and automation logic. This makes the framework suitable for building intelligent assistants, automated business workflows, and data-processing pipelines that rely on generative AI capabilities.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    douyin

    douyin

    Open source Douyin crawler for collecting and downloading public data

    DouyinCrawler is an open source data collection tool designed to gather publicly available information from the Douyin platform. It demonstrates how to build a Python-based web crawler combined with a graphical interface and command line functionality. It allows users to collect data from various types of Douyin content, including user profiles, videos, hashtags, and music pages. DouyinCrawler supports both automated scraping and batch operations to process multiple targets efficiently. It...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    DocStrange

    DocStrange

    Extract and convert data from any document, images, pdfs, word doc

    DocStrange is an open-source document understanding and extraction library designed to convert complex files into structured, LLM-ready outputs such as Markdown, JSON, CSV, and HTML. Developed by Nanonets, the project combines OCR, layout detection, table understanding, and structured extraction into one end-to-end pipeline, which reduces the need to stitch together multiple separate services. It is built for developers who need high-quality parsing from scans, photos, PDFs, office files,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    FastGPT

    FastGPT

    FastGPT is a knowledge-based platform built on the LLMs

    FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    OpenHealth

    OpenHealth

    AI health assistant for private, local data-driven insights mgmt

    ...A strong emphasis is placed on privacy, as the platform can run entirely locally, ensuring that sensitive medical data does not need to leave the user’s environment. OpenHealth also includes a data parsing layer that transforms raw medical inputs into structured datasets, making them usable for analysis and AI-driven insights. OpenHealth separates data ingestion, processing, and AI interaction, enabling flexibility in integrating different models and data sources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    dataline

    dataline

    AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake

    ...Once connected, users can generate tables, charts, and reports automatically based on queries produced by the AI engine. The platform is designed with a privacy-first architecture that stores data locally on the user’s device rather than sending it to external cloud services by default. It can also hide sensitive data from language models during processing, ensuring that only necessary metadata is used for query generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    hash-wasm

    hash-wasm

    Lightning fast hash functions using hand-tuned WebAssembly binaries

    ...The library supports a wide range of algorithms, including MD5, SHA variants, BLAKE, Argon2, bcrypt, and xxHash, making it suitable for applications ranging from security to data processing. By compiling optimized C implementations into WebAssembly, hash-wasm achieves significantly better performance compared to pure JavaScript alternatives while maintaining portability across platforms. It supports both simple one-shot hashing and advanced streaming modes, allowing developers to process large datasets incrementally. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB