Showing 117 open source projects for "data processing"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    HPCC Systems

    HPCC Systems

    End-to-end big data in a massively scalable supercomputing platform.

    HPCC Systems® (www.hpccsystems.com) from LexisNexis® Risk Solutions is a proven, open source solution for Big Data insights that can be implemented by businesses of all sizes. With HPCC Systems, developers can design applications with Big Data at their core, enabling businesses to better analyze and understand data at scale, improving business time to results and decisions. HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single, complete end-to-end architecture for efficient processing.
    Downloads: 57 This Week
    Last Update:
    See Project
  • 3
    TURGEN

    TURGEN

    Create your own Atari tapes

    With TURGEN, you can record your own tapes with software for Atari 8-bit computers, and transfer data from your PC or Mac to your Atari 8-bit computer using your data recorder and a cassette adapter. TURGEN converts digital images of Atari software to the original standard Atari tape records and also various "turbo" systems. You can create WAVE files, tape images, or send the signal directly to the sound card. Tools such as Wizard for files, Tape Side Creator, Dual Track Tape creator and Batch processing allow you to create tapes with convenience
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • 4
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    ...DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Rescuvera

    Rescuvera

    Data Recovery Application for Linux Systems

    This program is a GUI that utilizes some console-based data recovery tools. It recovers data from memory cards, flash drives, and external and internal magnetic drives. It supports image creation and image processing. This program carries no warranty, and all responsibility rests with the user. https://github.com/shampuan/Rescuvera
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Breakpad

    Breakpad

    Mirror of Google Breakpad project

    Breakpad is an open source crash reporting system developed by Google that provides both client and server components to capture, analyze, and report software crashes across platforms. It is designed to help developers diagnose and fix crashes efficiently by generating detailed crash dumps, stack traces, and diagnostic data whenever an application fails unexpectedly. The framework includes libraries for embedding crash-handling functionality directly into applications and tools for processing and symbolizing crash dumps on the server side. Breakpad supports multiple operating systems, including Linux, macOS, and Windows, and integrates easily into existing build systems. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file,...
    Leader badge
    Downloads: 511 This Week
    Last Update:
    See Project
  • 8
    zpaqfranz

    zpaqfranz

    Zpaq compatible archiver for Win, Linux, Free/OpenBSD, Solaris & MacOS

    ...Get forever storage of your files, managing critical backups with bulletproof archival solutions and enterprise-grade reliability Far more efficient than Time Machine or ZFS snapshots-perfect for VM backups and permanent archiving, effortlessly handling TBs and millions of files Optimized for cloud/NAS/USB with ultra-low bandwidth, military-grade encryption, and 1GB/s+ speeds on modern hardware GUI (Win/Linux/Mac) https://sourceforge.net/projects/catpaq Why choose catpaq/zpaqfranz? ✓ Complete: single/multi-file storage architecture ✓ Modern: SHA-2, SHA-3, BLAKE3, XXH3 and more ✓ Paranoid: anti-ransomware data verification with integrity checks ✓ Runs everywhere: TrueNAS, ARM-powered, even ESXi ✓ Lightning-fast: multi-core processing + hardware acceleration ✓ Deduplicated disk imaging ✓ Battle-tested: 15+ years of active development since 2009 https://github.com/fcorbelli/zpaqfranz https://www.francocorbelli.it/zpaqfranz 100% FOSS • forever free
    Downloads: 123 This Week
    Last Update:
    See Project
  • 9

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    AI File Sorter is a cross-platform desktop application that uses AI (local LLMs run on your computer) to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze images locally and propose descriptive rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg). It can also analyze document text to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common...
    Downloads: 256 This Week
    Last Update:
    See Project
  • 11
    LSJ-Server

    LSJ-Server

    LSJ-Server-Terra (Java Multipurpose Sever connected with MariaDB)

    ...LSJ-Server (Terra) is a high-performance, lightweight socket server built with Pure Java and MariaDB. It offers a robust backend for IT maintenance and MMORPG/RTS games without heavy framework overhead. [Features] Pure Java & MariaDB: Fast data processing & persistence. Real-time Monitoring: GUI for live logs and user management. Optimized Networking: Low-latency TCP/IP support. [How to Use] Config: Set MariaDB and Port (default: 10815). Control: Use 'Start' and 'Stop' buttons. Manage: Monitor logs/users and use the bottom command field. https://github.com/s1213123/LSJ-Server/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CloudI: A Cloud at the lowest level
    CloudI is an open-source private cloud computing framework for efficient, secure, and internal data processing. CloudI provides scaling for previously unscalable source code with efficient fault-tolerant execution of ATS, C/C++, Erlang/Elixir, Go, Haskell, Java, JavaScript/node.js, OCaml, Perl, PHP, Python, Ruby, or Rust services. The bare essentials for efficient fault-tolerant processing on a cloud!
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    SageMaker Experiments Python SDK

    SageMaker Experiments Python SDK

    Experiment tracking and metric logging for Amazon SageMaker notebooks

    ...Each step in the workflow is described by a Trial Component. There is no relationship between Trial Components such as ordering. Trial Component: A description of a single step in a machine learning workflow. For example data cleaning, feature extraction, model training, model evaluation, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    GXSM

    GXSM

    Scanning Probe Microscopy Controller and Data Visualization Software

    GXSM -- Gnome X Scanning Microscopy: A multi-channel image and vector-probe data acquisition and visualization system designed for SPM techniques (STM,AFM..), but also SPA-LEED/LEED/LEEM data analysis. A plug-in interface allows any user add-on data-processing and special hardware and instrument support. Latest: NC-AFM and related explorative methods as SQDM can be configured. High-Speed external PAC-PLL hardware option with digital DSP link.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16

    dispy

    Distributed and Parallel Computing with/for Python.

    dispy is a generic and comprehensive, yet easy to use framework for creating and using compute clusters to execute computations in parallel across multiple processors in a single machine (SMP), among many machines in a cluster, grid or cloud. dispy is well suited for data parallel (SIMD) paradigm where a computation (Python function or standalone program) is evaluated with different (large) datasets independently. dispy supports public / private / hybrid cloud computing, fog / edge computing.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    wasmboy

    wasmboy

    Game Boy / Game Boy Color Emulator Library

    wasmboy is a Game Boy and Game Boy Color emulator built using WebAssembly and JavaScript, designed to run efficiently in both browsers and Node environments. It leverages modern web technologies such as HTML5 canvas and the Web Audio API to deliver graphics and sound directly within a web interface. The project emphasizes portability and integration, allowing it to be embedded into other applications as a reusable dependency. It supports a wide range of emulator features including save...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    A set of tools (command line and GUI) to provide a complete digital photo workflow for Unixes. EXIF headers are used as the central information repository, so users may change their software at any time without loosing any data.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    A collection of small utilities for: data extraction (text or binary files), data buffering, message queue control, column addition, date/time manipulation, and data recovery testing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Albedo

    Albedo

    A recommender system for discovering GitHub repos

    ...It treats repositories and developers as a graph of interactions and applies large-scale matrix factorization to model affinities, with Apache Spark providing the distributed data processing. The project focuses on implicit feedback—stars, watches, and other engagement metrics—so it can build useful recommendations without explicit ratings. A reproducible setup and Makefile-driven workflow streamline tasks like spinning up services, loading datasets, training models, and generating candidate lists. Because it’s built around Spark’s scalable primitives, Albedo can experiment on substantial snapshots of GitHub metadata rather than toy corpora. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    makedumpfile

    make a small dumpfile of kdump

    The makedumpfile project page moved to https://github.com/makedumpfile/makedumpfile. By using the makedumpfile command, it is possible to shorten the dumpfile of kdump.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 22
    Orange

    Orange

    OpenResty/Nginx Gateway for API monitoring and management

    ...In addition, various variables in the request can be extracted for subsequent processing in two ways.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    mod_psldap

    Apache LDAP Directory Manager

    mod_psldap is an Apache module for leveraging LDAP services built on the OpenLDAP library and the Apache APIs, to include web based A&A, web based updates to the LDAP store, server-side XSLT processing, and session management across servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    IDTE- ID3 Tag Editor

    IDTE- ID3 Tag Editor

    A Smarter Way to Tag FLAC, Vorbis, Lyrics, WMA, WavPack, and ID3 Tags

    ...It also includes a powerful audio player with support for 40+ lossy and lossless formats, including FLAC, ALAC, OGG, APE, MP3, and more. IDTE offers a wide range of advanced features: • Rename files using tag information • Export and manage tag data • Create and manage playlists • Search online for missing or incomplete tags • Fetch lyrics and album artwork • Full CUE sheet support • Batch processing and multi-file editing • SFX, visualizations, and more A new, minimalistic version of IDTE is also available for download: https://sourceforge.net/projects/idte ⚠️ Important Notice Tagging MP4 video files and WAV audio files using IDTE is not recommended. ...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 25
    confd

    confd

    Manage local application configuration files using templates from etcd

    confd is a lightweight configuration management tool focused on keeping local configuration files up-to-date using data stored in etcd, consul, dynamodb, redis, vault, zookeeper, aws ssm parameter store or env vars and processing template resources. confd is also focused on reloading applications to pick up new config file changes. Go 1.10 is required to build confd, which uses the new vendor directory. You should have a working etcd, or consul server up and running and the ability to add new keys. ...
    Downloads: 4 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB