Showing 108 open source projects for "data processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    ...DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    ResIL

    ResIL

    Resilient Image Library - successor to DevIL

    ResIL started as a fork of DevIL and is actively maintained to use up-to-date libraries. The API has been updated to work with several threads. Countless fixes have been applied, much of the code base has actually been rewritten to enhance stability. Support for webp has been added. ResIL focuses on robust decoding so that even partially corrupted images can be displayed without crashes or exceptions.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    QScript

    QScript

    Expression and algorithm evaluator (built with Java)

    V2.1.1 compatible with Processing 2 & 3. A powerful yet simple to use scripting language that can be used to evaluate expressions and simple algorithms. In fact it is the only evaluator I know that supports complex numbers and vectors without the need for additional configuration. The emphasis is on simplicity and extensibility and the website provides detailed guides to help you get the most of QScript. It supports variables, selection (IF-THEN-ELSE) and repetition (WHILE-WEND and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    SimpleXlsxWriter

    C++ library for creating XLSX files for MS Excel 2007 and above.

    ...The main feature of this library is that it uses C++ standard file streams. On the one hand it results in almost unnoticeable memory and CPU resources consumption while processing (that may be very useful at saving a large data arrays), but on the other hand it makes unfeasible to edit data that were written. Hence, if using this library the structure of the future report should be known enough. The library is written in C++ with using STL functionality and based on the ZIP library (included), which has a free license: http://www.codeproject.com/Articles/7530/Zip-Utils-clean-elegant-simple-C-Win32 This library is distributed under the terms of the zlib license: http://www.zlib.net/zlib_license.html
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Neural Network Visualization

    Neural Network Visualization

    Project for processing neural networks and rendering to gain insights

    nn_vis is a minimalist visualization tool for neural networks written in Python using OpenGL and Pygame. It provides an interactive, graphical representation of how data flows through neural network layers, offering a unique educational experience for those new to deep learning or looking to explain it visually. By animating input, weights, activations, and outputs, the tool demystifies neural network operations and helps users intuitively grasp complex concepts. Its lightweight codebase is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    go-coffeeshop

    go-coffeeshop

    A practical event-driven microservices demo

    ...It also includes examples of integrating external systems and handling real-world scenarios such as order processing and inventory management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    QOR

    QOR

    Libraries written in Go abstracting features needed for business apps

    QOR is architected from the ground up to accelerate development and deployment of Content Management Systems, E-commerce Systems, and Business Applications. QOR is comprised of modules that abstract common features for such systems, such as a configurable and flexible back office, a content publishing system, a media library, and much more. We built QOR at The Plant to accelerate development for our clients, it is at the core of our engineering strategy, and we maintain it on an ongoing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Lingua

    Lingua

    The most accurate natural language detection library for Java

    Its task is simple: It tells you which language some provided textual data is written in. This is very useful as a preprocessing step for linguistic data in natural language processing applications such as text classification and spell checking. Other use cases, for instance, might include routing e-mails to the right geographically located customer service department, based on the e-mails' languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Augmentor.jl

    Augmentor.jl

    A fast image augmentation library in Julia for machine learning

    A fast library for increasing the number of training images by applying various transformations. Augmentor is a real-time image augmentation library designed to render the process of artificial dataset enlargement more convenient, less error prone, and easier to reproduce. It offers the user the ability to build a stochastic image-processing pipeline (or simply augmentation pipeline) using image operations as building blocks. In other words, an augmentation pipeline is little more but a...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Whisper Library

    Whisper Library

    Whisper is a file-based time-series database format for Graphite

    Whisper is one of three components within the Graphite project. Whisper is a fixed-size database, similar in design and purpose to RRD (round-robin-database). It provides fast, reliable storage of numeric data over time. Whisper allows for higher resolution (seconds per point) of recent data to degrade into lower resolutions for long-term retention of historical data. Copies data from src in dst, if missing. Unlike whisper-merge, don't overwrite data that's already present in the target...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    CoolplaySpark

    CoolplaySpark

    Spark Cool Play: Spark source code analysis, Spark class library, etc.

    ...The project contains annotated examples, explanations, and exercises that guide learners through Spark’s architecture, execution model, and source code internals. It is particularly valuable for developers who want to strengthen their understanding of Spark by not only using it as a data processing engine but also exploring how its internals function. Through code analysis and commentary, CoolplaySpark helps readers connect theoretical concepts with practical implementation details. By combining book study with this repository, learners can develop both conceptual clarity and hands-on expertise in Spark’s core components.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Flux3D.jl

    Flux3D.jl

    3D computer vision library in Julia

    Flux3D.jl is a 3D vision library, written completely in Julia. This package utilizes Flux.jl and Zygote.jl as its building blocks for training 3D vision models and for supporting differentiation. This package also have support of CUDA GPU acceleration with CUDA.jl.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Graph4NLP

    Graph4NLP

    Graph4nlp is the library for the easy use of Graph Neural Networks

    Graph4NLP is an easy-to-use library for R&D at the intersection of Deep Learning on Graphs and Natural Language Processing (i.e., DLG4NLP). It provides both full implementations of state-of-the-art models for data scientists and also flexible interfaces to build customized models for researchers and developers with whole-pipeline support. Built upon highly-optimized runtime libraries including DGL , Graph4NLP has both high running efficiency and great extensibility.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    libpostal

    libpostal

    A C library for parsing/normalizing street addresses around the world

    A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data. libpostal is a C library for parsing/normalizing street addresses around the world using statistical NLP and open data. The goal of this project is to understand location-based strings in every language, everywhere. Addresses and the locations they represent are essential for any application dealing with maps (place search, transportation, on-demand/delivery services,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Big List of Naughty Strings

    Big List of Naughty Strings

    List of strings which have a high probability of causing issues

    ...Because it’s crowdsourced, it reflects real issues practitioners have faced in production, not just theoretical cases. Using the list regularly helps harden applications against the fragile edges of text processing and user input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    earthengine-py-notebooks

    earthengine-py-notebooks

    A collection of 360+ Jupyter Python notebook examples

    earthengine-py-notebooks is a comprehensive collection of hundreds of Jupyter Python notebooks that serve as examples and tutorials for using the Google Earth Engine Python API. These notebooks are organized into thematic areas such as image processing, machine learning, visualization, filtering, and asset management, exposing users to real geospatial analysis tasks. The repository makes it easier to explore Earth Engine’s large geospatial data catalog, interactively display map layers, and generate visual insights without the need for external GIS software by leveraging interactive widgets and mapping libraries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NLP Architect

    NLP Architect

    A model library for exploring state-of-the-art deep learning

    ...The library contains NLP/NLU-related models per task, different neural network topologies (which are used in models), procedures for simplifying workflows in the library, pre-defined data processors and dataset loaders and misc utilities. The library is designed to be a tool for model development: data pre-processing, build model, train, validate, infer, save or load a model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CSSBox

    CSSBox

    Pure Java HTML / CSS rendering engine

    CSSBox is an (X)HTML/CSS rendering engine written in pure Java. Its primary purpose is to provide a complete information about the rendered page suitable for further processing. However, it also allows displaying the rendered document.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 19
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc..
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PyCBC

    PyCBC

    Learn how to use PyCBC to analyze gravitational-wave data

    PyCBC is a software developed by a collaboration of LIGO, Virgo, and independent scientists. It is open source and freely available. We use PyCBC in the detection of gravitational waves from binary mergers such as GW150914. These examples explore how to analyze gravitational wave data, how we find potential signals and learn about them. Many of these tutorials will require you to make edits to config files as part of their exercises. At the moment this isn't easy to do on services like...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Euler

    Euler

    A distributed graph deep learning framework.

    Data in the fields of text, speech, and images is easier to process into a grid-like type of Euclidean space, which is suitable for processing by existing deep learning models. Graph is a data type in non-Euclidean space and cannot be directly applied to existing methods, requiring a specially designed graph neural network system. Graph-based learning methods such as graph neural networks combine end-to-end learning with inductive reasoning, and are expected to solve a series of problems such as relational reasoning and interpretability that deep learning cannot handle.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Highland

    Highland

    High-level streams library for Node.js and the browser

    Highland is a JavaScript library designed to simplify working with streams and asynchronous data flows by providing a functional programming interface on top of Node.js streams. It allows developers to process data in a declarative and composable way, treating streams as sequences that can be transformed using familiar functional patterns such as map, filter, reduce, and flatMap. The library abstracts away much of the complexity of handling backpressure and asynchronous callbacks, enabling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Duckling (Old)

    Duckling (Old)

    Clojure library that parses text into structured data

    Duckling (the “old” archived version) is a natural language processing library (in Clojure) for parsing text to structured data — specifically, recognizing quantities such as dates, times, durations, measurements, currencies, etc., from free-form text. To use Duckling in your project, you just need two functions: load! to load the default configuration, and parse to parse a string. Duckling is a Clojure library that parses text into structured data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    JSVerbalExpressions

    JSVerbalExpressions

    JavaScript Regular expressions made easy

    VerbalExpressions is a JavaScript library that helps to construct difficult regular expressions. Methods have a return type of VerbalExpression except where mentioned otherwise. If there is no mention of a method’s parameters, it is to be assumed that it has none. VerEx is the function that is exported from within VerbalExpressions.js and is to be the first method in chains that describe verbal expressions. Construct and return a new instance of VerbalExpression. Used to support the new...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 182 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB