232 projects for "data processing" with 2 filters applied:

  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    Flux3D.jl

    Flux3D.jl

    3D computer vision library in Julia

    Flux3D.jl is a 3D vision library, written completely in Julia. This package utilizes Flux.jl and Zygote.jl as its building blocks for training 3D vision models and for supporting differentiation. This package also have support of CUDA GPU acceleration with CUDA.jl.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    markdown-wasm

    markdown-wasm

    Very fast Markdown parser and HTML generator implemented

    ...The library exposes a simple API that allows developers to convert Markdown input into HTML output using either strings or binary data, making it flexible for different use cases. Its architecture prioritizes performance and minimalism, avoiding external dependencies and keeping bundle sizes small, which is particularly beneficial for web applications.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Big List of Naughty Strings

    Big List of Naughty Strings

    List of strings which have a high probability of causing issues

    ...Because it’s crowdsourced, it reflects real issues practitioners have faced in production, not just theoretical cases. Using the list regularly helps harden applications against the fragile edges of text processing and user input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    earthengine-py-notebooks

    earthengine-py-notebooks

    A collection of 360+ Jupyter Python notebook examples

    earthengine-py-notebooks is a comprehensive collection of hundreds of Jupyter Python notebooks that serve as examples and tutorials for using the Google Earth Engine Python API. These notebooks are organized into thematic areas such as image processing, machine learning, visualization, filtering, and asset management, exposing users to real geospatial analysis tasks. The repository makes it easier to explore Earth Engine’s large geospatial data catalog, interactively display map layers, and generate visual insights without the need for external GIS software by leveraging interactive widgets and mapping libraries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    CSSBox

    CSSBox

    Pure Java HTML / CSS rendering engine

    CSSBox is an (X)HTML/CSS rendering engine written in pure Java. Its primary purpose is to provide a complete information about the rendered page suitable for further processing. However, it also allows displaying the rendered document.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 6
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Muze

    Muze

    Composable data visualisation library for web

    ...The library emphasizes composability, meaning multiple layers and interactions can be combined to produce complex, multi-dimensional visualizations similar to those found in advanced analytics tools like Tableau. It includes an in-browser data model that manages transformations, filtering, and interactions, serving as a central source of truth for all visual elements. Muze also automatically generates cross-chart interactions, allowing multiple visualizations to respond to user input in a coordinated way. By leveraging WebAssembly for data processing, it achieves high performance even with large datasets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    A collection of small utilities for: data extraction (text or binary files), data buffering, message queue control, column addition, date/time manipulation, and data recovery testing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    PyCBC

    PyCBC

    Learn how to use PyCBC to analyze gravitational-wave data

    PyCBC is a software developed by a collaboration of LIGO, Virgo, and independent scientists. It is open source and freely available. We use PyCBC in the detection of gravitational waves from binary mergers such as GW150914. These examples explore how to analyze gravitational wave data, how we find potential signals and learn about them. Many of these tutorials will require you to make edits to config files as part of their exercises. At the moment this isn't easy to do on services like...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Forms

    Forms

    An easy way to create, parse and validate forms in node.js

    ...It integrates with popular Node.js web frameworks, enabling seamless handling of form submissions and data processing. The design emphasizes reusability and maintainability, allowing developers to define forms once and use them across different parts of an application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Albedo

    Albedo

    A recommender system for discovering GitHub repos

    ...It treats repositories and developers as a graph of interactions and applies large-scale matrix factorization to model affinities, with Apache Spark providing the distributed data processing. The project focuses on implicit feedback—stars, watches, and other engagement metrics—so it can build useful recommendations without explicit ratings. A reproducible setup and Makefile-driven workflow streamline tasks like spinning up services, loading datasets, training models, and generating candidate lists. Because it’s built around Spark’s scalable primitives, Albedo can experiment on substantial snapshots of GitHub metadata rather than toy corpora. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    AhoCorasickDoubleArrayTrie

    AhoCorasickDoubleArrayTrie

    An extremely fast implementation of Aho Corasick algorithm

    ...This makes it a strong fit for tasks like content filtering, entity/term spotting, dictionary-based annotation, or high-throughput log/text processing. In short, it’s a specialized, speed-focused library for industrial-strength multi-keyword matching in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DSPatch

    DSPatch

    The Refreshingly Simple C++ Dataflow Framework

    Webite: http://flowbasedprogramming.com DSPatch, pronounced "dispatch", is a powerful C++ dataflow framework. DSPatch is not limited to any particular domain or data type, from reactive programming to stream processing, DSPatch's generic, object-oriented API allows you to create virtually any dataflow system imaginable. *See also:* DSPatcher ( https://github.com/MarcusTomlinson/DSPatcher ): A cross-platform graphical tool for building DSPatch circuits. DSPatchables ( https://github.com/MarcusTomlinson/DSPatchables ): A DSPatch component repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TP-COBOL-DEBUGGER

    TP-COBOL-DEBUGGER

    A COBOL debugger for GnuCOBOL/OpenCOBOL written in GnuCOBOL

    A COBOL debugger for GnuCOBOL written in GnuCOBOL. Works with both current GnuCOBOL and old GnuCOBOL/OpenCOBOL 1.1; could be used for other vendors with slightly modifications, too. Take a look at https://gnucobol.altervista.org/tp-cobol-debugger/
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    JuliaFEM.jl

    JuliaFEM.jl

    The JuliaFEM software library is a framework

    The JuliaFEM software library is a framework that allows for the distributed processing of large Finite Element Models across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. The JuliaFEM software library is a framework that allows for the distributed processing of large Finite Element Models across clusters of computers using simple programming models. It is designed...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Twint

    Twint

    An advanced Twitter scraping & OSINT tool written in Python

    Twint is an advanced open-source Twitter scraping and OSINT tool written in Python that extracts tweets, user data, followers, likes, and more—without relying on Twitter’s API—making it highly useful for researchers, analysts, and hobbyists who want to bypass rate limits and access public Twitter data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Highland

    Highland

    High-level streams library for Node.js and the browser

    Highland is a JavaScript library designed to simplify working with streams and asynchronous data flows by providing a functional programming interface on top of Node.js streams. It allows developers to process data in a declarative and composable way, treating streams as sequences that can be transformed using familiar functional patterns such as map, filter, reduce, and flatMap. The library abstracts away much of the complexity of handling backpressure and asynchronous callbacks, enabling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Duckling (Old)

    Duckling (Old)

    Clojure library that parses text into structured data

    Duckling (the “old” archived version) is a natural language processing library (in Clojure) for parsing text to structured data — specifically, recognizing quantities such as dates, times, durations, measurements, currencies, etc., from free-form text. To use Duckling in your project, you just need two functions: load! to load the default configuration, and parse to parse a string. Duckling is a Clojure library that parses text into structured data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Kalibr Allan

    Kalibr Allan

    IMU Allan standard deviation charts

    kalibr_allan is a utility repository that provides scripts and tools for calculating IMU noise parameters for use in Kalibr and other IMU filtering systems. While manufacturers typically provide “white noise” values in IMU datasheets, the bias instability and random walk parameters must be determined experimentally. This project enables users to compute those values using Allan variance analysis from recorded IMU data. The workflow involves recording IMU measurements with the device...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 183 This Week
    Last Update:
    See Project
  • 21
    Command-line/Ant-task/embeddable text file preprocessor. Macros, flow control, expressions. Recursive directory processing. Extensible in Java to display data from any data sources (as database). Can generate complete homepages (tree of HTML-s, images, etc.)
    Leader badge
    Downloads: 83 This Week
    Last Update:
    See Project
  • 22
    Tiramisu

    Tiramisu

    Polyhedral compiler for expressing fast and portable data algorithms

    Tiramisu is a compiler for expressing fast and portable data parallel computations. It provides a simple C++ API for expressing algorithms (Tiramisu expressions) and how these algorithms should be optimized by the compiler. Tiramisu can be used in areas such as linear and tensor algebra, deep learning, image processing, stencil computations and machine learning. The Tiramisu compiler is based on the polyhedral model thus it can express a large set of loop optimizations and data layout transformations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    cnn-text-classification-tf

    cnn-text-classification-tf

    Convolutional Neural Network for Text Classification in Tensorflow

    The cnn-text-classification-tf repository by Denny Britz is a well-known educational implementation of convolutional neural networks for text classification using TensorFlow, aimed at helping developers and researchers understand how CNNs can be applied to natural language processing tasks. Based loosely on Kim’s influential paper on CNNs for sentence classification, this codebase demonstrates how to preprocess text data, convert words into learned embeddings, and apply multiple convolution filters to extract n-gram features that are then pooled and fed into a classifier. The project includes scripts for training, evaluation, and data handling, making it easy to run experiments on datasets such as movie reviews or other labeled text collections. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    S-MVP

    S-MVP

    Optimized version of MVP, using annotation generics to simplify code

    ...Complete the writing of repetitive modules, use ASpect+GradlePlugin to complete horizontal AOP programming+Javassist dynamic bytecode injection+Tinker to achieve hot repair+Retrofit to achieve elegant network operations+RxJava to easily play with data processing. In MVP, Presenter completely separates Model and View, and the main program logic is implemented in Presenter. Moreover, the Presenter is not directly related to the specific View, but interacts through the defined interface (we only need to pass parameters according to the interface when testing alone), so that the Presenter can be kept unchanged when changing the View. i.e. reuse! ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Tables

    Tables

    Bulma themed, VueJS powered Datatable with server-side loading

    Data Table package with server-side processing, unlimited exporting and VueJS components. Quickly build any complex table based on a JSON template. This package can work independently of the Enso ecosystem. The front-end assets that utilize this API are present in the tables package.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB