Showing 49 open source projects for "raw"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    File Sorter for Photographers

    File Sorter for Photographers

    Organize files/images from a csv or xlsx file.

    A user-friendly application to efficiently sort all types of files from a source folder into a destination folder based on a list of filenames provided in an Excel or CSV file.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    ClickVisual

    ClickVisual

    Log analytic and data visualize platform built on clickhouse

    ClickVisual is a lightweight browser-based logs analytics and logs search platform for ClickHouse.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    CleanVision

    CleanVision

    Automatically find issues in image datasets

    ...The quality of machine learning models hinges on the quality of the data used to train them, but it is hard to manually identify all of the low-quality data in a big dataset. CleanVision helps you automatically identify common types of data issues lurking in image datasets. This package currently detects issues in the raw images themselves, making it a useful tool for any computer vision task such as: classification, segmentation, object detection, pose estimation, keypoint detection, generative modeling, etc.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    NCDatasets.jl

    NCDatasets.jl

    Load and create NetCDF files in Julia

    NCDatasets allows one to read and create netCDF files. NetCDF data set and attribute list behave like Julia dictionaries and variables like Julia arrays. This package implements the CommonDataModel.jl interface, which means that the datasets can be accessed in the same way as GRIB files opened with GRIBDatasets.jl.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Mercury

    Mercury

    Convert Python notebook to web app and share with non-technical users

    Turn Python notebooks to web applications with open-source Mercury framework. Hide code and add interactive widgets. Non-technical users can tweak widgets and execute notebook with new parameters. The core of Mercury is Open Source under AGPLv3. We provide Mercury Pro with additional features, dedicated support and friendly commercial license. Mercury is a perfect tool to convert Python notebook to interactive web application and share with non-programmers. You define interactive widgets for...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Foxglove Studio

    Foxglove Studio

    Robotics visualization and debugging

    ...Experience the world as your robot does. Visualize images and point clouds, overlay bounding boxes, add classification labels and planned movements, and drill down into your data with plots or raw message views. Upload recordings to your private data lake for easy storage, searching, and analysis. Stream recorded data directly into Foxglove Studio to get insights into your robots' behavior. We're long-time fans and beneficiaries of open source software. Join our community on Github and Slack to contribute bug reports, feature requests, or pull requests.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    visual-explainer

    visual-explainer

    Agent skill + prompt templates that generate rich HTML pages

    ...The project includes prompt templates and automation logic that enable coding agents to generate visual summaries such as diff reviews, architecture overviews, plan audits, and structured data tables. Its primary goal is to bridge the readability gap between raw machine output and stakeholder-friendly documentation. By producing styled web pages instead of plain text logs, visual-explainer improves communication in engineering and AI workflows where clarity is critical. The tool is particularly useful in environments that rely on autonomous agents or CI pipelines that generate dense technical output.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Diffgram

    Diffgram

    Training data (data labeling, annotation, workflow) for all data types

    ...Training Data is the art of supervising machines through data. This includes the activities of annotation, which produces structured data; ready to be consumed by a machine learning model. Annotation is required because raw media is considered to be unstructured and not usable without it. That’s why training data is required for many modern machine learning use cases including computer vision, natural language processing and speech recognition.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 9
    CxxWrap

    CxxWrap

    Package to make C++ libraries available in Julia

    ...This dynamic library is then loaded into Julia, where the Julia part of this package uses the data provided through a C interface to generate functions accessible from Julia. The functions are passed to Julia either as raw function pointers (for regular C++ functions that don't need argument or return type conversion) or std::functions (for lambda expressions and automatic conversion of arguments and return types). The Julia side of this package wraps all this into Julia methods automatically.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    F1 Race Replay

    F1 Race Replay

    An interactive Formula 1 race visualisation and data analysis tool

    ...Users can scrub through time, jump between cars, and overlay performance graphs such as speed, sector times, and gap differentials to evaluate performance trends across laps. This deep dive capability turns passive viewing into active exploration, empowering enthusiasts and professionals to discover insights usually hidden in raw data. The viewer also supports annotations and bookmark capabilities so users can mark moments of interest for future review or comparison.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    ggstatsplot

    ggstatsplot

    Enhancing {ggplot2} plots with statistical analysis

    {ggstatsplot} is an extension of {ggplot2} package for creating graphics with details from statistical tests included in the information-rich plots themselves. In a typical exploratory data analysis workflow, data visualization and statistical modeling are two different phases: visualization informs modeling, and modeling in its turn can suggest a different visualization method, and so on and so forth. Bayesian hypothesis-testing. The central idea of {ggstatsplot} is simple: combine these...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MemProcFS Analyzer

    MemProcFS Analyzer

    Automated Forensic Analysis of Windows Memory Dumps for DFIR

    ...The Analyzer layer adds higher-level parsing and extraction routines—for example, carving strings, locating injected modules, enumerating handles, or reconstructing network sockets—so investigators can go from raw memory to actionable evidence more quickly. It emphasizes automation and reproducibility: parsers can be chained, results exported, and reports templated to fit incident workflows. Because memory contains transient but critical traces of running malware or misuse, the project focuses on robust parsing in the face of corruption and mismatched OS versions.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    AutoGluon

    AutoGluon

    AutoGluon: AutoML for Image, Text, and Tabular Data

    AutoGluon enables easy-to-use and easy-to-extend AutoML with a focus on automated stack ensembling, deep learning, and real-world applications spanning image, text, and tabular data. Intended for both ML beginners and experts, AutoGluon enables you to quickly prototype deep learning and classical ML solutions for your raw data with a few lines of code. Automatically utilize state-of-the-art techniques (where appropriate) without expert knowledge. Leverage automatic hyperparameter tuning, model selection/ensembling, architecture search, and data processing. Easily improve/tune your bespoke models and data pipelines, or customize AutoGluon for your use-case. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Fondant

    Fondant

    Production-ready data processing made easy and shareable

    Fondant is a modular, pipeline-based framework designed to simplify the preparation of large-scale datasets for training machine learning models, especially foundation models. It offers an end-to-end system for ingesting raw data, applying transformations, filtering, and formatting outputs—all while remaining scalable and traceable. Fondant is designed with reproducibility in mind and supports containerized steps using Docker, making it easy to share and reuse data processing components. It’s built for use in research and production, empowering data scientists to streamline dataset curation and preprocessing workflows efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Timesketch

    Timesketch

    Collaborative forensic timeline analysis

    ...Powerful search, aggregations, and saved views help you pivot quickly, highlight anomalies, and preserve investigative steps for later review. The system supports tagging, sketch notes, and story building so teams can annotate findings and share context without losing the raw data trail. Integrations with popular DFIR pipelines make ingestion repeatable, while role-based access and audit logs support enterprise workflows. By combining scale, collaboration, and reproducibility, Timesketch moves incident response beyond ad-hoc spreadsheets to a durable, team-oriented investigation record.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Automated Tool for Optimized Modelling

    Automated Tool for Optimized Modelling

    Automated Tool for Optimized Modelling

    ...On the other hand, using multiple notebooks makes it harder to compare the results and to keep an overview. On top of that, refactoring the code for every test can be quite time-consuming. How many times have you conducted the same action to pre-process a raw dataset? How many times have you copy-and-pasted code from an old repository to re-use it in a new use case? ATOM is here to help solve these common issues. The package acts as a wrapper of the whole machine learning pipeline, helping the data scientist to rapidly find a good model for his problem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Ridgepole

    Ridgepole

    Ridgepole is a tool to manage DB schema. It defines DB schema

    Ridgepole is a database schema management tool that treats your schema as code by expressing it in a Ruby DSL and applying diffs to keep databases in sync. You describe the desired state in a “Schemafile” (tables, columns, indexes, constraints), and Ridgepole compares it with the live database to generate only the necessary changes. This diff-and-apply approach makes schema changes repeatable and reviewable, avoiding hand-written migrations for routine structural edits. It supports multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    mzMatch is a Java collection of small commandline tools specific for metabolomics MS data analysis. The tools are built on top of the PeakML core library, providing mass spectrometry specific functionality and access to the PeakML file format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Orbit

    Orbit

    ORBIT : Operating Business Intelligence Tool

    ORBIT : Operating Business Intelligence Tool Making Data Accessible Through Centralized Database Access ORBIT is a business intelligence tool designed to make data accessible to a broad audience within your company by centralizing access to databases. With this application, users can easily create reports, perform interactive analyses, and extract insights from raw data. The application simplifies data handling by providing easy-to-use features for non-technical users while maintaining flexibility for advanced queries.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 20
    Feathr

    Feathr

    A scalable, unified data and AI engineering platform for enterprise

    Feathr is a data and AI engineering platform that is widely used in production at LinkedIn for many years and was open sourced in 2022. It is currently a project under LF AI & Data Foundation. Define data and feature transformations based on raw data sources (batch and streaming) using Pythonic APIs. Register transformations by names and get transformed data(features) for various use cases including AI modeling, compliance, go-to-market and more. Share transformations and data(features) across team and company. Feathr is particularly useful in AI modeling where it automatically computes your feature transformations and joins them to your training data, using point-in-time-correct semantics to avoid data leakage, and supports materializing and deploying your features for use online in production.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    HiPlot

    HiPlot

    HiPlot makes understanding high dimensional data easy

    ...Because it renders as self-contained HTML, you can embed the visualization in notebooks, export it, or serve it as a lightweight web app for teammates. HiPlot also offers summary statistics, correlation hints, and outlier highlighting to surface patterns that aren’t obvious from raw tables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    HP93kShmooExcel

    HP93kShmooExcel

    Excel macro to display HP 93000 Shmoo

    HP93000 Shmoo Excel Tools read the shmoo files in hp93000 shmoo text-file format, and show them or overlay them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Luminance HDR

    Luminance HDR

    Complete solution for HDR photography

    ...Its graphical user interface, based on Qt5, runs on a variety of platforms, such as Microsoft Windows, Mac OS X 10.9 and later and several Unix flavors (Linux, FreeBSD and others). Input images can be supplied in multiple formats, from JPEG to RAW files. In the same way, output can be saved in many different formats as well, from JPEG to TIFF (both 8 bit and 16 bit per channel), enabling all the power of your post processing tools.
    Leader badge
    Downloads: 328 This Week
    Last Update:
    See Project
  • 24
    Graph Capture

    Graph Capture

    Fast 2D charting from raw data

    ...From text files or programs, that haven't visual charts themselves, data is passed to "Graph Capture" and plotted there. But you have to write Perl RegEx expressions to extract X and Y-values from raw data. Otherwise, you can request suitable filter templates here https://www.facebook.com/vlad.dubov.77. For making RegEx expressions short docs are also included in the program. You can find "Help: regular expressions in Perl" in the installation folder or in the program's top bar. The program can plot charts from a static file ( just drag and drop the file) or from a dynamically written file - by using a plugin that "connects" to the source. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    DataKit

    DataKit

    Connect processes into powerful data pipelines

    ...DataKit is a tool to orchestrate applications using a Git-like dataflow. It revisits the UNIX pipeline concept, with a modern twist: streams of tree-structured data instead of raw text. DataKit allows you to define complex build pipelines over version-controlled data. DataKit is currently used as the coordination layer for HyperKit, the hypervisor component of Docker for Mac and Windows, and for the DataKitCI continuous integration system. src contains the main DataKit service. This is a Git-like database to which other services can connect. ci contains DataKitCI, a continuous integration system that uses DataKit to monitor repositories and store build results. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB