Showing 67 open source projects for "raw"

View related business solutions
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    CleanVision

    CleanVision

    Automatically find issues in image datasets

    ...The quality of machine learning models hinges on the quality of the data used to train them, but it is hard to manually identify all of the low-quality data in a big dataset. CleanVision helps you automatically identify common types of data issues lurking in image datasets. This package currently detects issues in the raw images themselves, making it a useful tool for any computer vision task such as: classification, segmentation, object detection, pose estimation, keypoint detection, generative modeling, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ClickVisual

    ClickVisual

    Log analytic and data visualize platform built on clickhouse

    ClickVisual is a lightweight browser-based logs analytics and logs search platform for ClickHouse.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Mercury

    Mercury

    Convert Python notebook to web app and share with non-technical users

    Turn Python notebooks to web applications with open-source Mercury framework. Hide code and add interactive widgets. Non-technical users can tweak widgets and execute notebook with new parameters. The core of Mercury is Open Source under AGPLv3. We provide Mercury Pro with additional features, dedicated support and friendly commercial license. Mercury is a perfect tool to convert Python notebook to interactive web application and share with non-programmers. You define interactive widgets for...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    MemProcFS Analyzer

    MemProcFS Analyzer

    Automated Forensic Analysis of Windows Memory Dumps for DFIR

    ...The Analyzer layer adds higher-level parsing and extraction routines—for example, carving strings, locating injected modules, enumerating handles, or reconstructing network sockets—so investigators can go from raw memory to actionable evidence more quickly. It emphasizes automation and reproducibility: parsers can be chained, results exported, and reports templated to fit incident workflows. Because memory contains transient but critical traces of running malware or misuse, the project focuses on robust parsing in the face of corruption and mismatched OS versions.
    Downloads: 18 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    OpenBB Terminal

    OpenBB Terminal

    Investment research for everyone, anywhere

    ...The terminal allows for users to import their own proprietary datasets to use on our econometric menu. In addition, users are allowed to export any type of data to any type of format whether that is raw data in Excel or an image in PNG. This is ideal for finance content creation. Create notebook templates (through papermill) which can be run on different tickers. This level of automation allows to speed up the development of your investment thesis and reduce human error.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    ggstatsplot

    ggstatsplot

    Enhancing {ggplot2} plots with statistical analysis

    {ggstatsplot} is an extension of {ggplot2} package for creating graphics with details from statistical tests included in the information-rich plots themselves. In a typical exploratory data analysis workflow, data visualization and statistical modeling are two different phases: visualization informs modeling, and modeling in its turn can suggest a different visualization method, and so on and so forth. Bayesian hypothesis-testing. The central idea of {ggstatsplot} is simple: combine these...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    File Sorter for Photographers

    File Sorter for Photographers

    Organize files/images from a csv or xlsx file.

    A user-friendly application to efficiently sort all types of files from a source folder into a destination folder based on a list of filenames provided in an Excel or CSV file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    visual-explainer

    visual-explainer

    Agent skill + prompt templates that generate rich HTML pages

    ...The project includes prompt templates and automation logic that enable coding agents to generate visual summaries such as diff reviews, architecture overviews, plan audits, and structured data tables. Its primary goal is to bridge the readability gap between raw machine output and stakeholder-friendly documentation. By producing styled web pages instead of plain text logs, visual-explainer improves communication in engineering and AI workflows where clarity is critical. The tool is particularly useful in environments that rely on autonomous agents or CI pipelines that generate dense technical output.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    F1 Race Replay

    F1 Race Replay

    An interactive Formula 1 race visualisation and data analysis tool

    ...Users can scrub through time, jump between cars, and overlay performance graphs such as speed, sector times, and gap differentials to evaluate performance trends across laps. This deep dive capability turns passive viewing into active exploration, empowering enthusiasts and professionals to discover insights usually hidden in raw data. The viewer also supports annotations and bookmark capabilities so users can mark moments of interest for future review or comparison.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    NCDatasets.jl

    NCDatasets.jl

    Load and create NetCDF files in Julia

    NCDatasets allows one to read and create netCDF files. NetCDF data set and attribute list behave like Julia dictionaries and variables like Julia arrays. This package implements the CommonDataModel.jl interface, which means that the datasets can be accessed in the same way as GRIB files opened with GRIBDatasets.jl.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Diffgram

    Diffgram

    Training data (data labeling, annotation, workflow) for all data types

    ...Training Data is the art of supervising machines through data. This includes the activities of annotation, which produces structured data; ready to be consumed by a machine learning model. Annotation is required because raw media is considered to be unstructured and not usable without it. That’s why training data is required for many modern machine learning use cases including computer vision, natural language processing and speech recognition.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    wger

    wger

    Self hosted FLOSS fitness/workout, nutrition and weight tracker

    ...The application will calculate the nutritional values ​​(total energy, proteins, carbohydrates, etc.) of the entire plan and of each of the meals. Enter the weights and reps you've done for each exercise to generate diagrams that let you see at a glance how well you're doing. Of course, the raw numbers are still accessible.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Fondant

    Fondant

    Production-ready data processing made easy and shareable

    Fondant is a modular, pipeline-based framework designed to simplify the preparation of large-scale datasets for training machine learning models, especially foundation models. It offers an end-to-end system for ingesting raw data, applying transformations, filtering, and formatting outputs—all while remaining scalable and traceable. Fondant is designed with reproducibility in mind and supports containerized steps using Docker, making it easy to share and reuse data processing components. It’s built for use in research and production, empowering data scientists to streamline dataset curation and preprocessing workflows efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Tally

    Tally

    Let agents classify your bank transactions

    Tally is an open-source, AI-assisted tool designed to automate the classification of personal financial transactions, helping users turn raw bank data into meaningful categories without manual tagging. At its core, Tally pairs a local rule engine with large language models so that an AI assistant (like Claude Code, Copilot, or any CLI agent) interprets, suggests, and categorizes expenses, savings, subscriptions, and income events based on your own rules and behavior. It generates human-readable reports and can produce HTML, JSON, or Markdown outputs to suit dashboards or personal finance workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Timesketch

    Timesketch

    Collaborative forensic timeline analysis

    ...Powerful search, aggregations, and saved views help you pivot quickly, highlight anomalies, and preserve investigative steps for later review. The system supports tagging, sketch notes, and story building so teams can annotate findings and share context without losing the raw data trail. Integrations with popular DFIR pipelines make ingestion repeatable, while role-based access and audit logs support enterprise workflows. By combining scale, collaboration, and reproducibility, Timesketch moves incident response beyond ad-hoc spreadsheets to a durable, team-oriented investigation record.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    CxxWrap

    CxxWrap

    Package to make C++ libraries available in Julia

    ...This dynamic library is then loaded into Julia, where the Julia part of this package uses the data provided through a C interface to generate functions accessible from Julia. The functions are passed to Julia either as raw function pointers (for regular C++ functions that don't need argument or return type conversion) or std::functions (for lambda expressions and automatic conversion of arguments and return types). The Julia side of this package wraps all this into Julia methods automatically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Automated Tool for Optimized Modelling

    Automated Tool for Optimized Modelling

    Automated Tool for Optimized Modelling

    ...On the other hand, using multiple notebooks makes it harder to compare the results and to keep an overview. On top of that, refactoring the code for every test can be quite time-consuming. How many times have you conducted the same action to pre-process a raw dataset? How many times have you copy-and-pasted code from an old repository to re-use it in a new use case? ATOM is here to help solve these common issues. The package acts as a wrapper of the whole machine learning pipeline, helping the data scientist to rapidly find a good model for his problem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Foxglove Studio

    Foxglove Studio

    Robotics visualization and debugging

    ...Experience the world as your robot does. Visualize images and point clouds, overlay bounding boxes, add classification labels and planned movements, and drill down into your data with plots or raw message views. Upload recordings to your private data lake for easy storage, searching, and analysis. Stream recorded data directly into Foxglove Studio to get insights into your robots' behavior. We're long-time fans and beneficiaries of open source software. Join our community on Github and Slack to contribute bug reports, feature requests, or pull requests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Whenever

    Whenever

    Cron jobs in Ruby

    Whenever (javan/whenever) is a Ruby gem that provides a clean, Ruby DSL for defining and managing cron jobs within your application. Rather than editing system cron tables by hand, you describe scheduled tasks in a schedule.rb file using methods like every, runner, command, or rake. The gem then compiles this schedule into a standard crontab format and installs it for you, handling all the necessary quoting, environment setup, and command path resolution. It supports specifying execution...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    AutoGluon

    AutoGluon

    AutoGluon: AutoML for Image, Text, and Tabular Data

    AutoGluon enables easy-to-use and easy-to-extend AutoML with a focus on automated stack ensembling, deep learning, and real-world applications spanning image, text, and tabular data. Intended for both ML beginners and experts, AutoGluon enables you to quickly prototype deep learning and classical ML solutions for your raw data with a few lines of code. Automatically utilize state-of-the-art techniques (where appropriate) without expert knowledge. Leverage automatic hyperparameter tuning, model selection/ensembling, architecture search, and data processing. Easily improve/tune your bespoke models and data pipelines, or customize AutoGluon for your use-case. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Ridgepole

    Ridgepole

    Ridgepole is a tool to manage DB schema. It defines DB schema

    Ridgepole is a database schema management tool that treats your schema as code by expressing it in a Ruby DSL and applying diffs to keep databases in sync. You describe the desired state in a “Schemafile” (tables, columns, indexes, constraints), and Ridgepole compares it with the live database to generate only the necessary changes. This diff-and-apply approach makes schema changes repeatable and reviewable, avoiding hand-written migrations for routine structural edits. It supports multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    OHLC Gen

    A software that generates synthetic data on various commodities

    ...You can generate data for various financial instruments such as cryptocurrencies, stocks, forex and precious metals in 1 minute, 15 minutes, 1 hour and 1 day periods. The software offers three data options: “Raw Data” (raw OHLC data), “Normalized Data” (normalized structure) and “Synthetic Data” (synthetic data suitable for real price characteristics). You can examine the created CSV files, access error and guide information from the log section, and also select the desired instruments in Yahoo Finance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    mzMatch is a Java collection of small commandline tools specific for metabolomics MS data analysis. The tools are built on top of the PeakML core library, providing mass spectrometry specific functionality and access to the PeakML file format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SignServer
    The SignServer is an application for server side signatures called by other systems. It is flexible and can be customized to specific needs. The SignServer have a ready to use TimeStamp server and signers for PDF, XML, ODF, PGP, OOXML and MRTD (ePassport DS).
    Downloads: 31 This Week
    Last Update:
    See Project
  • 25
    ResCarta

    ResCarta

    Archive your personal history

    ResCarta Toolkit offers an open source solution to creating, storing, viewing, and searching digital collections. Applications in the toolkit let users create and edit metadata, convert data to open standard ResCarta format, index and host collections.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB