Showing 47 open source projects for "raw"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    CleanVision

    CleanVision

    Automatically find issues in image datasets

    ...The quality of machine learning models hinges on the quality of the data used to train them, but it is hard to manually identify all of the low-quality data in a big dataset. CleanVision helps you automatically identify common types of data issues lurking in image datasets. This package currently detects issues in the raw images themselves, making it a useful tool for any computer vision task such as: classification, segmentation, object detection, pose estimation, keypoint detection, generative modeling, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ClickVisual

    ClickVisual

    Log analytic and data visualize platform built on clickhouse

    ClickVisual is a lightweight browser-based logs analytics and logs search platform for ClickHouse.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    OpenBB Terminal

    OpenBB Terminal

    Investment research for everyone, anywhere

    ...The terminal allows for users to import their own proprietary datasets to use on our econometric menu. In addition, users are allowed to export any type of data to any type of format whether that is raw data in Excel or an image in PNG. This is ideal for finance content creation. Create notebook templates (through papermill) which can be run on different tickers. This level of automation allows to speed up the development of your investment thesis and reduce human error.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    ggstatsplot

    ggstatsplot

    Enhancing {ggplot2} plots with statistical analysis

    {ggstatsplot} is an extension of {ggplot2} package for creating graphics with details from statistical tests included in the information-rich plots themselves. In a typical exploratory data analysis workflow, data visualization and statistical modeling are two different phases: visualization informs modeling, and modeling in its turn can suggest a different visualization method, and so on and so forth. Bayesian hypothesis-testing. The central idea of {ggstatsplot} is simple: combine these...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    F1 Race Replay

    F1 Race Replay

    An interactive Formula 1 race visualisation and data analysis tool

    ...Users can scrub through time, jump between cars, and overlay performance graphs such as speed, sector times, and gap differentials to evaluate performance trends across laps. This deep dive capability turns passive viewing into active exploration, empowering enthusiasts and professionals to discover insights usually hidden in raw data. The viewer also supports annotations and bookmark capabilities so users can mark moments of interest for future review or comparison.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    NCDatasets.jl

    NCDatasets.jl

    Load and create NetCDF files in Julia

    NCDatasets allows one to read and create netCDF files. NetCDF data set and attribute list behave like Julia dictionaries and variables like Julia arrays. This package implements the CommonDataModel.jl interface, which means that the datasets can be accessed in the same way as GRIB files opened with GRIBDatasets.jl.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Diffgram

    Diffgram

    Training data (data labeling, annotation, workflow) for all data types

    ...Training Data is the art of supervising machines through data. This includes the activities of annotation, which produces structured data; ready to be consumed by a machine learning model. Annotation is required because raw media is considered to be unstructured and not usable without it. That’s why training data is required for many modern machine learning use cases including computer vision, natural language processing and speech recognition.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    wger

    wger

    Self hosted FLOSS fitness/workout, nutrition and weight tracker

    ...The application will calculate the nutritional values ​​(total energy, proteins, carbohydrates, etc.) of the entire plan and of each of the meals. Enter the weights and reps you've done for each exercise to generate diagrams that let you see at a glance how well you're doing. Of course, the raw numbers are still accessible.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    visual-explainer

    visual-explainer

    Agent skill + prompt templates that generate rich HTML pages

    ...The project includes prompt templates and automation logic that enable coding agents to generate visual summaries such as diff reviews, architecture overviews, plan audits, and structured data tables. Its primary goal is to bridge the readability gap between raw machine output and stakeholder-friendly documentation. By producing styled web pages instead of plain text logs, visual-explainer improves communication in engineering and AI workflows where clarity is critical. The tool is particularly useful in environments that rely on autonomous agents or CI pipelines that generate dense technical output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Fondant

    Fondant

    Production-ready data processing made easy and shareable

    Fondant is a modular, pipeline-based framework designed to simplify the preparation of large-scale datasets for training machine learning models, especially foundation models. It offers an end-to-end system for ingesting raw data, applying transformations, filtering, and formatting outputs—all while remaining scalable and traceable. Fondant is designed with reproducibility in mind and supports containerized steps using Docker, making it easy to share and reuse data processing components. It’s built for use in research and production, empowering data scientists to streamline dataset curation and preprocessing workflows efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Timesketch

    Timesketch

    Collaborative forensic timeline analysis

    ...Powerful search, aggregations, and saved views help you pivot quickly, highlight anomalies, and preserve investigative steps for later review. The system supports tagging, sketch notes, and story building so teams can annotate findings and share context without losing the raw data trail. Integrations with popular DFIR pipelines make ingestion repeatable, while role-based access and audit logs support enterprise workflows. By combining scale, collaboration, and reproducibility, Timesketch moves incident response beyond ad-hoc spreadsheets to a durable, team-oriented investigation record.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Tally

    Tally

    Let agents classify your bank transactions

    Tally is an open-source, AI-assisted tool designed to automate the classification of personal financial transactions, helping users turn raw bank data into meaningful categories without manual tagging. At its core, Tally pairs a local rule engine with large language models so that an AI assistant (like Claude Code, Copilot, or any CLI agent) interprets, suggests, and categorizes expenses, savings, subscriptions, and income events based on your own rules and behavior. It generates human-readable reports and can produce HTML, JSON, or Markdown outputs to suit dashboards or personal finance workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    CxxWrap

    CxxWrap

    Package to make C++ libraries available in Julia

    ...This dynamic library is then loaded into Julia, where the Julia part of this package uses the data provided through a C interface to generate functions accessible from Julia. The functions are passed to Julia either as raw function pointers (for regular C++ functions that don't need argument or return type conversion) or std::functions (for lambda expressions and automatic conversion of arguments and return types). The Julia side of this package wraps all this into Julia methods automatically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Automated Tool for Optimized Modelling

    Automated Tool for Optimized Modelling

    Automated Tool for Optimized Modelling

    ...On the other hand, using multiple notebooks makes it harder to compare the results and to keep an overview. On top of that, refactoring the code for every test can be quite time-consuming. How many times have you conducted the same action to pre-process a raw dataset? How many times have you copy-and-pasted code from an old repository to re-use it in a new use case? ATOM is here to help solve these common issues. The package acts as a wrapper of the whole machine learning pipeline, helping the data scientist to rapidly find a good model for his problem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Foxglove Studio

    Foxglove Studio

    Robotics visualization and debugging

    ...Experience the world as your robot does. Visualize images and point clouds, overlay bounding boxes, add classification labels and planned movements, and drill down into your data with plots or raw message views. Upload recordings to your private data lake for easy storage, searching, and analysis. Stream recorded data directly into Foxglove Studio to get insights into your robots' behavior. We're long-time fans and beneficiaries of open source software. Join our community on Github and Slack to contribute bug reports, feature requests, or pull requests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Whenever

    Whenever

    Cron jobs in Ruby

    Whenever (javan/whenever) is a Ruby gem that provides a clean, Ruby DSL for defining and managing cron jobs within your application. Rather than editing system cron tables by hand, you describe scheduled tasks in a schedule.rb file using methods like every, runner, command, or rake. The gem then compiles this schedule into a standard crontab format and installs it for you, handling all the necessary quoting, environment setup, and command path resolution. It supports specifying execution...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AutoGluon

    AutoGluon

    AutoGluon: AutoML for Image, Text, and Tabular Data

    AutoGluon enables easy-to-use and easy-to-extend AutoML with a focus on automated stack ensembling, deep learning, and real-world applications spanning image, text, and tabular data. Intended for both ML beginners and experts, AutoGluon enables you to quickly prototype deep learning and classical ML solutions for your raw data with a few lines of code. Automatically utilize state-of-the-art techniques (where appropriate) without expert knowledge. Leverage automatic hyperparameter tuning, model selection/ensembling, architecture search, and data processing. Easily improve/tune your bespoke models and data pipelines, or customize AutoGluon for your use-case. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Ridgepole

    Ridgepole

    Ridgepole is a tool to manage DB schema. It defines DB schema

    Ridgepole is a database schema management tool that treats your schema as code by expressing it in a Ruby DSL and applying diffs to keep databases in sync. You describe the desired state in a “Schemafile” (tables, columns, indexes, constraints), and Ridgepole compares it with the live database to generate only the necessary changes. This diff-and-apply approach makes schema changes repeatable and reviewable, avoiding hand-written migrations for routine structural edits. It supports multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    mzMatch is a Java collection of small commandline tools specific for metabolomics MS data analysis. The tools are built on top of the PeakML core library, providing mass spectrometry specific functionality and access to the PeakML file format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    beCPG PLM (Product Lifecycle Management)

    beCPG PLM (Product Lifecycle Management)

    New Product Development, Projects and Quality management

    ...We differentiate ourselves from the competitors with a comprehensive and user-friendly software offered at a competitive price. In a few words, beCPG offers the following features: - Product repository to manage finished products, recipes, raw materials and packaging with their technical and regulatory data - Formulation to calculate automatically allergens, ingredients, nutrient facts, costs, labeling, ... - Product specification generator for clients, R&D and production - Project management to manage new product development from ideas until market launch - Customer complaints
    Downloads: 13 This Week
    Last Update:
    See Project
  • 21
    Orbit

    Orbit

    ORBIT : Operating Business Intelligence Tool

    ORBIT : Operating Business Intelligence Tool Making Data Accessible Through Centralized Database Access ORBIT is a business intelligence tool designed to make data accessible to a broad audience within your company by centralizing access to databases. With this application, users can easily create reports, perform interactive analyses, and extract insights from raw data. The application simplifies data handling by providing easy-to-use features for non-technical users while maintaining flexibility for advanced queries.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Open Crypto Tracker

    Open Crypto Tracker

    Bitcoin Alts portfolio tracker, email / text / alexa / telegram alerts

    100% FREE / open source / PRIVATE cryptocurrency portfolio tracker. Email / text / alexa / telegram price alerts, price charts, mining calcs, leverage / gain / loss / balance stats, news feeds +more. Privately track Bitcoin / Ethereum / unlimited cryptocurrencies. Customize as many assets / markets / alerts / charts as you want. Over 50 Exchanges / 40 Trading Pairs Supported (exchanges / pairings list at bottom of README.txt): https://tinyurl.com/ct-readme Nearly Unlimited Assets...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    coronavirus

    coronavirus

    The coronavirus dataset

    ...Relevant and updated information about the virus, such as summary of new cases by country and total number of cases by region can be retrieved from this package. The raw data is pulled and arranged by the Johns Hopkins University Center for Systems Science and Engineering, which is gathered from various leading sources including the World Health Organization, China CDC, US CDC, European Centre for Disease Prevention and Control, and Australia Government Department of Health.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Feathr

    Feathr

    A scalable, unified data and AI engineering platform for enterprise

    Feathr is a data and AI engineering platform that is widely used in production at LinkedIn for many years and was open sourced in 2022. It is currently a project under LF AI & Data Foundation. Define data and feature transformations based on raw data sources (batch and streaming) using Pythonic APIs. Register transformations by names and get transformed data(features) for various use cases including AI modeling, compliance, go-to-market and more. Share transformations and data(features) across team and company. Feathr is particularly useful in AI modeling where it automatically computes your feature transformations and joins them to your training data, using point-in-time-correct semantics to avoid data leakage, and supports materializing and deploying your features for use online in production.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Bots-ediint open source edi translator

    Bots-ediint open source edi translator

    open source edi translator

    Bots-ediint is a file translator for EDI (Electronic Data Interchange). Translate files and databases data format to another one: edifact, x12, tradacoms, xml, json, raw, database (postgres, sqlite, mysql, oracle, ...) and more. Bots-ediint is based on Bots https://bots.sourceforge.net Support Python 2 and Python 3 AS2 communications are carried out with my external software that could work in standalone and/or in a Bots channel to send and receive AS2 messages (SYNC and ASYNC) Help the future to be an open world.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB