Showing 69 open source projects for "raw data viewer"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    Dash Data Agent

    Dash Data Agent

    Self-learning data agent that grounds its answers in layers of content

    ...The system then executes those queries against a database and interprets the results, returning human-friendly insights not just raw rows, while learning from errors and successes to reduce repeated mistakes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OpenBB

    OpenBB

    Investment Research for Everyone, Everywhere

    ...Create charts directly from raw data in seconds. Create charts directly from raw data in seconds. Customize your dashboards to build your dream terminal, integrate with your private datasets and bring your own fine-tuned AI copilots.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    OpenHealth

    OpenHealth

    AI health assistant for private, local data-driven insights mgmt

    ...A strong emphasis is placed on privacy, as the platform can run entirely locally, ensuring that sensitive medical data does not need to leave the user’s environment. OpenHealth also includes a data parsing layer that transforms raw medical inputs into structured datasets, making them usable for analysis and AI-driven insights. OpenHealth separates data ingestion, processing, and AI interaction, enabling flexibility in integrating different models and data sources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    graphify

    graphify

    AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)

    ...Overall, graphify serves as a bridge between raw data and visual insight.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Label Studio

    Label Studio

    Label Studio is a multi-type data labeling and annotation tool

    ...It can be used to prepare raw data or improve existing training data to get more accurate ML models. The frontend part of Label Studio app lies in the frontend/ folder and written in React JSX. Multi-user labeling sign up and login, when you create an annotation it's tied to your account. Configurable label formats let you customize the visual interface to meet your specific labeling needs.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 6
    Diffgram

    Diffgram

    Training data (data labeling, annotation, workflow) for all data types

    ...Training Data is the art of supervising machines through data. This includes the activities of annotation, which produces structured data; ready to be consumed by a machine learning model. Annotation is required because raw media is considered to be unstructured and not usable without it. That’s why training data is required for many modern machine learning use cases including computer vision, natural language processing and speech recognition.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Humanizer Skill

    Humanizer Skill

    Claude Code skill that removes signs of AI-generated writing from text

    Humanizer Skill is a utility library focused on transforming technical or machine-oriented text into expressions that are more natural, readable, and “human-friendly.” It provides a suite of algorithms that convert timestamps, identifiers, file sizes, code tokens, and structured data into phrases that resemble typical human phrasing rather than compact machine output. For example, date and time values can be expressed as relative terms (“two hours ago”), and file sizes can be shown in intuitive units instead of raw bytes. It also includes functions for transforming camelCase, snake_case, or PascalCase identifiers into spaced and capitalized representations suitable for user interfaces, reports, or documentation. ...
    Downloads: 123 This Week
    Last Update:
    See Project
  • 8
    DeepAnalyze

    DeepAnalyze

    Autonomous LLM agent for end-to-end data science workflows

    DeepAnalyze is an open source project that introduces an agentic large language model designed to perform autonomous data science tasks from start to finish. It is built to handle the entire data science pipeline, including data preparation, analysis, modeling, visualization, and report generation without requiring continuous human guidance. DeepAnalyze is capable of conducting open-ended data research across multiple data formats such as structured tables, semi-structured files, and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Claude Context

    Claude Context

    Code search MCP for Claude Code

    ...It also provides tools for organizing and managing context, making it easier to maintain structured knowledge bases. Overall, Claude-context acts as a bridge between raw data and AI models, improving the relevance and accuracy of generated outputs.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Wiseflow

    Wiseflow

    Enhance any agent's browser use skill

    Wiseflow is an open-source information extraction and knowledge discovery system designed to collect, filter, and organize valuable information from large volumes of online content. The platform continuously monitors specified sources such as websites, social platforms, and other digital channels to identify relevant data according to user-defined interests or topics. By combining web crawling, content parsing, and large language model analysis, the system extracts concise insights from raw information streams and converts them into structured data that can be stored or analyzed. This automated workflow helps reduce the noise associated with large information ecosystems and highlights the most important insights for users. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Instructor Python

    Instructor Python

    Structured outputs for llms

    Instructor is a Python library that bridges OpenAI responses with structured data validation using Pydantic models. It lets developers specify expected output schemas and ensures that the responses from OpenAI APIs are automatically parsed and validated against those models. This makes integrating LLMs into structured workflows safer and more predictable, especially in production applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    EverMemOS

    EverMemOS

    Long-term memory OS for AI with structured recall and context awarenes

    ...EverMemOS goes beyond simple retrieval by actively applying stored knowledge to current tasks, improving personalization and consistency. EverMemOS uses a multi-stage memory lifecycle to convert raw dialogue into structured semantic data, supporting long-horizon reasoning and adaptive behavior across sessions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    webclaw

    webclaw

    Fast, local-first web content extraction for LLMs

    webclaw is a high-performance web content extraction tool designed specifically for AI agents and large language models, focusing on delivering clean, structured data instead of raw HTML. It is built in Rust and operates without a headless browser, using advanced techniques such as TLS fingerprinting to bypass common scraping barriers and mimic real browser behavior. The tool addresses a major inefficiency in AI workflows by removing irrelevant elements like navigation menus, ads, and scripts, significantly reducing token usage when feeding data into language models. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    AutoGluon

    AutoGluon

    AutoGluon: AutoML for Image, Text, and Tabular Data

    AutoGluon enables easy-to-use and easy-to-extend AutoML with a focus on automated stack ensembling, deep learning, and real-world applications spanning image, text, and tabular data. Intended for both ML beginners and experts, AutoGluon enables you to quickly prototype deep learning and classical ML solutions for your raw data with a few lines of code. Automatically utilize state-of-the-art techniques (where appropriate) without expert knowledge. Leverage automatic hyperparameter tuning, model selection/ensembling, architecture search, and data processing. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Rill

    Rill

    Fast SQL-based BI tool for real-time dashboards and analytics

    Rill is an operational BI tool that turns raw datasets into fast, interactive dashboards using SQL and a code-first approach. It helps data teams move from data lake to insight quickly, without the complexity of traditional BI systems. With an embedded in-memory database powered by DuckDB or ClickHouse, queries run in milliseconds, enabling real-time exploration and analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    cognee

    cognee

    Deterministic LLMs Outputs for AI Applications and AI Agents

    ...Any kind of data works; unstructured text or raw media files, PDFs, tables, presentations, JSON files, and so many more. Add small or large files, or many files at once. We map out a knowledge graph from all the facts and relationships we extract from your data. Then, we establish graph topology and connect related knowledge clusters, enabling the LLM to "understand" the data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    LiteParse

    LiteParse

    A fast, helpful, and open-source document parser

    LiteParse is an open-source lightweight parsing library designed to extract structured data from unstructured text using large language models in an efficient and cost-effective manner. It focuses on simplifying the process of turning raw text into structured outputs such as JSON by providing a streamlined interface for prompt-based parsing. The system is designed to minimize overhead, making it suitable for applications where performance and cost are critical considerations. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    LLM Scraper

    LLM Scraper

    Extract structured data from webpages using LLM-powered scraping

    LLM Scraper is a TypeScript library designed to extract structured data from webpages using large language models. Instead of relying on fragile HTML selectors or manual parsing rules, the tool interprets webpage content with language models and converts it into structured data according to a defined schema. Developers can specify the data structure using tools such as Zod or JSON Schema, enabling the model to extract relevant information directly into typed objects. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Bespoke Curator

    Bespoke Curator

    Synthetic data curation for post-training and data extraction

    Curator is an open-source Python library designed to build synthetic data pipelines for training and evaluating machine learning models, particularly large language models. The system helps developers generate, transform, and curate high-quality datasets by combining automated generation with structured validation and filtering. It supports workflows where models are used to produce synthetic examples that can later be refined into reliable training datasets for reasoning, question...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Featuretools

    Featuretools

    An open source python library for automated feature engineering

    An open source Python framework for automated feature engineering. Featuretools automatically creates features from temporal and relational datasets. Featuretools uses DFS for automated feature engineering. You can combine your raw data with what you know about your data to build meaningful features for machine learning and predictive modeling. Featuretools provides APIs to ensure only valid data is used for calculations, keeping your feature vectors safe from common label leakage problems. You can specify prediction times row-by-row. Featuretools come with a library of low-level functions that can be stacked to create features. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    pixelmatch

    pixelmatch

    The smallest, simplest JavaScript pixel-level image comparison library

    ...Features accurate anti-aliased pixels detection and perceptual color difference metrics. Inspired by Resemble.js and Blink-diff. Unlike these libraries, pixelmatch is around 150 lines of code, has no dependencies, and works on raw typed arrays of image data, so it's blazing fast and can be used in any environment (Node or browsers). Compares two images, writes the output diff and returns the number of mismatched pixels.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    Deep Lake

    Deep Lake

    Data Lake for Deep Learning. Build, manage, and query datasets

    ...Use one API to upload, download, and stream datasets to/from AWS S3/S3-compatible storage, GCP, Activeloop cloud, or local storage. Store images, audios and videos in their native compression. Deeplake automatically decompresses them to raw data only when needed, e.g., when training a model. Treat your cloud datasets as if they are a collection of NumPy arrays in your system's memory. Slice them, index them, or iterate through them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PasteGuard

    PasteGuard

    Masks sensitive data and secrets before they reach AI

    PasteGuard is an open-source privacy proxy that protects sensitive information like personal data and API secrets by detecting and masking them before they reach large language model APIs such as OpenAI or Anthropic Claude. It sits between an application and the LLM provider, automatically replacing names, emails, tokens, and other personally identifiable information (PII) with placeholders so that external services never see raw sensitive values, and then optionally unmasking them in the returned output. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Framelink MCP for Figma

    Framelink MCP for Figma

    MCP server enabling AI coding tools to access Figma design data

    Figma-Context-MCP is an open source server that connects Figma design data with AI-powered coding tools through the Model Context Protocol (MCP). It allows coding assistants to retrieve structured information from Figma files so they can better translate visual designs into working code. Instead of relying on screenshots or manual descriptions, Figma-Context-MCP accesses layout, styling, and component metadata directly from the Figma API and presents it in a simplified format optimized for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Omi

    Omi

    AI that sees your screen and listens to conversations

    The Omi project is an open-source AI wearable ecosystem developed by Based Hardware that combines hardware, software, and cloud infrastructure to create a persistent “second brain” for capturing and processing real-world interactions. It is designed as a system that continuously listens to conversations and monitors screen activity, converting this input into structured data such as transcripts, summaries, and actionable insights in real time. The platform operates across multiple environments, including wearable devices, mobile apps, and desktop applications, ensuring seamless integration into a user’s daily workflow. At its core, omi uses a pipeline of speech-to-text systems, large language models, and memory storage services to transform raw audio and context into meaningful outputs like tasks and reminders. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB