Showing 22 open source projects for "params-validate"

View related business solutions
  • Ship AI Apps Faster with Vertex AI Icon
    Ship AI Apps Faster with Vertex AI

    Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

    Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
    Try Vertex AI Free
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 1
    Union Pandera

    Union Pandera

    Light-weight, flexible, expressive statistical data testing library

    ...A simple, zero-configuration data testing framework for data scientists and ML engineers seeking correctness. Access a comprehensive suite of built-in tests, or easily create your own validation rules for your specific use cases. Validate the functions that produce your data by automatically generating test cases for them. Integrate seamlessly with the Python ecosystem. Overcome the initial hurdle of defining a schema by inferring one from clean data, then refine it over time. Identify the critical points in your data pipeline, and validate data going in and out of them. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Encord Active

    Encord Active

    The toolkit to test, validate, and evaluate your models and surface

    Encord Active is an open-source toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling to supercharge model performance. Encord Active has been designed as a all-in-one open source toolkit for improving your data quality and model performance. Use the intuitive UI to explore your data or access all the functionalities programmatically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CSV Lint

    CSV Lint

    CSV Lint plug-in for Notepad++ for syntax highlighting

    CSV Lint plug-in for Notepad++ for syntax highlighting, csv validation, automatic column and datatype detecting fixed width datasets, change datetime format, decimal separator, sort data, count unique values, convert to xml, json, sql etc. A plugin for data cleaning and working with messy data files. Use CSV Lint for metadata discovery, technical data validation, and reformatting on tabular data files. It is not meant to be a replacement for spreadsheet programs like Excel or SPSS, but...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 4
    pointblank

    pointblank

    Data quality assessment and metadata reporting for data frames

    With the pointblank package it’s really easy to methodically validate your data whether in the form of data frames or as database tables. On top of the validation toolset, the package gives you the means to provide and keep up-to-date with the information that defines your tables. For table validation, the agent object works with a large collection of simple (yet powerful!) validation functions. We can enable much more sophisticated validation checks by using custom expressions, segmenting the data, and by selective mutations of the target table. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go from Data Warehouse to Data and AI platform with BigQuery Icon
    Go from Data Warehouse to Data and AI platform with BigQuery

    Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.

    BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
    Try BigQuery Free
  • 5
    Superstruct

    Superstruct

    A simple and composable way to validate data in JavaScript

    This project is a lightweight validation library for JavaScript and TypeScript that helps you define data “shapes” and validate runtime values against them. Instead of relying only on compile-time typing, it focuses on the reality that many apps receive unknown input from APIs, forms, and external systems, and those values need runtime checks. Its API is intentionally familiar if you’ve used TypeScript, Flow, Go structs, or GraphQL schemas, but the output is oriented around runtime correctness. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Automa.jl

    Automa.jl

    A julia code generator for regular expressions

    Automa is a regex-to-Julia compiler. By compiling regex to Julia code in the form of Expr objects, Automa provides facilities to create efficient and robust regex-based lexers, tokenizers and parsers using Julia's metaprogramming capabilities. You can view Automa as a regex engine that can insert arbitrary Julia code into its input-matching process, which will be executed when certain parts of the regex match an input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Datumaro

    Datumaro

    Dataset Management Framework, a Python library and a CLI tool to build

    ...It supports importing and exporting annotations and images across a wide variety of standards like COCO, PASCAL VOC, YOLO, ImageNet, Cityscapes, and many more, enabling easy integration with different training pipelines and tools. Datumaro makes it easy to merge datasets, split them into training/validation/test subsets, filter or transform annotations, and validate annotation quality — all while preserving metadata and supporting detailed statistics. It’s especially useful when you’re dealing with heterogeneous data sources or need to prepare complex datasets for machine learning workflows, freeing you from writing custom scripts for every format conversion.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    whylogs

    whylogs

    The open standard for data logging

    whylogs is an open-source library for logging any kind of data. With whylogs, users are able to generate summaries of their datasets (called whylogs profiles) which they can use to track changes in their dataset Create data constraints to know whether their data looks the way it should. Quickly visualize key summary statistics about their datasets. whylogs profiles are the core of the whylogs library. They capture key statistical properties of data, such as the distribution (far beyond...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    pprof

    pprof

    pprof is a tool for visualization and analysis of profiling data

    ...It supports multiple profile types (CPU, heap, allocations, contention, etc.) and can present data as text tables, call graphs (via Graphviz/dot), flame graphs, and interactive web UIs. The tool helps developers find hot paths, quantify resource usage, and compare profiles across runs to validate performance changes. It is widely used in Go but also has bindings and exporters for other ecosystems, and the repository includes a Go package for reading and writing profiles programmatically. The pprof command can operate on local files or fetch from targets exposing profiling endpoints, supporting iterative, production-oriented workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    nichenetr

    nichenetr

    NicheNet: predict active ligand-target links between interacting cells

    nichenetr: the R implementation of the NicheNet method. The goal of NicheNet is to study intercellular communication from a computational perspective. NicheNet uses human or mouse gene expression data of interacting cells as input and combines this with a prior model that integrates existing knowledge on ligand-to-target signaling paths. This allows to predict ligand-receptor interactions that might drive gene expression changes in cells of interest. This model of prior information on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Sysdig

    Sysdig

    Linux system exploration and troubleshooting tool

    ...Unify threat detection and incident response across containers, Kubernetes, and cloud with out-of-the-box Falco rules leveraging syscalls, Kubernetes audit logs and cloud logs. Gain deep insight with container and Kubernetes monitoring that is fully Prometheus compatible. Validate compliance against standards like PCI, NIST and SOC2 for containers, hosts, Kubernetes and cloud. Sysdig created Falco, the open standard for runtime threat detection for containers, hosts, Kubernetes and cloud.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Swiple

    Swiple

    Swiple enables you to easily observe, understand, validate data

    Swiple is an automated data monitoring platform that helps analytics and data engineering teams seamlessly monitor the quality of their data. With automated data analysis and profiling, scheduling and alerting, teams can resolve data quality issues before they impact mission critical resources. Experience hassle-free integration with Swiple's zero-infrastructure and zero-code setup. Seamlessly incorporate data quality checks into your existing workflows without any coding or infrastructure...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Aestel

    Aestel

    Applications for data management

    ...The AESTEL package contains two highly configurable applications for data management: A data loader and a reporting application, i.e. DataLoader and AEREA, respectively. The data loader application applies user-defined instructions to validate, process and load data. The reporting application provides a query builder and spreadsheet template designer. Both applications work with any relational data model. (Postgres and Oracle have been tested). The two applications have been initially developed for small molecule drug discovery research. However, they can be extended for use in other data domains.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SQLBucket

    SQLBucket

    Lightweight library to write, orchestrate and test your SQL ETL

    SQLBucket is a lightweight framework to help write, orchestrate and validate SQL data pipelines. It gives the possibility to set variables and introduces some control flow using the fantastic Jinja2 library. It also implements a very simplistic unit and integration test framework where you can validate the results of your ETL in the form of SQL checks. With SQLBucket, you can apply TDD principles when writing data pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    MS Data Miner

    MS Data Miner - A web-based proteomics software

    Data processing and analysis in proteomic studies is a significant challenge and very time consuming. MS Data Miner (MDM) is a freely available web-based software to analyze, process, validate, compare, and display output files from MS software, including Mascot (Matrix Science), Mascot Distiller (Matrix Science) and ProteinPilot (AB Sciex). It was developed to significantly decrease the data analysis time and to handle large proteomic studies. This open sourced system includes a spectra validation-system and automatically screenshot generation tool for Mascot-assigned spectra. ...
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    BGP Looking-glass

    BGP Looking-glass

    Open Source BGP Looking-glass

    BigPlg allows the collection of the IPv4 Internet routing table which may be viewed by service providers and/or ISP customers to validate IP routing across autonomous systems. Routing updates may also have attributes aliased to become (graph) readable. This function exists to allow the exporting of RIB updates to Splunk, or any such data collector. A beneficial side-effect of collecting BGP routing information is the ability to track BGP updates/events local or remote(upstream). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    EasyDataQuality for Pentaho Kettle

    EasyDataQuality for Pentaho Kettle

    EasyDataQuality for Pentaho Data Integration in Kettle

    EasyDQ plugins for Contact cleansing in Pentaho Data Integration in Kettle.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ns-2 Trace Toolkit

    ns-2 Trace Toolkit

    This toolkit allows you to analyze and validate ns-2 traces

    ns-2 Trace Toolkit was developed to help users to manage and debug ns-2 trace files. It is particularly useful for users that generate ns-2 trace files with other tools such as VanetMobiSim or SUMO (Simulation of Urban Mobility). With ns-2 Trace Toolkit, users can debug their trace files by generating histograms for: (1) node speed, (2) node x-position, and (3) node y-position. These histograms can give hits about concentration of nodes. Additionally, the tool allows users to simulate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The SPASE Toolkit contains a set of command-line applications which can be used to generate, validate, referentially check, use and organize resource descriptions written in SPASE XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Datacleaning Open Source
    A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This is a Java based RESTful Web Service designed to interface with the proprietary MD (Melissa Data) windows database. MD provides several interfaces to validate addresses, phone numbers, email addresses, zip codes, amd more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    RadianceML aims to integrate the Radiance lighting simulation system into XML environments. A set of XSL transformations is used to import and export native Radiance scenes, a schema to validate scenes and use XML-editors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB