Showing 20 open source projects for "params-validate"

View related business solutions
  • Cut Data Warehouse Costs up to 54% with BigQuery Icon
    Cut Data Warehouse Costs up to 54% with BigQuery

    Migrate from Snowflake, Databricks, or Redshift with free migration tools. Exabyte scale without the Exabyte price.

    BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.
    Try BigQuery Free
  • Deploy Apps in Seconds with Cloud Run Icon
    Deploy Apps in Seconds with Cloud Run

    Host and run your applications without the need to manage infrastructure. Scales up from and down to zero automatically.

    Cloud Run is the fastest way to deploy containerized apps. Push your code in Go, Python, Node.js, Java, or any language and Cloud Run builds and deploys it automatically. Get fast autoscaling, pay only when your code runs, and skip the infrastructure headaches. Two million requests free per month. And new customers get $300 in free credit.
    Try Cloud Run Free
  • 1
    Union Pandera

    Union Pandera

    Light-weight, flexible, expressive statistical data testing library

    ...A simple, zero-configuration data testing framework for data scientists and ML engineers seeking correctness. Access a comprehensive suite of built-in tests, or easily create your own validation rules for your specific use cases. Validate the functions that produce your data by automatically generating test cases for them. Integrate seamlessly with the Python ecosystem. Overcome the initial hurdle of defining a schema by inferring one from clean data, then refine it over time. Identify the critical points in your data pipeline, and validate data going in and out of them. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Encord Active

    Encord Active

    The toolkit to test, validate, and evaluate your models and surface

    Encord Active is an open-source toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling to supercharge model performance. Encord Active has been designed as a all-in-one open source toolkit for improving your data quality and model performance. Use the intuitive UI to explore your data or access all the functionalities programmatically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    pointblank

    pointblank

    Data quality assessment and metadata reporting for data frames

    With the pointblank package it’s really easy to methodically validate your data whether in the form of data frames or as database tables. On top of the validation toolset, the package gives you the means to provide and keep up-to-date with the information that defines your tables. For table validation, the agent object works with a large collection of simple (yet powerful!) validation functions. We can enable much more sophisticated validation checks by using custom expressions, segmenting the data, and by selective mutations of the target table. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    CSV Lint

    CSV Lint

    CSV Lint plug-in for Notepad++ for syntax highlighting

    CSV Lint plug-in for Notepad++ for syntax highlighting, csv validation, automatic column and datatype detecting fixed width datasets, change datetime format, decimal separator, sort data, count unique values, convert to xml, json, sql etc. A plugin for data cleaning and working with messy data files. Use CSV Lint for metadata discovery, technical data validation, and reformatting on tabular data files. It is not meant to be a replacement for spreadsheet programs like Excel or SPSS, but...
    Downloads: 28 This Week
    Last Update:
    See Project
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 5
    Superstruct

    Superstruct

    A simple and composable way to validate data in JavaScript

    This project is a lightweight validation library for JavaScript and TypeScript that helps you define data “shapes” and validate runtime values against them. Instead of relying only on compile-time typing, it focuses on the reality that many apps receive unknown input from APIs, forms, and external systems, and those values need runtime checks. Its API is intentionally familiar if you’ve used TypeScript, Flow, Go structs, or GraphQL schemas, but the output is oriented around runtime correctness. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Automa.jl

    Automa.jl

    A julia code generator for regular expressions

    Automa is a regex-to-Julia compiler. By compiling regex to Julia code in the form of Expr objects, Automa provides facilities to create efficient and robust regex-based lexers, tokenizers and parsers using Julia's metaprogramming capabilities. You can view Automa as a regex engine that can insert arbitrary Julia code into its input-matching process, which will be executed when certain parts of the regex match an input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Datumaro

    Datumaro

    Dataset Management Framework, a Python library and a CLI tool to build

    ...It supports importing and exporting annotations and images across a wide variety of standards like COCO, PASCAL VOC, YOLO, ImageNet, Cityscapes, and many more, enabling easy integration with different training pipelines and tools. Datumaro makes it easy to merge datasets, split them into training/validation/test subsets, filter or transform annotations, and validate annotation quality — all while preserving metadata and supporting detailed statistics. It’s especially useful when you’re dealing with heterogeneous data sources or need to prepare complex datasets for machine learning workflows, freeing you from writing custom scripts for every format conversion.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    whylogs

    whylogs

    The open standard for data logging

    whylogs is an open-source library for logging any kind of data. With whylogs, users are able to generate summaries of their datasets (called whylogs profiles) which they can use to track changes in their dataset Create data constraints to know whether their data looks the way it should. Quickly visualize key summary statistics about their datasets. whylogs profiles are the core of the whylogs library. They capture key statistical properties of data, such as the distribution (far beyond...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    pprof

    pprof

    pprof is a tool for visualization and analysis of profiling data

    ...It supports multiple profile types (CPU, heap, allocations, contention, etc.) and can present data as text tables, call graphs (via Graphviz/dot), flame graphs, and interactive web UIs. The tool helps developers find hot paths, quantify resource usage, and compare profiles across runs to validate performance changes. It is widely used in Go but also has bindings and exporters for other ecosystems, and the repository includes a Go package for reading and writing profiles programmatically. The pprof command can operate on local files or fetch from targets exposing profiling endpoints, supporting iterative, production-oriented workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build AI Apps with Gemini 3 on Vertex AI Icon
    Build AI Apps with Gemini 3 on Vertex AI

    Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

    Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
    Try Vertex AI Free
  • 10
    nichenetr

    nichenetr

    NicheNet: predict active ligand-target links between interacting cells

    nichenetr: the R implementation of the NicheNet method. The goal of NicheNet is to study intercellular communication from a computational perspective. NicheNet uses human or mouse gene expression data of interacting cells as input and combines this with a prior model that integrates existing knowledge on ligand-to-target signaling paths. This allows to predict ligand-receptor interactions that might drive gene expression changes in cells of interest. This model of prior information on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Sysdig

    Sysdig

    Linux system exploration and troubleshooting tool

    ...Unify threat detection and incident response across containers, Kubernetes, and cloud with out-of-the-box Falco rules leveraging syscalls, Kubernetes audit logs and cloud logs. Gain deep insight with container and Kubernetes monitoring that is fully Prometheus compatible. Validate compliance against standards like PCI, NIST and SOC2 for containers, hosts, Kubernetes and cloud. Sysdig created Falco, the open standard for runtime threat detection for containers, hosts, Kubernetes and cloud.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Swiple

    Swiple

    Swiple enables you to easily observe, understand, validate data

    Swiple is an automated data monitoring platform that helps analytics and data engineering teams seamlessly monitor the quality of their data. With automated data analysis and profiling, scheduling and alerting, teams can resolve data quality issues before they impact mission critical resources. Experience hassle-free integration with Swiple's zero-infrastructure and zero-code setup. Seamlessly incorporate data quality checks into your existing workflows without any coding or infrastructure...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Aestel

    Aestel

    Applications for data management

    ...The AESTEL package contains two highly configurable applications for data management: A data loader and a reporting application, i.e. DataLoader and AEREA, respectively. The data loader application applies user-defined instructions to validate, process and load data. The reporting application provides a query builder and spreadsheet template designer. Both applications work with any relational data model. (Postgres and Oracle have been tested). The two applications have been initially developed for small molecule drug discovery research. However, they can be extended for use in other data domains.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SQLBucket

    SQLBucket

    Lightweight library to write, orchestrate and test your SQL ETL

    SQLBucket is a lightweight framework to help write, orchestrate and validate SQL data pipelines. It gives the possibility to set variables and introduces some control flow using the fantastic Jinja2 library. It also implements a very simplistic unit and integration test framework where you can validate the results of your ETL in the form of SQL checks. With SQLBucket, you can apply TDD principles when writing data pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    BGP Looking-glass

    BGP Looking-glass

    Open Source BGP Looking-glass

    BigPlg allows the collection of the IPv4 Internet routing table which may be viewed by service providers and/or ISP customers to validate IP routing across autonomous systems. Routing updates may also have attributes aliased to become (graph) readable. This function exists to allow the exporting of RIB updates to Splunk, or any such data collector. A beneficial side-effect of collecting BGP routing information is the ability to track BGP updates/events local or remote(upstream). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    EasyDataQuality for Pentaho Kettle

    EasyDataQuality for Pentaho Kettle

    EasyDataQuality for Pentaho Data Integration in Kettle

    EasyDQ plugins for Contact cleansing in Pentaho Data Integration in Kettle.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ns-2 Trace Toolkit

    ns-2 Trace Toolkit

    This toolkit allows you to analyze and validate ns-2 traces

    ns-2 Trace Toolkit was developed to help users to manage and debug ns-2 trace files. It is particularly useful for users that generate ns-2 trace files with other tools such as VanetMobiSim or SUMO (Simulation of Urban Mobility). With ns-2 Trace Toolkit, users can debug their trace files by generating histograms for: (1) node speed, (2) node x-position, and (3) node y-position. These histograms can give hits about concentration of nodes. Additionally, the tool allows users to simulate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The SPASE Toolkit contains a set of command-line applications which can be used to generate, validate, referentially check, use and organize resource descriptions written in SPASE XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Datacleaning Open Source
    A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    RadianceML aims to integrate the Radiance lighting simulation system into XML environments. A set of XSL transformations is used to import and export native Radiance scenes, a schema to validate scenes and use XML-editors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB