Showing 55 open source projects for "talend data quality"

View related business solutions
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 1
    data-diff

    data-diff

    Efficiently diff rows across two different databases

    ...Replicating data at scale, across hundreds of tables, with low latency and at a reasonable infrastructure cost is a hard problem, and most data teams we’ve talked to, have faced data quality issues in their replication processes. The hard truth is that the quality of the replication is the quality of the data. Since copying entire datasets in batch is often infeasible at the modern data scale, businesses rely on the Change Data Capture (CDC) approach of replicating data using a continuous stream of updates.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Datacap

    Datacap

    DataCap is integrated software for data transformation

    Datacap is an open-source data catalog and governance tool that helps organizations manage and document their data assets. It provides metadata management, lineage tracking, and collaboration features to ensure data transparency and quality. Datacap is designed for teams that need a lightweight, self-hosted solution to organize and govern their data ecosystems.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Evil Seed

    Evil Seed

    A Gem for creating partial anonymized dumps of your database

    Evil Seed is a Ruby tool for seeding databases with realistic, localized, and structured test data. It integrates with Rails and uses Faker, but allows more advanced customization like data relationships and repeatable sequences. It’s ideal for developers who need high-quality sample data for testing or demos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Grafana

    Grafana

    Leading open-source visualization and observability platform

    Grafana OSS is a leading open-source visualization and observability platform that lets you query, visualize, alert on, and explore your data—regardless of where it’s stored. With support for 100+ data source plugins (such as Prometheus, Loki, Elasticsearch, InfluxDB, SQL/NoSQL databases, OTel, and more), you can unify metrics, logs, traces, and other observability signals in one place. Grafana OSS empowers you to build dynamic, reusable dashboards with rich visualizations, template...
    Downloads: 37 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
    Try Free
  • 5
    HOTOSM Tasking Manager

    HOTOSM Tasking Manager

    The tool to team up for mapping in OpenStreetMap

    Tasking Manager is a web platform developed by Humanitarian OpenStreetMap Team (HOT) to coordinate collaborative mapping projects. It divides mapping tasks into smaller sections that contributors can work on independently, streamlining efforts in disaster response, development planning, and humanitarian aid.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    TypeORM

    TypeORM

    ORM for TypeScript and JavaScript (ES7, ES6, ES5)

    ...Its goal is to always support the latest JavaScript features and provide additional features that help you to develop any kind of application that uses databases - from small applications with a few tables to large scale enterprise applications with multiple databases. TypeORM supports both Active Record and Data Mapper patterns, unlike all other JavaScript ORMs currently in existence, which means you can write high-quality, loosely coupled, scalable, maintainable applications the most productive way. TypeORM is highly influenced by other ORMs, such as Hibernate, Doctrine and Entity Framework. Supports both DataMapper and ActiveRecord (your choice). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Jailer

    Jailer

    Database Subsetter and Relational Data Browser

    Database Subsetting and Relational Data Browsing Tool. Navigate bidirectionally through the database by following foreign-key-based or user-defined relationships. Exports consistent, referentially intact row-sets from relational databases. Removes data w/o violating integrity. Generates topologically sorted SQL-DML, hierarchically structured XML. and DbUnit datasets.
    Leader badge
    Downloads: 123 This Week
    Last Update:
    See Project
  • 8
    SymmetricDS

    SymmetricDS

    Database Replication, Filtered Synchronization, and Transformation

    SymmetricDS is database replication and file synchronization software. It is a platform-independent, web-enabled, and database-agnostic synchronization tool. With initial data load, bidirectional change data capture, conflict resolution, and data transformation, it is a comprehensive data replication solution.
    Leader badge
    Downloads: 212 This Week
    Last Update:
    See Project
  • 9
    i-doit - Open Source CMDB & ITSM Tool

    i-doit - Open Source CMDB & ITSM Tool

    ITIL conform CMDB solution for documenting your entire IT landscape

    i-doit is a web based IT documentation and CMDB. i-doit documents IT-systems and their changes, defines emergency plans, displays vital information and helps to ensure a stable and efficient IT operation: The technical documentation: This means that all information can be organised (details flexibly adjusted), stored and maintained in one place. This allows simple Helpdesk, Support and Planning functions and provides the basis for quality assurance and certification. i-doit allows services to be documented as a logical grouping of objects placed in relation to each other. This provides the basis for (ITIL) processes such as Change and Problem Management, helps to control and manage the SLA (Service Level Agreement) or the communication with users. Its interfaces to other ITSM applications (like Nagios, Request Tracker etc.) allow the entering of data only once in the ITSM environment. i-doit maintains one holistic view of the operational situation.
    Downloads: 23 This Week
    Last Update:
    See Project
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 10
    sentinel

    sentinel

    Sentinel is a filesystem-backed document DBMS written in Rust.

    Sentinel is a filesystem-backed document DBMS built in Rust that prioritizes compliance, transparency, and auditability over raw performance. Unlike traditional databases, every document is a plain JSON file, making your data immediately forensic-friendly and Git-versionable. Perfect for regulated industries requiring GDPR, SOC 2, HIPAA, or PCI-DSS compliance. Sentinel provides async operations with automatic BLAKE3 hashing and optional Ed25519 signatures for cryptographic integrity. Data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Aestel

    Aestel

    Applications for data management

    "Information is data in action", and, consequently, having good quality data is essential. The AESTEL package contains two highly configurable applications for data management: A data loader and a reporting application, i.e. DataLoader and AEREA, respectively. The data loader application applies user-defined instructions to validate, process and load data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    DBFeeder

    DBFeeder

    Highly Customizable Test Data Generator

    DBFeeder is a great tool to generate synthetic testdata for Oracle Databases and it is ideal for companies who wants to outsource development. Thanks to his original approach, data can be highly customizable and it even fits primary and foreign keys constraints of tables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    QuickRedis

    QuickRedis

    QuickRedis is a free forever redis gui tool

    QuickRedis is a free forever Redis Desktop manager. It supports direct connection, sentinel, and cluster mode, supports multiple languages, supports hundreds of millions of keys, and has an amazing UI. Supports both Windows, Mac OS X and Linux platform.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 15
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Redux-ORM

    Redux-ORM

    A small, simple and immutable ORM to manage relational data

    A small, simple and immutable ORM to manage relational data in your Redux store. Redux-ORM is fast. All queries are lazily evaluated and come with built-in support for memorization. It provides a clean abstraction over low-level updates, protecting the state from accidental mutations. With nearly 100% branch coverage, the library is thoroughly tested to ensure rock-solid code quality. No matter where your data comes from, Redux-ORM will stay out of its way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    The Object-Role Modeling (ORM) standard version 2, associated schemas and generation tools, and a reference implementation in the form of the Natural Object-Role Modeling Architect for Visual Studio (NORMA) product.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    SQL Exporter

    SQL Exporter

    Database agnostic SQL exporter for Prometheus

    ...SQL queries are grouped into collectors -- logical groups of queries, e.g., query stats or I/O stats, mapped to the metrics they populate. Collectors may be DBMS-specific (e.g,. MySQL InnoDB stats) or custom, deployment-specific (e.g., pricing data freshness). This means you can quickly and easily set up custom collectors to measure data quality, whatever that might mean in your specific case.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    DataCleaner

    DataCleaner

    Data quality analysis, profiling, cleansing, duplicate detection +more

    DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    Spawner is a generator of sample/test data for databases. It can be configured to output delimited text or SQL insert statements. It can also insert directly into a MySQL 5.x database. Includes many field types, most of which are configurable.
    Leader badge
    Downloads: 87 This Week
    Last Update:
    See Project
  • 21
    LightProfiler

    LightProfiler

    Profiler for Oracle extended SQL trace files

    LightProfiler – application for performance analysis of the Oracle databases. It generates detailed resource profile for extended SQL trace files (10046 event), containing information about consuming of response time (by events, by cursors, etc.), data files usage, error analysis (SQL, PL/SQL) and much more. Also it contain tools for additional processing of trace files (extract session data, splitting files) and for management of database's sessions (disconnecting, tracing, monitor...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    UI To the Hadoop HBase Project
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DB Sanity
    DB Sanity performs checks on a database for verifying data consistency and correctness of aplication deployments. It is invoked by command line and creates an HTML report with aggregated diagnostic summaries and listings of faulty database entries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Avicenna

    Avicenna

    Location Data Visualisation and Editing

    Avicenna is an application for visualizing location data stored in excel or .csv table files on a map or in a table view. It provides several export functions (such as single pictures, database dumps and even a HTML-export that enables you to use Avicenna on your website). It was developed as part of the course "Techniken der Projektentwicklung" in summer semester 2012 at the University of Bielefeld by Team 2 and is licensed under the GNU Affero General Public License version 3. This...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Database to CSV

    "Database to CSV" project stores/loads data between database and CSVs

    "Database to CSV" allows to back-up data from database(Oracle, MySQL, SQL Server) in the form of CSV files. Each table data is stored in a separate CSV file. It also allows to load data from CSV files into the database. Program can be run from UI or from command line. There's one more advanced feature which allows merging of data from CSV file to Database. It is only available from command line. (It will take foreign keys and unique keys into consideration and then try to insert...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB