Alternatives to OpenRefine

Compare OpenRefine alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to OpenRefine in 2024. Compare features, ratings, user reviews, pricing, and more from OpenRefine competitors and alternatives in order to make an informed decision for your business.

  • 1
    DataBuck

    DataBuck

    FirstEigen

    (Bank CFO) “I don’t have confidence and trust in our data. We keep discovering hidden risks”. Since 70% of data initiatives fail due to unreliable data (Gartner research), are you risking your reputation by trusting the accuracy of your data that you share with your business stakeholders and partners? Data Trust Scores must be measured in Data Lakes, warehouses, and throughout the pipeline, to ensure the data is trustworthy and fit for use. It typically takes 4-6 weeks of manual effort just to set a file or table for validation. Then, the rules have to be constantly updated as the data evolves. The only scalable option is to automate data validation rules discovery and rules maintenance. DataBuck is an autonomous, self-learning, Data Observability, Quality, Trustability and Data Matching tool. It reduces effort by 90% and errors by 70%. "What took my team of 10 Engineers 2 years to do, DataBuck could complete it in less than 8 hours." (VP, Enterprise Data Office, a US bank)
    Compare vs. OpenRefine View Software
    Visit Website
  • 2
    Zuar Runner

    Zuar Runner

    Zuar, Inc.

    Utilizing the data that's spread across your organization shouldn't be so difficult! With Zuar Runner you can automate the flow of data from hundreds of potential sources into a single destination. Collect, transform, model, warehouse, report, monitor and distribute: it's all managed by Zuar Runner. Pull data from Amazon/AWS products, Google products, Microsoft products, Avionte, Backblaze, BioTrackTHC, Box, Centro, Citrix, Coupa, DigitalOcean, Dropbox, CSV, Eventbrite, Facebook Ads, FTP, Firebase, Fullstory, GitHub, Hadoop, Hubic, Hubspot, IMAP, Jenzabar, Jira, JSON, Koofr, LeafLogix, Mailchimp, MariaDB, Marketo, MEGA, Metrc, OneDrive, MongoDB, MySQL, Netsuite, OpenDrive, Oracle, Paycom, pCloud, Pipedrive, PostgreSQL, put.io, Quickbooks, RingCentral, Salesforce, Seafile, Shopify, Skybox, Snowflake, Sugar CRM, SugarSync, Tableau, Tamarac, Tardigrade, Treez, Wurk, XML Tables, Yandex Disk, Zendesk, Zoho, and more!
  • 3
    TiMi

    TiMi

    TIMi

    With TIMi, companies can capitalize on their corporate data to develop new ideas and make critical business decisions faster and easier than ever before. The heart of TIMi’s Integrated Platform. TIMi’s ultimate real-time AUTO-ML engine. 3D VR segmentation and visualization. Unlimited self service business Intelligence. TIMi is several orders of magnitude faster than any other solution to do the 2 most important analytical tasks: the handling of datasets (data cleaning, feature engineering, creation of KPIs) and predictive modeling. TIMi is an “ethical solution”: no “lock-in” situation, just excellence. We guarantee you a work in all serenity and without unexpected extra costs. Thanks to an original & unique software infrastructure, TIMi is optimized to offer you the greatest flexibility for the exploration phase and the highest reliability during the production phase. TIMi is the ultimate “playground” that allows your analysts to test the craziest ideas!
  • 4
    Data Ladder

    Data Ladder

    Data Ladder

    Data Ladder is a data quality and cleansing company dedicated to helping you "get the most out of your data" through data matching, profiling, deduplication, and enrichment. We strive to keep things simple and understandable in our product offerings to give our customers the best solution and customer service at an excellent price. Our products are in use across the Fortune 500 and we are proud of our reputation of listening to our customers and rapidly improving our products. Our user-friendly, powerful software helps business users across industries manage data more effectively and drive their bottom line. Our data quality software suite, DataMatch Enterprise, was proven to find approximately 12% to 300% more matches than leading software companies IBM and SAS in 15 different studies. With over 10 years of R&D and counting, we are constantly improving our data quality software solutions. This ongoing dedication has led to more than 4000 installations worldwide.
  • 5
    Verodat

    Verodat

    Verodat

    Verodat is a SaaS platform that gathers, prepares, enriches and connects your business data to AI Analytics tools. For outcomes you can trust. Verodat automates data cleansing & consolidates data into a clean, trustworthy data layer to feed downstream reporting. Manages data requests to suppliers. Monitors the data workflow to identify bottlenecks & resolve issues. Generates an audit trail to evidence quality assurance for every data row. Customize validation & governance to suit your organization. Reduces data prep time by 60%, allowing data analysts to focus on insights. The central KPI Dashboard reports key metrics on your data pipeline, allowing you to identify bottlenecks, resolve issues and improve performance. The flexible rules engine allows users to easily create validation and testing to suit your organization's needs. With out of the box connections to Snowflake, Azure and other cloud systems, it's easy to integrate with your existing tools.
  • 6
    Trillium Quality
    Rapidly transform high-volume, disconnected data into trusted and actionable business insights with scalable enterprise data quality. Trillium Quality is a versatile, powerful data quality solution that supports your rapidly changing business needs, data sources and enterprise infrastructures – including big data and cloud. Its data cleansing and standardization features automatically understand global data, such as customer, product and financial data, in any context – making pre-formatting and pre-processing unnecessary. Trillium Quality services deploy in batch or in real-time, on-premises or in the cloud, using the same rule sets and standards across an unlimited number of applications and systems. Open APIs let you seamlessly connect to custom and third-party applications, while controlling and managing data quality services centrally from one location.
  • 7
    Experian Data Quality
    Experian Data Quality is a recognized industry leader of data quality and data quality management solutions. Our comprehensive solutions validate, standardize, enrich, profile, and monitor your customer data so that it is fit for purpose. With flexible SaaS and on-premise deployment models, our software is customizable to every environment and any vision. Keep address data up to date and maintain the integrity of contact information over time with real-time address verification solutions. Analyze, transform, and control your data using comprehensive data quality management solutions - develop data processing rules that are unique to your business. Improve mobile/SMS marketing efforts and connect with customers using phone validation tools from Experian Data Quality.
  • 8
    WinPure MDM
    WinPure™ MDM is a master data management solution that aligns with your business to achieve a single view of your data with functions and features to help you manage your data. The features are ala-carte from all of the clean & match enterprise edition, repurposed specifically for simple web based data prep, and MDM operations. Data in dozens of different formats, dozens of simple and powerful ways to clean, standardize and to transform data. Industry leading data matching and error-tolerant technologies. Simple and configurable survivorship technology. General benefits include lower cost and faster time to market. Simple to use, minimal training and minimal implementation. Better business outcomes, faster MDM or systems deployment. Faster and more accurate batch loads, simple and accessible data prep tools. Flexible and effective interconnectivity with other internal and external database and systems via API. Faster time to synergies for M&A.
  • 9
    Syniti Data Matching
    Build a more connected business, drive growth, and leverage new technologies at scale with Syniti’s data matching solutions. No matter the shape or source of your data, our matching software accurately matches, deduplicates, unifies, and harmonizes data using intelligent, proprietary algorithms. Through innovation in data quality, Syniti’s matching solutions move beyond the traditional boundaries and empower data-driven businesses. Accelerate data harmonization by 90% and experience a 75% reduction in the amount of time spent on de-duplication on your journey to SAP S/4HANA. Perform deduplication, matching, and lookup on billions of records in only 5 minutes with performance-ready processing and out-of-the-box-ready solutions that don't require already-clean data. AI, proprietary algorithms, and steep customization maximize matches across complex datasets and minimize false positives.
  • 10
    Ataccama ONE
    Ataccama reinvents the way data is managed to create value on an enterprise scale. Unifying Data Governance, Data Quality, and Master Data Management into a single, AI-powered fabric across hybrid and Cloud environments, Ataccama gives your business and data teams the ability to innovate with unprecedented speed while maintaining trust, security, and governance of your data.
  • 11
    Great Expectations

    Great Expectations

    Great Expectations

    Great Expectations is a shared, open standard for data quality. It helps data teams eliminate pipeline debt, through data testing, documentation, and profiling. We recommend deploying within a virtual environment. If you’re not familiar with pip, virtual environments, notebooks, or git, you may want to check out the Supporting. There are many amazing companies using great expectations these days. Check out some of our case studies with companies that we've worked closely with to understand how they are using great expectations in their data stack. Great expectations cloud is a fully managed SaaS offering. We're taking on new private alpha members for great expectations cloud, a fully managed SaaS offering. Alpha members get first access to new features and input to the roadmap.
  • 12
    datuum.ai
    AI-powered data integration tool that helps streamline the process of customer data onboarding. It allows for easy and fast automated data integration from various sources without coding, reducing preparation time to just a few minutes. With Datuum, organizations can efficiently extract, ingest, transform, migrate, and establish a single source of truth for their data, while integrating it into their existing data storage. Datuum is a no-code product and can reduce up to 80% of the time spent on data-related tasks, freeing up time for organizations to focus on generating insights and improving the customer experience. With over 40 years of experience in data management and operations, we at Datuum have incorporated our expertise into the core of our product, addressing the key challenges faced by data engineers and managers and ensuring that the platform is user-friendly, even for non-technical specialists.
  • 13
    Union Pandera
    Pandera provides a simple, flexible, and extensible data-testing framework for validating not only your data but also the functions that produce them. Overcome the initial hurdle of defining a schema by inferring one from clean data, then refine it over time. Identify the critical points in your data pipeline, and validate data going in and out of them. Validate the functions that produce your data by automatically generating test cases for them. Access a comprehensive suite of built-in tests, or easily create your own validation rules for your specific use cases.
  • 14
    WinPure Clean & Match
    WinPure Clean & Match is WinPure’s award-winning data cleansing and data matching software suite, specially designed to increase the accuracy of business or consumer data. This software suite is ideal for cleaning, correcting and deduplicating mailing lists, databases, spreadsheets and CRMs. WinPure™ Clean & Match will help save your business time and money. * Increase the accuracy of virtually ANY list, spreadsheet, database, CRM, etc. * Locally installed Windows software so no need to worry about security as all processing is done on your own systems * Save hours of valuable time cleaning and removing duplicated records from your lists or databases using built-in sophisticated fuzzy and phonetic match algorithms. * Affordable licences available with World Class Support & Training. * Free Demo with Live Online Training available.
  • 15
    HighByte Intelligence Hub
    HighByte Intelligence Hub is the first DataOps solution purpose-built for industrial data. It provides manufacturers with a low-code software solution to accelerate and scale the usage of operational data throughout the extended enterprise by contextualizing, standardizing, and securing this valuable information. HighByte Intelligence Hub runs at the Edge, scales from embedded to server-grade computing platforms, connects devices and applications via a wide range of open standards and native connections, processes streaming data through standard models, and delivers contextualized and correlated information to the applications that require it. Use HighByte Intelligence Hub to reduce system integration time from months to hours, accelerate data curation and preparation for AI and ML applications, improve system-wide security and data governance, and reduce Cloud ingest, processing, and storage costs and complexity. Build a digital infrastructure that is ready for scale.
    Starting Price: 17,500 per year
  • 16
    Talend Data Fabric
    Talend Data Fabric’s suite of cloud services efficiently handles all your integration and integrity challenges — on-premises or in the cloud, any source, any endpoint. Deliver trusted data at the moment you need it — for every user, every time. Ingest and integrate data, applications, files, events and APIs from any source or endpoint to any location, on-premise and in the cloud, easier and faster with an intuitive interface and no coding. Embed quality into data management and guarantee ironclad regulatory compliance with a thoroughly collaborative, pervasive and cohesive approach to data governance. Make the most informed decisions based on high quality, trustworthy data derived from batch and real-time processing and bolstered with market-leading data cleaning and enrichment tools. Get more value from your data by making it available internally and externally. Extensive self-service capabilities make building APIs easy— improve customer engagement.
  • 17
    Data Cleansing & Matching

    Data Cleansing & Matching

    Data Cleansing & Matching

    This Matching software contains a matching engine that can transform, standardize your data, remove, compare two projects and match records from marketing lists and databases. Our data matching software is built with the most up-to-date technologies. Test our data scrubbing tools or use our data matching software today. Insert new record and update the main database and get statistics. Insert new record and update the main database only. Insert a new record into the main database and get statistics. Update the main database with matching information and get statistics. Remove match from the main database and get statistics.
    Starting Price: $99.95 one-time payment
  • 18
    matchit

    matchit

    360Science

    The foundation of our matching software, matchit® is designed specifically to deliver results that mirror human-like perception, at scale and without preprocessing. Using Artificial Intelligence, a proprietary phonetic algorithm, lexicons, and a contextual scoring engine, matchit defeats the errors, inconsistencies, and challenges commonly found in contact and business data. Conventional matching solutions require a user to define matching logic, which is a combination of functions and off-the-shelf fuzzy algorithms, used to produce an alphanumeric value. This alphanumeric value, or ‘match key’, forms the basis for comparing two records together and ultimately finding matches. Unlike conventional matching solutions, matchit doesn’t rely on a single comparison between match keys to find a match. Instead, matchit evaluates records contextually, running a variety of comparisons and scoring them individually to grade similarity between all the relevant elements that make up your data.
  • 19
    Waaila

    Waaila

    Cross Masters

    Waaila is a comprehensive application for automatic data quality monitoring, supported by a global community of hundreds of analysts, and helps to prevent disastrous scenarios caused by poor data quality and measurement. Validate your data and take control of your analytics and measuring. They need to be precise in order to utilize their full potential therefore it requires validation and monitoring. The quality of the data is key for serving its true purpose and leveraging it for business growth. The higher quality, the more efficient the marketing strategy. Rely on the quality and accuracy of your data and make confident data-driven decisions to achieve the best results. Save time, and energy, and attain better results with automated validation. Fast attack discovery prevents huge impacts and opens new opportunities. Easy navigation and application management contribute to fast data validation and effective processes, leading to quickly discovering and solving the issue.
    Starting Price: $19.99 per month
  • 20
    CLEAN_Data

    CLEAN_Data

    Runner EDQ

    CLEAN_Data is a collection of enterprise data quality solutions for managing the challenging and ever changing profiles of employee, customer, vendor, student, and alumni contact data. Our CLEAN_Data solutions are crucial in managing your enterprise data integrity requirements. Whether you are processing your data in real-time, batch, or connecting data systems, Runner EDQ has an integrated data solution your organization can rely on. CLEAN_Address is the integrated address verification solution that corrects and standardizes postal addresses within Oracle®, Ellucian® and other enterprise systems (ERP, SIS, HCM, CRM, MDM). Our seamless integration provides address correction in real-time at the point of entry and for existing data via batch and change of address processing. Real time address verification in all address entry pages using native fields in your SIS or CRM. Integrated batch processing corrects and formats your existing address records.
  • 21
    D&B Connect

    D&B Connect

    Dun & Bradstreet

    Realize the true potential of your first-party data. D&B Connect is a customizable, self-service master data management solution built to scale. Eliminate data silos across the organization and bring all your data together using the D&B Connect family of products. Benchmark, cleanse, and enrich your data using our database of hundreds of millions of records. The result is an interconnected, single source of truth that empowers your teams to make more confident business decisions. Drive growth and reduce risk with data you can trust. With a clean, complete data foundation, your sales and marketing teams can align territories with a full view of account relationships. Reduce internal conflict and confusion over incomplete or bad data. Strengthen segmentation and targeting. Increase personalization and the quality/quantity of marketing-sourced leads. Improve accuracy of reporting and ROI analysis.
  • 22
    Flowcore

    Flowcore

    Flowcore

    The Flowcore platform provides you with event streaming and event sourcing in a single, easy-to-use service. Data flow and replayable storage, designed for developers at data-driven startups and enterprises that aim to stay at the forefront of innovation and growth. All your data operations are efficiently persisted, ensuring no valuable data is ever lost. Immediate transformations and reclassifications of your data, loading it seamlessly to any required destination. Break free from rigid data structures. Flowcore's scalable architecture adapts to your growth, handling increasing volumes of data with ease. By simplifying and streamlining backend data processes, your engineering teams can focus on what they do best, creating innovative products. Integrate AI technologies more effectively, enriching your products with smart, data-driven solutions. Flowcore is built with developers in mind, but its benefits extend beyond the dev team.
    Starting Price: $10/month
  • 23
    Cloudingo

    Cloudingo

    Symphonic Source

    From deduping to importing and even migrating data, Cloudingo makes it super easy to manage your customer data. Salesforce is great for managing customers. But it misses the mark when it comes to data quality. Customer data that doesn’t make sense, duplicate records, reports that are a little… off. Sound familiar? Merging dupes one-by-one, native solutions, custom code, and spreadsheets can only go so far. You shouldn’t have to think twice about the quality of your customer data. Or spend lots of time cleaning and managing Salesforce. You’ve spent too long risking relationships, losing opportunities, and dealing with clutter. It’s time to fix it. Imagine a tool, just one, that turns your dirty, confusing, unreliable Salesforce data into an efficient, lead-nurturing, sales-producing machine.
    Starting Price: $1096 per year
  • 24
    DemandTools

    DemandTools

    Validity

    The #1 global data quality tool thousands of Salesforce administrators trust. Improve overall productivity in managing large data sets. Identify and deduplicate data within any database table. Perform multi-table mass manipulation and standardization of Salesforce objects. Bolster Lead conversion with a robust, customizable toolset. With its feature-rich data quality toolset, you can use DemandTools to cleanse, standardize, compare records, and more. With Validity Connect, you will have access to the EmailConnect module to verify email addresses on Contacts and Leads in bulk. Manage all aspects of your data in bulk with repeatable processes instead of record by record or need by need. Dedupe, standardize, and assign records automatically as they come in from spreadsheets, end user entry, and integrations. Get clean data to improve the performance of sales, marketing, and support, as well as the revenue and retention they generate.
  • 25
    DataMatch

    DataMatch

    Data Ladder

    DataMatch Enterprise™ solution is a highly visual data cleansing application specifically designed to resolve customer and contact data quality issues. The platform leverages multiple proprietary and standard algorithms to identify phonetic, fuzzy, miskeyed, abbreviated, and domain-specific variations. Build scalable configurations for deduplication & record linkage, suppression, enhancement, extraction, and standardization of business and customer data and create a Single Source of Truth to maximize the impact of your data across the enterprise.
  • 26
    Firstlogic

    Firstlogic

    Firstlogic

    Validate and verify your address data by checking them against official Postal Authority databases. Increase delivery rates, minimize returned mail and realize postal discounts. Connect address datasources to our enterprise-class cleansing transforms. Then, you'll be ready to validate and verify your address data. Increase delivery rates, minimize returned mail and realize postal discounts. Identify individual data elements within your address data and break them out into their component parts. Eliminate common spelling mistakes & format address data to comply with industry standards & improve mail delivery. Confirm an address’s existence against the official USPS address database. Check whether the address is residential or business and if the address is deliverable using USPS Delivery Point Validation (DPV). Merge validated data back to multiple disparate data sources or produce customized output files to use in your organization's workflow.
  • 27
    Datactics

    Datactics

    Datactics

    Profile, cleanse, match and deduplicate data in drag-and-drop rules studio. Lo-code UI means no programming skill required, putting power in the hands of subject matter experts. Add AI & machine learning to your existing data management processes In order to reduce manual effort and increase accuracy, providing full transparency on machine-led decisions with human-in-the-loop. Offering award-winning data quality and matching capabilities across multiple industries, our self-service solutions are rapidly configured within weeks with specialist assistance available from Datactics data engineers. With Datactics you can easily measure data to regulatory & industry standards, fix breaches in bulk and push into reporting tools, with full visibility and audit trail for Chief Risk Officers. Augment data matching into Legal Entity Masters for Client Lifecycle Management.
  • 28
    iCEDQ

    iCEDQ

    Torana

    iCEDQ is a DataOps platform for testing and monitoring. iCEDQ is an agile rules engine for automated ETL Testing, Data Migration Testing, and Big Data Testing. It improves the productivity and shortens project timelines of testing data warehouse and ETL projects with powerful features. Identify data issues in your Data Warehouse, Big Data and Data Migration Projects. Use the iCEDQ platform to completely transform your ETL and Data Warehouse Testing landscape by automating it end to end by letting the user focus on analyzing and fixing the issues. The very first edition of iCEDQ designed to test and validate any volume of data using our in-memory engine. It supports complex validation with the help of SQL and Groovy. It is designed for high-performance Data Warehouse Testing. It scales based on the number of cores on the server and is 5X faster than the standard edition.
  • 29
    BiG EVAL

    BiG EVAL

    BiG EVAL

    The BiG EVAL solution platform provides powerful software tools needed to assure and improve data quality during the whole lifecycle of information. BiG EVAL's data quality management and data testing software tools are based on the BiG EVAL platform - a comprehensive code base aimed for high performance and high flexibility data validation. All features provided were built by practical experience based on the cooperation with our customers. Assuring a high data quality during the whole life cycle of your data is a crucial part of your data governance and is very important to get the most business value out of your data. This is where the automation solution BiG EVAL DQM comes in and supports you in all tasks regarding data quality management. Ongoing quality checks validate your enterprise data continuously, provide a quality metric and supports you in solving the quality issues. BiG EVAL DTA lets you automate testing tasks in your data oriented project.
  • 30
    Anomalo

    Anomalo

    Anomalo

    Anomalo helps you get ahead of data issues by automatically detecting them as soon as they appear in your data and before anyone else is impacted. Detect, root-cause, and resolve issues quickly – allowing everyone to feel confident in the data driving your business. Connect Anomalo to your Enterprise Data Warehouse and begin monitoring the tables you care about within minutes. Our advanced machine learning will automatically learn the historical structure and patterns of your data, allowing us to alert you to many issues without the need to create rules or set thresholds.‍ You can also fine-tune and direct our monitoring in a couple of clicks via Anomalo’s No Code UI. Detecting an issue is not enough. Anomalo’s alerts offer rich visualizations and statistical summaries of what’s happening to allow you to quickly understand the magnitude and implications of the problem.‍
  • 31
    LinkageWiz

    LinkageWiz

    LinkageWiz

    Powerful Probabilistic Data Matching algorithms are used, using common identifiers such as name, date of birth, sex, address, SSN, business name and many others. Data can be imported from a wide range of desktop and corporate database systems. Data matching software will enable the detection of up to 99% or higher of all potential matches. For business this can represent considerable extra potential revenue or cost savings, increased fraud detection and, for medical research can mean the difference between a successful research project and one that failed to report any significant findings. LinkageWiz is fast, user friendly and represents outstanding value as it bundles many of the features provided by many other separate products into a single stand-alone package.
    Starting Price: $199 one-time payment
  • 32
    Syniti Data Quality
    Data has the power to disrupt markets and break new boundaries, but only when it’s trusted and understood. By leveraging our AI/ML-enhanced, cloud-based solution built with 25 years of best practices and proven data quality reports, stakeholders in your organization can work together to crowdsource data excellence. Quickly identify data quality issues and expedite remediation with embedded best practices and hundreds of pre-built reports. Cleanse data in advance of, or during, data migration, and track data quality in real-time with customizable data intelligence dashboards. Continuously monitor data objects and automatically initiate remediation workflows and direct them to the appropriate data owners. Consolidate data in a single, cloud-based platform and reuse knowledge to accelerate future data initiatives. Minimize effort and improve outcomes with every data stakeholder working in a single system.
  • 33
    SAP Data Services
    Maximize the value of all your organization’s structured and unstructured data with exceptional functionalities for data integration, quality, and cleansing. SAP Data Services software improves the quality of data across the enterprise. As part of the information management layer of SAP’s Business Technology Platform, it delivers trusted,relevant, and timely information to drive better business outcomes. Transform your data into a trusted, ever-ready resource for business insight and use it to streamline processes and maximize efficiency. Gain contextual insight and unlock the true value of your data by creating a complete view of your information with access to data of any size and from any source. Improve decision-making and operational efficiency by standardizing and matching data to reduce duplicates, identify relationships, and correct quality issues proactively. Unify critical data on premise, in the cloud, or within Big Data by using intuitive tools.
  • 34
    Shinydocs

    Shinydocs

    Shinydocs

    Across industries and around the world, organizations are struggling to get a handle on their data. Don’t fall behind; stay ahead of the curve with intelligent solutions. Shinydocs makes it easier than ever to locate, secure and understand your data. We simplify and automate records management processes so people can find what they need when they need it. Most importantly, your employees won’t need additional training or have to change the way they work. Our cognitive suite analyzes all of your data at machine speeds. With its many robust built-in tools, you can demystify your data and get meaningful insights so you can make better business decisions. Our flagship product, Shinydrive helps organizations realize the full potential of its ECM investment and extract 100% of the value of its managed data. We deliver on the promise of ECM and provide the same exceptional execution into Data Management in the cloud.
  • 35
    Informatica MDM

    Informatica MDM

    Informatica

    Our market-leading, multidomain solution supports any master data domain, implementation style, and use case, in the cloud or on premises. Integrates best-in-class data integration, data quality, business process management, and data privacy. Tackle complex issues head-on with trusted views of business-critical master data. Automatically link master, transaction, and interaction data relationships across master data domains. Increase accuracy of data records with contact data verification, B2B, and B2C enrichment services. Update multiple master data records, dynamic data models, and collaborative workflows with one click. Reduce maintenance costs and speed deployment with AI-powered match tuning and rule recommendations. Increase productivity using search and pre-configured, highly granular charts and dashboards. Create high-quality data that helps you improve business outcomes with trusted, relevant information.
  • 36
    CleanCRM

    CleanCRM

    ActivePrime

    CleanCRM is a data cleansing tool for your CRM. To dedupe data, you shouldn’t have to work manually. Our tool changes your workflow, deduping in bulk. Do in minutes what would normally takes hours or days to complete. Dedupe data with ease. Not all data cleansing tools are the same. With CleanCRM, you’ll experience a quick and easy way to dedupe. With cleaner, more reliable data, employees will use the CRM more, increasing adoption rates. Watch the video to see how it works. Our data cleansing tool embeds directly into your CRM. You won’t have to log into another system. You can run a deduplication scan in minutes, without the tediousness of importing and exporting data. You can dedupe all records: accounts, contacts, and leads. Then you’ll have a chance to review all results and take action. The process automatically labels duplicate sets for quick review and edits. Get back time and resources with this intelligent tool.
  • 37
    Egon

    Egon

    Ware Place

    Address quality software and geocoding. Validate, deduplicate and maintain accurate and deliverable address data. The data quality demonstrates the accuracy and completeness with which certain data represent the effective entity they refer to. Working for postal address verification and data quality means verifying, optimising and integrating the data in any address database so that it is reliable and functional to the purpose it was created for. In transports such as shipments, in data entry such as geomarketing, and in statistics such as mapping: there are numbers of sectors and operations which are based on the use of postal addresses. Quality archives and databases guarantee considerable economic and logistics savings for enterprise whose key to success is based on operations tuning. This is an added value which should not be underestimated to make work easier and more efficient. Egon is a data quality system online available directly by the web.
  • 38
    QuerySurge
    QuerySurge leverages AI to automate the data validation and ETL testing of Big Data, Data Warehouses, Business Intelligence Reports and Enterprise Apps/ERPs with full DevOps functionality for continuous testing. Use Cases - Data Warehouse & ETL Testing - Hadoop & NoSQL Testing - DevOps for Data / Continuous Testing - Data Migration Testing - BI Report Testing - Enterprise App/ERP Testing QuerySurge Features - Projects: Multi-project support - AI: automatically create datas validation tests based on data mappings - Smart Query Wizards: Create tests visually, without writing SQL - Data Quality at Speed: Automate the launch, execution, comparison & see results quickly - Test across 200+ platforms: Data Warehouses, Hadoop & NoSQL lakes, databases, flat files, XML, JSON, BI Reports - DevOps for Data & Continuous Testing: RESTful API with 60+ calls & integration with all mainstream solutions - Data Analytics & Data Intelligence:  Analytics dashboard & reports
  • 39
    Match2Lists

    Match2Lists

    Match2Lists

    Match2Lists is the fastest, easiest and most accurate way to Match, Merge and De-duplicate your data. With Our Match2D&B option, you can enrich your data with Dun & Bradstreet information on-demand. In just minutes, you can cleanse your data of duplicates and blend raw data from different sources into powerful information. Our first objective is maximum match results for our customers. Prior to creating Match2Lists, we ran analytics and data visualisation companies and used most "fuzzy" matching software on the market. Unsatisfied by their low match results, we spent 10 years developing the most advanced data matching logic. Our second objective is time: enable our customers to spend less time matching and cleansing data and more time analysing and executing. So we implemented our advanced matching logic on the fast in-memory cloud computing architecture we could find, capable of matching 200 million records in 30 seconds.
    Starting Price: $95 per month
  • 40
    Senzing

    Senzing

    Senzing

    Senzing® entity resolution API software provides the most advanced, affordable, and easy-to-use data matching and relationship detection capabilities available. With Senzing software, you can automatically resolve records into common entities in real time as new data is received. The complete view of all records related to every person or organization, across all of your internal and external data sources, can help you reduce costs and enable new revenue opportunities. Companies use Senzing entity resolution API to provide highly accurate views of people, organizations, and their relationships. You can deploy the Senzing entity resolution API on premises or in cloud-native deployments. Data remains in your ecosystem and never flows to Senzing. A free proof of concept can be completed in one day on AWS or on BareMetal. Senzing makes human-intelligent decisions without any pre-training or pre-tuning.
  • 41
    YData

    YData

    YData

    Adopting data-centric AI has never been easier with automated data quality profiling and synthetic data generation. We help data scientists to unlock data's full potential. YData Fabric empowers users to easily understand and manage data assets, synthetic data for fast data access, and pipelines for iterative and scalable flows. Better data, and more reliable models delivered at scale. Automate data profiling for simple and fast exploratory data analysis. Upload and connect to your datasets through an easily configurable interface. Generate synthetic data that mimics the statistical properties and behavior of the real data. Protect your sensitive data, augment your datasets, and improve the efficiency of your models by replacing real data or enriching it with synthetic data. Refine and improve processes with pipelines, consume the data, clean it, transform your data, and work its quality to boost machine learning models' performance.
  • 42
    DataMotto

    DataMotto

    DataMotto

    Your data almost always requires preprocessing to be ready for your needs. Our AI automates the tedious task of preparing and cleansing your data, saving you hours of work. Data analysts spend 80% of their time preprocessing and cleaning data for insights, a tedious, manual task. AI is a game-changer. Transform text columns like customer feedback into 0-5 numeric ratings. Identify patterns in customer feedback and create a new column for sentiment analysis. Remove unnecessary columns to focus on impactful data. Enriched with external data for comprehensive insights. Unreliable data leads to misguided decisions. Preparing high-quality, clean data should be the first priority in your data-driven decision-making process. Rest assured, we do not utilize your data to enhance our AI agents; your information remains strictly yours. We store your data with the most reliable and trusted cloud providers.
    Starting Price: $29 per month
  • 43
    Sweephy

    Sweephy

    Sweephy

    No-code data cleaning, preparing, and ML platform. Specialized development for business cases & on-premise setup for data privacy. Start to use Sweephy's free modules. No-code machine learning-powered tools. Just give the data and keywords that you are checking for. Our model can create a report based on keywords. It doesn't just check the words in the text, our model is classifying semantically and grammatically. Let us find similar or the same records in your database. Create a unified user database from different data sources with Sweephy Dedupu API. With Sweephy API, easily create object detection models by finetuning pre-trained models. Just send us some use cases, and we will create an appropriate model for you. Such as classifying documents, pdfs, receipts, or invoices. Just upload the image dataset. Our model will clean the noise on the image easily or we can create a finetuned model for your business case.
    Starting Price: €59 per month
  • 44
    Tableau Prep
    Tableau Prep changes the way traditional data prep is performed in an organization. By providing a visual and direct way to combine, shape and clean data, Tableau Prep makes it easier for analysts and business users to start their analysis, faster. Tableau Prep is comprised of two products: Tableau Prep Builder for building your data flows, and Tableau Prep Conductor for scheduling, monitoring and managing flows across the organization. Three coordinated views let you see row-level data, profiles of each column, and your entire data preparation process. Pick which view to interact with based on the task at hand. If you want to edit a value, you select and directly edit. Change your join type, and see the result right away. With each action, you instantly see your data change, even on millions of rows of data. Tableau Prep Builder gives you the freedom to re-order steps and experiment without consequence.
    Starting Price: $70 per user per month
  • 45
    INQDATA

    INQDATA

    INQDATA

    Cloud-based Data Science platform delivering intelligently curated and cleansed data, ready to be consumed. Firms face significant challenges, resource constraints, and high costs when managing their data before they can start adding any value. The data is ingested, cleansed, stored, accessed, and only then analyzed. But the analysis is where the value is. Our solution allows clients to focus on core business activities, not on the expensive, resource heavy data lifecycle. We take care of that. Cloud-native platform for real-time streaming analytics that fully leverages the benefits of cloud architecture to enable INQDATA to deliver fast scalable historical and real-time data without the complexity of infrastructure.
  • 46
    Email Hippo

    Email Hippo

    Email Hippo

    Email Hippo provides fast, accurate and secure email verification software, accessed via web app or API. The CORE product allows users to import lists of up to 500,000 emails and verify them directly within a self-service web app. MORE is an API product that can be used to check the validity of an email address in real time, looking at up to 74 data points for maximum accuracy. With ASSESS, users can check email addresses for common pre-fraud indicators. Email Hippo has provided email verification since 2000 and became ISO27001 certified in 2017.
    Starting Price: $10.00/one-time
  • 47
    Crux

    Crux

    Crux

    Find out why the heavy hitters are using the Crux external data automation platform to scale external data integration, transformation, and observability without increasing headcount. Our cloud-native data integration technology accelerates the ingestion, preparation, observability and ongoing delivery of any external dataset. The result is that we can ensure you get quality data in the right place, in the right format when you need it. Leverage automatic schema detection, delivery schedule inference, and lifecycle management to build pipelines from any external data source quickly. Enhance discoverability throughout your organization through a private catalog of linked and matched data products. Enrich, validate, and transform any dataset to quickly combine it with other data sources and accelerate analytics.
  • 48
    Tamr

    Tamr

    Tamr

    Tamr’s next-generation data mastering platform integrates machine learning with human feedback to break down data silos and continuously clean and deliver accurate data across your business. Tamr works with leading organizations around the world to solve their toughest data challenges. Tackle problems like duplicate records and errors to create a complete view of your data – from customers to product to suppliers. Next-generation data mastering integrates machine learning with human feedback to deliver clean data to drive business decisions. Feed clean data to analytics tools and operational systems, with 80% less effort than traditional approaches. From Customer 360 to reference data management, Tamr helps financial firms stay data-driven and accelerate business outcomes. Tamr helps the public sector meet mission requirements sooner through reduced manual workflows for data entity resolution.
  • 49
    Integrate.io

    Integrate.io

    Integrate.io

    Unify Your Data Stack: Experience the first no-code data pipeline platform and power enlightened decision making. Integrate.io is the only complete set of data solutions & connectors for easy building and managing of clean, secure data pipelines. Increase your data team's output with all of the simple, powerful tools & connectors you’ll ever need in one no-code data integration platform. Empower any size team to consistently deliver projects on-time & under budget. We ensure your success by partnering with you to truly understand your needs & desired outcomes. Our only goal is to help you overachieve yours. Integrate.io's Platform includes: -No-Code ETL & Reverse ETL: Drag & drop no-code data pipelines with 220+ out-of-the-box data transformations -Easy ELT & CDC :The Fastest Data Replication On The Market -Automated API Generation: Build Automated, Secure APIs in Minutes - Data Warehouse Monitoring: Finally Understand Your Warehouse Spend - FREE Data Observability: Custom
  • 50
    Airbyte

    Airbyte

    Airbyte

    Get all your ELT data pipelines running in minutes, even your custom ones. Let your team focus on insights and innovation. Unify your data integration pipelines in one open-source ELT platform. Airbyte addresses all your data team's connector needs, however custom they are and whatever your scale. The data integration platform that can scale with your custom or high-volume needs. From high-volume databases to the long tail of API sources. Leverage Airbyte’s long tail of high-quality connectors that adapt to schema and API changes. Extensible to unify all native & custom ELT. Edit pre-built open-source connectors, or build new ones with our connector development kit in a few hours. Transparent and scalable pricing. Finally, a transparent and predictable cost-based pricing that scales with your data needs. You don’t need to worry about volume anymore. No more need for custom systems for your in-house scripts or database replication.
    Starting Price: $2.50 per credit