Best Data Management Software - Page 92

Compare the Top Data Management Software as of June 2026 - Page 92

  • 1
    Graph Engine

    Graph Engine

    Microsoft

    Graph Engine (GE) is a distributed in-memory data processing engine, underpinned by a strongly-typed RAM store and a general distributed computation engine. The distributed RAM store provides a globally addressable high-performance key-value store over a cluster of machines. Through the RAM store, GE enables the fast random data access power over a large distributed data set. The capability of fast data exploration and distributed parallel computing makes GE a natural large graph processing platform. GE supports both low-latency online query processing and high-throughput offline analytics on billion-node large graphs. Schema does matter when we need to process data efficiently. Strongly-typed data modeling is crucial for compact data storage, fast data access, and clear data semantics. GE is good at managing billions of run-time objects of varied sizes. One byte counts as the number of objects goes large. GE provides fast memory allocation and reallocation with high memory ratios.
  • 2
    AnzoGraph DB

    AnzoGraph DB

    Cambridge Semantics

    With a huge collection of analytical features, AnzoGraph DB can enhance your analytical framework. Watch this video to learn how AnzoGraph DB is a Massively Parallel Processing (MPP) native graph database that is built for data harmonization and analytics. Horizontally scalable graph database built for online analytics and data harmonization. Take on data harmonization and linked data challenges with AnzoGraph DB, a market-leading analytical graph database. AnzoGraph DB provides industrialized online performance for enterprise-scale graph applications. AnzoGraph DB uses familiar SPARQL*/OWL for semantic graphs but also supports Labeled Property Graphs (LPGs). Access to many analytical, machine learning and data science capabilities help you achieve new insights, delivered at unparalleled speed and scale. Use context and relationships between data as first-class citizens in your analysis. Ultra-fast data loading and analytical queries.
  • 3
    Sparksee

    Sparksee

    Sparsity Technologies

    Sparksee (formerly known as DEX), makes space and performance compatible with a small footprint and a fast analysis of large networks. It is natively available for .Net, C++, Python, Objective-C and Java, and covers the whole spectrum of Operating Systems. The graph is represented through bitmap data structures that allow high compression rates. Each of the bitmaps is partitioned into chunks that fit into disk pages to improve I/O locality. Using bitmaps, operations are computed with binary logic instructions that simplify the execution in pipelined processors. Full native indexing allows an extremely fast access to each of the graph data structures. Node adjacencies are represented by bitmaps to minimize their footprint. The number of times each data page is brought to memory is minimized with advanced I/O policies. Each value in the database is represented only once, avoiding unnecessary replication.
  • 4
    TerminusDB

    TerminusDB

    TerminusDB

    Making data collaboration easy. If you are a developer looking to innovate or a data person looking for version control, we make collaboration work for everyone. TerminusDB is an open-source knowledge graph database that provides reliable, private & efficient revision control & collaboration. If you want to collaborate with colleagues or build data-intensive applications, nothing will make you more productive. TerminusDB provides the full suite of revision control features. TerminusHub allows users to manage access to databases and collaboratively work on shared resources. Flexible data storage, sharing, and versioning capabilities. Collaboration for your team or integrated into your app. Work locally then sync when you push your changes. Easy querying, cleaning, and visualization. Integrate powerful version control and collaboration for your enterprise and individual customers. Make it easy for remote data teams to work together on data projects.
  • 5
    TIBCO Graph Database
    To unveil the true value of constantly evolving business data, you need to understand the relationships in data in a much more profound way. Unlike other databases, a graph database puts relationships at the forefront, using Graph theory and Linear Algebra to traverse and show how complex data webs, data sources, and data points relate. TIBCO® Graph Database allows you to discover, store, and convert complex dynamic data into meaningful insights. Enable users to rapidly build data and computational models that establish dynamic relationships among organizational silos. These knowledge graphs deliver value by connecting your organization’s vast array of data and revealing relationships that let you accelerate optimization of assets and processes. Combined OLTP and OLAP features in a single enterprise-grade database. Optimistic ACID level transaction properties with native storage and access.
  • 6
    Vendia

    Vendia

    Vendia

    Vendia is a SaaS service that makes it easy for companies and organizations to share code and data across clouds, regions, accounts, and technology stacks. Vendia helps enterprises share code and data across companies, clouds, accounts, regions, and technology stacks. Vendia's unique architecture offers a distributed data model that goes everywhere you need it to, and its serverless design enables it to scale seamlessly. Vendia helps businesses create a complete portrait of their data, for example to track and trace items in a supply chain. Often that information spans business parties, such as suppliers, logistics, affiliates, and others. These might be different legal entities, different departments within the same enterprise, or even the same department but divided by their adoption of different public cloud services, such as one using AWS and another using Azure.
  • 7
    Enlyft

    Enlyft

    Enlyft

    Enlyft helps B2B companies generate better leads, close more deals, and acquire more customers - faster. Enlyfts AI-driven customer intelligence platform leverages machine learning to profile and predict the buying behavior of millions of companies worldwide, based on technology use, hundreds of business attributes, and real-time buyer intent signals. Increase sales by quickly discovering, prioritizing and engaging with prospects likely to buy your solution. Enlyft’s proprietary data platform contains real-time information on company firmographics, technology usage, buying intent signals and hundreds of additional account attributes. Leverage dedicated machine learning based models to predict future outcomes, by combining Enlyft’s comprehensive account insights with your customer history. Seamlessly integrate account insights into popular B2B Sales and Marketing platforms like Salesforce, HubSpot, Dynamics 365, LinkedIn, and more. Enrich records and keep data fresh.
  • 8
    TiMi

    TiMi

    TIMi

    With TIMi, companies can capitalize on their corporate data to develop new ideas and make critical business decisions faster and easier than ever before. The heart of TIMi’s Integrated Platform. TIMi’s ultimate real-time AUTO-ML engine. 3D VR segmentation and visualization. Unlimited self service business Intelligence. TIMi is several orders of magnitude faster than any other solution to do the 2 most important analytical tasks: the handling of datasets (data cleaning, feature engineering, creation of KPIs) and predictive modeling. TIMi is an “ethical solution”: no “lock-in” situation, just excellence. We guarantee you a work in all serenity and without unexpected extra costs. Thanks to an original & unique software infrastructure, TIMi is optimized to offer you the greatest flexibility for the exploration phase and the highest reliability during the production phase. TIMi is the ultimate “playground” that allows your analysts to test the craziest ideas!
  • 9
    IBM DataStage
    Accelerate AI innovation with cloud-native data integration on IBM Cloud Pak for data. AI-powered data integration, anywhere. Your AI and analytics are only as good as the data that fuels them. With a modern container-based architecture, IBM® DataStage® for IBM Cloud Pak® for Data delivers that high-quality data. It combines industry-leading data integration with DataOps, governance and analytics on a single data and AI platform. Automation accelerates administrative tasks to help reduce TCO. AI-based design accelerators and out-of-the-box integration with DataOps and data science services speed AI innovation. Parallelism and multicloud integration let you deliver trusted data at scale across hybrid or multicloud environments. Manage the data and analytics lifecycle on the IBM Cloud Pak for Data platform. Services include data science, event messaging, data virtualization and data warehousing. Parallel engine and automated load balancing.
  • 10
    Delta Lake

    Delta Lake

    Delta Lake

    Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads. Data lakes typically have multiple data pipelines reading and writing data concurrently, and data engineers have to go through a tedious process to ensure data integrity, due to the lack of transactions. Delta Lake brings ACID transactions to your data lakes. It provides serializability, the strongest level of isolation level. Learn more at Diving into Delta Lake: Unpacking the Transaction Log. In big data, even the metadata itself can be "big data". Delta Lake treats metadata just like data, leveraging Spark's distributed processing power to handle all its metadata. As a result, Delta Lake can handle petabyte-scale tables with billions of partitions and files at ease. Delta Lake provides snapshots of data enabling developers to access and revert to earlier versions of data for audits, rollbacks or to reproduce experiments.
  • 11
    Rocket Data Intelligence
    Rocket® Data Intelligence (RDI) delivers comprehensive visibility into enterprise data across mainframe, distributed, and cloud environments. It automatically discovers metadata, lineage, and data relationships so organizations can see where critical data resides, how it moves, and which applications and processes rely on it. RDI supports legacy and modern platforms, including Db2, VSAM, IMS, Adabas, Datacom, relational databases, ETL tools like Informatica and DataStage, code such as COBOL, Python, and Java, and cloud data stores. RDI provides enterprise-grade capabilities including automated data discovery and code parsing, impact analysis, lineage filtering, role/LOB-based categorization and governance, workflow management, business glossary, and dependency mapping. By unifying data asset visibility across hybrid environments, RDI reduces operational risk and accelerates data modernization, compliance reporting, discovery, and rationalization initiatives.
  • 12
    Kylo

    Kylo

    Teradata

    Kylo is an open source enterprise-ready data lake management software platform for self-service data ingest and data preparation with integrated metadata management, governance, security and best practices inspired by Think Big's 150+ big data implementation projects. Self-service data ingest with data cleansing, validation, and automatic profiling. Wrangle data with visual sql and an interactive transform through a simple user interface. Search and explore data and metadata, view lineage, and profile statistics. Monitor health of feeds and services in the data lake. Track SLAs and troubleshoot performance. Design batch or streaming pipeline templates in Apache NiFi and register with Kylo to enable user self-service. Organizations can expend significant engineering effort moving data into Hadoop yet struggle to maintain governance and data quality. Kylo dramatically simplifies data ingest by shifting ingest to data owners through a simple guided UI.
  • 13
    Tokern

    Tokern

    Tokern

    Open source data governance suite for databases and data lakes. Tokern is a simple to use toolkit to collect, organize and analyze data lake's metadata. Run as a command-line app for quick tasks. Run as a service for continuous collection of metadata. Analyze lineage, access control and PII datasets using reporting dashboards or programmatically in Jupyter notebooks. Tokern is an open source data governance suite for databases and data lakes. Improve ROI of your data, comply with regulations like HIPAA, CCPA and GDPR and protect critical data from insider threats with confidence. Centralized metadata management of users, datasets and jobs. Powers other data governance features. Track Column Level Data Lineage for Snowflake, AWS Redshift and BigQuery. Build lineage from query history or ETL scripts. Explore lineage using interactive graphs or programmatically using APIs or SDKs.
  • 14
    Apache Atlas

    Apache Atlas

    Apache Software Foundation

    Atlas is a scalable and extensible set of core foundational governance services – enabling enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the whole enterprise data ecosystem. Apache Atlas provides open metadata management and governance capabilities for organizations to build a catalog of their data assets, classify and govern these assets and provide collaboration capabilities around these data assets for data scientists, analysts and the data governance team. Pre-defined types for various Hadoop and non-Hadoop metadata. Ability to define new types for the metadata to be managed. Types can have primitive attributes, complex attributes, object references; can inherit from other types. Instances of types, called entities, capture metadata object details and their relationships. REST APIs to work with types and instances allow easier integration.
  • 15
    Truedat

    Truedat

    Bluetab Solutions

    Truedat is an open source data governance business solution tool developed by Bluetab Solutions in order to help our clients become data-driven companies. We help to define business processes, roles & responsibilities. We also help putting processes into practice. Integration and customization of truedat´s open source components to support the data governance processes. We guarantee the support and maintenance of the process & software of our solution modules installed by us. Based on our experience in, we have developed a solution that covers the need for Data Governance, allowing to manage and control highly complex and changing data architectures. The highly increasing migration of enterprise IT platforms to cloud, multi-cloud and hybrid architectures, increases the sources, complexity and types of data and therefore rises the need for truedat. Our solution comes from more than 8 years of experience in Data Governance consulting and development projects.
  • 16
    Parrot Analytics

    Parrot Analytics

    Parrot Analytics

    Helping media companies get smarter. The world’s largest studios, networks and OTT platforms apply our 360 degree view of content to optimize monetization decisions. Partner with Parrot Analytics to understand how to harness demand measurement to compete and thrive in the global attention economy. Our DEMAND360 platform captures consumption and engagement data from billions of TV fans around the world each day to provide unprecedented insight into global cross-platform audience demand. DEMAND360 captures demand signals in every country on the planet. Our language-agnostic platform uncovers global demand for local productions. We measure total market demand across SVOD, AVOD, linear and cable. We have created a holistic global measurement standard that integrates signals across consumer research data sources, P2P streaming/downloads and social media.
  • 17
    Factiva

    Factiva

    Dow Jones

    Gain unique insights from the world’s most comprehensive collection of news and data. Rely on an unrivaled selection of global news and data accessible via a powerful research platform, on mobile devices or integrated via advanced feeds and APIs. Power strategic decisions, uncover competitive advantage and deliver actionable intelligence with global news, data and insights. Generate deeper insights, improve sentiment analysis, uncover hidden relationships, accurately forecast and enrich data visualizations with news data derived from advanced analytics models. Keep track of your brand's reputation and stay ahead of potential issues by leveraging a database of content from 200 countries in 28 languages, comprehensive monitoring tools and curation solutions. Gather market intelligence, monitor competitors and provide strategic guidance with trusted world news, detailed company and executive data and multi-channel delivery.
  • 18
    Privacera

    Privacera

    Privacera

    At the intersection of data governance, privacy, and security, Privacera’s unified data access governance platform maximizes the value of data by providing secure data access control and governance across hybrid- and multi-cloud environments. The hybrid platform centralizes access and natively enforces policies across multiple cloud services—AWS, Azure, Google Cloud, Databricks, Snowflake, Starburst and more—to democratize trusted data enterprise-wide without compromising compliance with regulations such as GDPR, CCPA, LGPD, or HIPAA. Trusted by Fortune 500 customers across finance, insurance, retail, healthcare, media, public and the federal sector, Privacera is the industry’s leading data access governance platform that delivers unmatched scalability, elasticity, and performance. Headquartered in Fremont, California, Privacera was founded in 2016 to manage cloud data privacy and security by the creators of Apache Ranger™ and Apache Atlas™.
  • 19
    Oracle Coherence
    Oracle Coherence is the industry leading in-memory data grid solution that enables organizations to predictably scale mission-critical applications by providing fast access to frequently used data. As data volumes and customer expectations increase, driven by the “internet of things”, social, mobile, cloud and always-connected devices, so does the need to handle more data in real-time, offload over-burdened shared data services and provide availability guarantees. The latest release of Oracle Coherence, 14.1.1, adds a patented scalable messaging implementation, support for polyglot grid-side programming on GraalVM, distributed tracing in the grid, and certification on JDK 11. Coherence stores each piece of data within multiple members (one primary and one or more backup copies), and doesn't consider any mutating operation complete until the backup(s) are successfully created. This ensures that your data grid can tolerate the failure at any level: from single JVM, to whole data center.
  • 20
    Microsoft Power Query
    Power Query is the easiest way to connect, extract, transform and load data from a wide range of sources. Power Query is a data transformation and data preparation engine. Power Query comes with a graphical interface for getting data from sources and a Power Query Editor for applying transformations. Because the engine is available in many products and services, the destination where the data will be stored depends on where Power Query was used. Using Power Query, you can perform the extract, transform, and load (ETL) processing of data. Microsoft’s Data Connectivity and Data Preparation technology that lets you seamlessly access data stored in hundreds of sources and reshape it to fit your needs—all with an easy to use, engaging, no-code experience. Power Query supports hundreds of data sources with built-in connectors, generic interfaces (such as REST APIs, ODBC, OLE, DB and OData) and the Power Query SDK to build your own connectors.
  • 21
    DeepSee

    DeepSee

    DeepSee

    Putting humans back in charge of the automation. DeepSee empowers knowledge workers with AI techniques to turn data into powerful business assets. Solving real problems for real people. Knowledge is power, and equipping subject-matter experts with the right tools to sift through all the noise has never been more critical to business success. DeepSee created the Knowledge Process Automation (KPA) platform to mine unstructured data, operationalize AI-powered insights, and automate results into real-time action for the enterprise. We’re putting deep knowledge and the power of AI back into human hands. For enterprises across every major business sector, driving strong performance isn’t just about tracking KPIs. Today, competitive advantage is fueled by understanding trends, predictions, and outliers. The DeepSee platform extracts, processes, and transforms untapped data into these key competitive insights in real time — eliminating complexities between analysis and action.
  • 22
    QEDIT

    QEDIT

    QEDIT

    QEDIT is an enterprise-ready, cross-organizational data collaboration platform engineered for the new data economy. We leverage the latest innovations in privacy-enhancing technology to help you safely monetize data assets, improve business analytics processes and gain actionable insights from 2nd parties in a risk-free environment. Our highly scalable, cloud-hosted platform seamlessly integrates with legacy database systems so you can be up and running in no time. QEDIT provides you with timely, business-critical intelligence through a configurable dashboard, advanced reporting functionality, real-time notifications and more. QEDIT empowers companies to engage in regulatory-compliant data collaboration to accelerate growth, mitigate risk and solve complex business problems. QEDIT is an enterprise-ready, secure data collaboration platform that enables companies to share intelligence and monetize data insights derived from external sources, without revealing confidential information.
  • 23
    SAS Data Loader for Hadoop
    Load your data into or out of Hadoop and data lakes. Prep it so it's ready for reports, visualizations or advanced analytics – all inside the data lakes. And do it all yourself, quickly and easily. Makes it easy to access, transform and manage data stored in Hadoop or data lakes with a web-based interface that reduces training requirements. Built from the ground up to manage big data on Hadoop or in data lakes; not repurposed from existing IT-focused tools. Lets you group multiple directives to run simultaneously or one after the other. Schedule and automate directives using the exposed Public API. Enables you to share and secure directives. Call them from SAS Data Integration Studio, uniting technical and nontechnical user activities. Includes built-in directives – casing, gender and pattern analysis, field extraction, match-merge and cluster-survive. Profiling runs in-parallel on the Hadoop cluster for better performance.
  • 24
    Sentrana

    Sentrana

    Sentrana

    Whether your data is trapped in silos or you’re generating data at the edge, Sentrana gives you the flexibility to create AI and data engineering pipelines wherever your data is. And you can share your AI, Data, and Pipelines with anyone anywhere. With Sentrana, you can achieve newfound agility to effortlessly move between compute environments, while all your data and your work replicates automatically to wherever you want. Sentrana provides a large inventory of building blocks from which you can stitch together custom AI and Data Engineering pipelines. Rapidly assemble and test many different pipelines to create the AI you need. Turn your data into AI with near-zero effort and cost. Since Sentrana is an open platform, newer cutting-edge AI building blocks that are emerging every day are put right at your fingertips. Sentrana turns the Pipelines and AI models you create into re-executable building blocks that anyone on your team can hook into their own pipelines.
  • 25
    Talend Data Preparation
    Quickly prepare data for trusted insights throughout the organization. Data and business analysts spend too much time cleaning data instead of analyzing it. Talend Data Preparation provides a self-service, browser-based, point-and-click tool to quickly identify errors and apply rules that you can easily reuse and share, even across massive data sets. Our intuitive UI and self-service data preparation and curation functionality make it possible for anyone to do data profiling, cleansing, and enriching in real time. Users can share preparations and curated datasets, and embed data preparations into batch, bulk, and live data integration scenarios. Talend lets you turn ad-hoc data enrichment and analysis jobs into fully managed, reusable processes. Operationalize data preparation from virtually any data source, including Teradata, AWS, Salesforce, and Marketo, always using the latest datasets. Talend Data Preparation puts data governance in your hands.
  • 26
    Binary Demand

    Binary Demand

    Binary Demand

    Data is the fuel to any successful sales and marketing strategy. Data deteriorates by 2% every month. The relevance of your data collated via email marketing naturally degrade by about 22.5% every year. The absence of accurate data can make or break a business’s marketing strategy. Therefore, the need of an accurate live database becomes indispensable. Binary Demands’ global contact database can help you overhaul your marketing campaigns and strategies. Your collated data deteriorates over a period of time. Binary Demand provides custom solutions to prevent wastage of your data by making up for its natural degradation. Our customised data solutions include standardisation, de-duping, cleansing, verification etc. This helps in creating a list of probable customers based of criterias such as geography, company size, job titles, industry, etc. Our high accuracy and low cost model makes us the best ROI generating list partner in the marketplace.
  • 27
    Mongoose

    Mongoose

    Mongoose

    Let's face it, writing MongoDB validation, casting and business logic boilerplate is a drag. That's why we wrote Mongoose. Now say we like fuzzy kittens and want to record every kitten we ever meet in MongoDB. The first thing we need to do is include mongoose in our project and open a connection to the test database on our locally running instance of MongoDB. We have a pending connection to the test database running on localhost. We now need to get notified if we connect successfully or if a connection error occurs. Mongoose documents represent a one-to-one mapping to documents as stored in MongoDB. Each document is an instance of its Model. Subdocuments are documents embedded in other documents. In Mongoose, this means you can nest schemas in other schemas. Mongoose has two distinct notions of subdocuments: arrays of subdocuments and single nested subdocuments.
  • 28
    DataPreparator

    DataPreparator

    DataPreparator

    DataPreparator is a free software tool designed to assist with common tasks of data preparation (or data preprocessing) in data analysis and data mining. DataPreparator can assist you with exploring and preparing data in various ways prior to data analysis or data mining. It includes operators for cleaning, discretization, numeration, scaling, attribute selection, missing values, outliers, statistics, visualization, balancing, sampling, row selection, and several other tasks. Data access from text files, relational databases, and Excel workbooks. Handling of large volumes of data (since data sets are not stored in the computer memory, with the exception of Excel workbooks and result sets of some databases where database drivers do not support data streaming). Stand alone tool, independent of any other tools. User friendly graphical user interface. Operator chaining to create sequences of preprocessing transformations (operator tree). Creating of model tree for test/execution data.
  • 29
    SAS MDM
    Integrate master data management technologies with those in SAS 9.4. SAS MDM is a web-based application that is accessed through the SAS Data Management Console. It provides a single, accurate and unified view of corporate data, integrating information from various data sources into one master record. SAS® Data Remediation and SAS® Task Manager work together with SAS MDM and as well as with other software offerings, such as SAS® Data Management and SAS® Data Quality. SAS Data Remediation enables users to manage and correct issues triggered by business rules in SAS MDM batch jobs and real-time processes. SAS Task Manager is a complementary application to others that integrate with SAS Workflow technologies giving users direct access to a workflow that might have been initiated from another SAS application. Users can start, stop, and transition workflows that have been uploaded to the SAS Workflow server environment.
  • 30
    Zaloni Arena
    End-to-end DataOps built on an agile platform that improves and safeguards your data assets. Arena is the premier augmented data management platform. Our active data catalog enables self-service data enrichment and consumption to quickly control complex data environments. Customizable workflows that increase the accuracy and reliability of every data set. Use machine-learning to identify and align master data assets for better data decisioning. Complete lineage with detailed visualizations alongside masking and tokenization for superior security. We make data management easy. Arena catalogs your data, wherever it is and our extensible connections enable analytics to happen across your preferred tools. Conquer data sprawl challenges: Our software drives business and analytics success while providing the controls and extensibility needed across today’s decentralized, multi-cloud data complexity.
Auth0 Logo