Alternatives to Nomic Atlas

Compare Nomic Atlas alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Nomic Atlas in 2024. Compare features, ratings, user reviews, pricing, and more from Nomic Atlas competitors and alternatives in order to make an informed decision for your business.

  • 1
    OneTrust Privacy & Data Governance Cloud
    Go beyond compliance and build trust through transparency, choice, and control. People demand greater control of their data, unlocking an opportunity for organizations to use these moments to build trust and deliver more valuable experiences. We provide privacy and data governance automation to help organizations better understand their data across the business, meet regulatory requirements, and operationalize risk mitigation to provide transparency and choice to individuals. Achieve data privacy compliance faster and build trust in your organization. Our platform helps break down silos across processes, workflows, and teams to operationalize regulatory compliance and enable trusted data use. Build proactive privacy programs rooted in global best practices, not reactive to individual regulations. Gain visibility into unknown risks to drive mitigation and risk-based decision making. Respect individual choice and embed privacy and security by default into the data lifecycle.
  • 2
    MANTA

    MANTA

    Manta

    Manta is the world-class automated approach to visualize, optimize, and modernize how data moves through your organization through code-level lineage. By automatically scanning your data environment with the power of 50+ out-of-the-box scanners, Manta builds a powerful map of all data pipelines to drive efficiency and productivity. Visit manta.io to learn more. With Manta platform, you can make your data a truly enterprise-wide asset, bridge the understanding gap, enable self-service, and easily: • Increase productivity • Accelerate development • Shorten time-to-market • Reduce costs and manual effort • Run instant and accurate root cause and impact analyses • Scope and perform effective cloud migrations • Improve data governance and regulatory compliance (GDPR, CCPA, HIPAA, and more) • Increase data quality • Enhance data privacy and data security
  • 3
    Pinecone

    Pinecone

    Pinecone

    Long-term memory for AI. The Pinecone vector database makes it easy to build high-performance vector search applications. Developer-friendly, fully managed, and easily scalable without infrastructure hassles. Once you have vector embeddings, manage and search through them in Pinecone to power semantic search, recommenders, and other applications that rely on relevant information retrieval. Ultra-low query latency, even with billions of items. Give users a great experience. Live index updates when you add, edit, or delete data. Your data is ready right away. Combine vector search with metadata filters for more relevant and faster results. Launch, use, and scale your vector search service with our easy API, without worrying about infrastructure or algorithms. We'll keep it running smoothly and securely.
  • 4
    Zilliz Cloud
    Zilliz Cloud is a fully managed vector database based on the popular open-source Milvus. Zilliz Cloud helps to unlock high-performance similarity searches with no previous experience or extra effort needed for infrastructure management. It is ultra-fast and enables 10x faster vector retrieval, a feat unparalleled by any other vector database management system. Zilliz includes support for multiple vector search indexes, built-in filtering, and complete data encryption in transit, a requirement for enterprise-grade applications. Zilliz is a cost-effective way to build similarity search, recommender systems, and anomaly detection into applications to keep that competitive edge.
  • 5
    Qdrant

    Qdrant

    Qdrant

    Qdrant is a vector similarity engine & vector database. It deploys as an API service providing search for the nearest high-dimensional vectors. With Qdrant, embeddings or neural network encoders can be turned into full-fledged applications for matching, searching, recommending, and much more! Provides the OpenAPI v3 specification to generate a client library in almost any programming language. Alternatively utilise ready-made client for Python or other programming languages with additional functionality. Implement a unique custom modification of the HNSW algorithm for Approximate Nearest Neighbor Search. Search with a State-of-the-Art speed and apply search filters without compromising on results. Support additional payload associated with vectors. Not only stores payload but also allows filter results based on payload values.
  • 6
    Embeddinghub

    Embeddinghub

    Featureform

    Operationalize your embeddings with one simple tool. Experience a comprehensive database designed to provide embedding functionality that, until now, required multiple platforms. Elevate your machine learning quickly and painlessly through Embeddinghub. Embeddings are dense, numerical representations of real-world objects and relationships, expressed as vectors. They are often created by first defining a supervised machine learning problem, known as a "surrogate problem." Embeddings intend to capture the semantics of the inputs they were derived from, subsequently getting shared and reused for improved learning across machine learning models. Embeddinghub lets you achieve this in a streamlined, intuitive way.
    Starting Price: Free
  • 7
    Chroma

    Chroma

    Chroma

    Chroma is an AI-native open-source embedding database. Chroma has all the tools you need to use embeddings. Chroma is building the database that learns. Pick up an issue, create a PR, or participate in our Discord and let the community know what features you would like.
    Starting Price: Free
  • 8
    Deep Lake

    Deep Lake

    activeloop

    Generative AI may be new, but we've been building for this day for the past 5 years. Deep Lake thus combines the power of both data lakes and vector databases to build and fine-tune enterprise-grade, LLM-based solutions, and iteratively improve them over time. Vector search does not resolve retrieval. To solve it, you need a serverless query for multi-modal data, including embeddings or metadata. Filter, search, & more from the cloud or your laptop. Visualize and understand your data, as well as the embeddings. Track & compare versions over time to improve your data & your model. Competitive businesses are not built on OpenAI APIs. Fine-tune your LLMs on your data. Efficiently stream data from remote storage to the GPUs as models are trained. Deep Lake datasets are visualized right in your browser or Jupyter Notebook. Instantly retrieve different versions of your data, materialize new datasets via queries on the fly, and stream them to PyTorch or TensorFlow.
    Starting Price: $995 per month
  • 9
    Weaviate

    Weaviate

    Weaviate

    Weaviate is an open-source vector database. It allows you to store data objects and vector embeddings from your favorite ML-models, and scale seamlessly into billions of data objects. Whether you bring your own vectors or use one of the vectorization modules, you can index billions of data objects to search through. Combine multiple search techniques, such as keyword-based and vector search, to provide state-of-the-art search experiences. Improve your search results by piping them through LLM models like GPT-3 to create next-gen search experiences. Beyond search, Weaviate's next-gen vector database can power a wide range of innovative apps. Perform lightning-fast pure vector similarity search over raw vectors or data objects, even with filters. Combine keyword-based search with vector search techniques for state-of-the-art results. Use any generative model in combination with your data, for example to do Q&A over your dataset.
    Starting Price: Free
  • 10
    Superlinked

    Superlinked

    Superlinked

    Combine semantic relevance and user feedback to reliably retrieve the optimal document chunks in your retrieval augmented generation system. Combine semantic relevance and document freshness in your search system, because more recent results tend to be more accurate. Build a real-time personalized ecommerce product feed with user vectors constructed from SKU embeddings the user interacted with. Discover behavioral clusters of your customers using a vector index in your data warehouse. Describe and load your data, use spaces to construct your indices and run queries - all in-memory within a Python notebook.
  • 11
    LanceDB

    LanceDB

    LanceDB

    LanceDB is a developer-friendly, open source database for AI. From hyperscalable vector search and advanced retrieval for RAG to streaming training data and interactive exploration of large-scale AI datasets, LanceDB is the best foundation for your AI application. Installs in seconds and fits seamlessly into your existing data and AI toolchain. An embedded database (think SQLite or DuckDB) with native object storage integration, LanceDB can be deployed anywhere and easily scales to zero when not in use. From rapid prototyping to hyper-scale production, LanceDB delivers blazing-fast performance for search, analytics, and training for multimodal AI data. Leading AI companies have indexed billions of vectors and petabytes of text, images, and videos, at a fraction of the cost of other vector databases. More than just embedding. Filter, select, and stream training data directly from object storage to keep GPU utilization high.
    Starting Price: $16.03 per month
  • 12
    AtlasMap

    AtlasMap

    AtlasMap

    AtlasMap is a data mapping solution with interactive web based user interface, that simplifies configuring integrations between Java, XML, CSV and JSON data sources. You can design your data mapping with AtlasMap Data Mapper UI canvas, and then run that data mapping via runtime engine. In addition to plain Java API provided by runtime engine, camel-atlasmap Component is also available to perform data mapping as a part of Apache Camel route. There is also a Camel Quarkus extension available. The easiest way to use AtlasMap Data Mapper UI is the standalone mode. Or you can use it through a VS Code plugin. Historically, the AtlasMap Data Mapper UI was designed to work within Syndesis UI and it’s still a best way to experience full benefits of integrated typed data mapping with UI. You can install and run Syndesis by following the Syndesis Developer Handbook. You will find the AtlasMap Data Mapper UI under the integrations panel after selecting or adding an integration with a Data Mapper.
  • 13
    Marqo

    Marqo

    Marqo

    Marqo is more than a vector database, it's an end-to-end vector search engine. Vector generation, storage, and retrieval are handled out of the box through a single API. No need to bring your own embeddings. Accelerate your development cycle with Marqo. Index documents and begin searching in just a few lines of code. Create multimodal indexes and search combinations of images and text with ease. Choose from a range of open source models or bring your own. Build interesting and complex queries with ease. With Marqo you can compose queries with multiple weighted components. With Marqo, input pre-processing, machine learning inference, and storage are all included out of the box. Run Marqo in a Docker image on your laptop or scale it up to dozens of GPU inference nodes in the cloud. Marqo can be scaled to provide low-latency searches against multi-terabyte indexes. Marqo helps you configure deep-learning models like CLIP to pull semantic meaning from images.
    Starting Price: $86.58 per month
  • 14
    Metal

    Metal

    Metal

    Metal is your production-ready, fully-managed, ML retrieval platform. Use Metal to find meaning in your unstructured data with embeddings. Metal is a managed service that allows you to build AI products without the hassle of managing infrastructure. Integrations with OpenAI, CLIP, and more. Easily process & chunk your documents. Take advantage of our system in production. Easily plug into the MetalRetriever. Simple /search endpoint for running ANN queries. Get started with a free account. Metal API Keys to use our API & SDKs. With your API Key, you can use authenticate by populating the headers. Learn how to use our Typescript SDK to implement Metal into your application. Although we love TypeScript, you can of course utilize this library in JavaScript. Mechanism to fine-tune your spp programmatically. Indexed vector database of your embeddings. Resources that represent your specific ML use-case.
    Starting Price: $25 per month
  • 15
    Feedonomics

    Feedonomics

    Feedonomics

    Feedonomics is the #1 full-service product feed platform. Easily publish optimized product listings on hundreds of ad channels and marketplaces, such as Google Shopping, Facebook, Amazon, eBay, Walmart, and more. Feedonomics gives you the freedom to focus on other areas of your business. As a merchant or an agency that supports one, you shouldn’t have to dedicate time and resources to cleaning up product data, setting up exports, troubleshooting errors, and staying up-to-date with feed requirements just to manage your product feeds. What does "full-service" mean? It means we offer a powerful feed management platform, PLUS a team of feed specialists who help with the following: - Catalog optimization and product categorization, integration setup, feed refresh scheduling, maintenance, resolving feed-based errors, and more. - 24/7 support - Data governance and error alerts - Personalized service and solutions for your specific data needs
  • 16
    Magnitude Angles
    Empower your business to answer the questions that matter most with self-service operational analytics and ready-to-run business reports across core processes. What if there was a way to really understand what’s going on in your organization? A way to not only report on events, but to react in real time to insights surfaced from deep within your supply chain, finance, manufacturing and distribution processes? Change the way you respond to the ever-shifting business landscape. Magnitude Angles helps you uncover insights previously locked deep in your SAP or Oracle ERP system and streamlines the data analysis process. Traditional BI tools understand rows, tables, and columns, but they have no concept of materials, orders, or cash. Angles is built on top of a context-aware, process-rich business data model that translates complex ERP data architectures into self-service business analytics, putting data closer to decision and helping turn data into insight, and insight into action.
  • 17
    Talend Data Integration
    Talend Data Integration lets you connect and manage all your data, no matter where it lives. Use more than 1,000 connectors and components to connect virtually any data source with virtually any data environment, in the cloud or on premises. Easily develop and deploy reusable data pipelines with a drag-and-drop interface that’s 10 times faster than hand-coding. Talend has always supported scaling massive data sets to advanced data analytics or Spark platforms. We also partner with leading cloud service providers, data warehouses, and analytics platforms, including Amazon Web Services, Microsoft Azure, Google Cloud Platform, Snowflake, and Databricks. With Talend, data quality is embedded into every step of the data integration processes. Discover, highlight, and fix issues as data moves through your systems, before inconsistencies can disrupt or impact crucial decisions. Connect to data where it lives, use it where you need it.
  • 18
    Aggua

    Aggua

    Aggua

    Aggua is a data fabric augmented AI platform that enables data and business teams Access to their data, creating Trust and giving practical Data Insights, for a more holistic, data-centric decision-making. Instead of wondering what is going on underneath the hood of your organization's data stack, become immediately informed with a few clicks. Get access to data cost insights, data lineage and documentation without needing to take time out of your data engineer's workday. Instead of spending a lot of time tracing what a data type change will break in your data pipelines, tables and infrastructure, with automated lineage, your data architects and engineers can spend less time manually going through logs and DAGs and more time actually making the changes to infrastructure.
  • 19
    Milvus

    Milvus

    The Milvus Project

    Vector database built for scalable similarity search. Open-source, highly scalable, and blazing fast. Store, index, and manage massive embedding vectors generated by deep neural networks and other machine learning (ML) models. With Milvus vector database, you can create a large-scale similarity search service in less than a minute. Simple and intuitive SDKs are also available for a variety of different languages. Milvus is hardware efficient and provides advanced indexing algorithms, achieving a 10x performance boost in retrieval speed. Milvus vector database has been battle-tested by over a thousand enterprise users in a variety of use cases. With extensive isolation of individual system components, Milvus is highly resilient and reliable. The distributed and high-throughput nature of Milvus makes it a natural fit for serving large-scale vector data. Milvus vector database adopts a systemic approach to cloud-nativity, separating compute from storage.
    Starting Price: Free
  • 20
    Azure AI Search

    Azure AI Search

    Microsoft

    Deliver high-quality responses with a vector database built for advanced retrieval augmented generation (RAG) and modern search. Focus on exponential growth with an enterprise-ready vector database that comes with security, compliance, and responsible AI practices built in. Build better applications with sophisticated retrieval strategies backed by decades of research and customer validation. Quickly deploy your generative AI app with seamless platform and data integrations for data sources, AI models, and frameworks. Automatically upload data from a wide range of supported Azure and third-party sources. Streamline vector data processing with built-in extraction, chunking, enrichment, and vectorization, all in one flow. Support for multivector, hybrid, multilingual, and metadata filtering. Move beyond vector-only search with keyword match scoring, reranking, geospatial search, and autocomplete.
    Starting Price: $0.11 per hour
  • 21
    SuperDuperDB

    SuperDuperDB

    SuperDuperDB

    Build and manage AI applications easily without needing to move your data to complex pipelines and specialized vector databases. Integrate AI and vector search directly with your database including real-time inference and model training. A single scalable deployment of all your AI models and APIs which is automatically kept up-to-date as new data is processed immediately. No need to introduce an additional database and duplicate your data to use vector search and build on top of it. SuperDuperDB enables vector search in your existing database. Integrate and combine models from Sklearn, PyTorch, and HuggingFace with AI APIs such as OpenAI to build even the most complex AI applications and workflows. Deploy all your AI models to automatically compute outputs (inference) in your datastore in a single environment with simple Python commands.
  • 22
    Astra DB

    Astra DB

    DataStax

    Astra DB from DataStax is vector database for developers that need to get accurate Generative AI applications into production, quickly and efficiently. Built on Apache Cassandra, Astra DB is the only vector database that can make vector updates immediately available to applications and scale to the largest real-time data and streaming workloads, securely on any cloud. Astra DB offers unprecedented serverless, pay as you go pricing and the flexibility of multi-cloud and open-source. You can store up to 80GB and/or perform 20 million operations per month. Securely connect to VPC peering and private links. Manage your encryption keys with your own key management and SAML SSO secure account accessibility. You can deploy on AWS, GCP, or Azure while still maintaining open-source Cassandra compatibility.
  • 23
    MyScale

    MyScale

    MyScale

    MyScale is an innovative AI database that seamlessly integrates vector search with SQL analytics, delivering a comprehensive, fully managed, and high-performance solution. Key Features: - Superior Data Capacity and Performance: Each MyScale pod supports 5 million 768-dimensional data points with exceptional accuracy, enabling over 150 queries per second (QPS). - Rapid Data Ingestion: Import up to 5 million data points in under 30 minutes, reducing waiting time and enabling faster utilization of your vector data. - Flexible Indexing: MyScale allows you to create multiple tables with unique vector indexes, efficiently managing diverse vector data within a single cluster. - Effortless Data Import and Backup: Seamlessly import/export data from/to S3 or other compatible storage systems, ensuring smooth data management and backup processes. With MyScale, unleash the power of advanced AI database capabilities for efficient and effective data analysis.
  • 24
    Vald

    Vald

    Vald

    Vald is a highly scalable distributed fast approximate nearest neighbor dense vector search engine. Vald is designed and implemented based on the Cloud-Native architecture. It uses the fastest ANN Algorithm NGT to search neighbors. Vald has automatic vector indexing and index backup, and horizontal scaling which made for searching from billions of feature vector data. Vald is easy to use, feature-rich and highly customizable as you needed. Usually the graph requires locking during indexing, which cause stop-the-world. But Vald uses distributed index graph so it continues to work during indexing. Vald implements its own highly customizable Ingress/Egress filter. Which can be configured to fit the gRPC interface. Horizontal scalable on memory and cpu for your demand. Vald supports to auto backup feature using Object Storage or Persistent Volume which enables disaster recovery.
    Starting Price: Free
  • 25
    Vespa

    Vespa

    Vespa.ai

    Vespa is forBig Data + AI, online. At any scale, with unbeatable performance. To build production-worthy online applications that combine data and AI, you need more than point solutions: You need a platform that integrates data and compute to achieve true scalability and availability - and which does this without limiting your freedom to innovate. Only Vespa does this. Vespa is a fully featured search engine and vector database. It supports vector search (ANN), lexical search, and search in structured data, all in the same query. Users can easily build recommendation applications on Vespa. Integrated machine-learned model inference allows you to apply AI to make sense of your data in real-time. Together with Vespa's proven scaling and high availability, this empowers you to create production-ready search applications at any scale and with any combination of features.
    Starting Price: Free
  • 26
    Databricks Data Intelligence Platform
    The Databricks Data Intelligence Platform allows your entire organization to use data and AI. It’s built on a lakehouse to provide an open, unified foundation for all data and governance, and is powered by a Data Intelligence Engine that understands the uniqueness of your data. The winners in every industry will be data and AI companies. From ETL to data warehousing to generative AI, Databricks helps you simplify and accelerate your data and AI goals. Databricks combines generative AI with the unification benefits of a lakehouse to power a Data Intelligence Engine that understands the unique semantics of your data. This allows the Databricks Platform to automatically optimize performance and manage infrastructure in ways unique to your business. The Data Intelligence Engine understands your organization’s language, so search and discovery of new data is as easy as asking a question like you would to a coworker.
  • 27
    Supabase

    Supabase

    Supabase

    Create a backend in less than 2 minutes. Start your project with a Postgres database, authentication, instant APIs, real-time subscriptions and storage. Build faster and focus on your products. Every project is a full Postgres database, the world's most trusted relational database. Add user sign-ups and logins, securing your data with Row Level Security. Store, organize and serve large files. Any media, including videos and images. Write custom code and cron jobs without deploying or scaling servers. There are many example apps and starter projects to get going. We introspect your database to provide APIs instantly. Stop building repetitive CRUD endpoints and focus on your product. Type definitions built directly from your database schema. Use Supabase in the browser without a build process. Develop locally and push to production when you're ready. Manage Supabase projects from your local machine.
    Starting Price: $25 per month
  • 28
    Barracuda PST Enterprise
    PST files are often used by end-users as personal email archives. As a result, they are typically scattered widely across end-user devices and network storage, which makes them notoriously difficult to discover and manage consistently or effectively. Thanks to improvements to Microsoft Exchange and Office 365, your users no longer need to store data locally within PST files. However, legacy PST files still contain important data that you need to bring back under control. Barracuda PST Enterprise is designed specifically to help you address this challenge. You may have many terabytes of data stored in PST files located on end-user devices and on network servers. And these files are notoriously unreliable, since they are often corrupted and easily misplaced. The costs associated with storing, securing, and managing these large files, in terms of both system resources and IT administrative overhead, are significant.
  • 29
    Diffgram Data Labeling
    Your AI Data Platform Quality Training Data for Enterprise Data Labeling Software for Machine Learning Free on your Kubernetes Cluster Up to 3 Users. TRUSTED BY 5,000 HAPPY USERS WORLDWIDE Images, Video, Text Spatial Tools Quadratic Curves, Cuboids, Segmentation, Box, Polygons, Lines, Keypoints, Classification Tags, and More Use the exact spatial tool you need. All tools are easy to use, fully editable, and powerful ways to represent your data. All tools are available in Video. Attribute Tools More Meaning. More degrees of freedom through: Radio buttons. Multiple select. Date pickers. Sliders. Conditional logic. Directional Vectors. And more! You can capture complex knowledge and encode it into your AI. Streaming Data Automation Up to 10x Faster then manual labeling
    Starting Price: Free
  • 30
    Tictag

    Tictag

    Tictag

    Your AI deserves the best data. With industry-leading 99% accuracy, take the stress out of getting your machine learning datasets on Tictag with our unique mobile data platform and Truetag quality control. Tictag's first-of-its-kind mobile data platform combines a user-friendly interface with gamified elements to produce the highest quality datasets, powered by our proprietary Truetag quality control system. This is technology-enhanced labeling at its best. Tictag efficiently collects and labels the most complex and intricate of datasets with near-100% accuracy for AI and ML models in short turnarounds. Data labeling has never been faster or easier. Do it once and do it right. Tictag's technology-augmented Truetag quality control ensures your data is exactly as you need it. Through Tictag, your data needs, in turn, help people who need another source of income, or a way to learn new skills.
  • 31
    Superb AI

    Superb AI

    Superb AI

    Superb AI provides a new generation machine learning data platform to AI teams so that they can build better AI in less time. The Superb AI Suite is an enterprise SaaS platform built to help ML engineers, product teams, researchers and data annotators create efficient training data workflows, saving time and money. Majority of ML teams spend more than 50% of their time managing training datasets Superb AI can help. On average, our customers have reduced the time it takes to start training models by 80%. Fully managed workforce, powerful labeling tools, training data quality control, pre-trained model predictions, advanced auto-labeling, filter and search your datasets, data source integration, robust developer tools, ML workflow integrations, and much more. Training data management just got easier with Superb AI. Superb AI offers enterprise-level features for every layer in an ML organization.
  • 32
    Adeptia Connect

    Adeptia Connect

    Adeptia Inc.

    Adeptia Connect helps enterprises streamline and accelerate their data onboarding process by up to 80%, making organizations easy to do business with. Through a self-service approach, Adeptia Connect lets business users onboard data, thus accelerating service delivery and fast-forwarding revenues.
    Starting Price: $3000.00/month
  • 33
    Hive Data
    Create training datasets for computer vision models with our fully managed solution. We believe that data labeling is the most important factor in building effective deep learning models. We are committed to being the field's leading data labeling platform and helping companies take full advantage of AI's capabilities. Organize your media with discrete categories. Identify items of interest with one or many bounding boxes. Like bounding boxes, but with additional precision. Annotate objects with accurate width, depth, and height. Classify each pixel of an image. Mark individual points in an image. Annotate straight lines in an image. Measure, yaw, pitch, and roll of an item of interest. Annotate timestamps in video and audio content. Annotate freeform lines in an image.
    Starting Price: $25 per 1,000 annotations
  • 34
    Ab Initio

    Ab Initio

    Ab Initio

    Data arrives from every direction, growing in scale and complexity. Hidden in the data is knowledge and insight that is full of potential. Such potential is only fully realized when it permeates through to every decision and action the organization takes, second by second. As the business changes, so does the data itself, resulting in new knowledge and insight. A cycle is formed, learn and adapt. Industries as far ranging as financial services, healthcare, telecommunications, manufacturing, transportation, and entertainment have recognized the opportunity. Getting there is both challenging and exciting. Success demands new levels of speed and agility in understanding, managing, and processing vast amounts of continuously changing data. Complex organizations require a high performance data platform that is built for automation and self-service, that thrives amid change and adapts to new realities, and that can solve the toughest data processing and data management challenges.
  • 35
    WSO2 Enterprise Service Bus
    The WSO2 integration runtime engine is capable of playing multiple roles in your enterprise architecture. It can act as an ESB or a microservices integrator. When deployed as an ESB, it caters to your message routing, transformation, message mediation, service orchestration, as well as service and API hosting needs. Header-based, content-based, rule-based, and priority-based routing. Implement Enterprise Integration Patterns (EIPs), database integration, event stream integrations. Transform messages with XSLT 1.0/2.0, XPath, XQuery, and Smooks. Visual data mapping, CSV, JSON, and XML transformation connectors. Supports any RDBMS, CSV, Excel, ODS, Cassandra, and Google spreadsheets. Supports OData v4 protocol for any RDBMS and Cassandra data sources. Supports MSSQL, DB2, Oracle, OpenEdge, TerraData, MySQL, PostgreSQL/EnterpriseDB, H2, Derby or any database with a JDBC driver. Support for nested queries across data sources.
  • 36
    Io-Tahoe

    Io-Tahoe

    Io-Tahoe

    Thinking data automation 'first' enables your company to move faster and more effectively to making digital transformation a reality in the enterprise. Data Automation helps you serve customers better. It enables you to operate more efficiently. It unburdens your employees from mundane, repetitive work, allowing them to focus on solving problems and creating value. Accelerate data projects by enabling non-IT resources and data experts to consolidate, categorize,and master data. Power digital transformations by automating a significant portion of data management through human-guided machine learning. Increase the velocity of business outcomes with complete, accurate data curated automatically for data visualization tools and analytic insights. It works with your company’s core systems and existing applications, and automates data management, communication and response triggering. Once in place, Data Automation by Io-Tahoe delivers repetitive, high-volume data tasks efficiently.
  • 37
    HVR

    HVR

    HVR

    A subscription includes everything you need for efficient high-volume data replication and integration. Low-impact data movement even at high volumes with Log-Based Change Data Capture (CDC) and a unique compression algorithm. RESTful APIs enable workflow automation, saving time and streamlining processes. HVR has a variety of security features. Plus it uniquely enables data routing through a firewall proxy in hybrid environments. Supports multi and bi-directional data movement, giving you the freedom to design and optimize your data flows. Everything you need for your data replication project is included under once license. We surround our customers with in-depth training, accessible support, and documentation to foster success. Be confident your data is accurate and in-sync with our Data Validation and Live Compare feature. Everything you need for your data replication project is included under once license.
  • 38
    Liquid Data Mapper

    Liquid Data Mapper

    Liquid Technologies

    Liquid Data Mapper is a flexible data transformation and mapping tool providing the power and flexibility to produce any combination of mapping, even combining data from multiple sources. Provides a graphical visualization of data mapping between multiple data sources and targets. Execute transform within Liquid Studio or generate mapping code for use in your own application. Step through debugger with breakpoints, stack trace, state and variable watch windows. Liquid Studio Data Mapper provides an intuitive graphical interface for designing data transforms. The transforms can map multiple data input sources to multiple data output targets while performing conditional logic, filtering, string manipulation, statistical functions and much more.
    Starting Price: $149 one-time payment
  • 39
    BridgeHead

    BridgeHead

    BridgeHead

    Helping healthcare organizations to consolidate vital patient, clinical and administrative data, safeguard that information, and make it easily accessible to staff as part of a 360-degree view. Eliminate your data silos by aggregating patient and administrative information from across the healthcare enterprise. Optimize the storage of your healthcare data by utilizing all of your storage assets and reducing the total cost of ownership. Safeguard your healthcare data and ensure efficient recovery in the event of cyber attack, loss, corruption or disaster. Enhance care and improve outcomes by providing your staff with easy access to a 360-degree view of a patient’s history. Retire obsolete or legacy applications while retaining access to the data they contain- saving time, money and resources. Digitally transform your end-to-end clinical testing processes & accelerate testing operations, at scale.
  • 40
    Opsware

    Opsware

    Opsware

    Legal and engineering teams are using Opsware to embed privacy workflows across their organizations to simplify compliance with modern privacy laws. No coding is required. Eliminate the tedious work that comes with building data maps and managing vendors. Discover data sources using our fast and reliable data discovery and classification solutions. Trust our automation technology to complete your customers' data requests and consent changes across every data system and vendor. Get your teams on one tool and workflow. Track the flow of personal data across your company to operationalize compliance and get visibility into risk. We cover the workflows required to build out processing records with a single click. Your company data is fully controlled by you. Provision our virtual appliance on-premise so you can ensure that personal data doesn't leave your secure data context. Every business is data first and every company collects personal data.
  • 41
    Complyon

    Complyon

    Complyon

    We help, You comply. Make compliance an asset and improve your business through Complyon’s governance, compliance and risk management software. Our tools ensure your compliance. Data mapping Reuse, optimize and connect your dataflows to save time and secure your information. Reporting. Generate up-to-date and protocol-ready reports in seconds, covering everything from systems to risks. Decentralizing compliance. A central platform allows your compliance to be trusted by management, while being simple to update, validate and administrate. Improve your compliance with our tailor-made workflows. Central governance. Central governance and business unit input provides all the right data to secure compliance for GDPR and other regulations you need to abide by. Data flow analysis. Understand the complete overview of your data through the interconnection of activities, systems and processes, including everything from third parties and policies to legal basis and retention rules.
  • 42
    KDB.AI
    KDB.AI is a powerful knowledge-based vector database and search engine that allows developers to build scalable, reliable and real-time applications by providing advanced search, recommendation and personalization for AI applications. Vector databases are a new wave of data management designed for generative AI, IoT and time-series applications. Here's why they matter, what makes them different, how they work, the new use cases they're designed for, and how to get started.
  • 43
    CrateDB

    CrateDB

    CrateDB

    The enterprise database for time series, documents, and vectors. Store any type of data and combine the simplicity of SQL with the scalability of NoSQL. CrateDB is an open source distributed database running queries in milliseconds, whatever the complexity, volume and velocity of data.
  • 44
    Heartex

    Heartex

    Heartex

    Data labeling software that makes your AI smart — Data labeling tool for various data types — Automatically label up-to 95% of your dataset using Machine Learning and Active Learning — Manage training data in one place. Control quality, and privacy
  • 45
    Synthesis AI

    Synthesis AI

    Synthesis AI

    A synthetic data platform for ML engineers to enable the development of more capable AI models. Simple APIs provide on-demand generation of perfectly-labeled, diverse, and photoreal images. Highly-scalable cloud-based generation platform delivers millions of perfectly labeled images. On-demand data enables new data-centric approaches to develop more performant models. An expanded set of pixel-perfect labels including segmentation maps, dense 2D/3D landmarks, depth maps, surface normals, and much more. Rapidly design, test, and refine your products before building hardware. Prototype different imaging modalities, camera placements, and lens types to optimize your system. Reduce bias in your models associated with misbalanced data sets while preserving privacy. Ensure equal representation across identities, facial attributes, pose, camera, lighting, and much more. We have worked with world-class customers across many use cases.
  • 46
    Encord

    Encord

    Encord

    Achieve peak model performance with the best data. Create & manage training data for any visual modality, debug models and boost performance, and make foundation models your own. Expert review, QA and QC workflows help you deliver higher quality datasets to your artificial intelligence teams, helping improve model performance. Connect your data and models with Encord's Python SDK and API access to create automated pipelines for continuously training ML models. Improve model accuracy by identifying errors and biases in your data, labels and models.
  • 47
    Gretel

    Gretel

    Gretel.ai

    Privacy engineering tools delivered to you as APIs. Synthesize and transform data in minutes. Build trust with your users and community. Gretel’s APIs grant immediate access to creating anonymized or synthetic datasets so you can work safely with data while preserving privacy. Keeping the pace with development velocity requires faster access to data. Gretel is accelerating access to data with data privacy tools that bypass blockers and fuel Machine Learning and AI applications. Keep your data contained by running Gretel containers in your own environment or scale out workloads to the cloud in seconds with Gretel Cloud runners. Using our cloud GPUs makes it radically more effortless for developers to train and generate synthetic data. Scale workloads automatically with no infrastructure to set up and manage. Invite team members to collaborate on cloud projects and share data across teams.
  • 48
    Label Studio

    Label Studio

    Label Studio

    The most flexible data annotation tool. Quickly installable. Build custom UIs or use pre-built labeling templates. Configurable layouts and templates adapt to your dataset and workflow. Detect objects on images, boxes, polygons, circular, and key points supported. Partition the image into multiple segments. Use ML models to pre-label and optimize the process. Webhooks, Python SDK, and API allow you to authenticate, create projects, import tasks, manage model predictions, and more. Save time by using predictions to assist your labeling process with ML backend integration. Connect to cloud object storage and label data there directly with S3 and GCP. Prepare and manage your dataset in our Data Manager using advanced filters. Support multiple projects, use cases, and data types in one platform. Start typing in the config, and you can quickly preview the labeling interface. At the bottom of the page, you have live serialization updates of what Label Studio expects as an input.
  • 49
    Automaton AI

    Automaton AI

    Automaton AI

    With Automaton AI’s ADVIT, create, manage and develop high-quality training data and DNN models all in one place. Optimize the data automatically and prepare it for each phase of the computer vision pipeline. Automate the data labeling processes and streamline data pipelines in-house. Manage the structured and unstructured video/image/text datasets in runtime and perform automatic functions that refine your data in preparation for each step of the deep learning pipeline. Upon accurate data labeling and QA, you can train your own model. DNN training needs hyperparameter tuning like batch size, learning, rate, etc. Optimize and transfer learning on trained models to increase accuracy. Post-training, take the model to production. ADVIT also does model versioning. Model development and accuracy parameters can be tracked in run-time. Increase the model accuracy with a pre-trained DNN model for auto-labeling.
  • 50
    Sapien

    Sapien

    Sapien

    High-quality training data is essential for all large language models, whether you build the data yourself or use pre-existing models. A human-in-the-loop labeling process delivers real-time feedback for fine-tuning datasets to build the most performant and differentiated AI models. We provide precise data labeling with faster human input to enhance the robustness and input diversity to improve the adaptability of LLMs for your enterprise applications. Our labeler management allows us to segment teams— you only pay for the level of experience and skill sets your data labelling project requires. Sapien can quickly scale labelling operations up and down for annotation projects large and small. Human intelligence at scale. We can customize labeling models to handle your specific data types, formats, and annotation requirements.