Alternatives to Cribl Search

Compare Cribl Search alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Cribl Search in 2026. Compare features, ratings, user reviews, pricing, and more from Cribl Search competitors and alternatives in order to make an informed decision for your business.

  • 1
    SureSync

    SureSync

    Software Pursuits

    SureSync Pro is a file replication and synchronization application that provides one-way and multi-way processing in both scheduled and real-time modes. The Communications Agent provides real-time monitors, delta copies via Remote Differential Compression, TCP communications, compression, and encryption. SureSync Managed File Transfer (MFT) adds file locking, archiving, enhanced logging/status, blob storage support in Azure and Amazon clouds, and a next-generation intelligent transfer engine. File locking enables real-time multi-way collaborative environments with protection against users changing the same file in multiple offices at the same time. SQL Protection simplifies backups of critical SQL databases. SureSync comprehensive enterprise-grade feature set can help solve any file synchronization, replication, and archiving challenge.
    Leader badge
    Partner badge
    Compare vs. Cribl Search View Software
    Visit Website
  • 2
    Amazon S3
    Amazon Simple Storage Service (Amazon S3) is an object storage service that offers industry-leading scalability, data availability, security, and performance. This means customers of all sizes and industries can use it to store and protect any amount of data for a range of use cases, such as data lakes, websites, mobile applications, backup and restore, archive, enterprise applications, IoT devices, and big data analytics. Amazon S3 provides easy-to-use management features so you can organize your data and configure finely-tuned access controls to meet your specific business, organizational, and compliance requirements. Amazon S3 is designed for 99.999999999% (11 9's) of durability, and stores data for millions of applications for companies all around the world. Scale your storage resources up and down to meet fluctuating demands, without upfront investments or resource procurement cycles. Amazon S3 is designed for 99.999999999% (11 9’s) of data durability.
  • 3
    Azure AI Search
    Deliver high-quality responses with a vector database built for advanced retrieval augmented generation (RAG) and modern search. Focus on exponential growth with an enterprise-ready vector database that comes with security, compliance, and responsible AI practices built in. Build better applications with sophisticated retrieval strategies backed by decades of research and customer validation. Quickly deploy your generative AI app with seamless platform and data integrations for data sources, AI models, and frameworks. Automatically upload data from a wide range of supported Azure and third-party sources. Streamline vector data processing with built-in extraction, chunking, enrichment, and vectorization, all in one flow. Support for multivector, hybrid, multilingual, and metadata filtering. Move beyond vector-only search with keyword match scoring, reranking, geospatial search, and autocomplete.
    Starting Price: $0.11 per hour
  • 4
    lakeFS

    lakeFS

    Treeverse

    lakeFS enables you to manage your data lake the way you manage your code. Run parallel pipelines for experimentation and CI/CD for your data. Simplifying the lives of engineers, data scientists and analysts who are transforming the world with data. lakeFS is an open source platform that delivers resilience and manageability to object-storage based data lakes. With lakeFS you can build repeatable, atomic and versioned data lake operations, from complex ETL jobs to data science and analytics. lakeFS supports AWS S3, Azure Blob Storage and Google Cloud Storage (GCS) as its underlying storage service. It is API compatible with S3 and works seamlessly with all modern data frameworks such as Spark, Hive, AWS Athena, Presto, etc. lakeFS provides a Git-like branching and committing model that scales to exabytes of data by utilizing S3, GCS, or Azure Blob for storage.
  • 5
    Azure Blob Storage
    Massively scalable and secure object storage for cloud-native workloads, archives, data lakes, high-performance computing, and machine learning. Azure Blob Storage helps you create data lakes for your analytics needs, and provides storage to build powerful cloud-native and mobile apps. Optimize costs with tiered storage for your long-term data, and flexibly scale up for high-performance computing and machine learning workloads. Blob storage is built from the ground up to support the scale, security, and availability needs of mobile, web, and cloud-native application developers. Use it as a cornerstone for serverless architectures such as Azure Functions. Blob storage supports the most popular development frameworks, including Java, .NET, Python, and Node.js, and is the only cloud storage service that offers a premium, SSD-based object storage tier for low-latency and interactive scenarios.
    Starting Price: $0.00099
  • 6
    Symantec Cloud Workload Protection
    Many applications and services running in public clouds use Amazon S3 buckets and Azure Blob storage. Over time, storage can become contaminated with malware, misconfigured buckets can allow data breaches, and unclassified sensitive data can result in compliance violations and fines. CWP for Storage automatically discovers and scans Amazon S3 buckets and Azure Blobs to keep cloud storage clean and secure. CWP for Storage DLP applies Symantec DLP policy to Amazon S3 to discover and classify sensitive information. AWS Tags can be applied as needed for remediation and further actions in time. Cloud security posture management (CSPM) for Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP). Containers improve agility, however they also bring public cloud security challenges and vulnerabilities that increase risk.
  • 7
    Data Lakes on AWS
    Many Amazon Web Services (AWS) customers require a data storage and analytics solution that offers more agility and flexibility than traditional data management systems. A data lake is a new and increasingly popular way to store and analyze data because it allows companies to manage multiple data types from a wide variety of sources, and store this data, structured and unstructured, in a centralized repository. The AWS Cloud provides many of the building blocks required to help customers implement a secure, flexible, and cost-effective data lake. These include AWS managed services that help ingest, store, find, process, and analyze both structured and unstructured data. To support our customers as they build data lakes, AWS offers the data lake solution, which is an automated reference implementation that deploys a highly available, cost-effective data lake architecture on the AWS Cloud along with a user-friendly console for searching and requesting datasets.
  • 8
    Cloud Storage Manager

    Cloud Storage Manager

    SmiKar Software

    Azure storage consumption is growing at an incredible pace, even faster than originally predicted. Organizations have an ever-growing data footprint and are therefore eager to take advantage of Azure and it’s limitless supply of storage and resources. However, as an organization’s storage requirements grow, it’s easy to lose track of where all the storage is being consumed, which also means the Azure storage cost keeps going up often causing cost blowout. With Cloud Storage Manager you will be able to instantly see where all your storage is going, allowing you to take back control and save money. Cloud Storage Manager provides you with an Azure Explorer like view of all your Azure Blobs and what resides in your Azure Files. From this view you can see details of each individual Blob, including Blob size, date the Azure Blob was created and last modified, as well as what Storage Tiering the Blob currently is in.
  • 9
    Azure Storage Explorer
    Manage your storage accounts in multiple subscriptions across all Azure regions, Azure Stack, and Azure Government. Add new features and capabilities with extensions to manage even more of your cloud storage needs. Accessible, intuitive, and feature-rich graphical user interface (GUI) for full management of cloud storage resources. Securely access your data using Azure AD and fine-tuned access control list (ACL) permissions. Efficiently connect and manage your Azure storage service accounts and resources across subscriptions and organizations. Create, delete, view, edit, and manage resources for Azure Storage, Azure Data Lake Storage, and Azure managed disks. Seamlessly view, search, and interact with your data and resources using an intuitive interface. Improved accessibility with multiple screen reader options, high contrast themes, and hot keys on Windows and macOS.
  • 10
    Azure FXT Edge Filer
    Create cloud-integrated hybrid storage that works with your existing network-attached storage (NAS) and Azure Blob Storage. This on-premises caching appliance optimizes access to data in your datacenter, in Azure, or across a wide-area network (WAN). A combination of software and hardware, Microsoft Azure FXT Edge Filer delivers high throughput and low latency for hybrid storage infrastructure supporting high-performance computing (HPC) workloads.Scale-out clustering provides non-disruptive NAS performance scaling. Join up to 24 FXT nodes per cluster to scale to millions of IOPS and hundreds of GB/s. When you need performance and scale in file-based workloads, Azure FXT Edge Filer keeps your data on the fastest path to processing resources. Managing data storage is easy with Azure FXT Edge Filer. Shift aging data to Azure Blob Storage to keep it easily accessible with minimal latency. Balance on-premises and cloud storage.
  • 11
    Electrik.Ai

    Electrik.Ai

    Electrik.Ai

    Automatically ingest marketing data into any data warehouse or cloud file storage of your choice such as BigQuery, Snowflake, Redshift, Azure SQL, AWS S3, Azure Data Lake, Google Cloud Storage with our fully managed ETL pipelines in the cloud. Our hosted marketing data warehouse integrates all your marketing data and provides ad insights, cross-channel attribution, content insights, competitor Insights, and more. Our customer data platform performs identity resolution in real-time across data sources thus enabling a unified view of the customer and their journey. Electrik.AI is a cloud-based marketing analytics software and full-service platform. Electrik.AI’s Google Analytics Hit Data Extractor enriches and extracts the un-sampled hit level data sent to Google Analytics from the website or application and periodically ships it to your desired destination database/data warehouse or file/data lake.
    Starting Price: $49 per month
  • 12
    Google Cloud Search
    With Cloud Search, we’re bringing the best of Google Search to your business and delivering true enterprise search. Whether integrated with G Suite or used as stand-alone to connect to all your third-party applications and data platforms, Cloud Search helps your employees quickly, easily, and securely find information across the business. Searching through your company’s data should be easier. Cloud Search utilizes machine learning to bring instant query suggestions and surface the most relevant results across more than 100 different content platforms — in over 100 different languages. What Google does for the web, Cloud Search does for enterprise search and for your business. Cloud Search delivers enterprise search through robust SDKs and ready-to-use APIs to help you scalably index vast amounts of data from any source. With 100+ connectors, you can index your third-party content from dozens of enterprise sources.
  • 13
    Elastic Cloud
    Enterprise search, observability, and security for the cloud. Quickly and easily find information, gain insights, and protect your technology investment whether you run on Amazon Web Services, Google Cloud, or Microsoft Azure. We handle the maintenance and upkeep, so you can focus on gaining the insights that help you run your business. Configuration and deployment are a breeze. Simple scaling, custom plugins, and architecture optimized for log and time series data are only a taste of what’s possible. Get the complete Elastic experience with features like machine learning, Canvas, APM, index lifecycle management, Elastic App Search, Elastic Workplace Search, and more — exclusively available here. Logging and metrics are just the start. Bring your diverse data together to address security, observability, and other critical use cases.
    Starting Price: $16 per month
  • 14
    Guild AI

    Guild AI

    Guild AI

    Guild AI is an open-source experiment tracking toolkit designed to bring systematic control to machine learning workflows, enabling users to build better models faster. It automatically captures every detail of training runs as unique experiments, facilitating comprehensive tracking and analysis. Users can compare and analyze runs to deepen their understanding and incrementally improve models. Guild AI simplifies hyperparameter tuning by applying state-of-the-art algorithms through straightforward commands, eliminating the need for complex trial setups. It also supports the automation of pipelines, accelerating model development, reducing errors, and providing measurable results. The toolkit is platform-agnostic, running on all major operating systems and integrating seamlessly with existing software engineering tools. Guild AI supports various remote storage types, including Amazon S3, Google Cloud Storage, Azure Blob Storage, and SSH servers.
  • 15
    Amazon CloudSearch
    Amazon CloudSearch is a managed service in the AWS Cloud that makes it simple and cost-effective to set up, manage, and scale a search solution for your website or application. Amazon CloudSearch supports 34 languages and popular search features such as highlighting, autocomplete, and geospatial search. With Amazon CloudSearch, you can quickly add rich search capabilities to your website or application. You don't need to become a search expert or worry about hardware provisioning, setup, and maintenance. With a few clicks in the AWS Management Console, you can create a search domain and upload the data that you want to make searchable, and Amazon CloudSearch will automatically provision the required resources and deploy a highly tuned search index. You can easily change your search parameters, fine tune search relevance, and apply new settings at any time. As your volume of data and traffic fluctuates, Amazon CloudSearch seamlessly scales to meet your needs.
  • 16
    CubeBackup

    CubeBackup

    CubeBackup

    CubeBackup is a Google Workspace backup application to secure your company data across the entire domain. It backs up all data with version history to local storage or your private cloud storage. CubeBackup allows you to backup Gmail, Google Drive, shared Drives, Contacts, Calendar, and Sites data to on-premises storage such as a local disk, NAS, SAN, or file server. If you prefer, data can also be stored in your company’s private cloud storage like Amazon S3, Google Cloud, Azure Blob Storage, and Backblaze B2. Unlike Google Drive, which limits file version history to only 30 days, CubeBackup can restore Google Drive and Shared Drive files to any previous version. In fact, CubeBackup can restore entire projects, with complete file and folder structure, to any previous state. Don’t leave your data in someone else’s hands. Unlike most other Google Workspace cloud backup providers who physically control your data, CubeBackup allows you to manage your own backups using local storage.
    Starting Price: $2 per user per year
  • 17
    Deep Lake

    Deep Lake

    activeloop

    Generative AI may be new, but we've been building for this day for the past 5 years. Deep Lake thus combines the power of both data lakes and vector databases to build and fine-tune enterprise-grade, LLM-based solutions, and iteratively improve them over time. Vector search does not resolve retrieval. To solve it, you need a serverless query for multi-modal data, including embeddings or metadata. Filter, search, & more from the cloud or your laptop. Visualize and understand your data, as well as the embeddings. Track & compare versions over time to improve your data & your model. Competitive businesses are not built on OpenAI APIs. Fine-tune your LLMs on your data. Efficiently stream data from remote storage to the GPUs as models are trained. Deep Lake datasets are visualized right in your browser or Jupyter Notebook. Instantly retrieve different versions of your data, materialize new datasets via queries on the fly, and stream them to PyTorch or TensorFlow.
    Starting Price: $995 per month
  • 18
    Cazena

    Cazena

    Cazena

    Cazena’s Instant Data Lake accelerates time to analytics and AI/ML from months to minutes. Powered by its patented automated data platform, Cazena delivers the first SaaS experience for data lakes. Zero operations required. Enterprises need a data lake that easily supports all of their data and tools for analytics, machine learning and AI. To be effective, a data lake must offer secure data ingestion, flexible data storage, access and identity management, tool integration, optimization and more. Cloud data lakes are complicated to do yourself, which is why they require expensive teams. Cazena’s Instant Cloud Data Lakes are instantly production-ready for data loading and analytics. Everything is automated, supported on Cazena’s SaaS Platform with continuous Ops and self-service access via the Cazena SaaS Console. Cazena's Instant Data Lakes are turnkey and production-ready for secure data ingest, storage and analytics.
  • 19
    BigLake

    BigLake

    Google

    BigLake is a storage engine that unifies data warehouses and lakes by enabling BigQuery and open-source frameworks like Spark to access data with fine-grained access control. BigLake provides accelerated query performance across multi-cloud storage and open formats such as Apache Iceberg. Store a single copy of data with uniform features across data warehouses & lakes. Fine-grained access control and multi-cloud governance over distributed data. Seamless integration with open-source analytics tools and open data formats. Unlock analytics on distributed data regardless of where and how it’s stored, while choosing the best analytics tools, open source or cloud-native over a single copy of data. Fine-grained access control across open source engines like Apache Spark, Presto, and Trino, and open formats such as Parquet. Performant queries over data lakes powered by BigQuery. Integrates with Dataplex to provide management at scale, including logical data organization.
    Starting Price: $5 per TB
  • 20
    Amazon OpenSearch Service
    Increase operational excellence by using a popular open source solution, managed by AWS. Audit and secure your data with a data center and network architecture with built-in certifications. Systematically detect potential threats and react to a system’s state through machine learning, alerting, and visualization. Optimize time and resources for strategic work. Securely unlock real-time search, monitoring, and analysis of business and operational data. Amazon OpenSearch Service makes it easy for you to perform interactive log analytics, real-time application monitoring, website search, and more. OpenSearch is an open source, distributed search and analytics suite derived from Elasticsearch. Amazon OpenSearch Service offers the latest versions of OpenSearch, support for 19 versions of Elasticsearch (1.5 to 7.10 versions), as well as visualization capabilities powered by OpenSearch dashboards and Kibana.
    Starting Price: $0.036 per hour
  • 21
    Neum AI

    Neum AI

    Neum AI

    No one wants their AI to respond with out-of-date information to a customer. ‍Neum AI helps companies have accurate and up-to-date context in their AI applications. Use built-in connectors for data sources like Amazon S3 and Azure Blob Storage, vector stores like Pinecone and Weaviate to set up your data pipelines in minutes. Supercharge your data pipeline by transforming and embedding your data with built-in connectors for embedding models like OpenAI and Replicate, and serverless functions like Azure Functions and AWS Lambda. Leverage role-based access controls to make sure only the right people can access specific vectors. Bring your own embedding models, vector stores and sources. Ask us about how you can even run Neum AI in your own cloud.
  • 22
    NooBaa

    NooBaa

    Red Hat

    NooBaa is a software-driven infrastructure that enables agility, flexibility and hybrid cloud capabilities. A deployment takes 5 minutes from download to an operational system. With unprecedented flexibility, pay-as-you-go pricing, and incredible management simplicity, NooBaa represents an entirely new approach to managing the explosive growth of data. NooBaa can consume data from AWS S3, Microsoft Azure Blobs, Google Storage or any AWS S3 compatible storage Private Cloud. Eliminate vendor lock-in, allowing your application software stack to be independent of the underlying infrastructure. This independence also creates the interoperability required for fast migration or expansion of workloads. It allows you to run a specific workload on a specific platform, without worrying about the storage. NooBaa provides an AWS S3-compatible API, the de facto standard, independent of any specific vendor or location.
  • 23
    Inogic SharePoint Security Sync
    Sync Dynamics 365 CRM and SharePoint security privileges to diminish security risk while storing documents/attachments in SharePoint. Restrict user’s level of access in SharePoint to the same level that is assigned to them in Dynamics 365 CRM. Furthermore, replicate any changes made to Dynamics 365 CRM security privileges in SharePoint. Integrate with SharePoint, Dropbox or Azure Blob as the cloud storage location for your files and attachments. Drag and drop multiple files and folders to upload them all at once. Generate anonymous links to the documents to share outside of the organization. Directly email the files as attachments or links to documents from within CRM. Upload, Rename, Delete & Search files from cloud storage. Bulk Migrate Note/Email/Sales Literature attachments to the configured cloud storage. Security Template to control user privileges to the various actions discussed above.
  • 24
    Dataplex Universal Catalog
    Dataplex Universal Catalog is Google Cloud’s intelligent governance platform for data and AI artifacts. It centralizes discovery, management, and monitoring across data lakes, warehouses, and databases, giving teams unified access to trusted data. With Vertex AI integration, users can instantly find datasets, models, features, and related assets in one search experience. It supports semantic search, data lineage, quality checks, and profiling to improve trust and compliance. Integrated with BigQuery and BigLake, it enables end-to-end governance for both proprietary and open lakehouse environments. Dataplex Universal Catalog helps organizations democratize data access, enforce governance, and accelerate analytics and AI initiatives.
    Starting Price: $0.060 per hour
  • 25
    Cribl Lake
    Storage that doesn’t lock data in. Get up and running fast with a managed data lake. Easily store, access, and retrieve data, without being a data expert. Cribl Lake keeps you from drowning in data. Easily store, manage, enforce policy on, and access data when you need. Dive into the future with open formats and unified retention, security, and access control policies. Let Cribl handle the heavy lifting so data can be usable and valuable to the teams and tools that need it. Minutes, not months to get up and running with Cribl Lake. Zero configuration with automated provisioning and out-of-the-box integrations. Streamline workflows with Stream and Edge for powerful data ingestion and routing. Cribl Search unifies queries no matter where data is stored, so you can get value from data without delays. Take an easy path to collect and store data for long-term retention. Comply with legal and business requirements for data retention by defining specific retention periods.
  • 26
    Vertex AI Search
    Google Cloud's Vertex AI Search is a comprehensive, enterprise-grade search and retrieval platform that leverages Google's advanced AI technologies to deliver high-quality search experiences across various applications. It enables organizations to build secure, scalable search solutions for websites, intranets, and generative AI applications. It supports both structured and unstructured data, offering capabilities such as semantic search, vector search, and Retrieval Augmented Generation (RAG) systems, which combine large language models with data retrieval to enhance the accuracy and relevance of AI-generated responses. Vertex AI Search integrates seamlessly with Google's Document AI suite, facilitating efficient document understanding and processing. It also provides specialized solutions tailored to specific industries, including retail, media, and healthcare, to address unique search and recommendation needs.
  • 27
    Quest LiteSpeed for SQL Server
    Get high-speed, storage-efficient backup and restore for SQL Server databases, with up to 85 percent savings in backup size and duration compared to competing solutions. LiteSpeed for SQL Server makes it possible, with minimal effort and risk. Ensure the correct SQL Server data is restored and available as quickly as possible with a wide variety of backup and recovery options. Integrate directly with Microsoft Azure Blob storage and Amazon S3, as well as ISM TSM, for cloud-based backup and restore with on-premises and virtualized cloud SQL Servers. Achieve dramatic reductions in SQL Server backup/restore times and storage costs. Choose the best combination of CPU resource utilization and backup storage size reduction for your environment with eight compression levels. Save time managing and monitoring your SQL Server backup and recovery. Define, schedule and manage all your backup and recovery jobs from one central location.
  • 28
    Amazon Security Lake
    Amazon Security Lake automatically centralizes security data from AWS environments, SaaS providers, on-premises, and cloud sources into a purpose-built data lake stored in your account. With Security Lake, you can get a more complete understanding of your security data across your entire organization. You can also improve the protection of your workloads, applications, and data. Security Lake has adopted the Open Cybersecurity Schema Framework (OCSF), an open standard. With OCSF support, the service normalizes and combines security data from AWS and a broad range of enterprise security data sources. Use your preferred analytics tools to analyze your security data while retaining complete control and ownership over that data. Centralize data visibility from cloud and on-premises sources across your accounts and AWS Regions. Streamline your data management at scale by normalizing your security data to an open standard.
    Starting Price: $0.75 per GB per month
  • 29
    Google Cloud Data Fusion
    Open core, delivering hybrid and multi-cloud integration. Data Fusion is built using open source project CDAP, and this open core ensures data pipeline portability for users. CDAP’s broad integration with on-premises and public cloud platforms gives Cloud Data Fusion users the ability to break down silos and deliver insights that were previously inaccessible. Integrated with Google’s industry-leading big data tools. Data Fusion’s integration with Google Cloud simplifies data security and ensures data is immediately available for analysis. Whether you’re curating a data lake with Cloud Storage and Dataproc, moving data into BigQuery for data warehousing, or transforming data to land it in a relational store like Cloud Spanner, Cloud Data Fusion’s integration makes development and iteration fast and easy.
  • 30
    Postgresus

    Postgresus

    Postgresus

    Postgresus is a free, open source and self-hosted tool to backup PostgreSQL. Make backups with different storages (S3, Google Drive, FTP, etc.) and notifications about progress (Slack, Discord, Telegram, etc.) Key features: - Scheduled backups for multiple PostgreSQL databases - Storage targets: local disk, S3, Cloudflare R2, Google Drive, Azure Blob, NAS, etc. - Notifications about backup status via email, Telegram, Slack, Discord, MS Teams and customizable webhooks - Works with both self-hosted PostgreSQL and managed services (RDS, Cloud SQL, Azure Database for PostgreSQL, etc.) - Runs as a single Docker container or via Helm on Kubernetes; can also be installed via a shell script - Team management with different workspaces, RBAC and audit logs - Encryption for secrets and backup files
  • 31
    Alibaba Cloud Data Lake Formation
    A data lake is a centralized repository used for big data and AI computing. It allows you to store structured and unstructured data at any scale. Data Lake Formation (DLF) is a key component of the cloud-native data lake framework. DLF provides an easy way to build a cloud-native data lake. It seamlessly integrates with a variety of compute engines and allows you to manage the metadata in data lakes in a centralized manner and control enterprise-class permissions. Systematically collects structured, semi-structured, and unstructured data and supports massive data storage. Uses an architecture that separates computing from storage. You can plan resources on demand at low costs. This improves data processing efficiency to meet the rapidly changing business requirements. DLF can automatically discover and collect metadata from multiple engines and manage the metadata in a centralized manner to solve the data silo issues.
  • 32
    S3 Drive

    S3 Drive

    Callback Technologies

    3 Drive connects to any standard S3 cloud data store, enabling you to work virtually with cloud files as if they are right on your local file system. Access, update, edit, and save files stored in any storage service compatible with the S3 API, such as: Amazon S3, Google Cloud Storage, Microsoft Azure Blob Storage, IBM Cloud Object Storage, MinIO, Backblaze B2, Wasabi, DigitalOcean, and more. S3 Drive adds a local cache layer on top of the S3 API, saving files locally and uploading them automatically - so you don’t have to upload and download files each time. Powerful Capabilities: - Store multiple connection profiles for a quick, convenient connection. - S3 Drive offers FIPS mode. - Run S3 Drive as a Windows service or desktop application. - Use S3 Drive as a Desktop application or from the command line. - S3 Drive supports Windows Arm64. - Available for Windows, Linux, and macOS. S3 Drive is trusted by the biggest technology companies in the world.
  • 33
    SISCIN

    SISCIN

    Waterford Technologies

    SISCIN is a File Analysis, Archiving and Compliance solution hosted in Azure. It’s a single dashboard for full visibility of your entire file server data. Allowing the creation of policies based on data profile for retention, deduplication or archiving, enabling full control in managing your file data. With flexible storage control to archive directly to the Cloud or locally. Giving organizations the performance and scalability of the Cloud with their existing server infrastructure. Available in SISCIN is our advanced Vue-X Search which provides advanced content indexing and search capabilities to analyze, identify, locate, retrieve and delete data for DSAR or e-Discovery management.
    Starting Price: $125 per month
  • 34
    AWS HealthLake
    Extract meaning from unstructured data with integrated Amazon Comprehend Medical for easy search and querying. Make predictions on health data using Amazon Athena queries, Amazon SageMaker ML models, and Amazon QuickSight analytics. Support interoperable standards such as the Fast Healthcare Interoperability Resources (FHIR). Run medical imaging applications in the cloud to increase scale and reduce costs. AWS HealthLake is a HIPAA-eligible service offering healthcare and life sciences companies a chronological view of individual or patient population health data for query and analytics at scale. Analyze population health trends, predict outcomes, and manage costs with advanced analytics tools and ML models. Identify opportunities to close gaps in care and deliver targeted interventions with a longitudinal view of patient journeys. Apply advanced analytics and ML to newly structured data to optimize appointment scheduling and reduce unnecessary procedures.
  • 35
    SearchBlox

    SearchBlox

    SearchBlox Software

    We simplify search for complex enterprises. Data isn’t just getting bigger. It’s becoming more connected, which makes data-driven decision making more complicated than ever. We build intuitive and intelligent insight engines on open source technologies. Our enterprise search products securely deliver the right data to the right user at the right time. Avoid vendor lock-in with an annual subscription. Our transparent, upfront annual pricing allows you to anticipate your spend and avoid surprises, even in the cloud. You won’t see the phrase “Contact Us for Pricing” anywhere on our website. Search that’s as easy for you to install as it is for your customers to use. Increasingly, visitors use search to navigate websites. And if they don’t find what they’re looking for right away, they leave. SearchBlox Site Search provides fast, accurate search and a sophisticated customer experience that boosts conversions.
  • 36
    Sphinx

    Sphinx

    Sphinx

    Sphinx is an open source full text search server, designed from the ground up with performance, relevance (aka search quality), and integration simplicity in mind. It's written in C++ and works on Linux (RedHat, Ubuntu, etc), Windows, MacOS, Solaris, FreeBSD, and a few other systems. Sphinx lets you either batch index and search data stored in an SQL database, NoSQL storage, or just files quickly and easily, or index and search data on the fly, working with Sphinx pretty much as with a database server. A variety of text processing features enable fine-tuning Sphinx for your particular application requirements, and a number of relevance functions ensures you can tweak search quality as well. Searching via SphinxAPI is as simple as 3 lines of code, and querying via SphinxQL is even simpler, with search queries expressed in good old SQL. Sphinx indexes up to 10-15 MB of text per second per single CPU core, that is 60+ MB/sec per server (on a dedicated indexing machine).
  • 37
    Archon Data Store

    Archon Data Store

    Platform 3 Solutions

    Archon Data Store is a next-generation enterprise data archiving platform designed to help organizations manage rapid data growth, reduce legacy application costs, and meet global compliance standards. Built on a modern Lakehouse architecture, Archon Data Store unifies data lakes and data warehouses to deliver secure, scalable, and analytics-ready archival storage. The platform supports on-premise, cloud, and hybrid deployments with AES-256 encryption, audit trails, metadata governance, and role-based access control. Archon Data Store offers intelligent storage tiering, high-performance querying, and seamless integration with BI tools. It enables efficient application decommissioning, cloud migration, and digital modernization while transforming archived data into a strategic asset. With Archon Data Store, organizations can ensure long-term compliance, optimize storage costs, and unlock AI-driven insights from historical data.
  • 38
    Apache DataFusion

    Apache DataFusion

    Apache Software Foundation

    Apache DataFusion is an extensible, high-performance query engine written in Rust that utilizes Apache Arrow as its in-memory format. Designed for developers building data-centric systems such as databases, data frames, machine learning, and streaming applications, DataFusion offers SQL and DataFrame APIs, a vectorized, multi-threaded, streaming execution engine, and support for partitioned data sources. It natively supports formats like CSV, Parquet, JSON, and Avro, and allows for seamless integration with object stores including AWS S3, Azure Blob Storage, and Google Cloud Storage. The engine features a comprehensive query planner, a state-of-the-art optimizer with capabilities like expression coercion and simplification, projection and filter pushdown, sort and distribution-aware optimizations, and automatic join reordering. DataFusion is highly customizable, enabling the addition of user-defined scalar, aggregate, and window functions, custom data sources, query languages, etc.
  • 39
    Delta Lake

    Delta Lake

    Delta Lake

    Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads. Data lakes typically have multiple data pipelines reading and writing data concurrently, and data engineers have to go through a tedious process to ensure data integrity, due to the lack of transactions. Delta Lake brings ACID transactions to your data lakes. It provides serializability, the strongest level of isolation level. Learn more at Diving into Delta Lake: Unpacking the Transaction Log. In big data, even the metadata itself can be "big data". Delta Lake treats metadata just like data, leveraging Spark's distributed processing power to handle all its metadata. As a result, Delta Lake can handle petabyte-scale tables with billions of partitions and files at ease. Delta Lake provides snapshots of data enabling developers to access and revert to earlier versions of data for audits, rollbacks or to reproduce experiments.
  • 40
    ParadeDB

    ParadeDB

    ParadeDB

    ParadeDB brings column-oriented storage and vectorized query execution to Postgres tables. Users can choose between row and column-oriented storage at table creation time. Column-oriented tables are stored as Parquet files and are managed by Delta Lake. Search by keyword with BM25 scoring, configurable tokenizers, and multi-language support. Search by semantic meaning with support for sparse and dense vectors. Surface results with higher accuracy by combining the strengths of full text and similarity search. ParadeDB is ACID-compliant with concurrency controls across all transactions. ParadeDB integrates with the Postgres ecosystem, including clients, extensions, and libraries.
  • 41
    Rubrik

    Rubrik

    Rubrik

    A logical air gap prevents attackers from discovering your backups while our append-only file system ensures backup data can't be encrypted. You can keep unauthorized users out with globally-enforced multi-factor authentication. From backup frequency and retention to replication and archival, replace hundreds or thousands of backup jobs with just a few policies. Apply the same policies to all your workloads across on-premises and cloud. Archive your data to your public cloud provider’s blob storage service. Quickly access archived data with real-time predictive search. Search across your entire environment, down to the file level, and select the right point in time to recover. Reduce recovery time from days and weeks to hours or less. Rubrik and Microsoft have joined forces to help you build a cyber-resilient business. Reduce the risk of backup data breach, loss, or theft by storing immutable copies of your data in a Rubrik-hosted cloud environment, isolated from your core workloads.
  • 42
    ELCA Smart Data Lake Builder
    Classical Data Lakes are often reduced to basic but cheap raw data storage, neglecting significant aspects like transformation, data quality and security. These topics are left to data scientists, who end up spending up to 80% of their time acquiring, understanding and cleaning data before they can start using their core competencies. In addition, classical Data Lakes are often implemented by separate departments using different standards and tools, which makes it harder to implement comprehensive analytical use cases. Smart Data Lakes solve these various issues by providing architectural and methodical guidelines, together with an efficient tool to build a strong high-quality data foundation. Smart Data Lakes are at the core of any modern analytics platform. Their structure easily integrates prevalent Data Science tools and open source technologies, as well as AI and ML. Their storage is cheap and scalable, supporting both unstructured data and complex data structures.
  • 43
    Amazon Kendra
    Amazon Kendra is a highly accurate and easy to use enterprise search service that’s powered by machine learning. Kendra delivers powerful natural language search capabilities to your websites and applications so your end users can more easily find the information they need within the vast amount of content spread across your company. Use natural language questions instead of just simple keywords to get the answers you’re looking for, whether that is a precise answer, an FAQ, or an entire document. Say goodbye to sifting through long lists of links and hoping one has the information you need. Get rid of information silos. Kendra lets you easily add content from file systems, SharePoint, intranet sites, file sharing services, and more, into a centralized location so you can quickly search all of your information to find the best answer. Your results get better over time search, because Kendra’s machine learning algorithms learn which results your users find most valuable.
    Starting Price: $2.50 per hour
  • 44
    Rinalogy Search
    Almost any search query applied to Big Data returns a very large number of results that are often practically impossible to review. Every user has specific needs. Finding information based on a user query and general data statistics does not produce useful results. eDiscovery, healthcare, financial services, crime, consulting, academia and other fields need to be able to quickly find accurate information. Rinalogy Search is a next generation search tool that uses machine learning to interactively learn from each user to return personalized results based on user’s feedback in real time. Rinalogy Search returns relevancy scores for individual documents in the results for each query. Rinalogy Search can be deployed in clients’ IT infrastructure, close to your data and behind your firewall. Rinalogy allows users to define the level of importance of search concepts by assigning weights to them, which helps finding the results You are looking for.
    Starting Price: $50 per month
  • 45
    Atolio

    Atolio

    Atolio

    Atolio is an AI-powered enterprise search engine that keeps your data in your cloud. It enables you to ask questions about your knowledge and receive intelligent, permission-aware answers, without your IP leaving your control. Atolio is designed and built for the enterprise, offering secure, self-hosted deployment on AWS, Azure, or GCP, ensuring enterprise-grade security and compliance. It provides AI-driven knowledge discovery, allowing you to find what you need, when you need it. Atolio scours enterprise applications to surface relevant documents and identify internal experts, fostering collaboration and informed decision-making. With seamless integration across Office 365, Google Workspace, Slack, Salesforce, ServiceNow, and more, Atolio unifies your search experience, helping teams work smarter, not harder. It works with your model and cloud of choice, using LLMs that don’t train on your data, so you’ll be confident that your IP stays safe and in your control.
  • 46
    Attach2Dynamics
    Attach2Dynamics is a document management solution which provides seamless attachment management in multiple cloud storages like SharePoint, Dropbox and Azure Blob Storage from within Dynamics 365 CRM. It enables features like drag and drop, browse, and choose multiple files or a folder at a single instance to upload to the Cloud Storage of choice. It has an easy to view UI for viewing all the files & folders in the configured cloud storage against the current record. Users can further rename, create, email, delete, preview files/folders and generate sharable link of the file or the folder to provide it in the email directly from within Dynamics 365 CRM.
  • 47
    ExpanDrive

    ExpanDrive

    ExpanDrive

    The Smartest Way to Connect to the Cloud. Seamless access to cloud storage from any within any application. ExpanDrive adds cloud storage like Google Drive, Dropbox, Amazon S3, SFTP (SSH), Box, OneDrive and Sharepoint to Finder and Explorer. Don’t bother with an extra app just to move data around. ExpanDrive connects cloud storage to every application on your computer including Office 365, Photoshop, and VS Code. Choose files for offline access and work without an internet connection. Synchronization to the cloud takes place automatically when you’re back online. Other files are accessed on-demand from the cloud, taking no disk space. Major storage providers have left you behind and we’re here to help. ExpanDrive adds native cloud storage access into Linux for all major Linux distributions, including Ubuntu, Linux Mint, CentOS, Redhat, and more. ExpanDrive hooks into Spotlight Search on Mac and Windows file search. Quickly search your remote storage for whatever you’re looking for.
  • 48
    Coreviz

    Coreviz

    Coreviz

    CoreViz Studio is a visual-AI platform that helps users automatically understand, organize, edit, search, tag, generate, and collaborate on images and videos without writing code. It supports natural-language search (RAG style) so you can describe what you’re looking for and find matching visual content, and it provides tools for background removal, object removal, upscaling/enhancement, and image editing via text instructions. It also has features for tagging and organizing media, detecting visual similarity across your library, and using specialized AI models trained for domain-specific tasks (e.g., forensic, medical, industrial) for more accurate results. CoreViz integrates with external storage and import sources like Google Drive, Dropbox, and data lakes, plus supports custom workflows and collaboration across teams and organizations, including real-time sharing and custom layout of processes.
    Starting Price: $15 per month
  • 49
    QuikFynd

    QuikFynd

    QuikFynd

    This software will enable you to quickly search, organize and share all of your files on your desktop and across your multiple cloud drives. You have valuable data stored on your computers and it is growing rapidly. You also have many files in cloud drives but bulk of your documents and pictures still remain on your computer. It is very difficult to know where all the files are and you could be spending your precious time in trying to find the right file. With QuikFynd, you can instantly find files on your computer, connected network drives, removable storage and across cloud drives. Beautifully designed and easy to use, QuikFynd is also very powerful. You can search by keywords in documents, objects in images and it even tags your documents automatically using machine learning.
    Starting Price: $2.80 per month
  • 50
    Apache Doris

    Apache Doris

    The Apache Software Foundation

    Apache Doris is a modern data warehouse for real-time analytics. It delivers lightning-fast analytics on real-time data at scale. Push-based micro-batch and pull-based streaming data ingestion within a second. Storage engine with real-time upsert, append and pre-aggregation. Optimize for high-concurrency and high-throughput queries with columnar storage engine, MPP architecture, cost based query optimizer, vectorized execution engine. Federated querying of data lakes such as Hive, Iceberg and Hudi, and databases such as MySQL and PostgreSQL. Compound data types such as Array, Map and JSON. Variant data type to support auto data type inference of JSON data. NGram bloomfilter and inverted index for text searches. Distributed design for linear scalability. Workload isolation and tiered storage for efficient resource management. Supports shared-nothing clusters as well as separation of storage and compute.