Showing 33 open source projects for "high"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    pandas

    pandas

    Fast, flexible and powerful Python data analysis toolkit

    ...pandas is continuously being developed to be a fundamental high-level building block for doing practical, real world data analysis in Python, as well as powerful and flexible open source data analysis/ manipulation tool for any language.
    Downloads: 67 This Week
    Last Update:
    See Project
  • 2
    LinDB

    LinDB

    LinDB is a scalable, high performance, high availability database

    LinDB is a scalable, high-performance, high-availability distributed time series database. A single server could easily support more than one million write TPS; With fundamental techniques like efficient compression storage and parallel computing, LinDB delivers highly optimized query performance. The multi-channel replication protocol supports any amount of nodes, and ensures the system's availability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    IoTDB

    IoTDB

    Apache IoTDB

    Apache IoTDB (Database for Internet of Things) is an IoT native database with high performance for data management and analysis, deployable on the edge and the cloud. Due to its light-weight architecture, high performance and rich feature set together with its deep integration with Apache Hadoop, Spark and Flink, Apache IoTDB can meet the requirements of massive data storage, high-speed data ingestion and complex data analysis in the IoT industrial fields.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    DuckDB

    DuckDB

    DuckDB is an in-process SQL OLAP Database Management System

    DuckDB is a high-performance analytical database system. It is designed to be fast, reliable and easy to use. DuckDB provides a rich SQL dialect, with support far beyond basic SQL. DuckDB supports arbitrary and nested correlated subqueries, window functions, collations, complex types (arrays, structs), and more. For more information on the goals of DuckDB, please refer to the Why DuckDB page on our website.
    Downloads: 68 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    Pandas Profiling

    Pandas Profiling

    Create HTML profiling reports from pandas DataFrame objects

    ...Comprehensive and automatic list of potential data quality issues (high correlation, skewness, uniformity, zeros, missing values, constant values, between others).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    SageMaker Spark Container

    SageMaker Spark Container

    Docker image used to run data processing workloads

    Apache Spark™ is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for stream processing. The SageMaker Spark Container is a Docker image used to run batch data processing workloads on Amazon SageMaker using the Apache Spark framework. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    .NET for Apache Spark

    .NET for Apache Spark

    A free, open-source, and cross-platform big data analytics framework

    .NET for Apache Spark provides high-performance APIs for using Apache Spark from C# and F#. With these .NET APIs, you can access the most popular Dataframe and SparkSQL aspects of Apache Spark, for working with structured data, and Spark Structured Streaming, for working with streaming data. .NET for Apache Spark is compliant with .NET Standard - a formal specification of .NET APIs that are common across .NET implementations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    clusterProfiler

    clusterProfiler

    A universal enrichment tool for interpreting omics data

    clusterProfiler is an R/Bioconductor package that provides a unified workflow for functional enrichment analysis to interpret high-throughput omics results. It supports both over-representation analysis and gene set enrichment analysis, letting you work with unranked gene lists or ranked statistics from differential pipelines. The package connects to multiple knowledge bases—such as Gene Ontology, KEGG, Reactome, Disease Ontology, MeSH and others—through a consistent interface so you can query different biological lenses without rewriting code. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    dlib

    dlib

    Toolkit for making machine learning and data analysis applications

    Dlib is a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real world problems. It is used in both industry and academia in a wide range of domains including robotics, embedded devices, mobile phones, and large high performance computing environments. Dlib's open source licensing allows you to use it in any application, free of charge. Good unit test coverage, the ratio of unit test lines of code to library lines of code is about 1 to 4. The library is tested regularly on MS Windows, Linux, and Mac OS X systems. No other packages are required to use the library, only APIs that are provided by an out of the box OS are needed. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Sweetviz

    Sweetviz

    Visualize and compare datasets, target values and associations

    Sweetviz is an open-source Python library that generates beautiful, high-density visualizations to kickstart EDA (Exploratory Data Analysis) with just two lines of code. Output is a fully self-contained HTML application. The system is built around quickly visualizing target values and comparing datasets. Its goal is to help quick analysis of target characteristics, training vs testing data, and other such data characterization tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    POCO

    POCO

    Cross-platform C++ libraries for building network applications

    The POCO C++ Libraries are powerful cross-platform C++ libraries for building network- and internet-based applications that run on desktop, server, mobile, IoT, and embedded systems. Whether building automation systems, industrial automation, IoT platforms, air traffic management systems, enterprise IT application and infrastructure management, security and network analytics, automotive infotainment and telematics, financial or healthcare, C++ developers have been trusting the POCO C++...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    LabPlot

    LabPlot

    Data Visualization and Analysis

    LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 14
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    SciDAVis is a user-friendly data analysis and visualization program primarily aimed at high-quality plotting of scientific data. It strives to combine an intuitive, easy-to-use graphical user interface with powerful features such as Python scriptability.
    Leader badge
    Downloads: 1,271 This Week
    Last Update:
    See Project
  • 16

    MS-Helios

    MS-Helios: A Circos wrapper to visualize multi-omic datasets

    ...We present MS-Helios, an easy to use command line tool with multiple built-in data processing functions, allowing non-expert users to construct CPMs or in general terms circular plots with a non-genomic basis. MS-Helios automatically generates data and configuration files to create high quality and publishable circular plots with Circos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    TensorBase

    TensorBase

    TensorBase is a new big data warehousing with modern efforts

    ...After thoughts, we decided to temporarily leave the general data warehousing field. For people who want to learn how a database system can be built up, or how to apply modern Rust to the high-performance field, or embed a lightweight data analysis system into your own big one. You can still try, ask or contribute to TensorBase. The committers are still around the community. We will help you in all kinds of interesting things pursued in the project by us and maybe you. We still maintain the project to look forward to meeting more database geniuses in this world, although no new feature will be added in the near future.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    TSA_CRAFT

    Automatic command line tool for TSA data analysis

    DSF is a high-throughput platform of TSA assay to screen various conditions that affect protein stability. To facilitate TSA data analysis, we developed an automatic tool "TSA-CRAFT". TSA-CRAFT was developed by integrating PERL script and Gnuplot. PERL scripts manage the entire workflow and data processing procedures of TSA data analysis. On the other hand, Gnuplot takes over the curve fitting and result presentation works.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    OpenUnderwriter (Insurance Distribution)

    OpenUnderwriter (Insurance Distribution)

    Feature rich insurance quote & buy system for underwriters and brokers

    ...Our goal is simple, we are focused on enabling insurers and brokers to perform more effectively by providing a powerful, adaptable and robust open source insurance software toolset that removes the traditional barriers associated with insurance software adoption, including high cost of ownership, slow time to market and restrictive long term contracts.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Central Proteomics Facilities Pipeline

    Central Proteomics Facilities Pipeline

    A data analysis pipeline for shotgun mass-spectrometry proteomics.

    ...However: * Limited fixes have been made on the main master branch by Phil Charles at the University of Oxford. * As of 2019, limited maintenance is being carried out by myself (now in the BioHPC high performance computing group at UTSW), to allow CPFP to install and run on RHEL 7 / CentOS 7, with updated TPP 5.1.0 and search tools. This updated code can be found in the utsw-biohpc branch of the git repository, and may become a 2.2.0 release if time and testing allows. -- David Trudgian - 2019/01/03
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    HCS Analyzer

    HCS Analyzer

    High Content Screening Analyzer

    HCS Analyzer is an open source software dedicated to High Content Screening data processing and analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    GlycoPAT

    GlycoPAT

    Comprehensive, open-source tool for glycoproteomics MS data analysis

    GlycoPAT is a modular, open-source MATLAB based toolbox for shotgun mass spectrometry based glycoproteomics data analysis. This program provides a novel platform for the streamlined analysis of traditional LC-MSn based high-throughput experimental data for the identification of site-specific N- and O-linked glycosylation on various proteins. More details are available at https://www.VirtualGlycome.org/glycopat Source code also available at https://github.com/kaichengub/GlycoPAT If you use this software please cite: Gang Liu, Kai Cheng, Chi Y. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Canvas-2DGC

    A user-extensible Freemat based 1DGC and GCxGC data analysis software.

    ...The first goal of this project is to provide a highly intelligent analytical tool for reducing a complex chromatographic data analysis, either 1D or 2D, to just a few simple and intuitive steps. The second goal is to provide a fast and practical way of customizing or extending functionalities by ordinary users through high-level Freemat scripting. Freemat, one of open source Matlab clones, is chosen for this very purpose, because its syntax is most compatible with Matlab and it is lightweight - very suited for ordinary users. Contact xguan@jnxtec.com for support.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24

    Bamformatics

    Toolkit and GUI for sequencing data analysis

    The Bamformatics project aims to provide a coherent and consistent approach to analysis of high-throughput sequencing data. Its toolkit includes, among others, programs to identify variants and to compute various types genomic tracks. It also provides a graphical user interface to facilitate general bioinformatic workflows. The project wiki contains further details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    IAP.G2P

    IAP.G2P

    IAP - the Integrated Analysis Pipeline

    The Integrated Analysis Platform (IAP) has been designed and developed to support the analysis of large-scale image data sets of different camera systems. It aims in bridging different data domains and in integrating different approaches to data analysis and post-processing.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo