Showing 809 open source projects for "python data analysis"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Picsart Enterprise Background Removal API for Stunning eCommerce Visuals Icon
    Picsart Enterprise Background Removal API for Stunning eCommerce Visuals

    Instantly remove the background from your images in just one click.

    With our Remove Background API tool, you can access the transformative capabilities of automation , which will allow you to turn any photo asset into compelling product imagery. With elevated visuals quality on your digital platforms, you can captivate your audience, and therefore achieve higher engagement and sales.
    Learn More
  • 1
    QGIS

    QGIS

    Cross platform (lin/win/mac) geographical information system (GIS)

    ... and functionalities. QGIS provides a continously growing number of capabilities provided by core functions and plugins. You can visualize, manage, edit, analyze data, and compose printable maps. We also collect case studies of how people all over the world use QGIS for their GIS work. You can view combinations of vector and raster data (in 2D or 3D) in different formats and projections without conversion to an internal or common format.
    Downloads: 82 This Week
    Last Update:
    See Project
  • 2
    Avogadro 2

    Avogadro 2

    Avogadro libraries provide 3D rendering, visualization, and analysis

    AvogadroLibs is the core C++ library behind Avogadro 2, an open-source molecular editor and visualization platform used in chemistry, materials science, and education. It provides the essential tools for constructing, analyzing, and visualizing molecular structures in 2D and 3D. Designed for extensibility, AvogadroLibs supports plugins for quantum chemistry computations, molecular mechanics, and surface rendering. It interfaces with multiple chemistry formats and data sources, making...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 3
    PDF4QT

    PDF4QT

    Open source PDF editor

    PDF4QT is open source PDF editor based on Qt framework. It contains a C++ library, applications for viewing/editing PDF documents, and a command line tool. PDF4QT is an open-source PDF editor for Windows/Linux. It is a modern solution for viewing/editing/rendering PDF documents, for users and developers alike. For developers, there is a C++ library and a command line tool for use in scripts. For users, there are four applications offering many features. The project is hosted on Github and...
    Downloads: 35 This Week
    Last Update:
    See Project
  • 4
    LAStools

    LAStools

    efficient tools for LiDAR processing

    LAStools is a collection of efficient, multi-core, scriptable tools for processing LiDAR data. It supports various formats, including LAS, LAZ, Terrasolid BIN, and ESRI Shapefiles, providing a comprehensive suite for LiDAR data management and analysis.
    Downloads: 14 This Week
    Last Update:
    See Project
  • MongoDB Atlas | Run databases anywhere Icon
    MongoDB Atlas | Run databases anywhere

    Ensure the availability of your data with coverage across AWS, Azure, and GCP on MongoDB Atlas—the multi-cloud database for every enterprise.

    MongoDB Atlas allows you to build and run modern applications across 125+ cloud regions, spanning AWS, Azure, and Google Cloud. Its multi-cloud clusters enable seamless data distribution and automated failover between cloud providers, ensuring high availability and flexibility without added complexity.
    Learn More
  • 5
    DwarFS

    DwarFS

    A fast high compression read-only file system for Linux, Windows

    The Deduplicating Warp-speed Advanced Read-only File System. A fast high compression read-only file system for Linux and Windows. DwarFS is a read-only file system with a focus on achieving very high compression ratios in particular for very redundant data. This probably doesn't sound very exciting, because if it's redundant, it should compress well. However, I found that other read-only, compressed file systems don't do a very good job at making use of this redundancy. See here...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 6
    dlib

    dlib

    Toolkit for making machine learning and data analysis applications

    Dlib is a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real world problems. It is used in both industry and academia in a wide range of domains including robotics, embedded devices, mobile phones, and large high performance computing environments. Dlib's open source licensing allows you to use it in any application, free of charge. Good unit test coverage, the ratio of unit test lines of code to library lines of code is...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 7
    ROOT

    ROOT

    Analyzing, storing and visualizing big data, scientifically

    ROOT is a unified software package for the storage, processing, and analysis of scientific data: from its acquisition to the final visualization in the form of highly customizable, publication-ready plots. It is reliable, performant and well supported, easy to use and obtain, and strives to maximize the quantity and impact of scientific results obtained per unit cost, both of human effort and computing resources. ROOT provides a very efficient storage system for data models, that demonstrated...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    Open3D

    Open3D

    A modern library for 3D data processing

    Open3D is an open-source library that supports rapid development of software that deals with 3D data. The Open3D frontend exposes a set of carefully selected data structures and algorithms in both C++ and Python. The backend is highly optimized and is set up for parallelization. Open3D was developed from a clean slate with a small and carefully considered set of dependencies. It can be set up on different platforms and compiled from source with minimal effort. The code is clean, consistently...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 9
    DuckDB

    DuckDB

    DuckDB is an in-process SQL OLAP Database Management System

    ... data analysis, e.g. Joining & aggregate multiple large tables. Concurrent large changes, to multiple large tables, e.g. appending rows, adding/removing/updating columns. Large result set transfer to client. For development, DuckDB requires CMake, Python3 and a C++11 compliant compiler. Run make in the root directory to compile the sources. For development, use make debug to build a non-optimized debug version.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    TensorFlow

    TensorFlow

    TensorFlow is an open source library for machine learning

    Originally developed by Google for internal use, TensorFlow is an open source platform for machine learning. Available across all common operating systems (desktop, server and mobile), TensorFlow provides stable APIs for Python and C as well as APIs that are not guaranteed to be backwards compatible or are 3rd party for a variety of other languages. The platform can be easily deployed on multiple CPUs, GPUs and Google's proprietary chip, the tensor processing unit (TPU). TensorFlow...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    SimpleITK

    SimpleITK

    A layer built on top of the Insight Toolkit (ITK)

    ... be applicable to SimpleITK. Unlike ITK's support of n-dimensional spatio-temporal images, SimpleITK supports 2D, 3D and 4D images. The dimensionality refers to spatiotemporal dimensions, the voxels can be n-dimensional vectors. Open-source multi-dimensional image analysis in Python, R, Java, C#, Lua, Ruby, TCL and C++. Developed by the Insight Toolkit community for the biomedical sciences and beyond.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12

    Protocol Buffers

    Google's data interchange format

    Protocol Buffers are Google’s fast and simple, language- and platform-neutral, extensible mechanism for serializing structured data. It allows you to define how your data should be structured once, and then using a special generated source code, you can then easily write and read your structured data to and from a variety of data streams and using a variety of languages. Protocol Buffers currently supports a wide array of languages, including C++, Java, Python, Ruby, and many others with more...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    ArcticDB

    ArcticDB

    ArcticDB is a high performance, serverless DataFrame database

    Built for the modern Python Data Science ecosystem, ArcticDB transforms your ability to handle complex real-world data with an Incredibly fast proven Petabyte scale. ArcticDB is designed with quant users in mind. It allows you to self-manage your data leveraging your preferred infrastructure. Giving you the keys to protect your most valuable asset. Supports large concurrent writes to many tables ensuring datasets can be onboarded fast and in the most convenient format. Scale-out architecture...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    XGBoost

    XGBoost

    Scalable and Flexible Gradient Boosting

    XGBoost is an optimized distributed gradient boosting library, designed to be scalable, flexible, portable and highly efficient. It supports regression, classification, ranking and user defined objectives, and runs on all major operating systems and cloud platforms. XGBoost works by implementing machine learning algorithms under the Gradient Boosting framework. It also offers parallel tree boosting (GBDT, GBRT or GBM) that can quickly and accurately solve many data science problems. XGBoost...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    Watchman

    Watchman

    A file watching service that records when files change

    Watchman purpose is to watch files and record when there are changes. It can trigger actions (rebuilding assets, for example) when there is a change in matching files. The watchman executable has both the client and the server components of the watchman service. When running watchman, it will attempt to communicate with your existing server instance (each user has their own persistent process), and will attempt to start it if it doesn’t exist. There are some options that affect how watchman...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    ccapi

    ccapi

    A header-only C++ library for interacting with crypto exchanges

    A header-only C++ library for streaming market data and executing trades directly from cryptocurrency exchanges (i.e. the connections are between your server and the exchange server without anything in-between).
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Perspective

    Perspective

    A data visualization and analytics component

    Perspective is a high-performance data visualization library for building real-time, interactive analytics dashboards. Developed by FINOS, it supports WebAssembly-powered pivot tables and can handle large streaming datasets with speed and flexibility. Perspective is ideal for fintech, trading, and IoT applications where insights from live data need to be visualized, sliced, and explored quickly in a browser.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    Apache Doris

    Apache Doris

    MPP-based interactive SQL data warehousing for reporting and analysis

    Apache Doris is a modern MPP analytical database product. It can provide sub-second queries and efficient real-time data analysis. With it's distributed architecture, up to 10PB level datasets will be well supported and easy to operate. Apache Doris can meet various data analysis demands, including history data reports, real-time data analysis, interactive data analysis, and exploratory data analysis. Make your data analysis easier! Support standard SQL language, compatible with MySQL protocol...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    osm2pgsql

    osm2pgsql

    Import OpenStreetMap data into a PostgreSQL/PostGIS database

    osm2pgsql is a powerful tool for importing OpenStreetMap (OSM) data into a PostgreSQL/PostGIS database, enabling geographic data analysis and map rendering. It supports various rendering schemas like "flex" and "lua" to customize how data is loaded and indexed. Designed for performance and scalability, osm2pgsql is widely used in map tile generation pipelines and by GIS professionals handling large-scale spatial datasets.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Hotspot

    Hotspot

    The Linux perf GUI for performance analysis

    This project is a KDAB R&D effort to create a standalone GUI for performance data. As the first goal, we want to provide a UI like KCachegrind around Linux perf. Looking ahead, we intend to support various other performance data formats under this umbrella. The main feature of the hotspot is visualizing a perf.data file graphically. The timeline allows filtering the results by time, process, or thread. The data views will update accordingly. You can also launch perf from the hotspot, to profile...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    Pedalboard

    Pedalboard

    A Python library for audio

    pedalboard is a Python library for working with audio: reading, writing, rendering, adding effects, and more. It supports the most popular audio file formats and a number of common audio effects out of the box and also allows the use of VST3® and Audio Unit formats for loading third-party software instruments and effects. pedalboard was built by Spotify’s Audio Intelligence Lab to enable using studio-quality audio effects from within Python and TensorFlow. Internally at Spotify, pedalboard...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Soufflé

    Soufflé

    Datalog variant for tool designers crafting analyses in Horn clauses

    Rapid prototyping for your analysis problems with logic; enabling deep design-space explorations; designed for large-scale static analysis; e.g., points-to analysis for Java, taint-analysis, and security checks. Futamura projections/partial evaluation for effective translation to parallel C++; optimized staged compilation; specialized data-structures for logical relations. Efficient translation to parallel C++ of Datalog programs (CAV'16, CC'16) Efficient interpretation using de-specialization...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Fast Binary Encoding (FBE)

    Fast Binary Encoding (FBE)

    Fast Binary Encoding is fast and universal serialization solution

    Fast Binary Encoding is ultra fast and universal serialization solution for C++, C#, Go, Java, JavaScript, Kotlin, Python, Ruby, and Swift. Fast Binary Encoding allows to description of any domain models, business objects, complex data structures, client/server requests & responses and generate native code for different programming languages and platforms.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    chDB

    chDB

    chDB is an in-process OLAP SQL Engine

    chDB is an in-process SQL OLAP Engine powered by ClickHouse. It is developed by ClickHouse, Inc and open-source contributors.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    manticoresearch

    manticoresearch

    Easy to use open source fast database for search

    ... over HTTP and uses the MySQL protocol (you can use your preferred MySQL client). JSON over HTTP: to provide a more programmatic way to manage your data and schemas, Manticore provides a HTTP JSON protocol. Written fully in C++: starts fast, doesn't take much RAM, and low-level optimizations provide good performance. Can sync from MySQL/PostgreSQL/ODBC/xml/csv out of the box. Not fully ACID-compliant, but supports transactions and binlog for safe writes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.