Showing 48 open source projects for "python data analysis"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Mimesis

    Mimesis

    High-performance fake data generator for Python

    Mimesis is an open source high-performance fake data generator for Python, able to provide data for various purposes in various languages. It's currently the fastest fake data generator for Python, and supports many different data providers that can produce data related to people, food, transportation, internet and many more. Mimesis is really easy to use, with everything you need just an import away.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 3
    PerfView

    PerfView

    PerfView is a CPU and memory performance-analysis tool

    PerfView is a free performance analysis tool that helps isolate CPU and memory-related performance issues. It is a Windows tool, but it also has some support for analyzing data collected on Linux machines. It works for a wide variety of scenarios, but has a number of special features for investigating performance issues in code written for the .NET runtime. If you are unfamiliar with PerfView, there are PerfView video tutorials.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 4
    Deepchecks

    Deepchecks

    Test Suites for validating ML models & data

    Deepchecks is the leading tool for testing and for validating your machine learning models and data, and it enables doing so with minimal effort. Deepchecks accompany you through various validation and testing needs such as verifying your data’s integrity, inspecting its distributions, validating data splits, evaluating your model and comparing between different models. While you’re in the research phase, and want to validate your data, find potential methodological problems, and/or validate...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Atheris

    Atheris

    A Coverage-Guided, Native Python Fuzzer

    ...The tool integrates smoothly with Python’s packaging and unit-test ecosystems, so you can wrap existing tests as fuzz targets and keep results understandable. It supports structured input strategies and custom mutators, which is especially helpful for text and data formats common in Python workloads. In practice, Atheris compresses weeks of edge-case brainstorming into hours of automated exploration with actionable, minimized reproductions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Computer Science Flash Cards

    Computer Science Flash Cards

    Mini website for testing both general CS knowledge and enforce coding

    This repository collects concise flash cards that cover the core ideas of a traditional computer science curriculum with a focus on interview readiness. The cards distill topics like time and space complexity, classic data structures, algorithmic paradigms, operating systems, networking, and databases into short, testable prompts. They are designed for spaced-repetition style study so you can cycle frequently through fundamentals until recall feels automatic. Many cards point at canonical...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    sqlmap

    sqlmap

    Automatic SQL injection and database takeover tool

    sqlmap is a powerful, feature-filled, open source penetration testing tool. It makes detecting and exploiting SQL injection flaws and taking over the database servers an automated process. sqlmap comes with a great range of features that along with its powerful detection engine make it the ultimate penetration tester. It offers full support for MySQL, Oracle, PostgreSQL, Microsoft SQL Server, Microsoft Access, IBM DB2, SQLite, Firebird, and many other database management systems. It also...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    Gatling Studio

    Gatling Studio

    Desktop app that helps you craft Gatling load-testing simulations

    Gatling Studio is a native desktop application created to simplify the creation of Gatling load-testing simulations from real browser traffic. Instead of hand-writing scenarios in Scala or Java from scratch, you use Gatling Studio to drive a Chromium-based browser through a typical user journey. The tool captures all HTTP and HTTPS requests and responses, saves them as a HAR file, and then automatically converts that data into a structured Gatling project with groups, pauses, and exec...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    TextTest is an application-independent tool for text-based functional testing. This means running a batch-mode binary in lots of different ways, and using the text output produced as a means of controlling the behaviour of that application.
    Leader badge
    Downloads: 102 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    Classic HWUT - rename of previous HWUT

    Classic HWUT - rename of previous HWUT

    Software Unit Tests (Language Independent Approach)

    Automation of Unit and System Tests. Tests can be implemented in any language and on many platforms. The flexible approach enables the inclusion of many types of tests, such as memory leak checks (using valgrind), coding rule checks, complexity checks, etc. Tests are run by a simple call to hwut in a base directory of a project. In particular for C, HWUT supports make file generation using 'sos' and 'sols' modes. Remote control-able function stubs may be generated using the 'stub' mode....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CAT

    CAT

    CAT is the basic component of the server project

    ...In addition to metrics, it enables tracing—propagating context across RPC boundaries so problems like latency spikes or failed calls can be traced end-to-end. Alert rules and anomaly detection can be defined to notify teams proactively. The system supports multiple data backends and ingestion pipelines to collect data from JVM, C/C++, Python, and other ecosystems. With the collected data, Cat supports analysis of hotspots, trending anomalies, and capacity planning to drive continuous reliability improvements.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    GoReplay

    GoReplay

    The Swiss Army knife for testing and monitoring

    Increase confidence in code deployments, configuration changes and infrastructure changes with GoReplay! GoReplay is an innovative open source tool that captures and replays live HTTP traffic, using it to continuously test your system with real data. Now you don’t have to put up with risks that come with putting a third-party component in the critical path. With GoReplay you can analyze and record your application traffic without affecting it whatsoever. Use it for shadowing, load testing, monitoring and detailed analysis. You can trust GoReplay to accurately emulate production environments so you can make changes and deploy with confidence.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    speedtest-cli

    speedtest-cli

    Command line interface for testing internet bandwidth using speedtest

    Command line interface for testing internet bandwidth using speedtest.net. It is not a goal of this application to be a reliable latency reporting tool. Latency reported by this tool should not be relied on as a value indicative of ICMP style latency. It is a relative value used for determining the lowest latency server for performing the actual speed test against. Speedtest CLI brings the trusted technology and global server network behind Speedtest to the command line. Measure internet...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Frontend Regression Validator (FRED)

    Frontend Regression Validator (FRED)

    Visual regression tool used to compare baseline and updated instances

    Visual regression tool used to compare baseline and updated instances of a website in a deployment pipeline. FRED is an opensource visual regression tool used to compare two instances of a website. FRED is responsible for automatic visual regression testing, with the purpose of ensuring that functionality is not broken by comparing a current(baseline) and an updated version of a website. The visual analysis computes the Normalized Mean Squared error and the Structural Similarity Index on the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Hiper

    Hiper

    A statistical analysis tool for performance testing

    Hiper is an open-source command-line tool designed for statistical analysis of web performance and page load behavior during performance testing. The tool repeatedly loads a specified webpage multiple times and gathers detailed timing metrics in order to produce more reliable performance measurements than single-run benchmarks. By averaging data across multiple page loads, Hiper helps developers understand whether performance optimizations actually improve real-world page loading behavior. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    pysourceinfo

    pysourceinfo

    RTTI for Python Source and Binary Files

    The 'pysourceinfo' package provides source information on Python runtime objects based on 'inspect', 'sys', 'os', and 'imp'. The covered objects include packages, modules, functions, methods, scripts, and classes by two views: - File System View - packages, modules, and linenumbers - based on files and paths - Runtime Object View - callables, classes, and containers - based on in-memory RTTI / introspection The supported platforms are: - Linux, BSD, Unix, OS-X, Cygwin, and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Sploit

    Sploit

    SploitFrameWork *BETA*

    WELCOME TO THE SPLOITFRAMEWORK SPLOIT COMES WITH A PAYLOAD OF THINGS FOR ALL YOUR HACKING NEEDS, SUCH AS 1. Email Phishing 2. Spear Phishing 3. Spoofing 4. Credential Harvester 5. Kayloggers 6. Tabnabbing And More . . .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Specstractor

    Tool chain for the automated extraction/analysis of system invariants

    Specstractor is an automated tool chain for the extraction and analysis of system specifications in the form of system invariants. These automatically generated invariants yield useful insight into the actual system behavior and can reveal flaws and missing elements in the developer-maintained specifications. Specstractor consists of two main components, the Extractor, which automatically extracts invariants from automatically generated test cases of a system using data mining, and Insight, a web based tool that gives the user high level views and visualizations of the resulting invariants. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SW Test Automation Framework
    The Software Testing Automation Framework (STAF) is a framework designed to improve the level of reuse and automation in test cases and test environments. The goal of STAF is to provide a complete end-to-end automation solution for testers.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 21
    CANStream

    CANStream

    An applicaton for CAN bus coms development, testing and validation

    ...Thanks to its powerful built-in mathematical expression evaluator, CANStream can also behave as a real control system feeding back the test device or commanding a third party device with context sensible data. Extended data logging and data analysis features of CANStream provide a comprehensive solution for testing and results analysis CANStream extensively uses the PCAN-Basic API developed by PEAK System for the PCAN-USB adapter You will therefore need to have PCAN-USB adapter and at least one free USB port available to make a complete use of CANStream. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SNMP Simulator is a software that would act like a multitude of real physical devices from SNMP Manager's point of view. Simulator builds and uses a database of physical devices' SNMP footprints to respond like their real counterparts do. Project moved to GitHub: https://github.com/etingof/snmpsim
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    cutplace
    Cutplace validates tabular data (CSV, fixed format) according to an interface control document (ICD). The ICD acts as executable specification and can be described using popular spreadsheet applications (Calc, Excel).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    LittleDarwin

    Mutation Analysis Framework for Java

    Mutation testing has been proven to be a reliable method to estimate the quality of a test suite. Most of current mutation tools require the architecture of their target system to be in a certain way in order to apply the technique; thus making it impossible to use them in complex build environments. LittleDarwin is a mutation testing tool designed primarily with the premise of easy deployment in complex systems; in order to provide mutation testing where other tools fail.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    AWK Statistics

    awkStatistics generates statistics from csv-data with awk.

    AwkStatistics is a script which is used to generate statistical data for csv data by using awk. It takes two columns from the input file, one is used to identify the row(option -i) and the other one is the column where the values are taken from (option -v). The script was designed for performance engineers to be able to analyze CSV-Data on the fly and have a good possibility to automate the analysis process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB