Showing 41 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Mimesis

    Mimesis

    High-performance fake data generator for Python

    Mimesis is an open source high-performance fake data generator for Python, able to provide data for various purposes in various languages. It's currently the fastest fake data generator for Python, and supports many different data providers that can produce data related to people, food, transportation, internet and many more. Mimesis is really easy to use, with everything you need just an import away.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 3
    PerfView

    PerfView

    PerfView is a CPU and memory performance-analysis tool

    PerfView is a free performance analysis tool that helps isolate CPU and memory-related performance issues. It is a Windows tool, but it also has some support for analyzing data collected on Linux machines. It works for a wide variety of scenarios, but has a number of special features for investigating performance issues in code written for the .NET runtime. If you are unfamiliar with PerfView, there are PerfView video tutorials.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 4
    Deepchecks

    Deepchecks

    Test Suites for validating ML models & data

    Deepchecks is the leading tool for testing and for validating your machine learning models and data, and it enables doing so with minimal effort. Deepchecks accompany you through various validation and testing needs such as verifying your data’s integrity, inspecting its distributions, validating data splits, evaluating your model and comparing between different models. While you’re in the research phase, and want to validate your data, find potential methodological problems, and/or validate...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Pacu

    Pacu

    The AWS exploitation framework, designed for testing security

    ...What used to take days to manually enumerate can be now be achieved in minutes. There are currently over 35 modules that range from reconnaissance, persistence, privilege escalation, enumeration, data exfiltration, log manipulation, and miscellaneous general exploitation.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    Atheris

    Atheris

    A Coverage-Guided, Native Python Fuzzer

    ...The tool integrates smoothly with Python’s packaging and unit-test ecosystems, so you can wrap existing tests as fuzz targets and keep results understandable. It supports structured input strategies and custom mutators, which is especially helpful for text and data formats common in Python workloads. In practice, Atheris compresses weeks of edge-case brainstorming into hours of automated exploration with actionable, minimized reproductions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Computer Science Flash Cards

    Computer Science Flash Cards

    Mini website for testing both general CS knowledge and enforce coding

    This repository collects concise flash cards that cover the core ideas of a traditional computer science curriculum with a focus on interview readiness. The cards distill topics like time and space complexity, classic data structures, algorithmic paradigms, operating systems, networking, and databases into short, testable prompts. They are designed for spaced-repetition style study so you can cycle frequently through fundamentals until recall feels automatic. Many cards point at canonical...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    sqlmap

    sqlmap

    Automatic SQL injection and database takeover tool

    sqlmap is a powerful, feature-filled, open source penetration testing tool. It makes detecting and exploiting SQL injection flaws and taking over the database servers an automated process. sqlmap comes with a great range of features that along with its powerful detection engine make it the ultimate penetration tester. It offers full support for MySQL, Oracle, PostgreSQL, Microsoft SQL Server, Microsoft Access, IBM DB2, SQLite, Firebird, and many other database management systems. It also...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 9
    Gatling Studio

    Gatling Studio

    Desktop app that helps you craft Gatling load-testing simulations

    Gatling Studio is a native desktop application created to simplify the creation of Gatling load-testing simulations from real browser traffic. Instead of hand-writing scenarios in Scala or Java from scratch, you use Gatling Studio to drive a Chromium-based browser through a typical user journey. The tool captures all HTTP and HTTPS requests and responses, saves them as a HAR file, and then automatically converts that data into a structured Gatling project with groups, pauses, and exec...
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10

    DBD2

    A multi-threaded, multi-database tcp-based database insertion app.

    DBD2 inserts database records from TCP input. It is a companion project to Syslgod2 and an application in its own right. It ships with a backgrounding client for generic use. The client sends whatever information the user puts on the command-line to the back-end daemon via TCP. Upon successful transmission, the client exits. On failure, it backs off and retries until its life-time timer expires. The back-end daemon accepts data from Syslogd2 or its own client and inserts that data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 12
    CAT

    CAT

    CAT is the basic component of the server project

    ...In addition to metrics, it enables tracing—propagating context across RPC boundaries so problems like latency spikes or failed calls can be traced end-to-end. Alert rules and anomaly detection can be defined to notify teams proactively. The system supports multiple data backends and ingestion pipelines to collect data from JVM, C/C++, Python, and other ecosystems. With the collected data, Cat supports analysis of hotspots, trending anomalies, and capacity planning to drive continuous reliability improvements.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    GoReplay

    GoReplay

    The Swiss Army knife for testing and monitoring

    Increase confidence in code deployments, configuration changes and infrastructure changes with GoReplay! GoReplay is an innovative open source tool that captures and replays live HTTP traffic, using it to continuously test your system with real data. Now you don’t have to put up with risks that come with putting a third-party component in the critical path. With GoReplay you can analyze and record your application traffic without affecting it whatsoever. Use it for shadowing, load testing, monitoring and detailed analysis. You can trust GoReplay to accurately emulate production environments so you can make changes and deploy with confidence.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    speedtest-cli

    speedtest-cli

    Command line interface for testing internet bandwidth using speedtest

    Command line interface for testing internet bandwidth using speedtest.net. It is not a goal of this application to be a reliable latency reporting tool. Latency reported by this tool should not be relied on as a value indicative of ICMP style latency. It is a relative value used for determining the lowest latency server for performing the actual speed test against. Speedtest CLI brings the trusted technology and global server network behind Speedtest to the command line. Measure internet...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Frontend Regression Validator (FRED)

    Frontend Regression Validator (FRED)

    Visual regression tool used to compare baseline and updated instances

    Visual regression tool used to compare baseline and updated instances of a website in a deployment pipeline. FRED is an opensource visual regression tool used to compare two instances of a website. FRED is responsible for automatic visual regression testing, with the purpose of ensuring that functionality is not broken by comparing a current(baseline) and an updated version of a website. The visual analysis computes the Normalized Mean Squared error and the Structural Similarity Index on the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Hiper

    Hiper

    A statistical analysis tool for performance testing

    Hiper is an open-source command-line tool designed for statistical analysis of web performance and page load behavior during performance testing. The tool repeatedly loads a specified webpage multiple times and gathers detailed timing metrics in order to produce more reliable performance measurements than single-run benchmarks. By averaging data across multiple page loads, Hiper helps developers understand whether performance optimizations actually improve real-world page loading behavior. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    pysourceinfo

    pysourceinfo

    RTTI for Python Source and Binary Files

    The 'pysourceinfo' package provides source information on Python runtime objects based on 'inspect', 'sys', 'os', and 'imp'. The covered objects include packages, modules, functions, methods, scripts, and classes by two views: - File System View - packages, modules, and linenumbers - based on files and paths - Runtime Object View - callables, classes, and containers - based on in-memory RTTI / introspection The supported platforms are: - Linux, BSD, Unix, OS-X, Cygwin, and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Sploit

    Sploit

    SploitFrameWork *BETA*

    WELCOME TO THE SPLOITFRAMEWORK SPLOIT COMES WITH A PAYLOAD OF THINGS FOR ALL YOUR HACKING NEEDS, SUCH AS 1. Email Phishing 2. Spear Phishing 3. Spoofing 4. Credential Harvester 5. Kayloggers 6. Tabnabbing And More . . .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SW Test Automation Framework
    The Software Testing Automation Framework (STAF) is a framework designed to improve the level of reuse and automation in test cases and test environments. The goal of STAF is to provide a complete end-to-end automation solution for testers.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 20
    SNMP Simulator is a software that would act like a multitude of real physical devices from SNMP Manager's point of view. Simulator builds and uses a database of physical devices' SNMP footprints to respond like their real counterparts do. Project moved to GitHub: https://github.com/etingof/snmpsim
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    cutplace
    Cutplace validates tabular data (CSV, fixed format) according to an interface control document (ICD). The ICD acts as executable specification and can be described using popular spreadsheet applications (Calc, Excel).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    LittleDarwin

    Mutation Analysis Framework for Java

    Mutation testing has been proven to be a reliable method to estimate the quality of a test suite. Most of current mutation tools require the architecture of their target system to be in a certain way in order to apply the technique; thus making it impossible to use them in complex build environments. LittleDarwin is a mutation testing tool designed primarily with the premise of easy deployment in complex systems; in order to provide mutation testing where other tools fail.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    AWK Statistics

    awkStatistics generates statistics from csv-data with awk.

    AwkStatistics is a script which is used to generate statistical data for csv data by using awk. It takes two columns from the input file, one is used to identify the row(option -i) and the other one is the column where the values are taken from (option -v). The script was designed for performance engineers to be able to analyze CSV-Data on the fly and have a good possibility to automate the analysis process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    misspell-check

    misspell-check

    A program to check source code for mispelings...

    A program to check source code for mispelings...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Analysis tools for scale test data generated by The Grinder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB