Showing 505 open source projects for "python data analysis"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1

    newsscrape

    news headline collecting for analysis in determining the category

    newsscrape is web scraping for news headline to analyse on how it relates to a news category. - It extracts RSS feed from Google News. - Each news headline is matched against Google News category like Entertainment, Sports, etc. - Called from scheduler to collect this data at 5 minutes interval and be accumulated in a database. - It contains R statistical computing scripts to learn the pattern on words in the headline resulting a particular category. - To test its accuracy in predicting...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Bifrozt

    Bifrozt

    High interaction honeypot solution for Linux based systems

    NOTICE: The format of this project has been changed from ISO to using ansible and has been moved to GitHub. Github link: https://github.com/Bifrozt/bifrozt-ansible
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    vtrunkd

    Network link bonding/trunking/aggregation and multichannel VPN daemon

    vtrunkd - network link bonding and multichannel VPN. Vtrunkd is a Linux VPN daemon used to combine several connection paths into one aggregated channel. Features latency, reordering and jitter management, behaviour analysis optimizations for encapsulated protocols, bufferbloat control, packet redundancy, and multiple cpu cores utilization. Up to 30 heterogenous links bonding supported. Used for live streaming, LTE/3G/Wi-Fi link bonding. 32/64-bit, x86, MIPS and ARM supported. Supports python...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    IAD dispatch web scraper

    A very simple web scraper for taxi dispatch data.

    ... in the past hour. This information is updated a few times every hour, not in real time. Goal: Write a very, very, very (did I mention very?) simple program to pull data from a simple website and plot the information on a graph. Additional work can be done to build a small database and do analytics on the data. Motivation: What is the best time of day to wait for passengers as a taxi driver? How long are the expected wait times based on time of day?
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 5
    SteppedStateMachine

    SteppedStateMachine

    Creates and operates a stepped state machine

    Implements a stepped state machine, i.e. a state machine which executes a single state transition at a time. Because of this, no data, e.g. state data, can be stored between executions. Instead, any such data must be stored in persistent storage between executions. This permits operation of the state machine as a CGI program in a web server. A WSGI or fastCGI or other such web server is not required. Received symbols may be received from sources outside the state machine, or may...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    dotCODES_Source_Control_for_VS

    dotCODES_Source_Control_for_VS

    The dotCODES Source Control Maintenance Mainframe (SCM2)

    The dotCODES Source Control Maintenance Mainframe for Visual Studio is an administrator console application for developing dotCODES components. Built upon a Python foundation, the program is used to create data center routines (Unix packages) and maintain enterprise cloud services (CGI scripts/Apache) by means of building dotCODES runtimes and deploying them to and from the client server.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    must: A More Useful Syslog Test tool

    A syslog message generator that replicates real syslog messages

    Using templated message formats with customisable placeholders, run in configurable sequences that can selectively reuse data between steps, must allows more intelligent testing of syslog receivers with realistic data, as well as longer soak testing and stress testing. must was created to fill a gap found when trying to stress test Splunk as real, indexable and meaningful data was needed. must will (eventually) be provided as a standalone tool that uses XML configs (for quick use...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Reconcile: Web Credibility Evaluation

    Reconcile: Web Credibility Evaluation

    Robust online credibility evaluation of web content

    ... a great, positive social impact. Dataset from our mTurk studies is now available for public use! Our dataset contains 15750 evaluations of 5543 pages from 2041 participants with some additional information about website characteristics and basic demographic features of users. If You would like to gain access to this data-set or learn more about data structure please contact our administrator (konrad@pjwstk.edu.pl) stating Your name, academic degree and university affiliation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Secure remote access solution to your private network, in the cloud or on-prem. Icon
    Secure remote access solution to your private network, in the cloud or on-prem.

    Deliver secure remote access with OpenVPN.

    OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
    Get started — no credit card required.
  • 10
    Cookie Cruncher For Chrome

    Cookie Cruncher For Chrome

    Cookie Cruncher is an easy way to remove all cookies in Chrome.

    Cookie Cruncher For Chrome is a simple, quick and effective way to safely delete all cookies in Chrome with a single click. Here is our VirusTotal link: https://www.virustotal.com/en/file/01266a35134fa5f4b687cfb91b6e84967f659a23b8435e60bef8bbe114e8fd1b/analysis/1438536098/ Note: Jiangmin and Zillya both flag the software as malicious, but this is a false positive and is to be expected, only 2 out of 55 antiviruses flag the software as "malicious". We think this is because our software...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    power-console-admin

    Unix/Linux console admin for PowerDNS server

    Easy to use PowerDNS admin for Unix/Linux console. Work with SQLite3, MySQL, PostgreSQL backend. Easy way to create domain, add/update record data - simultaneously forward and reverse zone. Both IPv4 and IPv6 addresses. In the future maybe with ncurces UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PyTables - Hierarchical datasets
    The goal of PyTables is to enable the end user to efficiently and easily manipulate large datasets (both homogenous, i.e. arrays, and heterogenous, i.e. tables) on a persistent, hierarchical way.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13

    python-web_excavator

    Genral Data Mining API: Only write html parsing code.

    A general web scraper that uses the requests library to communicate with the website. Scraper() contains a parser object, which you can add parsing handles to. ParseHandle() is the code mining for you data from an html source. Repo: https://github.com/crispycret/web_excavator
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Roxy Fileman

    Web file manager

    Free file browser for CKEditor and TinyMCE. Can be used in any web application. Ready for use with ASP .NET and PHP - all scripts are included. User unterface completely separated from the server side logic. Language independant server side scipts - can be implemented in any programming language - PHP, ASP .NET, Python etc. Latest versions is available at http://www.roxyfileman.com/download
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    Big Brother Bot (B3)
    BigBrotherBot (B3) is a cross-platform, cross-game game administration bot. Features in-game administration of game servers, multiple user access levels, and database storage. Currently include parsers for: Call of Duty, Urban Terror and more!
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    The goal of this project is to provide a Content Provider plugin for the iQser GIN Platform (www.iqser.com) as a semantic middleware to synchronize and transform data objects of LinkedIn for integration and analysis requirements.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    The goal of this project is to provide a free Content Provider plugin for the iQser GIN Platform (www.iqser.com) as a semantic middleware to synchronize and transform data objects from Facebook for integration and analysis requirements.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    ReorJS

    Distributed Computing with JavaScript

    Create your own distributed computer that can distributed javascript based applications to any computer with a web browser, headless browser or node.js installation. For more information and updates please see our website - http://reorjs.com.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Domain Analyzer Security Tool

    Finds all the security information for a given domain name

    Domain analyzer is a security analysis tool which automatically discovers and reports information about the given domain. Its main purpose is to analyze domains in an unattended way.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Flamingo Project

    Flamingo Project

    Workflow Designer, Hive Editor, Pig Editor, File System Browser

    Flamingo is a open-source Big Data Platform that combine a Ajax Rich Web Interface + Workflow Engine + Workflow Designer + MapReduce + Hive Editor + Pig Editor. 1. Easy Tool for big data 2. Use comfortable in Hadoop EcoSystem projects 3. Based GPL V3 License Supporting Pig IDE, Hive IDE, HDFS Browser, Scheduler, Hadoop Job Monitoring, Workflow Engine, Workflow Designer, MapReduce.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    AWK Statistics

    awkStatistics generates statistics from csv-data with awk.

    AwkStatistics is a script which is used to generate statistical data for csv data by using awk. It takes two columns from the input file, one is used to identify the row(option -i) and the other one is the column where the values are taken from (option -v). The script was designed for performance engineers to be able to analyze CSV-Data on the fly and have a good possibility to automate the analysis process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    SE Auditor

    Free SEO audit software.

    SE Auditor is a program for analyzing web pages for search engines. SE Auditor is application that you can use to view statistical data about your website, in order to improve its position within the Web search results. SE Auditor is addressed to SEO professionals, website designers, developers, website testers and owners. SE Auditor enables you to check meta description, keywords, sitemap, the number of links and keyword consistency, the text/HTML ratio and many more ranking...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23

    Job Crawler

    Job Data Collection - Web Crawler

    ... on these figures, and performs a detailed analysis for the employment situation of the states of the USA. What is the hot job in your state? This report is going to explain how to design and implement solution for Job data collection system. It also includes some links for source code, class diagram, algorithm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Flightdeck-UI is a project that utilizes the ideas from the design of aircraft controls and instruments for creating computer user interfaces, such as for monitoring.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    FlightFeather's goal is "social networking for everyone". This means that anyone should have a chance to run a popular social networking site -- on minimal hardware, and without wasting bandwidth.
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.