Showing 1049 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Picsart Enterprise Background Removal API for Stunning eCommerce Visuals Icon
    Picsart Enterprise Background Removal API for Stunning eCommerce Visuals

    Instantly remove the background from your images in just one click.

    With our Remove Background API tool, you can access the transformative capabilities of automation , which will allow you to turn any photo asset into compelling product imagery. With elevated visuals quality on your digital platforms, you can captivate your audience, and therefore achieve higher engagement and sales.
    Learn More
  • 1
    NTK RTMP SERVER

    NTK RTMP SERVER

    Naam Tamilar Web TV Live Streamer

    Naam Tamilar RTMP Server This project updated as open source for future use of Naam Tamilar Political Party. To contribute to the party and in some case if there is any possibilities if i cannot support them for long term. I thought of sharing this source code so in future it may be helpful for the community and party in which other software developers can help them to upgrade. This source is forked from - https://github.com/arut/nginx-rtmp-module and modified with multiple broadcast...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    ... JASP for advanced data editing and RapidMiner for advanced prediction modeling. DSTK is written in C#, Java and Python to interface with R, NLTK, and Weka. It can be expanded with plugins using R Scripts. We have also created plugins for more statistical functions, and Big Data Analytics with Microsoft Azure HDInsights (Spark Server) with Livy. License: R, RStudio, NLTK, SciPy, SKLearn, MatPlotLib, Weka, ... each has their own licenses.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    EasyPHP Devserver & Webserver

    EasyPHP Devserver & Webserver

    A complete WAMP environment for PHP development & personal web hosting

    EasyPHP Devserver & Webserver Develop with Devserver & Host with Webserver PHP, Apache, MySQL, Nginx, PhpMyAdmin, Xdebug, PostgreSQL, MongoDB, Python, Ruby... for Windows. Devserver installs a complete and ready-to-use development environment. Devserver is portable, modular, fully configurable and easy to update and extend. Webserver turns your computer into a ready-to-use personal web hosting server. You can host whatever you want directly on your computer and share it on internet like any...
    Downloads: 31 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    phoneutria
    A Java Web crawler: multi-threaded, scalable, with high performance, extensible and polite. It can be used to crawl and index any web or enterprise domain and is configurable through a XML configuration file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AlienVault OSSIM

    AlienVault OSSIM

    Open Source SIEM

    OSSIM, AlienVault’s Open Source Security Information and Event Management (SIEM) product, provides event collection, normalization and correlation. For more advanced functionality, AlienVault Unified Security Management (USM) builds on OSSIM with these additional capabilities: * Log management * Advanced threat detection with a continuously updated library of pre-built correlation rules * Actionable threat intelligence updates from AlienVault Labs Security Research Team * Rich...
    Leader badge
    Downloads: 78 This Week
    Last Update:
    See Project
  • 7
    A simple but relatively complete DNS and DHCP implementation based on a MySQL (Or other) database with both servers drawing information from the same source thus eliminating one of the headaches of administering a small network.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    mediaTUM is free software written in Python for archiving and retrieval of images, documents and other research data. It was originally developed in the framework of the DFG project IntegraTUM and is continuously expanded with new functionalities as required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OpenWebSpider
    OpenWebSpider is an Open Source multi-threaded Web Spider (robot, crawler) and search engine with a lot of interesting features!
    Downloads: 5 This Week
    Last Update:
    See Project
  • Secure remote access solution to your private network, in the cloud or on-prem. Icon
    Secure remote access solution to your private network, in the cloud or on-prem.

    Deliver secure remote access with OpenVPN.

    OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
    Get started — no credit card required.
  • 10
    Proxenet

    Proxenet

    The ONLY hacker friendly proxy for webapp pentests

    Proxenet is a hacker-friendly, DIY web proxy designed for web application penetration testing. Written in C, it allows interaction with higher-level languages like Python and Ruby for on-the-fly modification of HTTP requests and responses.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Websocket Server in a Pure Data patch

    Websocket Server in a Pure Data patch

    a single patch for direct communications between Pd and web browsers

    NEW : pd-vanilla version requiring only mrpeach external library ! download link (and notes) : https://sourceforge.net/projects/websocketserverinapatch/files/ "websocket-server in a patch.pd" single Pd file contains a reusable and fully documented websocket server, meant as an easy way to experiment with Pure Data <-> web-browsers communications through websockets, without the need of other software (like node.js or python) or any specific external to compile (just one deken library...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Reactive Extensions for JavaScript

    Reactive Extensions for JavaScript

    An API for asynchronous programming with observable streams

    An API for asynchronous programming with observable streams. The Observer pattern done right. ReactiveX is a combination of the best ideas from. The Observer pattern, the Iterator pattern, and functional programming. ReactiveX is everywhere, and it's meant for everything. Available for idiomatic Java, Scala, C#, C++, Clojure, JavaScript, Python, Groovy, JRuby, and others. Embrace ReactiveX's asynchronicity, enabling concurrency and implementation independence. Manipulate UI events and API...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    htmlarea

    htmlarea

    Small, powerful, full featured WYSIWYG editor

    HTMLArea 4 is a browser based WYSIWYG editor that easily replaces the TEXTAREA in your web pages. It is written in JavaScript, and suitable for use in any modern web browser, and any page on your web site. Current version is 4.0-2016-08-29
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14

    Ganglia

    Scalable, distributed monitoring system for high-performance computing

    Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It is based on a hierarchical design targeted at federations of clusters. Supports clusters up to 2000 nodes in size.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 15
    Cub Linux

    Cub Linux

    Chromium + Ubuntu = Cub Linux

    The best of Chromium and Ubuntu. Cub Linux is a project to replicate the Chromium OS experience on an Ubuntu Linux base system. Cub Linux is free to download and use forever.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 16
    WebDAVSurfer

    WebDAVSurfer

    WebDAV client 64-bit works with Plone 5, Apache and more

    GUI WebDAV Client for Linux and Windows 10. Includes PROPFIND, PROPPATCH, LOCK ,UNLOCK ,VERSION-CONTROL,REPORT. HTTP(S) with Basic Authentication and PKI client and server Certificates. Works with Plone, Zope, Apache + mod_dav, PyWebDAV, PyDAV, Tamino. 64-bit wxPython used. Upload files or from Web. Update properties. Tested with Plone 5.04
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    IAD dispatch web scraper

    A very simple web scraper for taxi dispatch data.

    Introduction: The Dulles International Airport (IAD) near Washington, D.C. has a taxi service provided by the Washington Flyer. Taxi cabs are leased by drivers and rides are regulated using a queue system. Drivers enter a corral near the Arrival gate and wait for dispatchers to announce passengers. There is a website that displays useful information about the queue. The number of taxis waiting in queue, the wait time of the last vehicle out, and the number of taxis to exit the corral in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    O.N.O.S

    O.N.O.S

    Open Network Object System

    ... want, connect your arduino to the usb and run the python program on the pc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    PAMIE

    A Python class to allow the user to automate Internet Explorer

    Python Automation Module (class) for Internet Explorer (PAM.py). Originally written as a simple Python module. This new Python class starting with 2.0 allows the user to automate Internet Explorer browser for QA testing, development testing, or web scraping. This python class only runs on Windows (only) and automates Internet Explorer using the COM object, there is no support for Firefox, Chrome, Safari or Flex at this time. This is not an Application. Also check out the original...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Simple yet powerful multi-threaded object-oriented CGI/FastCGI/WSGI/mod_python/html-templating modules for Python. This project has moved to GitHub: https://github.com/jribbens/jonpy
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Flashlight

    Flashlight

    The missing Spotlight plugin system

    Search the web, save a note, add a reminder, or do over 200 other things, just by typing into your Mac's Spotlight search box. Or write your own plugins. Flashlight is an open platform for Spotlight, which allows developers to write plugins that extend OS X's built-in search with new features. Flashlight is an unofficial Spotlight API that allows you to programmatically process queries and add additional results. It's very rough right now, and a horrendous hack, but a fun proof of concept. Once...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    a Project with All the bells and whistles to allow the average user to fully benefit from HTTP,DNS,FTP,SSH through python, allowing quick and easy deploying of servers without compiling, or installing anything but our favorite language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SteppedStateMachine

    SteppedStateMachine

    Creates and operates a stepped state machine

    Implements a stepped state machine, i.e. a state machine which executes a single state transition at a time. Because of this, no data, e.g. state data, can be stored between executions. Instead, any such data must be stored in persistent storage between executions. This permits operation of the state machine as a CGI program in a web server. A WSGI or fastCGI or other such web server is not required. Received symbols may be received from sources outside the state machine, or may...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MultiPlink

    MultiPlink

    [UPDATE 28-10-2015] FINAL MULTIPLINK SSH LIFETIME v2.0 UP-5 "FAREWELL"

    Download Link Utama: http://sourceforge.net/projects/multiplink/files/Final_MultiPlink_SSHLIFETIME_2.0_UP-5_FAREWELL.rar/download Jika di Halaman Download, Link belum tersedia, klik link alternatif : https://www.dropbox.com/s/znptxp2gxhihpax/Final_MultiPlink_SSHLIFETIME_2.0_UP-5_FAREWELL.rar?dl=0 Lalu Download & Install juga Minor...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    dotCODES_Source_Control_for_VS

    dotCODES_Source_Control_for_VS

    The dotCODES Source Control Maintenance Mainframe (SCM2)

    The dotCODES Source Control Maintenance Mainframe for Visual Studio is an administrator console application for developing dotCODES components. Built upon a Python foundation, the program is used to create data center routines (Unix packages) and maintain enterprise cloud services (CGI scripts/Apache) by means of building dotCODES runtimes and deploying them to and from the client server.
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.