Showing 1049 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Sales CRM and Pipeline Management Software | Pipedrive Icon
    Sales CRM and Pipeline Management Software | Pipedrive

    The easy and effective CRM for closing deals

    Pipedrive’s simple interface empowers salespeople to streamline workflows and unite sales tasks in one workspace. Unlock instant sales insights with Pipedrive’s visual sales pipeline and fine-tune your strategy with robust reporting features and a personalized AI Sales Assistant.
    Try it for free
  • 1
    Poor Http

    Poor Http

    WSGI Server, WSGI Connector, Python doc generator

    Poor Http Server is standalone wsgi server, which is designed for using python web applications. Unlike other projects, this is not framework, but single server, light wsgi connector, and python doc generator.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Java-based framework for decoupling back-end services and front-end interfaces. Browse and interact with a database, a class library, a network, a log file, or any live java object as though it were a filesystem. (It works with filesystems too!)
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    PyCancerDB

    Cancer Proteomics Database display and management

    PyCancerDB is a source code distribution providing a Web-based interface for browsing and updating the Cancer Proteomics Database, together with scripts for maintaining the database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    WebCollector

    WebCollector is an open source web crawler framework based on Java.

    WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes. Github: https://github.com/CrawlScript/WebCollector Demo: https://github.com/CrawlScript/WebCollector/blob/master/YahooCrawler.java
    Downloads: 1 This Week
    Last Update:
    See Project
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Start for Free
  • 5

    python-web_excavator

    Genral Data Mining API: Only write html parsing code.

    A general web scraper that uses the requests library to communicate with the website. Scraper() contains a parser object, which you can add parsing handles to. ParseHandle() is the code mining for you data from an html source. Repo: https://github.com/crispycret/web_excavator
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PyLucid is a Open Source web content management system written in Python using the Django Framework. fully customizable output, expandable with plugins, i18n, L10n, Revision controled, support many database engines, WSGI conform and many more...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    huhamhire-hosts

    huhamhire-hosts

    hosts for Internet Freedom

    Since the governments of some countries are using the blocking the internet access to several websites and web service providers which includes some world famous sites like Google, YouTube, twitter, Facebook, and Wikipedia etc., we designed this tiny utility in order to help people getting through the Internet blockade. Hosts Setup Utility provides basic tools to manage the hosts file on current operating systems. It also provides both support for Graphical Desktop environment with Graphical...
    Leader badge
    Downloads: 12 This Week
    Last Update:
    See Project
  • 8

    Roxy Fileman

    Web file manager

    Free file browser for CKEditor and TinyMCE. Can be used in any web application. Ready for use with ASP .NET and PHP - all scripts are included. User unterface completely separated from the server side logic. Language independant server side scipts - can be implemented in any programming language - PHP, ASP .NET, Python etc. Latest versions is available at http://www.roxyfileman.com/download
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    The Burster 3D
    Web browser plug-in for hosting real 3D on the websites.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    AMPPS

    AMPPS

    AMPPS is a LAMP/MAMP/WAMP stack

    AMPPS is a stack of Apache, MySQL, PHP, Perl & Python. AMPPS enables you to focus more on using applications rather than maintaining them. With AMPPS you can create a website by installing any of the 330+ Apps, customizing it, and then simply publishing it on the internet via a wide choice of hosting service providers. A complete package on your desktop, same like the server that provides many open source web applications. Application Management, Domain management, Database management, etc...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11

    Igorx

    Modular Web-/Application server framework

    Igorx is a modular Web-/Application server written in Python and C++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    The goal of this project is to make possible to access Progress database from any external program that can use sockets. The server (broker and agents) are written in Progress 4GL and made use of sockets capabilities of Progress V9.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Sight Reading Trainer

    Sight Reading Trainer

    Scrolling notes in any key for any instrument

    The MIDI Sight Reader Java Applet works again! To use it you should use Firefox with Java Applet Plug-in installed. Developed using Firefox 25.0.1 and Java 7, Update 51. So what is it? Well it scrolls random notes within a selected key, playing them aloud for audio confirmation, while you, the user, play the notes on your instrument. There is no MIDI interface, the name refers to MIDI because it uses the Java Midi API and organizes everything numerically, according to MIDI numbers. The...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Addons for IOSEC - DoS HTTP Security

    Addons for IOSEC - DoS HTTP Security

    IOSec Addons are enhancements for web security and crawler detection

    IOSEC PHP HTTP FLOOD PROTECTION ADDONS IOSEC is a php component that allows you to simply block unwanted access to your webpage. if a bad crawler uses to much of your servers resources iosec can block that. IOSec Enhanced Websites: https://www.artikelschreiber.com/en/ https://www.unaique.net/en/ https://www.unaique.com/ https://www.artikelschreiber.com/marketing/ https://www.paraphrasingtool1.com/ https://www.artikelschreiben.com/ https://www.buzzerstar.com/ https...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    CFPanel

    Web panel for keeping track of crowdfunding projects

    A web panel where to keep track of crowdfunding projects in which you have taken part.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    ReorJS

    Distributed Computing with JavaScript

    Create your own distributed computer that can distributed javascript based applications to any computer with a web browser, headless browser or node.js installation. For more information and updates please see our website - http://reorjs.com.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Penetration-Testing-Toolkit v1.0

    A web interface for various penetration testing tools

    Penetration-Testing-Toolkit is a web based project to automate Scanning a network,Exploring CMS, Generating Undectable metasploit payload, DNS-Queries, IP related informations, Information Gathering, Domain related info etc
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    sitecheck

    Modular web site spider for web developers.

    More than just a link checker, sitecheck is a website spider (also known as a crawler) which can assist with SEO by testing an entire site plus both inbound links from search engines and outbound links to other sites for the following issues: looping redirects (HTTP 301/302), broken links (HTTP 404), server errors (HTTP 500), spelling mistakes, low readability scores (using the Flesch Reading Ease test), missing/empty/duplicate meta tags, duplicate content, slow page speed, W3C validation...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19

    iFlux

    Tornado Web Based Framework

    iFlux was rebooted!!! We are developing a Tornado Web Based Framework. Still in planning phase. Check us on openhub too: https://www.openhub.net/p/iflux Used to be: PHP Framework project based on Model-View-Controller MVC Desing Patern. iFlux is a MyFuses extension that makes easy build applications with i18n, form mapping, template engine like smarty and other advanced features.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20

    gooseweb

    A lightweight, no bullshit web browser written in python

    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Job Crawler

    Job Data Collection - Web Crawler

    Job data collection bases on the Web Crawler’s concept. In the context of the World Wide Web, Web crawler is program use the crawling process in order to gathering data from web pages includes hyperlinks and content. Web crawler is also to be called a Web spider, an ant, an automatic indexer. Job data collection system is a web crawler program is used to gather job information and supply for user an overview about the list of jobs in their location. Moreover, program is going to reply...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PHP Mailman allows the integration of Mailman into a dynamic website without using Python or requiring permission to Mailman binaries.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23

    Domain Analyzer Security Tool

    Finds all the security information for a given domain name

    Domain analyzer is a security analysis tool which automatically discovers and reports information about the given domain. Its main purpose is to analyze domains in an unattended way.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    MovieGrabber

    MovieGrabber

    MovieGrabber is a fully automated way of downloading

    MovieGrabber has now moved to GitHub! https://github.com/binhex/moviegrabber
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Simple Blogger
    New version was released! Visit https://github.com/jsnjack/simpleblogger to download Simple Blogger is a small and very easy to use application that enables you to quickly write and publish a new post of your blog. Available in Windows and Linux. Simple Blogger has written on Python with wxPython.
    Downloads: 1 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.