Showing 139 open source projects for "python web crawler"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • Secure remote access solution to your private network, in the cloud or on-prem. Icon
    Secure remote access solution to your private network, in the cloud or on-prem.

    Deliver secure remote access with OpenVPN.

    OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
    Get started — no credit card required.
  • 1
    Ascoos Web Extended Studio

    Ascoos Web Extended Studio

    Is a portable web server suite for windows 64Bit, for Web Development.

    The Ascoos Web Extended Studio is a special 64Bit freeware version of web server for all Web Developers and Designers and is based on Apache, PHP, MariaDB, MongoDB, Filezilla and other. It offers to user the option of executing different versions of PHP and MariaDB. It is structured for easy upgrading Each new version of the Ascoos Web Extended Studio, includes the latest versions of individual programs without repealing earlier versions. So, you have the opportunity for experiments...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Ascoos Web Server

    Ascoos Web Server

    Is a web server for all Web Developers and Web Designers

    For PHP 5.6 - 8.4.X see: Ascoos Web Extended Studio (AWES) is here : https://sourceforge.net/projects/ascoos-web-extended-studio/ ASCOOS Web Server is a rich package designed as a versatile web server for development purposes. It incorporates third-party components such as PHP, MySQL, pgSQL, MongoDB and FileZilla and stands out through a compact setup and a well-built administrative panel. ASCOOS Web Server allows you to work with multiple versions of PHP and MySQL without having to re...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    elFinder
    elFinder is a file manager for web similar to that you use on your computer. Written in JavaScript using jQuery UI, it just work's in any modern browser. Its creation is inspired by simplicity and convenience of Finder.app program used in Mac OS X.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Liferay Portal

    Liferay Portal

    The world's leading open source portal

    Liferay Portal is the world's leading enterprise open source portal framework, offering integrated Web publishing and content management, an enterprise service bus and service-oriented architecture, and compatibility with all major IT infrastructure. Check GitHub for our latest releases: https://github.com/liferay/liferay-portal/releases https://github.com/liferay/liferay-ide/releases
    Leader badge
    Downloads: 272 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    magnetW

    magnetW

    Magnet link aggregation search

    ... such advertisements. This application is open source and free, and is only used for crawler technology exchange and learning. The search results are all from the source site, and no responsibility is assumed. The project complies with GNU General Public License v3.0. Online playback is performed in conjunction with the webtorrent desktop version. It needs to be downloaded separately. After clicking the online play, it will jump to webtorrent to add tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6

    gdpr

    Tool to maintain gdpr data protection declaration

    Admins often maintain multiple web pages, each of which under EU-GDPR requires a privacy statement. In order to keep them coherent, up-to-date and at the same time avoiding doing the same work multiple times, this project provides a tool to automatically create the appropriate statements for each page from a single source. The project is currently available in PHP, however if anyone is willing to provide a version in Python or Perl or whatever, it is more than welcome. The project...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7

    Ganglia

    Scalable, distributed monitoring system for high-performance computing

    Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It is based on a hierarchical design targeted at federations of clusters. Supports clusters up to 2000 nodes in size.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 8
    PyLucid is a Open Source web content management system written in Python using the Django Framework. fully customizable output, expandable with plugins, i18n, L10n, Revision controled, support many database engines, WSGI conform and many more...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    Roxy Fileman

    Web file manager

    Free file browser for CKEditor and TinyMCE. Can be used in any web application. Ready for use with ASP .NET and PHP - all scripts are included. User unterface completely separated from the server side logic. Language independant server side scipts - can be implemented in any programming language - PHP, ASP .NET, Python etc. Latest versions is available at http://www.roxyfileman.com/download
    Downloads: 2 This Week
    Last Update:
    See Project
  • Turn Your Content into Interactive Magic - For Free Icon
    Turn Your Content into Interactive Magic - For Free

    From Canva to Slides, Desmos to YouTube, Lumio works with the tech tools you are already using.

    Transform anything you share into an engaging digital experience - for free. Instantly convert your PDFs, slides, and files into dynamic, interactive sessions with built-in collaboration tools, activities, and real-time assessment. From teaching to training to team building, make every presentation unforgettable. Used by millions for education, business, and professional development.
    Start Free Forever
  • 10
    AMPPS

    AMPPS

    AMPPS is a LAMP/MAMP/WAMP stack

    AMPPS is a stack of Apache, MySQL, PHP, Perl & Python. AMPPS enables you to focus more on using applications rather than maintaining them. With AMPPS you can create a website by installing any of the 330+ Apps, customizing it, and then simply publishing it on the internet via a wide choice of hosting service providers. A complete package on your desktop, same like the server that provides many open source web applications. Application Management, Domain management, Database management, etc...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11

    sitecheck

    Modular web site spider for web developers.

    More than just a link checker, sitecheck is a website spider (also known as a crawler) which can assist with SEO by testing an entire site plus both inbound links from search engines and outbound links to other sites for the following issues: looping redirects (HTTP 301/302), broken links (HTTP 404), server errors (HTTP 500), spelling mistakes, low readability scores (using the Flesch Reading Ease test), missing/empty/duplicate meta tags, duplicate content, slow page speed, W3C validation...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12

    iFlux

    Tornado Web Based Framework

    iFlux was rebooted!!! We are developing a Tornado Web Based Framework. Still in planning phase. Check us on openhub too: https://www.openhub.net/p/iflux Used to be: PHP Framework project based on Model-View-Controller MVC Desing Patern. iFlux is a MyFuses extension that makes easy build applications with i18n, form mapping, template engine like smarty and other advanced features.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    SE Auditor

    Free SEO audit software.

    SE Auditor is a program for analyzing web pages for search engines. SE Auditor is application that you can use to view statistical data about your website, in order to improve its position within the Web search results. SE Auditor is addressed to SEO professionals, website designers, developers, website testers and owners. SE Auditor enables you to check meta description, keywords, sitemap, the number of links and keyword consistency, the text/HTML ratio and many more ranking...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    A collection of pure Python algorithms and data structures
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    APMP
    Another distribution of apache that contains Apache, MySQL, Perl, and PHP. and comes with the following modules mod_perl, mod_php, mod_python, and more. Also Contains a FTP and Mail Server. Will be ready to run right after installation and a reboot.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    LinkChecker

    check links in web documents or full websites

    New Homepage: http://wummel.github.io/linkchecker/ Linkchecker features: - recursive and multithreaded checking and site crawling - output in colored or normal text, HTML, SQL, CSV, XML or a sitemap graph in different formats - HTTP/1.1, HTTPS, FTP, mailto:, news:, nntp:, Telnet and local file links support - restrict link checking with regular expression filters for URLs - proxy support -...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 17

    Spondulas

    Spondulas is browser emulator designed to retrieve web pages for hunti

    Spondulas is browser emulator and parser designed to retrieve web pages for hunting malware. It supports generation of browser user agents, GET/POST requests, and SOCKS5 proxy. It can be used to parse HTML files sent via e-mail. Monitor mode allows a website to be monitored at intervals to discover changes in DNS or content over time. Autolog mode creates an investigation file that documents redirection chains. The retrieved web pages are parsed for links and reported to an output file. More...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    SaWALi Web Application Library

    The SaWALi is a website management tool written in Python.

    The SaWALi Web Application Library is a Python application that aims to provide a reasonably complete set of components for operating a multi-purpose website. Taking advantage of the Pylons Framework, SaWALi is fully-customisable and inherently-extensible. All of SaWALi's administrative and public interfaces can be modified to suit a website's userbase— from its document editors and server error pages down to its public-facing pages and site maps. Being a Python module, SaWALi can also...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    tomato's Web Application Common Components -- project aimed on building commonly used components for web applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    TurboDiesel

    TurboDiesel

    Платформа создания Web-приложений

    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Strawberry
    Strawberry allows you to store, manage, classify and consult your preferred links. As a web application (running with Mother framework), you can access your bookmarks from everywhere if runned on a server.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Crawler
    Crawler is a bare-bones spider designed to quickly and effectively build an index of all files and pages on a given Web site as well as the link relationship (both incoming and outgoing) between each page. More open source at https://github.com/fcc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    htc-py converts XML data into HTML web-pages.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    The project is based on technology OSTIS(Open Semantic Technology for Intelligent Systems). The project has developed tools enable the use of project OSTIS for developing Web applications for the Internet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The web lint checks HTML and XHTML pages for possible markup problems. It attempts to find problems with your code that an HTML validator does not.
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.