105 projects for "python web crawler" with 2 filters applied:

  • Dun and Bradstreet Risk Analytics - Supplier Intelligence Icon
    Dun and Bradstreet Risk Analytics - Supplier Intelligence

    Use an AI-powered solution for supply and compliance teams who want to mitigate costly supplier risks intelligently.

    Risk, procurement, and compliance teams across the globe are under pressure to deal with geopolitical and business risks. Third-party risk exposure is impacted by rapidly scaling complexity in domestic and cross-border businesses, along with complicated and diverse regulations. It is extremely important for companies to proactively manage their third-party relationships. An AI-powered solution to mitigate and monitor counterparty risks on a continuous basis, this cutting-edge platform is powered by D&B’s Data Cloud with 520M+ Global Business Records and 2B+ yearly updates for third-party risk insights. With high-risk procurement alerts and multibillion match points, D&B Risk Analytics leverages best-in-class risk data to help drive informed decisions. Perform quick and comprehensive screening, using intelligent workflows. Receive ongoing alerts of key business indicators and disruptions.
    Learn More
  • WinMan ERP Software Icon
    WinMan ERP Software

    For companies of all sizes and enterprises in need of a solution to improve their operations

    WinMan ERP is an all-encompassing solution designed to manage the operational, quality, commercial, and financial processes of manufacturers and distributors. It is particularly well-suited for companies embracing Lean strategies.
    Learn More
  • 1
    Liferay Portal

    Liferay Portal

    The world's leading open source portal

    Liferay Portal is the world's leading enterprise open source portal framework, offering integrated Web publishing and content management, an enterprise service bus and service-oriented architecture, and compatibility with all major IT infrastructure. Check GitHub for our latest releases: https://github.com/liferay/liferay-portal/releases https://github.com/liferay/liferay-ide/releases
    Leader badge
    Downloads: 129 This Week
    Last Update:
    See Project
  • 2

    Ganglia

    Scalable, distributed monitoring system for high-performance computing

    Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It is based on a hierarchical design targeted at federations of clusters. Supports clusters up to 2000 nodes in size.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 3
    PyLucid is a Open Source web content management system written in Python using the Django Framework. fully customizable output, expandable with plugins, i18n, L10n, Revision controled, support many database engines, WSGI conform and many more...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    iFlux

    Tornado Web Based Framework

    iFlux was rebooted!!! We are developing a Tornado Web Based Framework. Still in planning phase. Check us on openhub too: https://www.openhub.net/p/iflux Used to be: PHP Framework project based on Model-View-Controller MVC Desing Patern. iFlux is a MyFuses extension that makes easy build applications with i18n, form mapping, template engine like smarty and other advanced features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-First Supply Chain Management Icon
    AI-First Supply Chain Management

    Supply chain managers, executives, and businesses seeking AI-powered solutions to optimize planning, operations, and decision-making across the supply

    Logility is a market-leading provider of AI-first supply chain management solutions engineered to help organizations build sustainable digital supply chains that improve people’s lives and the world we live in. The company’s approach is designed to reimagine supply chain planning by shifting away from traditional “what happened” processes to an AI-driven strategy that combines the power of humans and machines to predict and be ready for what’s coming. Logility’s fully integrated, end-to-end platform helps clients know faster, turn uncertainty into opportunity, and transform the supply chain from a cost center to an engine for growth.
    Learn More
  • 5

    sitecheck

    Modular web site spider for web developers.

    More than just a link checker, sitecheck is a website spider (also known as a crawler) which can assist with SEO by testing an entire site plus both inbound links from search engines and outbound links to other sites for the following issues: looping redirects (HTTP 301/302), broken links (HTTP 404), server errors (HTTP 500), spelling mistakes, low readability scores (using the Flesch Reading Ease test), missing/empty/duplicate meta tags, duplicate content, slow page speed, W3C validation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    A collection of pure Python algorithms and data structures
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    LinkChecker

    check links in web documents or full websites

    New Homepage: http://wummel.github.io/linkchecker/ Linkchecker features: - recursive and multithreaded checking and site crawling - output in colored or normal text, HTML, SQL, CSV, XML or a sitemap graph in different formats - HTTP/1.1, HTTPS, FTP, mailto:, news:, nntp:, Telnet and local file links support - restrict link checking with regular expression filters for URLs - proxy support -...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8

    Spondulas

    Spondulas is browser emulator designed to retrieve web pages for hunti

    Spondulas is browser emulator and parser designed to retrieve web pages for hunting malware. It supports generation of browser user agents, GET/POST requests, and SOCKS5 proxy. It can be used to parse HTML files sent via e-mail. Monitor mode allows a website to be monitored at intervals to discover changes in DNS or content over time. Autolog mode creates an investigation file that documents redirection chains. The retrieved web pages are parsed for links and reported to an output file. More...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    SaWALi Web Application Library

    The SaWALi is a website management tool written in Python.

    The SaWALi Web Application Library is a Python application that aims to provide a reasonably complete set of components for operating a multi-purpose website. Taking advantage of the Pylons Framework, SaWALi is fully-customisable and inherently-extensible. All of SaWALi's administrative and public interfaces can be modified to suit a website's userbase— from its document editors and server error pages down to its public-facing pages and site maps.
    Downloads: 0 This Week
    Last Update:
    See Project
  • D&B Hoovers is Your Sales Accelerator Icon
    D&B Hoovers is Your Sales Accelerator

    For sales teams that want to accelerate B2B sales with better data

    Speed up sales prospecting with the rich audience targeting capabilities of D&B Hoovers so you can spend more sales time closing.
    Learn More
  • 10
    tomato's Web Application Common Components -- project aimed on building commonly used components for web applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Strawberry
    Strawberry allows you to store, manage, classify and consult your preferred links. As a web application (running with Mother framework), you can access your bookmarks from everywhere if runned on a server.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    htc-py converts XML data into HTML web-pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    This is a Django based application for creating database backed or email forms. It features an innovative drag and drop interface for building forms, a workflow for reviewing forms, and infinite customization. As of 12/1/2013, primary development has moved to http://github.com/carsongee/formunculous
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The goal of this project is to write a software which can create out of simple stylesheet the templates of common templatesystems like the ones in vbulletin, Drupal, Wordpress, Joomla, ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The product for Plone (Archetypes based) designed for management of web presentations of conferences. It has tools for conferee registration, article insertion and reviewing, conference activities management, conference calendar, and reservation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ** Guys I have built a much more powerful Fully Featured CMS system at: https://github.com/MacdonaldRobinson/FlexDotnetCMS Macs CMS is a Flat File ( XML and SQLite ) based AJAX Content Management System. It focuses mainly on the Edit In Place editing concept. It comes with a built in blog with moderation support, user manager section, roles manager section, SEO / SEF URL
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    jSEO -- Pluggable SEO (Search Engine Optimization) for dynamic JEE web applications
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Sphene Community Tools is a collection of django applications which are designed to be easily pluggable into any django project. It currently consists of a full featured board and wiki application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    vieW4 (vienna World Wide Web Workgroups) is a collaboration server based on Zope. This program allows a group of people to share documents, discuss them and search for previous released documents in major formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This is an ***old archive*** of tools developed for facilitating the use of Creative Commons licenses and metadata. --- For the most up to date representation of any of the projects listed here, please see: http://creativecommons.org/project/Developer.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    The ADGen application was developed in order to simplify the generation of web site using AsciiDoc file format
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Txt2tags converts a text file with minimal markup to HTML, XHTML, SGML, LaTeX, Lout, UNIX Man Page, Wikipedia, Google Code Wiki, DokuWiki, MoinMoin, MagicPoint(mgp), PageMaker. Features: simple, fast, automatic TOC, macros, filters, include, GUI/CLI/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Plone is a ready-to-run content management system that is built on Python and the Zope application server. Plone is easy, flexible, and gives you a system for web content that is ideal for projects, communities, websites and intranets. http://plone.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    0.9.5stable and beyond... USE THE EGGS! get CalendarX via egg or tarball at Pypi or dist.plone.org. CalendarX is a feature-rich calendar and event CMS built in python using Plone and Archetypes. Event categories (metacalendar) and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Nanon is a web page manager, ideal for simple sites with no server generated content. It lets you define templates and reusable html code blocks like menus and ad sections. Nanon is a spiffy command line tool that you can learn how to use in minutes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next