Showing 17 open source projects for "python web crawler"

View related business solutions
  • All-in-one security tool helps you prevent ransomware and breaches. Icon
    All-in-one security tool helps you prevent ransomware and breaches.

    SIEM + Detection and Response for IT Teams

    Blumira’s detection and response platform enables faster resolution of threats to help you stop ransomware attacks and prevent data breaches. We surface real threats, providing meaningful findings so you know what to prioritize. With our 3-step rapid response, you can automatically block known threats, use our playbooks for easy remediation, or contact our security team for additional guidance. Our responsive security team helps with onboarding, triage and ongoing consultations to continuously help your organization improve your security coverage.
    Learn More
  • G-P - Global EOR Solution Icon
    G-P - Global EOR Solution

    Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

    With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.
    Learn More
  • 1
    Archivematica

    Archivematica

    Free and open-source digital preservation system

    Archivematica is a web- and standards-based, open-source application which allows your institution to preserve long-term access to trustworthy, authentic, and reliable digital content. Our target users are archivists, librarians, and anyone working to preserve digital objects. You are free to copy, modify, and distribute Archivematica with attribution under the terms of the AGPLv3 license. Archivematica is an open-source application based on recognized standards that makes it possible to...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Zero Install
    Zero Install is a decentralised cross-distribution software installation system. Create one package that works everywhere! With dependency handling and automatic updates, full support for shared libraries, and integration with native package managers
    Leader badge
    Downloads: 3,663 This Week
    Last Update:
    See Project
  • 3
    The archive-crawler project is building Heritrix: a flexible, extensible, robust, and scalable web crawler capable of fetching, archiving, and analyzing the full diversity and breadth of internet-accesible content.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    MarcXimiL is a flexible multi-platform bibliographic similarity analysis framework. Features: deduplication, information monitoring, visual analysis, plagiarism detection. Supported: MARCXML, OAI-PMH2 harvesting, and importation of text MARC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Award-Winning Medical Office Software Designed for Your Specialty Icon
    Award-Winning Medical Office Software Designed for Your Specialty

    Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

    RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
    Learn More
  • 5
    iCamp is a research and development project funded by the European Commission. The project aims at creating an infrastructure for collaboration and networking in Higher Education across systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Trident Project
    A digital repository and metadata editing initiative of Duke University Libraries
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Make AsciiDoc part of your literate programming tool set. With eWEB you can weave and tangle literate programs written as AsciiDoc documents, using embedded WEB code snippets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Museum portal based on Plone and PostgreSQL presenting archive-, photo-, subject matter and book materials in addition to online articles. Supports importing of data from museum systems in CIDOC XML format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • The Original Buy Center Software. Icon
    The Original Buy Center Software.

    Never Go To The Auction Again.

    VAN sources private-party vehicles from over 20 platforms and provides all necessary tools to communicate with sellers and manage opportunities. Franchise and Independent dealers can boost their buy center strategies with our advanced tools and an experienced Acquisition Coaching™ team dedicated to your success.
    Learn More
  • 10
    Whiki is a hierarchic data structure. This project contains a web content management system to handle a whiki database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The Open Archive Cataloguer (zOAC) project applies the OAI-PMH protocol for automatic metadata harvesting and aggregation of bibliographic records and has been developed over the web application server Zope. Based on Pentila's ZOpenArchives Zope Product.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A web-based search interface tailored to the New Zealand Gazette PDF archive for the NZ library community. A generic Python-based Swish-e search interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    CiteULike is a free service to help academics share, store, and organise the papers they're reading. This open source project contains the code to scrape citations from publishers' web sites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    collection good tip code as utility packet,esp. Chinses! and design one mail core Blog system. only through mail we can blogging us Blog! and as one opening proj. , let China Python easy share tip code into uniform packet; and enjoy OpenSource...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    open-tamil

    Tamil Tools, Tamil Library for Python 2, 3

    Open-Tamil is a full featured Tamil text processing library in Python. It works fully in Python 2, 3. Published via pip - python package index. See: https://pypi.python.org/pypi/Open-Tamil/0.67
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    cosmos

    cosmos

    Algorithms that run our universe | Your personal library of every algo

    Cosmos (by OpenGenus Foundation) is your personal offline collection of every algorithm and data structure one will ever encounter and use in a lifetime. This provides solutions in various languages spanning C, C++, Java, JavaScript, Swift, Python, Go and others. This work is maintained by a community of hundreds of people and is a massive collaborative effort to bring the readily available coding knowledge offline.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    PyShelf

    PyShelf

    FOSS Ebook Server, With no windowing requirements

    PyShelf is an Open Source python based, ebook server, that does not and never will require a windowing system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next