Showing 22 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    yz-next-apyref

    yz-next-apyref

    Arno-Can's Python based API Reference Generator.

    Multi-Syntax-Version and Multi-Implementation API reference documentation generator. Supports local and remote sources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    etalon

    Standards Document Redaction System

    The best process for creating standards is that of the IETF’s RFCs. Standard documents are written, submitted for publication, reviewed, and finally approved. Then they get a official number and hence can be referenced. Some can even be blessed to be an official standard. Etalon will implement this whole process, web-based and with markdown documents. Create your company’s standards the RFC way!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Leo is an outlining-oriented project manager and browser. With Leo you can organize programs, web sites, URL's, pictures or any other kind of data. Leo is fully scriptable in Python. Leo is 100% pure Python. It requires the Qt gui toolkit.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • 5
    DirDiff

    DirDiff

    Python 2.7 script, analyzing 2 directories for differences.HTML output

    Comparing two different versions of a directory with text files of any kind has never been easier. Detects deleted files, new files, files with changed name but same content and of course a side-by-side / line-by-line comparison of changed files. Number of context-lines is configurable. Here's, how to call it: >python DirDiff.py dir1 dir2 output.html or >python.exe DirDiff.py -l 3 -v olddir newdir dirdiff.html -l is the number of context lines in the side-by-side comparison. -v...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Poor Http

    Poor Http

    WSGI Server, WSGI Connector, Python doc generator

    Poor Http Server is standalone wsgi server, which is designed for using python web applications. Unlike other projects, this is not framework, but single server, light wsgi connector, and python doc generator.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DoCookBook

    DoCookBook

    Cookbook Style Document for DocBook Customizations

    This project has been moved to GitHub: https://github.com/tomschr/dbcookbook/ The DoCookBook project aims to create an open source book about DocBook and the DocBook XSL stylesheets written as a cookbook and released under a Creative Commons license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    CodeInvestigator is a tracing tool for Python programs. All the run time information is recorded. Read your code together with its run time details. Use it to visualize what happened when you program ran.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Pynta - Flexible Web Framework in Python
    Pynta is flexible web framework written in Python. All development going on https://github.com/lig/pynta
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 10
    ARREA

    ARREA

    Collaborative editor, requirements management tool and much more

    ARREA (A Reasonable Requirements Engineering Application) is a simple yet powerful requirements management tool that uses arbitrary files to store information and Subversion to allow team collaboration. Its simple conventions make it suitable for much other purposes. In fact, for any document-oriented tasks focused on collaborative work and exhaustive change and traceability management. At its heart, ARREA provides a thin layer on top of SVN that allows it to be used as a powerful...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Make AsciiDoc part of your literate programming tool set. With eWEB you can weave and tangle literate programs written as AsciiDoc documents, using embedded WEB code snippets.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 12
    Txt2tags converts a text file with minimal markup to HTML, XHTML, SGML, LaTeX, Lout, UNIX Man Page, Wikipedia, Google Code Wiki, DokuWiki, MoinMoin, MagicPoint(mgp), PageMaker. Features: simple, fast, automatic TOC, macros, filters, include, GUI/CLI/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    uListen is a TTS(Text To Speech) application. It can TALK you the web pages, chm files, pdf files and word files and plain text files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The pydoc module, distributed bundled with python, proved extremely helpful in handling documentation strings included in python scripts. This project proposes a clean up of the code as well as add new capabilities, with python version 2.6 as a target.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A small, effective, extendable and customizable personal Wiki developed for pocketPC systems. Can run on all python supported platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    EurekA is a knowldege base system primarily developed in python with MySQL as the database, SQLObject module as the DB API and CherryPy+Genshi as the frontend. The main aim of this project is to provide a dynamic Knowldege Base system that caters to the
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Fudge is a documentation system for software projects inspired by Pudge. It is currently targeted at providing integration with current Python software developement tools and internationalization, but support for other programming languages is planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The Django-driven web-based book-authoring content-management-system (ddwbbacms) is a program that allows authors to work on their books, articles, or topics without having to have their hard drives nearby. Access to the web is all that is required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    System to assist with the management of a local Buddhist study library. Will download and install chosen documentation (sutras, etc), index them, and provide a web interface. Python, HTML. Needs a webserver, namazu. Can be used for other documents, too.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Wenost simplifies the process of building and maintaining larger web sites by hand. Wenost is used in conjunction with PPWizard, a free HTML pre-processor. For examples of websites built with wenost see www.csem.org.uk and the wenost project website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    POST (Python Obviously Simple Text) provides support for simple, flexible dynamic document generation in multiple output formats. Supports inputs in text or XML, outputs in HTML, PDF, RTF, LaTeX source, nroff source, postscript, and plain text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Gathering requirements should be fun. It's the brainstorming process refined. It shouldn't get in the way of productivity or creativity. Harvest is about gathering requirements in the most intuitive, least intrusive way possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.