pydocrawl automatically downloads pdf-, ps- and doc- files from web sites. An initial URL and a wordlist must be given. Multithreaded information mining (harvesting) tool written entirely in Python. Version 0.1 successfully runs on Linux and Cygwin.

Project Activity

See All Activity >

Follow pydocrawl

pydocrawl Web Site

Other Useful Business Software
Ship AI Apps Faster with Vertex AI Icon
Ship AI Apps Faster with Vertex AI

Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
Try Vertex AI Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of pydocrawl!

Additional Project Details

Registered

2004-09-12