pydocrawl automatically downloads pdf-, ps- and doc- files from web sites. An initial URL and a wordlist must be given. Multithreaded information mining (harvesting) tool written entirely in Python. Version 0.1 successfully runs on Linux and Cygwin.
Follow pydocrawl
Other Useful Business Software
Auth0 for AI Agents now in GA
Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of pydocrawl!