HAipproxy is a distributed proxy IP pool system designed to collect, manage, and provide large numbers of proxy addresses for web crawling tasks. It automatically crawls proxy resources from the internet and aggregates them into a centralized pool that can be accessed by distributed spiders and scraping systems. It is built using Python and relies on Scrapy for high-performance crawling while Redis is used for data storage, communication, and task coordination between components. It includes crawlers that discover proxy servers, validators that test proxy availability and performance, and schedulers that manage crawling and validation tasks. HAipproxy aims to maintain a high availability proxy pool with low latency so that scraping frameworks can rotate proxies efficiently and avoid blocking during large-scale data collection. Its architecture supports distributed deployment, allowing multiple crawler workers and validators to run across different machines.

Features

  • Distributed crawler architecture powered by Scrapy and Redis
  • Automatic discovery and collection of proxy IP resources
  • Proxy validation system to ensure availability and reliability
  • High availability design for crawler and scheduler components
  • Flexible task routing and scheduling for proxy collection jobs
  • Support for HTTP, HTTPS, and SOCKS5 proxy protocols

Project Samples

Project Activity

See All Activity >

Categories

Web Scrapers

License

MIT License

Follow haipproxy

haipproxy Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of haipproxy!

Additional Project Details

Programming Language

Python

Related Categories

Python Web Scrapers

Registered

2026-03-10