Requests-HTML

This library intends to make parsing HTML (e.g. scraping the web) as simple and intuitive as possible. When using this library you automatically get full JavaScript support! (Using Chromium, thanks to puppeteer) CSS Selectors (a.k.a jQuery-style, thanks to PyQuery). XPath Selectors, for the faint of heart. Mocked user-agent (like a real web browser). Automatic following of redirects. Connection–pooling and cookie persistence. The Requests experience you know and love, with magical parsing abilities, and async support. The rest of the code operates the same way as the synchronous version except that results is a list containing multiple response objects however the same basic processes can be applied as above to extract the data you want.

Features

Full JavaScript support! (Using Chromium, thanks to pyppeteer)
CSS Selectors (a.k.a jQuery-style, thanks to PyQuery)
XPath Selectors, for the faint of heart
Mocked user-agent (like a real web browser)
Connection–pooling and cookie persistence
Automatic following of redirects

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Requests-HTML

Requests-HTML Web Site

Other Useful Business Software

Ship Agents Faster

Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free

Rate This Project

User Reviews

Be the first to post a review of Requests-HTML!

Additional Project Details

Operating Systems

Windows

Programming Language

Python

Related Categories

Python Web Scrapers

Registered

2023-04-10

Similar Business Software

Apify

Apify is a full-stack web scraping and automation platform helping anyone get value from the web. At its core is Apify Store, a marketplace with over 10,000 Actors where developers build, publish, and monetize automation tools. Actors are serverless cloud programs that extract data, automate...

See Software
Gaffa

Gaffa is a web scraping and browser automation API that gives developers full, real-browser control with a single API call no headless browsers, proxies, CAPTCHA handling, or scaling infrastructure to manage. JavaScript rendering is handled by default, so pages load exactly as they would for a...

See Software
Oxylabs

Oxylabs is a market leader in web intelligence with enterprise-grade, ethical, and compliant solutions. Its proxy infrastructure spans one of the largest global networks, offering residential, ISP, mobile, datacenter, & dedicated datacenter proxies, along with Web Unblocker – an AI-driven...

See Software
Bright Data

Bright Data is the world's #1 web data, proxies, & data scraping solutions platform. Fortune 500 companies, academic institutions and small businesses all rely on Bright Data's products, network and solutions to retrieve crucial public web data in the most efficient, reliable and flexible...

See Software
NetNut

Get ready to experience unmatched control and insights with our user-friendly dashboard tailored to your needs. Monitor and adjust your proxies with just a few clicks. Track your usage and performance with detailed statistics. Our team is devoted to providing customers with proxy solutions...

See Software
MangoProxy

MangoProxy is a professional residential proxy service designed for developers, web scrapers, and traffic arbitrage specialists. Key Features: • 90M+ residential IP addresses from 200+ countries • API integration for Python, JavaScript, Go, and other languages • Automatic IP rotation to...

See Software

Report inappropriate content

Requests-HTML

Pythonic HTML Parsing for Humans

Get an email when there's a new version of Requests-HTML

Features

Project Samples

Project Activity

Categories

License

Follow Requests-HTML

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered