Lightcrawler is a command-line tool designed to crawl a website and run automated audits on the discovered pages using Google Lighthouse. It works by starting from a given URL and recursively exploring linked pages to collect a set of pages that should be analyzed. Each discovered page is then evaluated using Lighthouse, which performs checks related to performance, accessibility, and web development best practices. This allows developers to audit multiple pages of a site automatically instead of manually running Lighthouse on each individual page. Lightcrawler supports configuration through a JSON configuration file, enabling users to customize how the crawler operates and which Lighthouse audits should be executed. Settings such as crawl depth and the number of concurrent browser instances can be configured to control how aggressively the crawler scans a site. It was created as a developer utility to help identify issues across an entire website more efficiently.
Features
- Crawls a website starting from a specified URL and discovers linked pages
- Runs Lighthouse audits automatically on each crawled page
- Supports configuration through a JSON configuration file
- Allows limiting crawl depth and number of Chrome instances
- Enables selecting specific Lighthouse categories or audits
- Provides automated multi-page website analysis for developers