RobotsDisallowed

RobotsDisallowed is a public catalog that tracks websites and organizations explicitly blocking AI and web-scraping crawlers in their robots.txt or related mechanisms. It focuses on documenting the growing trend of content owners asserting control over how their data is used for model training and automated harvesting. The project aggregates domains, notes the targeted bots or user agents, and surfaces patterns for researchers, policymakers, and tool builders. It serves both as a transparency effort and as a resource for people designing allow/deny strategies for automated access. The dataset invites community contributions to keep the picture current as new bots emerge and policies shift. It also highlights the intersection of web standards, ethics, and AI governance by showing how site owners operationalize consent and restriction at scale.

Features

Curated list of domains that disallow AI or scraping bots
Identification of targeted user agents and blocking patterns
Community-updated dataset reflecting policy changes
Reference for researchers and builders of crawl-aware tools
Snapshot of evolving norms around data usage and consent
Lightweight format for analysis and reuse

Project Samples

Project Activity

See All Activity >

Follow RobotsDisallowed

RobotsDisallowed Web Site

Other Useful Business Software

Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account

Rate This Project

User Reviews

Be the first to post a review of RobotsDisallowed!

Additional Project Details

Registered

2025-10-28

Similar Business Software

DHTMLX

DHTMLX is a JavaScript UI library that provides a set of highly customizable and flexible components for building modern and responsive web applications. The library includes more than 30 UI components, such as Gantt, Scheduler, Kanban, diagrams, charts, grids, spreadsheets, calendars, trees,...

See Software
Webix

JavaScript UI library and framework for speeding up web development. JS Framework for cross-platform web Apps development 102 UI widgets and feature-rich CSS / HTML5 JavaScript controls. Save at least 3000+ development hours by using ready-made widgets and UI controls. Develop Web UI 30% faster....

See Software
SurveyJS

SurveyJS is a product family of four open-source JavaScript libraries that provide you with all the advantages of a custom-built in-house survey application while significantly reducing time-to-market and resources required to get the system up and running. The libraries are compatible with most...

See Software
Bryntum

Bryntum is a leading provider of high-performance scheduling solutions for the web. Our suite of JavaScript components—including Gantt, Scheduler, Task Board, and Calendar—enables developers to build modern project management applications with features like drag-and-drop scheduling, resource...

See Software
Blockly

The Blockly library adds an editor to your app that represents coding concepts as interlocking blocks. It outputs syntactically correct code in the programming language of your choice. Custom blocks may be created to connect to your own application. Blockly in a browser allows web pages to...

See Software
FusionCharts

FusionCharts is a powerful and easy-to-use JavaScript charting library that helps developers to add interactive charts and data visualizations to their web and mobile applications. With 100+ chart types, including column, bar, line, area, pie, doughnut, scatter, bubble, and more, it's easy to...

See Software

Report inappropriate content

RobotsDisallowed

A curated list of the most common and most interesting robots.txt

Get an email when there's a new version of RobotsDisallowed

Features

Project Samples

Project Activity

Categories

Follow RobotsDisallowed

User Reviews

Additional Project Details

Registered