LLM Scraper

LLM Scraper is a TypeScript library designed to extract structured data from webpages using large language models. Instead of relying on fragile HTML selectors or manual parsing rules, the tool interprets webpage content with language models and converts it into structured data according to a defined schema. Developers can specify the data structure using tools such as Zod or JSON Schema, enabling the model to extract relevant information directly into typed objects. LLM Scraper integrates browser automation through Playwright, allowing it to load webpages and process their content before sending it to a language model for interpretation. Multiple content processing modes are supported, including raw HTML, cleaned HTML, Markdown, extracted text, screenshots, and custom inputs, making it adaptable to a wide range of scraping scenarios. LLM Scraper also provides streaming output and code generation capabilities that help developers build reusable scraping workflows.

Features

Extracts structured data from webpages using large language models
Supports multiple LLM providers including GPT, Gemini, Llama, Qwen, and Sonnet
Schema-based data extraction using Zod or JSON Schema
Built on Playwright for automated webpage loading and interaction
Streaming mode for receiving partial structured outputs in real time
Code generation for creating reusable Playwright scraping scripts

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow LLM Scraper

LLM Scraper Web Site

Other Useful Business Software

Gemini 3 and 200+ AI Models on One Platform

Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free

Rate This Project

User Reviews

Be the first to post a review of LLM Scraper!

Additional Project Details

Programming Language

TypeScript

Related Categories

TypeScript Artificial Intelligence Software

Registered

19 hours ago

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Google Cloud BigQuery

BigQuery is a serverless, multicloud data warehouse that simplifies the process of working with all types of data so you can focus on getting valuable business insights quickly. At the core of Google’s data cloud, BigQuery allows you to simplify data integration, cost effectively and securely...

See Software
Process Street

Process Street is an AI-powered compliance operations platform that automates complex workflows, enforces standards, and tracks audit data in real time. Teams use it to create structured SOPs, assign tasks, collect data, and monitor execution with intelligent oversight. From onboarding and...

See Software
Retool

Retool is an AI-powered platform that enables teams to build internal software, agents, and workflows faster using natural language and composable building blocks. It allows users to go from a simple prompt to a fully deployed application that works with their existing data, systems, and...

See Software
Teradata VantageCloud

Teradata VantageCloud: The complete cloud analytics and data platform for AI. Teradata VantageCloud is an enterprise-grade, cloud-native data and analytics platform that unifies data management, advanced analytics, and AI/ML capabilities in a single environment. Designed for scalability and...

See Software

Report inappropriate content

LLM Scraper

Extract structured data from webpages using LLM-powered scraping

Get an email when there's a new version of LLM Scraper

Features

Project Samples

Project Activity

Categories

License

Follow LLM Scraper

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered