web capture free download

Showing 26 open source projects for "web capture"

View related business solutions

Artificial Intelligence Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
1

Live API Web Console

A react-based starter app for using the Live API over websockets

Live API Web Console is a React starter that demonstrates how to use Gemini’s Live API over WebSockets to build real-time, multimodal experiences. The app includes modules for streaming audio playback, recording user media from the microphone, webcam, or even screen capture, and it surfaces a unified event log so you can debug the session as it flows.

Downloads: 0 This Week

Last Update: 2025-10-14
See Project
2

MCP Server Playwright

MCP server for browser automation using Playwright

An MCP (Model Context Protocol) server that leverages Playwright to provide browser automation capabilities, enabling large language models (LLMs) to interact with web pages, take screenshots, and execute JavaScript within a real browser environment.

Downloads: 3 This Week

Last Update: 2025-04-07
See Project
3

Browserbase MCP Server

Allow LLMs to control a browser with Browserbase and Stagehand

...The system supports multiple AI models and integrates seamlessly into agent workflows, making it suitable for applications such as web scraping, testing, and intelligent automation. It also includes advanced capabilities such as screenshot capture, DOM analysis, and session persistence, enabling complex interactions across multiple browsing sessions.

Downloads: 1 This Week

Last Update: 2026-03-31
See Project
4

BrowserTools MCP

Monitor browser logs directly from Cursor

Browser Tools MCP is an MCP server and Chrome extension that gives AI agents safe, structured access to your live browser for debugging and automation. It can capture console/network logs, DOM snapshots, and screenshots, and expose them as typed resources the agent can query or act on. The design aims to make IDE agents (e.g., Cursor, Claude Desktop) more “web-aware,” enabling workflows like reproducing a bug, collecting evidence, and proposing fixes without copy-pasting. Documentation and community guides outline a quick setup, including the extension, the MCP server process, and common troubleshooting steps. ...

Downloads: 3 This Week

Last Update: 2025-10-08
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Browser MCP

Browser MCP is a Model Context Provider (MCP) server

...By adapting a Playwright-style approach to control the running browser profile, it reuses logged-in sessions and cookies, which reduces re-authentication friction and helps avoid some bot-detection heuristics. The server exposes structured tools for navigation, element interaction, and artifact capture (DOM, screenshots, logs), all discoverable via MCP schemas. Because it runs against the user’s primary browser, it’s well-suited to repetitive web tasks, authenticated dashboards, and debugging workflows inside MCP-capable IDEs. A public website and extension streamline installation and connect the local server to clients like Claude, Cursor, VS Code, and Windsurf. ...

Downloads: 1 This Week

Last Update: 2025-10-08
See Project
6

clone-voice

A sound cloning tool with a web interface, using your voice

...It is built around Coqui’s XTTS-v2 model, so it inherits multilingual support and modern neural TTS quality while wrapping it in a user-friendly desktop workflow. The app is designed to be very easy to use: you download a precompiled package, double-click app.exe, and it launches a browser-based web interface where you control cloning and synthesis. It does not require an NVIDIA GPU to run basic tasks, although GPU acceleration can be used when available, making it accessible on modest machines. The tool supports around sixteen languages, including Chinese, English, Japanese, Korean, French, German, Italian, and others, and can capture reference voices directly from a microphone or from uploaded audio.

Downloads: 6 This Week

Last Update: 2025-11-28
See Project
7

comfyui-mixlab-nodes

Workflow and speech recognition app

comfyui-mixlab-nodes is a large collection of custom nodes for ComfyUI that turns workflows into interactive apps and adds real-time multimedia, LLM, and TTS capabilities. It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. For audio and speech, it provides nodes for SpeechRecognition and SpeechSynthesis, plus workflows that combine voice generation with real-time face swapping and other audio-visual effects. ...

Downloads: 9 This Week

Last Update: 2025-11-28
See Project
8

Claude-Mem

Claude Code plugin that automatically captures everything Claude does

Claude-Mem is a persistent memory compression system built specifically for Claude Code to preserve context across coding sessions. It automatically captures Claude’s tool usage, observations, and decisions, then compresses them into semantic memories that carry forward into future sessions. By enabling long-term continuity, Claude-Mem helps Claude “remember” project history, past fixes, and prior reasoning even after restarts or reconnects. Its progressive disclosure approach intelligently...

Downloads: 16 This Week

Last Update: 22 hours ago
See Project
9

Deep Chat

Customizable AI chat component for websites with API support

Deep Chat is a highly customizable web component designed to simplify the integration of AI-powered chat interfaces into websites. It allows developers to embed a fully functional chatbot using minimal setup, while still offering extensive control over behavior, appearance, and integrations. Deep Chat supports connections to a wide range of AI services as well as custom backends, enabling flexible deployment for different use cases. It is built as a framework-agnostic solution, meaning it...

Downloads: 6 This Week

Last Update: 2026-03-18
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Rocketnotes

AI-powered markdown editor - leverage LLMs with your documents

RocketNotes is an open-source note-taking application designed to combine traditional knowledge management with artificial intelligence features that enhance how users capture and organize information. The project focuses on providing a fast, lightweight environment where users can create structured notes, manage personal knowledge bases, and interact with AI tools to summarize or expand their content. Instead of functioning purely as a document editor, RocketNotes integrates AI capabilities...

Downloads: 5 This Week

Last Update: 2026-03-09
See Project
11

Pal

A personal context-agent that learns how you work

Pal is an open-source AI personal agent built within the Agno ecosystem that functions as an intelligent digital assistant designed to learn from user activity over time. The system acts as an AI-powered “second brain” capable of capturing, organizing, and retrieving personal knowledge such as notes, bookmarks, research findings, people, and meeting information. Instead of acting as a simple chatbot, Pal continuously builds a structured database of a user’s knowledge and context so it can...

Downloads: 0 This Week

Last Update: 2026-04-03
See Project
12

Integuru v0

The first AI agent that builds permissionless integrations

Integuru is an open-source AI agent designed to automatically create integrations between software platforms by reverse-engineering their internal APIs. Instead of relying on official developer documentation or publicly available APIs, the system analyzes network traffic generated by user interactions within a web application. Developers capture browser requests and authentication data, which the agent then uses to infer the structure of the platform’s internal API endpoints. Based on this information, the system generates executable code that can replicate the original action programmatically. This approach allows developers to automate workflows and build integrations with services that do not provide official APIs or developer tools. ...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
13

Large Concept Model

Language modeling in a sentence representation space

Large Concept Model is a research codebase centered on concept-centric representation learning at scale, aiming to capture shared structure across many categories and modalities. It organizes training around concepts (rather than just raw labels), encouraging models to understand attributes, relations, and compositional structure that transfer across tasks. The repository provides training loops, data tooling, and evaluation routines to learn and probe these concept embeddings, typically...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
14

Automated Interpretability

Code for Language models can explain neurons in language models paper

The automated-interpretability repository implements tools and pipelines for automatically generating, simulating, and scoring explanations of neuron (or latent feature) behavior in neural networks. Instead of relying purely on manual, ad hoc interpretability probing, this repo aims to scale interpretability by using algorithmic methods that produce candidate explanations and assess their quality. It includes a “neuron explainer” component that, given a target neuron or latent feature,...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
15

Ainee

Ainee - AI Notetaking and Learning Companion

Ainee is your ultimate AI-powered notetaking and learning companion. Capture lecture notes in real-time and effortlessly transform audio, text, files, and YouTube videos into formatted notes, mindmaps, quizzes, flashcards, podcasts, and more. Explore our AI meeting note taker, AI notes, video transcript generator, PDF to AI converter, and AI flashcard maker. Enhance your learning with our AI voice recorder, article summarizer AI, and AI quiz generator. Additionally, share your knowledge...

1 Review

Downloads: 0 This Week

Last Update: 2025-05-23
See Project
16

bitfarm-Archiv Document Management - DMS

bitfarm-Archiv is a powerful Document Management (DMS), Enterprise Content Management (ECM) and Knowledge Management System (KMS) with Workflow Components. Help us! As we live in the internet age, the best thing, you can help, is to write a short statement about your scenario and your use of the DMS, along with your experiences and put it on your own website or in a blog or forum. It would help us best, if you can also add a hyperlink to our site http://www.bitfarm-archiv.com. By this...

11 Reviews

Downloads: 15 This Week

Last Update: 2 days ago
See Project
17

Knowledge + Chat

Knowledge is a tool for saving, searching, accessing, and chatting

...A built-in Chromium-based browser enables users to capture and analyze web content directly within the application environment.

Downloads: 0 This Week

Last Update: 2026-03-07
See Project
18

Dissapearing-People

Removing people from complex backgrounds in real time

Person removal from complex backgrounds over time. Removing people from complex backgrounds in real-time using TensorFlow.js in the web browser using JavaScript. This code attempts to learn over time the makeup of the background of a video such that I can attempt to remove any humans from the scene. This is all happening in real-time, in the browser, using TensorFlow.js. This is an experiment. It may not be perfect in all situations. Go ahead and try it right now in your own web browser....

Downloads: 5 This Week

Last Update: 2021-11-22
See Project
19

OCR Web based

OCR web based for Browser Firefox & PC

Optical Character Recognition in JS for Browser is based on ocrad.js. OCR for Browser is a free extension and You can use this application to extract text from any image you supply. Just upload your image files. OCR for Browser takes either a JPG, GIF, TIFF, BMP, PNG. ========= Get OCR for Android (Beta release) - https://play.google.com/store/apps/details?id=com.ulm.ocr ========= Add-on for Opera: http://bit.ly/1F0E0wP ========= Release 1.0.1 For safety reasons, I disabled...

2 Reviews

Downloads: 0 This Week

Last Update: 2018-09-05
See Project
20

FormRead

Free OMR - OCR web sofware based on javascript and PHP

https://formread.org FormRead is a completely free OMR (optical mark recognition) web software for scanning and grading user-filled, multiple choice forms. Create your formats with any of your office or drawing tools, scan them and parameterize their coordinates in an easy way. Once you have parameterized your form, you can print many of them, give it to your students/respondents, scan and recognize them with formread, and you can finally export the data in your preferred formats...

Downloads: 9 This Week

Last Update: 2022-03-04
See Project
21

Vision2u

free image processing software

Vision2u offers a free image processing software for personal use and research. Primary tasks of the image processing can be realized during simple operation of the software. Every Web cam owner can have simplest measuring, counting or tasks of monitoring done without high capital outlays.

Downloads: 0 This Week

Last Update: 2015-05-01
See Project
22

lavalamp>3

screen colors get changed by a neural net

On the full screen or a frame the colors get changed dynamically by using a image recognition neural net.

Downloads: 0 This Week

Last Update: 2014-05-08
See Project
23

phpSANE

Web-Based Frontend for SANE

phpSANE is a web-based frontend for SANE written in HTML/PHP so you can scan with your web-browser. It also supports OCR.

13 Reviews

Downloads: 3 This Week

Last Update: 2013-10-24
See Project
24

Fish4Knowledge Project

Analysis of undersea fish videos

...A combination of computer vision, database storage, workflow and human computer interaction methods were used to achieve this. The project used live video feeds from 10 underwater cameras as a testbed for investigating more generally applicable methods for capture, storage, analysis and querying of multiple video streams. We collated a public database from 3 years containing video summaries of the observed fish and associated descriptors. Expert web-based interfaces were developed for use by marine researchers, allowing unprecedented access to live and previously stored videos, or previously extracted information.

Downloads: 0 This Week

Last Update: 2016-11-29
See Project
25

opendias

NB: openDIAS is moving away from SF.net. Please visit the homepage link for the most update to date information, support and files. Document Imaging Archive System. Home document imaging, with OCR. Scan documents (with SANE) or import ODF documents, assign tags. Use openDIAS to store all our letters, bills, statements, etc in a convenient, safe and easily retrievable way.

Downloads: 0 This Week

Last Update: 2013-04-22
See Project