54 projects for "webkit-browser" with 2 filters applied:

  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery turns your data warehouse into an AI platform. No new languages required.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Browser MCP

    Browser MCP

    Browser MCP is a Model Context Provider (MCP) server

    Browser MCP is an MCP server plus browser integration that lets AI apps automate the user’s real browser instead of launching a separate automation session. By adapting a Playwright-style approach to control the running browser profile, it reuses logged-in sessions and cookies, which reduces re-authentication friction and helps avoid some bot-detection heuristics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OpenAI.fm

    OpenAI.fm

    Code for openai.fm, a demo for the OpenAI Speech API

    ...Developed using Next.js and the OpenAI Speech API, this demo illustrates how the latest neural voice models can produce natural, expressive speech with adjustable styles and voices, highlighting features like emotional range, tone, and real-time playback. Users can experiment with different input text and voice options directly in their browser, gaining a sense of how high-fidelity AI audio can be integrated into applications ranging from podcasts and narration to accessibility tools and interactive agents. Although the web demo is free to explore, production use of the underlying API requires an OpenAI API key and may incur costs based on usage.
    Downloads: 418 This Week
    Last Update:
    See Project
  • 3
    Secret Llama

    Secret Llama

    Fully private LLM chatbot that runs entirely with a browser

    Secret Llama is a privacy-first large-language-model chatbot that runs entirely inside your web browser, meaning no server is required and your conversation data never leaves your device. It focuses on open-source model support, letting you load families like Llama and Mistral directly in the client for fully local inference. Because everything happens in-browser, it can work offline once models are cached, which is helpful for air-gapped environments or travel.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Scribe.js

    Scribe.js

    JavaScript OCR and text extraction for images and PDFs

    ...In addition to simple text extraction, Scribe.js supports writing or injecting a high-quality invisible text layer back into PDFs, effectively making them searchable and improving usability for indexing or accessibility. It is written in modern ECMAScript Modules (ESM), so it can be imported in both browser and Node.js environments without a build step, though browser usage requires same-origin hosting of the files.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    BrowserAI

    BrowserAI

    Run local LLMs like llama, deepseek, kokoro etc. inside your browser

    BrowserAI is a cutting-edge platform that allows users to run large language models (LLMs) directly in their web browser without the need for a server. It leverages WebGPU for accelerated performance and supports offline functionality, making it a highly efficient and privacy-conscious solution. The platform provides a developer-friendly SDK with pre-configured popular models, and it allows for seamless switching between MLC and Transformer engines.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Happy Coder

    Happy Coder

    Mobile and Web client for Codex and Claude Code, with realtime voice

    ...At its core, Happy wraps existing AI coding tools with a unified interface, providing real-time voice interactions, encrypted communication, and seamless device switching between desktop and mobile. You can start a coding session locally through the Happy CLI or connect from a phone or browser, allowing developers to inspect, interact with, and guide the AI as it generates, tests, or explains code. The project includes components like a dedicated backend server for encrypted sync, a rich front-end experience across web and native apps, and support for push notifications when your coding agent encounters permission requests or errors. ...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 7
    Dafthunk

    Dafthunk

    A workflow execution platform built on top of the fantastic Cloudflare

    Dafthunk is a browser-first visual workflow automation platform that lets you build, run, and monitor workflows without standing up a traditional server stack. It’s designed around durable, multi-step execution so workflows can keep going (and recover) even when individual steps fail, time out, or need retries. The platform is closely aligned with Cloudflare’s ecosystem, using edge-native building blocks for execution, orchestration, and storage so workflows can run near users with low latency. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Reader 3

    Reader 3

    Quick illustration of how one can easily read books together with LLMs

    ...While it lacks advanced features like built-in annotations or rich media support, its simplicity is intentional, enabling users to quickly load EPUBs, view them in a browser, and even repurpose text for downstream tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    PasteGuard

    PasteGuard

    Masks sensitive data and secrets before they reach AI

    ...PasteGuard supports two primary modes: mask mode, which anonymizes data and still uses external APIs; and route mode, which forwards sensitive requests to a local LLM inference engine while sending the rest to the cloud. It can be self-hosted via Docker, works with a wide range of SDKs and tools, and includes a browser extension for automatic protection in everyday AI chats.
    Downloads: 2 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    FlowLens MCP

    FlowLens MCP

    Open-source MCP server that gives your coding agent

    FlowLens MCP Server is an open-source tool designed to give AI-powered coding agents (like Claude Code, Cursor, GitHub Copilot / Codex, and others) full, replayable browser context to dramatically improve debugging, bug reporting, and regression testing for web applications. It works together with a companion browser extension: when a user reproduces a bug or a complicated UI interaction, the extension captures a rich session log, including screen/video recording, network traffic, console logs, DOM events, storage changes, and more, and exports it. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    xgplayer

    xgplayer

    A HTML5 video player with a parser that saves traffic

    ...It abstracts many of the lower-level complexities of HTML5 media, providing a consistent API for playback control, custom UI overlays, adaptive streaming, plugin hooks, and cross-browser compatibility. Because of its emphasis on modularity and extensibility, xgplayer can be embedded into modern web projects and customized — developers can add controls, custom buffering strategies, subtitle handling, adaptive bitrate streaming, or integrate with other web-based video infrastructures. It seeks to provide a smooth, stable viewing experience even on varied devices or network conditions, and is particularly appealing for web apps that need more control than vanilla video tags offer.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    OpenManus

    OpenManus

    No fortress, purely open ground. OpenManus is Coming

    OpenManus is an open‑agent AI framework focused on building versatile general-purpose agents capable of autonomously executing complex workflows — such as planning, browsing, tool invocation — all via a pluggable prompts and tools interface. It's being extended with reinforcement learning‑based tuning modules and designed for researchers and developers building custom AI agents.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    NextJS Ollama LLM UI

    NextJS Ollama LLM UI

    Fully-featured web interface for Ollama LLMs

    ...Its goal is to remove the complexity of setting up and managing UI components for local or offline LLM usage by providing a straightforward chat experience with support for responsive layouts, light and dark themes, and local chat history storage in the browser. The interface stores conversations in local storage, so no separate backend database is required, making it ideal for hobbyists, experimenters, and developers who want a simple, web-accessible portal to their models. It includes usability enhancements like code syntax highlighting and easy code block copying, plus basic controls to download and manage models directly from the web UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Supertonic

    Supertonic

    Lightning-fast, on-device TTS, running natively via ONNX

    ...Supertonic is designed to handle real-world text gracefully, including numbers, dates, currency symbols, abbreviations, and technical units, without requiring heavy pre-processing or custom text normalization. The repository provides complete reference implementations across many programming ecosystems—Python, Node.js, browser (WebGPU/WASM), Java, C++, C#, Go, Swift, iOS, Rust, and Flutter.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Llama Coder

    Llama Coder

    Open source Claude Artifacts – built with Llama 3.1 405B

    ...It’s framed as an open-source “Claude Artifacts”-style experience: you describe the app you want, the tool calls an LLM hosted on Together.ai, and you get back a runnable code artifact. The project includes a web interface where you can enter prompts, see generated code, and run or tweak the result directly in the browser. Technically, it is built using a modern TypeScript/Next.js stack and integrates with Together’s API, making it a good blueprint for building your own AI-powered developer tools. By focusing on small self-contained apps or components, it keeps scope manageable while still showcasing the power of code generation. Developers can fork the repo to plug in different models, change the UI, or integrate it into their own IDE-adjacent workflows.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 16
    WhisperLive

    WhisperLive

    A nearly-live implementation of OpenAI's Whisper

    WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    OpenAI Realtime Agents

    OpenAI Realtime Agents

    This is a simple demonstration of more advanced, agentic patterns

    ...The SDK offers abstractions such as agent orchestration, event handling, handoffs, state management, and guardrails, tailored to support realtime, conversational systems. The demo includes a Next.js frontend for browser interaction and likely a backend component to orchestrate realtime sessions and agent logic. It also supports a “Chat-Supervisor” pattern where a lightweight realtime chat agent handles user interactions and delegates more complex reasoning or tool usage to a stronger textual model (e.g. GPT-4). Because realtime agents are still a beta feature, the code and API surface are subject to changes and may evolve.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Groq TypeScript / Node.s

    Groq TypeScript / Node.s

    The official Node.js / Typescript library for the Groq API

    ...The library also supports passing different input types (file streams, blobs, fetch responses) for media-related endpoints, making it flexible for diverse environments (backend, browser, serverless). With this SDK, developers can call Groq’s models, transcribe audio, perform file uploads — all with minimal boilerplate — which streamlines creation of AI-enabled applications in the JavaScript/TypeScript ecosystem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    edge-tts

    edge-tts

    Use Microsoft Edge's online text-to-speech service from Python

    edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications. The tool lets you list available voices, specify locale and voice name, and generate audio files in common formats like MP3 or WAV. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 20
    Speech-AI-Forge

    Speech-AI-Forge

    Speech-AI-Forge is a project developed around TTS generation model

    Speech-AI-Forge is a full-stack project built around modern text-to-speech generation models, providing both an API server and a Gradio-based web UI for interactive use. At its core, it acts as a hub that wires together multiple speech-related capabilities, including TTS, speech-to-text and LLM-based control flows, behind a consistent interface. The system is designed to be deployed in several ways: you can try it online via hosted demos, spin it up in a one-click Colab environment, run it...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    Actionbook

    Actionbook

    Browser action engine for AI agents. 10× faster, resilient by design

    Actionbook is an AI-centric automation framework that equips intelligent agents with the ability to interact with real live web pages in a reliable and scalable way, eliminating the guesswork involved in navigating modern dynamic sites. Instead of having agents blindly scrape HTML or blindly try to click things, Actionbook supplies up-to-date action manuals and verified DOM structure, letting agents know exactly how to click, type, and navigate complex interfaces such as SPAs or streaming...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    agentation

    agentation

    The visual feedback tool for agents

    Agentation is a visual annotation and feedback tool designed to make interacting with AI coding agents more intuitive and precise by letting developers visually click on frontend elements in a browser and annotate them with context before sending structured feedback to an agent. Instead of describing UI elements in text — like “the blue button in the sidebar” — users click directly on elements to automatically capture selectors, positions, and contextual metadata that can be consumed by AI agents to locate exact code references. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    GitHub Copilot SDK

    GitHub Copilot SDK

    Multi-platform SDK for integrating GitHub Copilot Agent into apps

    ...Instead of being limited to editors like VS Code, this SDK lets teams embed Copilot-style code suggestions, natural language assistance, and predictive completions anywhere they see fit—such as internal IDEs, browser extensions, documentation portals, or bespoke tools tailored to specific languages or frameworks. It provides a structured API surface for invoking the Copilot model in context with the surrounding user state, capturing document content, cursor position, and invocation triggers so suggestions are relevant and responsive. The SDK includes helpers for streaming completions, managing rate limits, handling authentication, and integrating with telemetry and analytics pipelines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    rep+

    rep+

    Burp-style HTTP Repeater for Chrome DevTools with built‑in AI

    rep+ is a lightweight browser extension for Chrome DevTools that brings a Burp Suite-style HTTP repeater directly into the developer console, enhanced with built-in AI to help explain requests and suggest tests. It captures HTTP traffic from the inspected page without needing a proxy, allowing users to replay, modify, and analyze individual requests with fine-grained control over headers, bodies, and methods.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Grok CLI

    Grok CLI

    An open-source AI agent that brings the power of Grok

    Grok CLI is a command-line interface built around the Grok AI model that brings programmatic and conversational AI capabilities directly to developer terminals. It lets you run Grok queries from your shell, scripting environment, or automation workflows without switching to a browser, enabling utility in scripting, quick data exploration, code generation, and assistant-guided tasks directly where you write code. The CLI supports streaming responses, so outputs appear in real time as the Grok model generates them, making interactions feel responsive and fluid in terminal contexts. Grok CLI is designed to integrate with existing terminal habits—aliases, pipes, editors, and tooling—so you can combine AI assistance with native command-line workflows like grep, awk, and git. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB