44 projects for "page" with 2 filters applied:

  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    The Hundred-Page Machine Learning Book

    The Hundred-Page Machine Learning Book

    The Python code to reproduce illustrations from Machine Learning Book

    The Hundred-Page Machine Learning Book is the official companion repository for The Hundred-Page Machine Learning Book written by machine learning researcher Andriy Burkov. The repository contains Python code used to generate the figures, visualizations, and illustrative examples presented in the book. Its purpose is to help readers better understand the concepts explained in the text by allowing them to run and experiment with the underlying code themselves.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    KeepChatGPT

    KeepChatGPT

    Browser userscript that enhances ChatGPT reliability and usability

    ...KeepChatGPT also introduces a variety of enhancements that improve the overall interface and user experience, including page cleanup, expanded display layouts, conversation cloning, and detailed chat information.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    MemFree

    MemFree

    Hybrid AI Search Engine & AI Page Generator

    ...Beyond search, memfree includes an AI Page Generator that can transform text or images into production-ready frontend code using modern web stacks. The platform emphasizes productivity by automatically organizing information and reducing the need for manual bookmarking or note management. Overall, memfree positions itself as an all-in-one knowledge assistant and rapid UI generation tool for developers and power users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    canvas-editor

    canvas-editor

    Canvas-based WYSIWYG rich text editor with advanced layout tools

    ...It is designed to provide a WYSIWYG editing experience similar to word processors, enabling precise control over layout, rendering, and document structure. canvas-editor supports a wide range of formatting and document features, including text styling, tables, images, and embedded elements, all managed through a structured data model. Its architecture is modular, allowing developers to extend functionality through plugins, custom commands, and event hooks. It includes support for page-based layouts with headers, footers, pagination, and print-ready output, including PDF generation. It also provides interactive components such as form controls and context menus, making it suitable for building complex document editing systems.
    Downloads: 12 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    BruteForceAI

    BruteForceAI

    Advanced LLM-powered brute-force tool combining AI intelligence

    ...The repository emphasizes features such as threaded execution, logging, and notification integrations, which position it as an automation-oriented project for controlled security assessment environments. From a software design perspective, its distinguishing idea is the use of language models as a front-end analysis layer that interprets a target page before the rest of the workflow proceeds.
    Downloads: 107 This Week
    Last Update:
    See Project
  • 6
    Puck

    Puck

    Open source visual editor for building React drag-and-drop pages

    Puck is an open source visual editor designed for React applications that enables developers to build customizable drag-and-drop page editing experiences. It allows teams to create their own page builders by defining React components that can be arranged and configured through a visual interface. Puck is component-based and configuration-driven, meaning developers specify how components render and which editable fields control their properties. Puck integrates directly into existing React environments and works well with frameworks such as Next.js, enabling developers to embed editing capabilities directly inside their applications. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Gemma 4 Browser Assistant

    Gemma 4 Browser Assistant

    On-device AI agent Chrome extension powered by Transformers.js

    ...The extension includes a side panel interface that allows users to interact with the AI while browsing, enabling tasks such as summarizing pages and answering questions. It can access and analyze page content, browsing history, and tab state to provide contextual assistance. The architecture follows modern browser extension standards, with separate components for background processing, content scripts, and UI rendering. It also supports tool-calling capabilities, allowing the AI to perform actions such as navigating tabs or highlighting elements. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    GLM-4.6V

    GLM-4.6V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    ...Its architecture supports a very large context window (on the order of 128K tokens during training), which lets it handle complex multimodal inputs like long documents, multi-page reports, or video transcripts, while maintaining coherence across extended content. In benchmarks and internal evaluations, GLM-4.6V achieves state-of-the-art (SoTA) performance among models of comparable parameter scale on multimodal reasoning.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Index

    Index

    The SOTA Open-Source Browser Agent

    ...The system enables developers to instruct an AI agent to interact with web pages using natural language rather than traditional automation scripts. Instead of writing detailed browser automation code, users can describe the desired task and allow the agent to interpret the page structure, interact with elements, and complete multi-step workflows automatically. The project is built to integrate easily with applications through a simple programming interface, allowing developers to embed browser automation capabilities directly into their software systems. Index can perform tasks such as navigating pages, filling forms, collecting data, and analyzing web content without requiring manual scripting for each website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    Guizang Social Card Skill

    Guizang Social Card Skill

    Claude Code / Codex skill — generate Xiaohongshu carousels

    ...The skill turns articles, scripts, screenshots, product notes, subtitles, or photos into structured social card outputs. It supports editorial magazine layouts and Swiss-style visual systems, giving agents rules for typography, spacing, covers, and page rhythm. The project is self-contained and meant to be used by AI coding or agent tools as a repeatable creative workflow. It is useful for creators, marketers, designers, and content teams producing platform-ready social visuals.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    HivisionIDPhoto

    HivisionIDPhoto

    HivisionIDPhotos: a lightweight and efficient AI ID photos tools

    ...The software analyzes portrait images, performs background removal, aligns the face according to ID photo standards, and produces images in various official size formats. It also allows the generation of layout sheets such as six-inch photo arrangements for printing multiple ID photos on a single page. The project focuses on building a practical pipeline for automated ID photo production using AI-based segmentation and image processing techniques.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    effective-html

    effective-html

    Agent skill for elegant and simple html plans, architecture diagrams

    effective-html is an agent skill package for generating polished, self-contained HTML visual artifacts. It focuses on pragmatic outputs such as plans, architecture diagrams, system diagrams, and general-purpose HTML pages. The repository includes separate skills for standard HTML artifacts, full-screen HTML diagrams, and HTML plan pages. It is designed to work with agent tooling such as Skills, Claude Code plugins, and Codex plugins. The skill emphasizes strong visual structure, high-quality...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    PowerPoint-ist

    PowerPoint-ist

    Web presentation editor replicating many PowerPoint features online

    PPTist is a web-based presentation editing application designed to replicate many of the commonly used features found in traditional slide presentation software. It allows users to create, edit, and present slide decks directly within a web browser while maintaining a desktop-like editing experience. PPTist is built with Vue 3 and TypeScript and focuses on providing a highly interactive slide editing environment with extensive customization and extension potential. PPTist supports a wide...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    rep+

    rep+

    Burp-style HTTP Repeater for Chrome DevTools with built‑in AI

    rep+ is a lightweight browser extension for Chrome DevTools that brings a Burp Suite-style HTTP repeater directly into the developer console, enhanced with built-in AI to help explain requests and suggest tests. It captures HTTP traffic from the inspected page without needing a proxy, allowing users to replay, modify, and analyze individual requests with fine-grained control over headers, bodies, and methods. The tool offers hierarchical grouping, tagging, and filtering of captured requests so that developers and security testers can manage complex traffic flows efficiently. It includes AI-assisted insights, where contextual explanations and attack vector suggestions help interpret request outcomes or propose modifications. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Depth Pro

    Depth Pro

    Sharp Monocular Metric Depth in Less Than a Second

    ...Apple highlights both accuracy and speed: the model can synthesize a ~2.25-megapixel depth map in around 0.3 seconds on a standard GPU, enabling near real-time applications. The repo and research page emphasize boundary fidelity and crisp geometry, addressing a common weakness in monocular depth where edges can blur. Community integrations (e.g., inference wrappers and UI nodes) have sprung up around the model, reflecting practical interest in video, AR, and generative pipelines. As a general-purpose monocular depth backbone, Depth Pro slots into 3D reconstruction, relighting, and scene understanding workflows that benefit from metric predictions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Ralph Wiggum Marketer

    Ralph Wiggum Marketer

    A Claude Code Plugin that provides an autonomous AI copywriter

    Ralph Wiggum Marketer is a Claude Code plugin that serves as an autonomous AI copywriter tailored for SaaS content marketing, enabling automated generation of marketing copy such as landing pages, taglines, feature summaries, and promotional messaging. It leverages the Ralph Wiggum loop concept — a continuous iteration pattern named after the iconic character that symbolizes persistent, repeated refinement — to let Claude Code keep iterating on content until predefined completion criteria...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    DocStrange

    DocStrange

    Extract and convert data from any document, images, pdfs, word doc

    DocStrange is an open-source document understanding and extraction library designed to convert complex files into structured, LLM-ready outputs such as Markdown, JSON, CSV, and HTML. Developed by Nanonets, the project combines OCR, layout detection, table understanding, and structured extraction into one end-to-end pipeline, which reduces the need to stitch together multiple separate services. It is built for developers who need high-quality parsing from scans, photos, PDFs, office files,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    PokeeResearch-7B

    PokeeResearch-7B

    Pokee Deep Research Model Open Source Repo

    PokeeResearchOSS provides an open-source, agentic “deep research” model centered on a 7B backbone that can browse, read, and synthesize current information from the web. Instead of relying only on static training data, the agent performs searches, visits pages, and extracts evidence before forming answers to complex queries. It is built to operate end-to-end: planning a research strategy, gathering sources, reasoning over conflicting claims, and writing a grounded response. The repository...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Pinchtab

    Pinchtab

    High-performance browser automation bridge and orchestrator

    Pinchtab is a lightweight browser automation backend built specifically for AI agents that need efficient, programmatic web control. Implemented as a small standalone HTTP server, it allows any agent or script to interact with web pages using simple API calls instead of heavyweight browser frameworks. The tool emphasizes accessibility-first snapshots that dramatically reduce token usage compared to screenshot-based approaches, making it cost-effective for large-scale automation. It launches...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    trench

    trench

    Open-Source Analytics Infrastructure

    ...It was originally developed to solve scaling challenges in product analytics systems where traditional relational databases become inefficient as event tables grow. The platform enables developers to collect events such as page views, user actions, and behavioral metrics while storing them in a column-oriented analytics database optimized for time-series workloads. By combining streaming ingestion with fast analytical queries, the system supports use cases such as product analytics dashboards, observability pipelines, and machine learning data preparation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Reader LLM

    Reader LLM

    Convert any URL to an LLM-friendly input with a simple prefix

    ...The system works by transforming a webpage into a clean text or Markdown representation that removes unnecessary formatting and highlights the core information within the page. Developers can use a simple URL prefix to retrieve a version of a webpage that has been optimized for machine consumption, making it suitable for use in AI agents or retrieval-augmented generation pipelines. In addition to converting individual pages, the service can perform web searches and return relevant content that can be ingested directly by AI systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    StreamSpeech

    StreamSpeech

    StreamSpeech is a seamless model for offline speech recognition

    StreamSpeech is an “all-in-one” speech model designed to perform offline and simultaneous speech recognition, speech translation, and speech synthesis within a single unified architecture. Developed as part of an ACL 2024 paper, it targets streaming and low-latency scenarios where intermediate results and final translations or synthetic speech must be produced continuously as audio is being received. The model supports eight tasks: offline ASR, speech-to-text translation, speech-to-speech...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    NativeMind Extension

    NativeMind Extension

    Your fully private, open-source, on-device AI assistant

    NativeMindExtension is an open-source browser extension that provides a private, on-device AI assistant designed to run without cloud dependencies. The project is built around a privacy-first model in which conversations, document analysis, translations, and writing assistance stay on the user’s device rather than being sent to external servers. It integrates with local model back ends such as Ollama and also supports WebLLM for quick in-browser trials, giving users a choice between stronger...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Actionbook

    Actionbook

    Browser action engine for AI agents. 10× faster, resilient by design

    ...Instead of having agents blindly scrape HTML or blindly try to click things, Actionbook supplies up-to-date action manuals and verified DOM structure, letting agents know exactly how to click, type, and navigate complex interfaces such as SPAs or streaming UIs. This design makes browsing up to 10× faster and far more resilient than ad-hoc approaches that break on minor page changes, because the action manuals codify expected flows and DOM targets. It provides multiple integration paths — a Rust-based CLI, MCP server support for AI IDEs, and a JavaScript SDK — so developers can plug it into a wide range of agent pipelines and toolchains.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Groq AppGen

    Groq AppGen

    Project showcasing Llama 3.3 70B HTML codegen abilities

    Groq AppGen is an interactive web application (built with Next.js and TypeScript) that uses Groq’s LLM API to generate or modify web application code based on natural-language prompts. Essentially, you tell the app what kind of web app or page you want (in plain English), and groq-appgen will produce HTML/JSX code scaffolding, layout, and optionally application logic accordingly. It supports iterative feedback: you can refine your prompt, adjust parameters or requirements, and have the app regenerate or update the code — facilitating rapid prototyping and experimentation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo