Showing 87 open source projects for "show%20mouse"

View related business solutions
  • Outgrown Windows Task Scheduler? Icon
    Outgrown Windows Task Scheduler?

    Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

    Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.
    Download Free Tool
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    deepface

    deepface

    A Lightweight Face Recognition and Facial Attribute Analysis

    ...It is a hybrid face recognition framework wrapping state-of-the-art models: VGG-Face, FaceNet, OpenFace, DeepFace, DeepID, ArcFace, Dlib, SFace and GhostFaceNet. Experiments show that human beings have 97.53% accuracy on facial recognition tasks whereas those models already reached and passed that accuracy level.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 2
    AskUI Vision Agent

    AskUI Vision Agent

    Enable AI to control your desktop, mobile and HMI devices

    ...It is designed for multi-platform compatibility and supports multiple AI models so you can tailor perception and decision-making to your workload. The repository presents a feature overview, sample media, and frequent release notes, which show ongoing improvements such as CORS checks and other operational tweaks. The broader AskUI documentation covers the Python Vision Agent along with suite services and inference APIs, indicating a productized ecosystem rather than a single library. Community-curated lists also recognize Vision Agent as part of the broader “GUI agents” landscape, placing it among other computer-use agents.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    MiniMax-M2.1

    MiniMax-M2.1

    MiniMax M2.1, a SOTA model for real-world dev & agents.

    ...MiniMax-M2.1 excels in real-world software engineering tasks, including multilingual development and complex workflow automation. It demonstrates strong generalization across agent frameworks and consistently improves upon its predecessor, MiniMax-M2. Benchmarks show that it rivals or approaches top proprietary models while remaining fully open for local deployment and customization.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    MCP Proxy

    MCP Proxy

    A TypeScript SSE proxy for MCP servers that use stdio transport

    ...The tool can multiplex multiple named STDIO servers behind one proxy instance, simplifying fleet deployments or local development with many tools. It ships prebuilt artifacts and a Homebrew formula for quick install on macOS and Linux, with container images published for broader environments. Releases show steady improvements focused on developer experience and operational flexibility. Overall, it lowers the friction of composing diverse MCP tools into a single reachable endpoint.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    EvoBot

    EvoBot

    EvoBot is a Discord Music Bot built with TypeScript + Discord.js

    EvoBot is a Discord Music Bot built with TypeScript + Discord.js, includes Docker builds & localization in 20+ languages. After installation finishes follow the configuration instructions then run npm run start to start the bot. For those who would prefer to use our Docker container, you may provide values from config.json as environment variables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    JMusicBot

    JMusicBot

    A Discord music bot that's easy to set up and run yourself

    ...Playlist support (both web/youtube and local) This bot (and the source code here) might not be easy to edit for inexperienced programmers. The main purpose of having the source public is to show the capabilities of the libraries, to allow others to understand how the bot works, and to allow those knowledgeable about java, JDA, and Discord bot development to contribute.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    pyttsx3

    pyttsx3

    Offline Text To Speech synthesis for python

    ...It supports both a high-level speak convenience function and a lower-level engine object with event hooks, queuing, and saving output to audio files. The repository includes examples and documentation that show how to adjust properties dynamically, persist synthesized output, and integrate pyttsx3 into GUIs or background services.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Orpheus TTS

    Orpheus TTS

    Towards Human-Sounding Speech

    ...The project ships both pretrained and finetuned English models, as well as a family of multilingual models released as a research preview, and includes data-processing scripts so users can train or finetune their own variants. Inference is provided through a Python package that uses vLLM under the hood for high-throughput, low-latency generation, including streaming examples that show how to generate audio chunks in real time. The maintainers provide Colab notebooks, a standardized prompting format, and one-click deployment via Baseten for production-grade, FP8/FP16 optimized inference with ~200 ms streaming latency.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    windsurf.vim

    windsurf.vim

    Free, ultrafast Copilot alternative for Vim and Neovim

    windsurf.vim is a plugin for Vim and Neovim by Exafunction (formerly part of the Codeium project) that brings in AI-driven code completion and assistance capabilities. The aim is to provide a “free, ultrafast” alternative to other AI code assistants (such as GitHub Copilot) directly within Vim/Neovim. Once installed and configured, windsurf.vim can suggest code completions, generate multi-line snippets based on comments or invitation in code, and make the editing experience more predictive...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Total Network Visibility for Network Engineers and IT Managers Icon
    Total Network Visibility for Network Engineers and IT Managers

    Network monitoring and troubleshooting is hard. TotalView makes it easy.

    This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
    Learn More
  • 10
    Live Agent Studio

    Live Agent Studio

    Open source AI Agents hosted on the oTTomator Live Agent Studio

    ...Each agent in the collection is designed for a specific use case — such as content summarization, task automation, travel planning, or RAG workflows — and is provided with the code or configuration needed to explore and extend it on your own, making the repository both a learning resource and a practical starting point for real projects. The repository is community focused, with sample agents like tweet generators, smart selectors, research assistants, and multi-tool workflows that show how agents can integrate with tools like n8n or custom Python code. Because it’s tied to the broader Live Agent Studio ecosystem, users can experiment with deploying and using these agents in a hosted environment.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Open Model Zoo

    Open Model Zoo

    Pre-trained Deep Learning models and demos

    ...It includes hundreds of models covering object detection, classification, segmentation, pose estimation, speech recognition, text-to-speech, and more, many of which are already converted into formats optimized for inference on CPUs, GPUs, VPUs, and other accelerators supported by OpenVINO. In addition to model files, Open Model Zoo provides demo applications that show realistic usage patterns and help developers quickly prototype and understand inference pipelines in C++, Python, or via the OpenCV Graph API. Tools in the repository also help automate model downloads and other tasks, making it easier to incorporate these models into production systems or custom solutions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    OpenAI Cookbook

    OpenAI Cookbook

    Examples and guides for using the OpenAI API

    ...It also includes deployment recipes, integration snippets (e.g. with GitHub Actions), and production considerations. Because OpenAI’s API evolves rapidly, the Cookbook acts as a living, community-curated reference to show “how to do X with the API” rather than only reprinting documentation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Deep Learning Is Nothing

    Deep Learning Is Nothing

    Deep learning concepts in an approachable style

    ...It typically begins with linear algebra, calculus, and optimization refreshers before moving to perceptrons, multilayer networks, and gradient-based training. Implementations favor small, readable examples—often NumPy first—to show how forward and backward passes work without depending solely on high-level frameworks. Once the fundamentals are clear, the material extends to CNNs, RNNs, and attention mechanisms, explaining why each architecture suits particular tasks. Practical sections cover data pipelines, regularization, and evaluation, emphasizing reproducibility and debugging techniques. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    VisualDL

    VisualDL

    Deep Learning Visualization Toolkit

    VisualDL, a visualization analysis tool of PaddlePaddle, provides a variety of charts to show the trends of parameters and visualizes model structures, data samples, histograms of tensors, PR curves , ROC curves and high-dimensional data distributions. It enables users to understand the training process and the model structure more clearly and intuitively so as to optimize models efficiently. VisualDL provides various visualization functions, including tracking metrics in real-time, visualizing the model structure, displaying the data sample, visualizing the relationship between hyperparameters and model metrics, presenting the changes of distributions of tensors, showing the pr curves, projecting high-dimensional data to a lower dimensional space and more. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Agents Towards Production

    Agents Towards Production

    Code-first tutorials covering every layer of GenAI agents

    ...The repository is built around runnable tutorials, each in its own folder, often sponsored by or built in collaboration with infrastructure providers like LangChain, Redis, Bright Data, Contextual AI, Tavily, Runpod, Portia, and others. These tutorials show how to implement things like secure tool calling with OAuth, dual-memory architectures, production RAG agents, multi-agent communication protocols, GPU deployment, containerization with Docker, FastAPI endpoints, and Streamlit chat UIs. The architecture diagram and accompanying material provide a mental model for how production-grade agents should be wired together.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    MiniMax-M2

    MiniMax-M2

    MiniMax-M2, a model built for Max coding & agentic workflows

    ...It is also optimized for multi-step agent tasks, planning and executing long toolchains that span shell commands, browsers, retrieval systems, and code runners. Benchmarks show that it achieves highly competitive scores on a wide range of intelligence and agent benchmarks, including SWE-Bench variants, Terminal-Bench, BrowseComp, GAIA, and several long-context reasoning suites.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    The Web MCP

    The Web MCP

    A powerful Model Context Protocol (MCP) server

    ...Bright Data also advertises a getting-started tier with a free monthly allotment, plus options for remote or self-hosted operation depending on governance needs. Ecosystem materials and examples show how it plugs into MCP-capable runtimes and agent frameworks. Overall, the project is aimed at making web intelligence a reliable building block for agent workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    firerpa LAMDA

    firerpa LAMDA

    The most powerful Android RPA agent framework

    ...The project emphasizes low-friction setup and powerful control primitives so teams can move from interactive validation to repeatable automation. A public wiki, releases, and issue tracker show active development across areas like connectivity, instrumentation compatibility, and robustness under detection. Together with companion projects (e.g., a device hub), lamda is positioned as a next-generation mobile automation stack rather than a single tool. Its focus on remote control plus RPA primitives makes it useful for QA, operations, and large-scale device orchestration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Scrapling

    Scrapling

    An undetectable, powerful, flexible, high-performance Python library

    ...The library targets the full scraping pipeline: session handling, fetching, rendering when needed, parsing, and export—while keeping ergonomics front and center. Community posts and guides show active usage patterns, packaging tips, and frequent releases that iterate on speed and resilience. The repository positions Scrapling as a batteries-included alternative to stitching together many small libraries. In short, it aims to make tough targets tractable while keeping scripts readable and maintainable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Supertonic

    Supertonic

    Lightning-fast, on-device TTS, running natively via ONNX

    ...It focuses on running entirely locally, eliminating the need for cloud APIs and providing low latency and strong privacy guarantees, even on constrained devices like Raspberry Pi boards and e-readers. The core model is highly compact at around 66 million parameters, yet benchmarks show it can generate speech up to 167× faster than real time on modern consumer hardware and significantly outpace popular cloud TTS APIs in throughput and real-time factor. Supertonic is designed to handle real-world text gracefully, including numbers, dates, currency symbols, abbreviations, and technical units, without requiring heavy pre-processing or custom text normalization. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    ...Configuration lives in a simple .env file and the project boots with standard web tooling, letting you experiment quickly with models, system prompts, and tool declarations. It ships with demo branches that show grounded search, function calling, and visualization—one example has the model calling a function that renders Vega/Altair graphs directly in the UI. Under the hood there’s an event-emitting WebSocket client, an audio in/out processing layer, and a minimal scaffolded view so you can focus on your app logic rather than wiring.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Advanced RAG Techniques

    Advanced RAG Techniques

    Advanced techniques for RAG systems

    ...The repository organizes techniques into categories such as foundational RAG, query enhancement, context enrichment, and advanced retrieval, making it easier to navigate specific areas of interest. It includes hands-on Jupyter notebooks and runnable scripts that show how to implement ideas like optimizing chunk sizes, proposition chunking, HyDE/HyPE query transformations, fusion retrieval, reranking, and ensemble retrieval. There is also an evaluation section that demonstrates how to measure RAG performance and compare different configurations in a systematic way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ArXiv MCP Server

    ArXiv MCP Server

    A Model Context Protocol server for searching and analyzing arXiv

    ...With simple tools like “search” and “fetch,” an agent can find papers, pull abstracts, and download PDFs for downstream summarization or analysis. The project includes packaging and CI to publish to PyPI, plus tests and linting for reliability. Issue threads show feature requests such as extracting embedded LaTeX and improving markdown conversion, reflecting active community use in research flows. It’s designed to be drop-in for MCP clients, giving them typed inputs/outputs and predictable errors around a well-known academic corpus. For developers building research copilots, it removes the glue work of wiring arXiv APIs into an agent toolchain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MGIE

    MGIE

    Guiding Instruction-based Image Editing via Multimodal Large Language

    ...The project focuses on making edits explainable and controllable: the model interprets text guidance, reasons over image content, and outputs edits aligned with user intent. It’s positioned as an ICLR 2024 Spotlight work, with code and references that show how to connect language planning to concrete image operations. This bridges a gap between free-form prompts and precise edits by letting users describe “what” and “where” in everyday language. The repo includes instructions, examples, and links that situate MGIE within Apple’s broader line of multimodal research. For practitioners, MGIE provides a blueprint for text-to-edit systems that are more semantically grounded than naive prompt-only pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ML Ferret

    ML Ferret

    Refer and Ground Anything Anywhere at Any Granularity

    ...The core idea is a hybrid region representation that mixes discrete coordinates with continuous visual features, so the model can fluidly handle “any-form” referring while maintaining precise spatial localization. The repo presents the vision-language pipeline, model assets, and paper resources that show how Ferret answers questions, follows instructions, and returns grounded outputs rather than just text. In practice, this enables tasks like “find that small red icon next to the chart and describe it” where both the linguistic reference and the visual region are ambiguous without fine spatial reasoning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next