Showing 37 open source projects for "vision"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    UI.Vision RPA

    UI.Vision RPA

    Open-Source RPA Software (formerly Kantu)

    The UI Vision RPA software is the tool for visual process automation, codeless UI test automation, web scraping and screen scraping. Automate tasks on Windows, Mac and Linux. The UI Vision RPA core is open-source with enterprise security. The free and open-source browser extension can be extended with local apps for desktop UI automation. UI.Vision RPA's computer-vision visual UI testing commands allow you to write automated visual tests with UI.Vision RPA - this makes UI.Vision RPA the first and only Chrome and Firefox extension (and Selenium IDE) that has "👁👁 eyes". ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Open Generative AI

    Open Generative AI

    Uncensored, open-source alternative to Higgsfield AI

    ...The repository organizes information about models, libraries, datasets, and learning materials, making it easier for developers to navigate the rapidly evolving AI landscape. It includes references to tools for natural language processing, computer vision, and multimodal systems. The project is designed as a knowledge hub, helping users discover technologies and best practices for building generative AI applications. It is particularly useful for beginners who need a structured overview as well as for experienced developers looking for new tools. The repository is continuously updated to reflect the latest developments in the field. ...
    Downloads: 54 This Week
    Last Update:
    See Project
  • 3
    Midscene

    Midscene

    Vision-based AI framework for cross-platform UI automation tasks

    Midscene.js is an open source AI-driven UI automation framework designed to control user interfaces across multiple platforms using natural language instructions. Instead of relying on traditional selectors, DOM structures, or accessibility attributes, it uses a vision-first approach where screenshots are analyzed by visual-language models to identify interface elements and perform actions. It allows developers to automate interactions on web applications, desktop software, and mobile devices without needing platform-specific automation logic. Developers can describe tasks such as clicking buttons, filling forms, or extracting information, and the system interprets these commands to interact with the interface accordingly. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Magnitude

    Magnitude

    Vision AI browser agent for automation, testing, and extraction

    Browser Agent by Magnitude is an open source, vision-first browser automation framework that enables users to control web interfaces using natural language instructions. It leverages visually grounded AI models to interpret and interact with web pages based on what is seen on the screen rather than relying solely on the DOM structure. This approach allows the agent to generalize better across complex and modern websites, making it more robust than traditional selector-based automation tools. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    MCiSEE

    MCiSEE

    All of Minecraft, EASILY get Minecraft resources

    MCiSEE is an open-source project designed to integrate Minecraft with computer vision and artificial intelligence experiments. The system focuses on capturing visual information from the game environment and exposing it to external programs for analysis or machine learning research. By converting gameplay data into visual or structured formats, MCiSEE enables researchers and developers to build AI agents capable of interacting with the Minecraft environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AI Deadlines

    AI Deadlines

    AI conference deadline countdowns

    AI Deadlines is an open-source project that provides a centralized system for tracking important submission deadlines for major artificial intelligence and machine learning conferences. The repository powers a website that displays countdown timers and structured information for top research conferences across subfields such as computer vision, natural language processing, machine learning, and robotics. The project maintains a curated dataset of conferences that includes metadata such as submission deadlines, abstract deadlines, event dates, conference locations, and related information. Researchers and students use the platform to plan their paper submissions and manage academic schedules without manually tracking multiple conference announcements. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    PrimeReact

    PrimeReact

    The Most Complete React UI Component Library

    Elevate your web applications with PrimeReact's comprehensive suite of customizable, feature-rich UI components. With PrimeReact, turning your development vision into reality has never been easier. The ultimate set of UI Components to assist you with 80+ impressive React Components. Choose from a variety of pre-built themes or implement your design systems with the CSS library of your choice like TailwindCSS. Connect with the other open-source community members, collaborate, and have a voice in the project roadmap. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    HSD

    HSD

    Handshake Daemon & full node

    ...Handshake is an experiment that seeks to explore those new ways in which the necessary tools to build a more decentralized internet. Services on the internet have become more centralized beginning in the 1990s, but do not fulfill the original decentralized vision of the internet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Chromatone

    Chromatone

    Chromatone is a digital garden of visual music theory

    Cards and short overviews on the physics and physiology of vision and hearing and their intersection at visual music research, exploration, practice, and self-expression. Useful tools to have in the pocket like a pack of interactive cards to learn and use in everyday music practice. These are open source web experiments with different aspects of sound and color. Chromatone is an open source research and design project to explore, develop and implement the scientific way of visual music education, communication and performance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    clipwise
    Search through images and videos backed by local LLMs. Private, fast, offline-first — powered by bge-large embeddings, moondream vision, and Whisper transcription.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Screen Recorder Studio

    Screen Recorder Studio

    Screen Recorder Studio Screen Recorder Studio is a modern, lightweigh

    ...Key Features 🎥 High-quality screen recording 🎮 Gameplay recording with minimal performance impact 🖥️ Full-screen, window, and custom-region capture 🎙️ Microphone and system audio recording ⚡ Fast startup and lightweight resource usage 📁 Easy video export and file management 🎨 Modern, clean, user-friendly interface Vision join our discord in order to suggest or to communicate: https://discord.gg/YRhcgEGVjN
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    LibrePlan

    LibrePlan

    Open Web Planning

    LibrePlan is a collaborative tool to plan, monitor and control projects and has a rich web interface which provides a desktop alike user experience. All the team members can take part in the planning and this makes possible to have a real-time planning. LibrePlan is open source and you can download, install and customize it for free. Highlights: * Open source solution * Collaborative & web based software * Multiproject focus * Real time planning * Collaborate with other...
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 13
    Entropy Linux

    Entropy Linux

    Arch based, Modern, Midweight, Practical, Experimental, AMD, Szmelc

    ...Power & Flexibility: Designed for power users, sysadmins, and DevOps pros. Advanced Toolset: Custom utilities to boost productivity, AMD™ Optimized: Engineered for peak performance on AMD hardware. Part of a Vision: A key piece of an evolving ecosystem, connecting projects seamlessly. Entropy is a constantly evolving system—designed to be explored, tweaked, and refined. Some setup may be required, but that’s part of the fun. What to Expect? A robust, adaptable distro with powerful tools and limitless customization. Occasional bugs, experimental features, and hidden easter eggs because pushing limits isn’t always smooth!
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    pipeless

    pipeless

    A computer vision framework to create and deploy apps in minutes

    Pipeless is an open-source computer vision framework to create and deploy applications without the complexity of building and maintaining multimedia pipelines. It ships everything you need to create and deploy efficient computer vision applications that work in real-time in just minutes. Pipeless is inspired by modern serverless technologies. It provides the development experience of serverless frameworks applied to computer vision.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    TuiCss

    TuiCss

    Text-based user interface CSS library

    ...This kind of interface is very legible because the ultra-contrast colors used and because the reduced effects used on the components in the view. The base of this project is Turbo Vision Framework, but some other frameworks were also checked to introduce some features to TuiCss, like curses, ncurses, Newt, etc. Check the examples page in the wiki to stay on top of some creations, or check the getting started page to start using this library! To start to use TuiCss in your project, you can just download the repository content and import the files that are in the dist folder with the html directives. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ClassyVision

    ClassyVision

    An end-to-end PyTorch framework for image and video classification

    Classy Vision is a PyTorch-based framework designed for large-scale training and deployment of state-of-the-art image and video classification models. Developed by Facebook Research, it serves as an end-to-end system that simplifies the process of training at scale, reducing redundancy and friction in moving from research to production. Unlike traditional computer vision libraries that focus solely on modular components, Classy Vision provides a complete and unified framework, featuring distributed training, reproducible experiments, and flexible configuration tools. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Advanced REST Client application

    Advanced REST Client application

    Advanced REST Client - Desktop application

    ...ARC was built as an open-source and free for everyone API tool out of a passion for giving the developer community tools they need. The application and related projects (like API Console) are created and distributed to our users for free. Our vision is that API tools are available for every developer and organization, regardless of their size, for free and without forced relationship. API tools are using open standards to communicate with other applications so developers can build integrations. Originally the application was created by the author for own convenience while developing APIs. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Deep Learning 500 Questions

    Deep Learning 500 Questions

    500 Questions on Deep Learning using a question-and-answer format

    ...The first sections focus on essential mathematics, machine learning basics, and deep learning foundations, establishing the groundwork for more advanced topics. Later chapters explore classic neural network structures such as CNNs, RNNs, and GANs, as well as key applications in computer vision like object detection and image segmentation. The resource also delves into optimization methods, including transfer learning, network architecture design, hyperparameter tuning, model compression, and acceleration techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CYBR-SUITE

    CYBR-SUITE

    Remote Scrum Communication & Collaboration Suite :: Scrum Board & co

    The CYSU CYBR-SUITE is a remote Scrum solution with advanced, unique communication capabilities - the CFLX. 1. Sprint Vision Boards → clear OKRs 2. User-Story-Templates 3. Requirement-Templates, 4. Product-Backlog-Item's & Task's assessments, 5. Personality assessments, ...these mandatory ways to add more context & breaking-down complexity enable a better overview and a better understanding - for man and machine... ...based on the standardized structure of the DATA WE ARE NOW CREATING AND STORING, these values can now be used directly for the inputs and targets of our artificial neuronal networks and generate our competitive advantage based on SUPERIOR MACHINE LEARNING capabilities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    NativeScript Documentation

    NativeScript Documentation

    Documentation, API reference, and code snippets for NativeScript

    NativeScript provides platform APIs directly to the JavaScript runtime (with strong types) for a rich TypeScript development experience. Building Web, iOS, Android, and Vision Pro apps with a shared codebase (aka, cross-platform apps) Building native platform apps with portable JavaScript skills. Augmenting JavaScript projects with platform API capabilities. AndroidTV and Watch development watchOS development. Learning native platforms through JavaScript understanding. Exploring platform API documentation by trying APIs directly from a web browser without requiring a platform development machine setup.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Bot Libre

    Bot Libre

    free open artificial intelligence for everyone

    Bot Libre is a free open source platform for artificial intelligence, chat bot, virtual agents, live chat, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Show Facebook Computer Vision Tags

    Show Facebook Computer Vision Tags

    Chrome Extension that displays automated image tags from Facebook

    Show Facebook Computer Vision Tags is a Chrome (and Firefox) browser extension created to expose and overlay the automatically generated image tags that Facebook applies to photos in users’ feeds. Since Facebook uses a computer-vision model to analyse user-uploaded images and generate alt-text tags for accessibility (e.g., “Image may contain: golf, grass, outdoor and nature”), this extension surfaces those hidden tags directly in the UI—revealing what kind of information Facebook infers about images (objects present, activities being done, environment). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    survol

    RDF-based framework monitoring business systems activity

    ...all communicating with each other, manipulating your data, and whose software architecture has become, with time, complicated, difficult to understand, and undocumented. Data are aggregated with an RDF inference engine, creating a global vision of the business information processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    tracking.js

    tracking.js

    A modern approach for Computer Vision on the web

    ...Test out the web server by loading the finished version of the project. The main goal of tracking.js is to provide those complex techniques in a simple and intuitive way on the web. We believe computer vision is important to improve people's life, bringing it to the web will make this future a reality a lot faster.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    One Web Forum

    One Web Forum

    One Web Forum is tool for expressing your free speech in todays world

    ...You can install our plugin for Mozilla Firefox. We hired best software engineers and researchers from companies such as Microsoft, Facebook, Pacific Biosciences and Microsoft Research to help continue vision of a great man, Martin Luther King, Jr.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo