Showing 19 open source projects for "visual\"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Seamless Communication

    Seamless Communication

    Foundational Models for State-of-the-Art Speech and Text Translation

    ...The system architecture includes a real-time multimodal signal pipeline for audio, video, and sensor data, a dialog manager that can decide when to act (speak, gesture, point) or query, and a cross-modal reasoning layer that fuses perception with semantic context. The research prototype includes components for visual grounding (understanding when a user references something in view), gesture recognition and synthesis, and turn-taking mechanisms that mirror human conversational timing. Because latency and synchronization are critical, the codebase invests in asynchronous scheduling, overlap of perception and reasoning, and fast fallback responses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TEN

    TEN

    Open-source framework for conversational voice AI agents

    TEN (Transformative Extensions Network) is an open source framework designed to empower developers to build real-time multimodal AI agents capable of voice, video, text, image, and data-stream interaction with ultra-low latency. It includes a full ecosystem, TEN Turn Detection, TEN Agent, and TMAN Designer, allowing developers to rapidly assemble human-like, responsive agents that can see, speak, hear, and interact. With support for languages like Python, C++, and Go, it offers flexible...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Mozilla JPEG Encoder Project

    Mozilla JPEG Encoder Project

    Improved JPEG encoder

    MozJPEG improves JPEG compression efficiency achieving higher visual quality and smaller file sizes at the same time. It is compatible with the JPEG standard, and the vast majority of the world's deployed JPEG decoders. MozJPEG is compatible with the libjpeg API and ABI. It is intended to be a drop-in replacement for libjpeg. MozJPEG is a strict superset of libjpeg-turbo's functionality. All MozJPEG's improvements can be disabled at run time, and in that case it behaves exactly like libjpeg-turbo. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    NeuralCoref

    NeuralCoref

    Fast Coreference Resolution in spaCy with Neural Networks

    NeuralCoref is a pipeline extension for spaCy 2.1+ which annotates and resolves coreference clusters using a neural network. NeuralCoref is production-ready, integrated in spaCy's NLP pipeline and extensible to new training datasets. For a brief introduction to coreference resolution and NeuralCoref, please refer to our blog post. NeuralCoref is written in Python/Cython and comes with a pre-trained statistical model for English only. NeuralCoref is accompanied by a visualization client...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    A Disk Scheduling Simulator tool. With an intuitive GUI interface and command line functionality. Built using both VC and VB. Currently supports SSTF,LOOK,CLOOK,SCAN and CSCAN algorithms. Rotational latency is also optionally supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    NeuroSolutions: Formula Generator

    Utility converts the weights file of a MLP Breadboard into a formula

    The NeuroSolutions: Formula Generator utility converts the weights file of a default MLP breadboard (1-hidden layer with a TanhAxon in the hidden layer and either a TanhAxon or BiasAxon in the output layer) into a usable formula that can be copied and pasted into your own programs to compute the output of the trained neural network.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Azul OS

    Azul OS

    Azul OS version dev(Linux) IA

    Azul OS version dev , est une version de Azul pour les developpeurs basé sur Linux , doté d'une IA un programme nommé Azul voice et qui est un système de reconnaissance vocale qui comprend ce que vous dites et réponds par des sensations . Azul Dev est une distribution linux , qui comporte des outils et des lib pour les developpeurs avec une Interface Gnome # Azul voice système sensation . Windows & linux. En cours .. # Azul voice version windows Azul interface . Disponible # Azul dev...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Interface and example projects for CLIPS on NT using the CLIPS DLL. Includes Web/CGI interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    This project is to share my work in visual recognition of gestures. It shows how I have implemented person detection, color adaptation and tracking of the user's right-hand, gesture feature extraction, etc. using a single videocamera.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    It's Windows version of WordNet 3.0 (not provided on author's site) with c code source for compiling with visual studio
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    computer vision workshop project - realtime video processing and simulation of musical instruments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    RobotFlow is a mobile robotics tookit based on FlowDesigner. The visual programming interface provided with FlowDesigner will help people to better visualize & understand what is really happening in the robot's control loops, sensors and actuators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DTW is intended to be a Voice in -> Pictures + Text out program written in java using Sphinx from CMU. This is intended to be useful to people who have good oral/visual literacy skills but poor written literacy skills.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    An attempt at developing an Artificial Inteligence software, based on the notion that inteligence is a set of conditional choices (in programming terms, it means a whole lot of if statements).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AIBuddy is an artificially intelligent chatbot. Unlike other chatbots, which pattern-match the entered phrase against a database of hard-coded responses, AIBuddy actually remembers the conversation, and can respond in context.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Pangaea will be a robust and feature filled game engine built using Allegro (http://alleg.sourceforge.net) It will be similar to Final Fantasy 1-3, etc., complete with map editor/world builder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A collection of user contributed applications which use the Open Computer Vision Library
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB