Showing 19 open source projects for "visual"

View related business solutions
  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    TEN

    TEN

    Open-source framework for conversational voice AI agents

    TEN (Transformative Extensions Network) is an open source framework designed to empower developers to build real-time multimodal AI agents capable of voice, video, text, image, and data-stream interaction with ultra-low latency. It includes a full ecosystem, TEN Turn Detection, TEN Agent, and TMAN Designer, allowing developers to rapidly assemble human-like, responsive agents that can see, speak, hear, and interact. With support for languages like Python, C++, and Go, it offers flexible...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 2
    Seamless Communication

    Seamless Communication

    Foundational Models for State-of-the-Art Speech and Text Translation

    ...The system architecture includes a real-time multimodal signal pipeline for audio, video, and sensor data, a dialog manager that can decide when to act (speak, gesture, point) or query, and a cross-modal reasoning layer that fuses perception with semantic context. The research prototype includes components for visual grounding (understanding when a user references something in view), gesture recognition and synthesis, and turn-taking mechanisms that mirror human conversational timing. Because latency and synchronization are critical, the codebase invests in asynchronous scheduling, overlap of perception and reasoning, and fast fallback responses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.
    Leader badge
    Downloads: 38 This Week
    Last Update:
    See Project
  • 4
    Mozilla JPEG Encoder Project

    Mozilla JPEG Encoder Project

    Improved JPEG encoder

    MozJPEG improves JPEG compression efficiency achieving higher visual quality and smaller file sizes at the same time. It is compatible with the JPEG standard, and the vast majority of the world's deployed JPEG decoders. MozJPEG is compatible with the libjpeg API and ABI. It is intended to be a drop-in replacement for libjpeg. MozJPEG is a strict superset of libjpeg-turbo's functionality. All MozJPEG's improvements can be disabled at run time, and in that case it behaves exactly like libjpeg-turbo. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • The Original Buy Center Software. Icon
    The Original Buy Center Software.

    Never Go To The Auction Again.

    VAN sources private-party vehicles from over 20 platforms and provides all necessary tools to communicate with sellers and manage opportunities. Franchise and Independent dealers can boost their buy center strategies with our advanced tools and an experienced Acquisition Coaching™ team dedicated to your success.
    Learn More
  • 5
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    NeuralCoref

    NeuralCoref

    Fast Coreference Resolution in spaCy with Neural Networks

    NeuralCoref is a pipeline extension for spaCy 2.1+ which annotates and resolves coreference clusters using a neural network. NeuralCoref is production-ready, integrated in spaCy's NLP pipeline and extensible to new training datasets. For a brief introduction to coreference resolution and NeuralCoref, please refer to our blog post. NeuralCoref is written in Python/Cython and comes with a pre-trained statistical model for English only. NeuralCoref is accompanied by a visualization client...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    A Disk Scheduling Simulator tool. With an intuitive GUI interface and command line functionality. Built using both VC and VB. Currently supports SSTF,LOOK,CLOOK,SCAN and CSCAN algorithms. Rotational latency is also optionally supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    NeuroSolutions: Formula Generator

    Utility converts the weights file of a MLP Breadboard into a formula

    The NeuroSolutions: Formula Generator utility converts the weights file of a default MLP breadboard (1-hidden layer with a TanhAxon in the hidden layer and either a TanhAxon or BiasAxon in the output layer) into a usable formula that can be copied and pasted into your own programs to compute the output of the trained neural network.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Azul OS

    Azul OS

    Azul OS version dev(Linux) IA

    Azul OS version dev , est une version de Azul pour les developpeurs basé sur Linux , doté d'une IA un programme nommé Azul voice et qui est un système de reconnaissance vocale qui comprend ce que vous dites et réponds par des sensations . Azul Dev est une distribution linux , qui comporte des outils et des lib pour les developpeurs avec une Interface Gnome # Azul voice système sensation . Windows & linux. En cours .. # Azul voice version windows Azul interface . Disponible # Azul dev...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Turn more customers into advocates. Icon
    Turn more customers into advocates.

    Fight skyrocketing paid media costs by turning your customers into a primary vehicle for acquisition, awareness, and activation with Extole.

    The platform's advanced capabilities ensure companies get the most out of their referral programs. Leverage custom events, profiles, and attributes to enable dynamic, audience-specific referral experiences. Use first-party data to tailor customer segment messaging, rewards, and engagement strategies. Use our flexible APIs to build management capabilities and consumer experiences–headlessly or hybrid. We have all the tools you need to build scalable, secure, and high-performing referral programs.
    Learn More
  • 10
    Interface and example projects for CLIPS on NT using the CLIPS DLL. Includes Web/CGI interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    This project is to share my work in visual recognition of gestures. It shows how I have implemented person detection, color adaptation and tracking of the user's right-hand, gesture feature extraction, etc. using a single videocamera.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    It's Windows version of WordNet 3.0 (not provided on author's site) with c code source for compiling with visual studio
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    computer vision workshop project - realtime video processing and simulation of musical instruments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    RobotFlow is a mobile robotics tookit based on FlowDesigner. The visual programming interface provided with FlowDesigner will help people to better visualize & understand what is really happening in the robot's control loops, sensors and actuators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DTW is intended to be a Voice in -> Pictures + Text out program written in java using Sphinx from CMU. This is intended to be useful to people who have good oral/visual literacy skills but poor written literacy skills.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    An attempt at developing an Artificial Inteligence software, based on the notion that inteligence is a set of conditional choices (in programming terms, it means a whole lot of if statements).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AIBuddy is an artificially intelligent chatbot. Unlike other chatbots, which pattern-match the entered phrase against a database of hard-coded responses, AIBuddy actually remembers the conversation, and can respond in context.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Pangaea will be a robust and feature filled game engine built using Allegro (http://alleg.sourceforge.net) It will be similar to Final Fantasy 1-3, etc., complete with map editor/world builder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A collection of user contributed applications which use the Open Computer Vision Library
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next