Showing 20 open source projects for "visual%20scraper"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    Seamless Communication

    Seamless Communication

    Foundational Models for State-of-the-Art Speech and Text Translation

    ...The system architecture includes a real-time multimodal signal pipeline for audio, video, and sensor data, a dialog manager that can decide when to act (speak, gesture, point) or query, and a cross-modal reasoning layer that fuses perception with semantic context. The research prototype includes components for visual grounding (understanding when a user references something in view), gesture recognition and synthesis, and turn-taking mechanisms that mirror human conversational timing. Because latency and synchronization are critical, the codebase invests in asynchronous scheduling, overlap of perception and reasoning, and fast fallback responses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TEN

    TEN

    Open-source framework for conversational voice AI agents

    TEN (Transformative Extensions Network) is an open source framework designed to empower developers to build real-time multimodal AI agents capable of voice, video, text, image, and data-stream interaction with ultra-low latency. It includes a full ecosystem, TEN Turn Detection, TEN Agent, and TMAN Designer, allowing developers to rapidly assemble human-like, responsive agents that can see, speak, hear, and interact. With support for languages like Python, C++, and Go, it offers flexible...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    AI File Sorter

    AI File Sorter

    Local AI file organization with image-based rename suggestions

    AI File Sorter is a cross-platform desktop application that uses AI to organize all files and suggest better file names for image files, based on their visual content. The app can analyze picture files locally and suggest meaningful, human-readable names. For example, a generic file like IMG_2048.jpg can be renamed to clouds_over_lake.jpg. All rename and categorization suggestions are optional and must be reviewed and approved before anything is applied. AI File Sorter helps tidy up cluttered folders such as Downloads, external drives, or NAS storage by grouping files based on filenames, extensions, folder context, and learned organization patterns. ...
    Downloads: 268 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    Mozilla JPEG Encoder Project

    Mozilla JPEG Encoder Project

    Improved JPEG encoder

    MozJPEG improves JPEG compression efficiency achieving higher visual quality and smaller file sizes at the same time. It is compatible with the JPEG standard, and the vast majority of the world's deployed JPEG decoders. MozJPEG is compatible with the libjpeg API and ABI. It is intended to be a drop-in replacement for libjpeg. MozJPEG is a strict superset of libjpeg-turbo's functionality. All MozJPEG's improvements can be disabled at run time, and in that case it behaves exactly like libjpeg-turbo. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    NeuralCoref

    NeuralCoref

    Fast Coreference Resolution in spaCy with Neural Networks

    NeuralCoref is a pipeline extension for spaCy 2.1+ which annotates and resolves coreference clusters using a neural network. NeuralCoref is production-ready, integrated in spaCy's NLP pipeline and extensible to new training datasets. For a brief introduction to coreference resolution and NeuralCoref, please refer to our blog post. NeuralCoref is written in Python/Cython and comes with a pre-trained statistical model for English only. NeuralCoref is accompanied by a visualization client...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    A Disk Scheduling Simulator tool. With an intuitive GUI interface and command line functionality. Built using both VC and VB. Currently supports SSTF,LOOK,CLOOK,SCAN and CSCAN algorithms. Rotational latency is also optionally supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    NeuroSolutions: Formula Generator

    Utility converts the weights file of a MLP Breadboard into a formula

    The NeuroSolutions: Formula Generator utility converts the weights file of a default MLP breadboard (1-hidden layer with a TanhAxon in the hidden layer and either a TanhAxon or BiasAxon in the output layer) into a usable formula that can be copied and pasted into your own programs to compute the output of the trained neural network.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 10
    Azul OS

    Azul OS

    Azul OS version dev(Linux) IA

    Azul OS version dev , est une version de Azul pour les developpeurs basé sur Linux , doté d'une IA un programme nommé Azul voice et qui est un système de reconnaissance vocale qui comprend ce que vous dites et réponds par des sensations . Azul Dev est une distribution linux , qui comporte des outils et des lib pour les developpeurs avec une Interface Gnome # Azul voice système sensation . Windows & linux. En cours .. # Azul voice version windows Azul interface . Disponible # Azul dev...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Interface and example projects for CLIPS on NT using the CLIPS DLL. Includes Web/CGI interface.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    This project is to share my work in visual recognition of gestures. It shows how I have implemented person detection, color adaptation and tracking of the user's right-hand, gesture feature extraction, etc. using a single videocamera.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    It's Windows version of WordNet 3.0 (not provided on author's site) with c code source for compiling with visual studio
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    computer vision workshop project - realtime video processing and simulation of musical instruments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    RobotFlow is a mobile robotics tookit based on FlowDesigner. The visual programming interface provided with FlowDesigner will help people to better visualize & understand what is really happening in the robot's control loops, sensors and actuators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DTW is intended to be a Voice in -> Pictures + Text out program written in java using Sphinx from CMU. This is intended to be useful to people who have good oral/visual literacy skills but poor written literacy skills.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    An attempt at developing an Artificial Inteligence software, based on the notion that inteligence is a set of conditional choices (in programming terms, it means a whole lot of if statements).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AIBuddy is an artificially intelligent chatbot. Unlike other chatbots, which pattern-match the entered phrase against a database of hard-coded responses, AIBuddy actually remembers the conversation, and can respond in context.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Pangaea will be a robust and feature filled game engine built using Allegro (http://alleg.sourceforge.net) It will be similar to Final Fantasy 1-3, etc., complete with map editor/world builder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A collection of user contributed applications which use the Open Computer Vision Library
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next