Showing 6 open source projects for "speech engine"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Dicio assistant

    Dicio assistant

    Dicio assistant app for Android

    Dicio is a free and open source voice assistant for Android that focuses on strong privacy by running its understanding and response generation directly on the device whenever possible. It supports multiple input and output methods, including hotword-based voice input using the Vosk speech-to-text engine and a graphical interface for users who prefer to tap instead of talk. The assistant is built around a flexible “skills” system that lets it respond to a wide variety of requests such as search, weather, navigation, calculator, timers, media control, and more. It offers rich multilingual support and is already available in a wide range of languages including English, Spanish, German, French, Italian, Russian, and others. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Rhino

    Rhino

    On-device Speech-to-Intent engine powered by deep learning

    Rhino is Picovoice's Speech-to-Intent engine. It directly infers intent from spoken commands within a given context of interest, in real-time. The end-to-end platform for embedding private voice AI into any software in a few lines of code. Design with no limits on top of a modular platform. Create use-case-specific voice AI models in seconds. Develop voice features with a few lines of code using intuitive and cross-platform SDKs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Cactus

    Cactus

    Low-latency AI inference engine optimized for mobile devices

    Cactus is a low-latency, energy-efficient AI inference framework designed specifically for mobile devices and wearables, enabling advanced machine learning capabilities directly on-device. It provides a full-stack architecture composed of an inference engine, a computation graph system, and highly optimized hardware kernels tailored for ARM-based processors. Cactus emphasizes efficient memory usage through techniques such as zero-copy computation graphs and quantized model formats, allowing large models to run within the constraints of mobile hardware. It supports a wide range of AI tasks including text generation, speech-to-text, vision processing, and retrieval-augmented workflows through a unified API interface. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    TTS Server

    TTS Server

    Android system TTS application with Microsoft demo interface

    tts-server-android is an Android system TTS application that acts both as a powerful local text-to-speech engine and as a flexible TTS “server” for other apps via HTTP. It includes a built-in Microsoft TTS demo interface and lets users configure custom HTTP requests, making it possible to route TTS through various cloud providers or local servers. The app can import other local TTS engines, giving Android devices a unified interface to multiple voices and providers, and it features simple narration/dialogue detection based on Chinese quotation marks so it can read text with different styles for narration and dialogue. ...
    Downloads: 22 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    NASH OS

    NASH OS

    Nash Operating System for Modern Ecommerce

    The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    VoR ~ Voice Recognition

    VoR ~ Voice Recognition

    VoR is a voice recognition tool for Android.

    It is an Open-Source project that acquire the word spoken and open the its page in Wikipedia. It include a VoR Engine ~ Voice Recognition Engine make by CMU Sphinx tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next