Search Results for "speech processing" - Page 6

Showing 152 open source projects for "speech processing"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    de-ess

    de-ess

    De-essing software to reduce sibilance in speech using TSP

    This de-esser uses a novel approach called Temporal Sibilance Processing. The idea is to distinguish between fricatives and voiced sections of the speech signal by the number of zero crossings in time. Most of the speech file is left untouched (the samples are directly copied from source to destination). Only fricatives that are long enough and loud enough are filtered. The advantage of this approach over traditional approaches is that the clarity of the remaining speech is completely unaffected.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Java application for training and deploying text processing applications such as part-of-speech taggers, based on a re-implementation of Brill's algorithm in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Open Pandora's Box

    Open Pandora's Box

    Pandora is an artificial intelligent web based bot

    Pandora is an artificial intelligent web based bot written in Java. Pandora is a component based AI architecture including, database memory, XML, voice, voice rec, chat, IRC, HTTP, Wiktionary, Freebase, consciousness, language, GUI, applet, web, jsp, Android
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    KALIMAT Multipurpose Arabic Corpus

    A corpus that could be of help for researchers working on Arabic NLP

    KALIMAT a Multipurpose Arabic Corpus We are pleased to announce the immediate availability of KALIMAT 1.0, KALIMAT is an Arabic natural language resource that consists of: 1) 20,291 Arabic articles collected from the Omani newspaper Alwatan by (Abbas et al. 2011). 2) 20,291 Extractive Single-document system summaries. 3) 2,057 Extractive Multi-document system summaries. 4) 20,291 Named Entity Recognised articles. 5) 20,291 Part of Speech Tagged articles. 6) 20,291...
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    DPRK pull is a script that pulls the English language North Korean news articles from the KCNA website and puts them into one file for reading by a Text to Speech program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    BINVB Speech Processing Toolbox

    Toolbox for speech processing. Realization of Voicebox interface.

    This is a toolbox for speech processing written in C. It realizes interface of Voicebox toolbox (http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    PHN/LAB file format converter

    Convert the format of speech label files

    Research in speech processing requires labeling and segmentation of speech signals, i.e., label files. Two useful formats for label files are the CSLU format (.phn extension) and the HTK format (.lab extension). This program converts speech label files from any of these two formats to the other one. Do this: 1- Provide the source directory (directory where the original label files are). 2- Provide the destination directory (directory where the new label files will be placed). 3- Select the operation, either phn->lab or lab->phn. 4- Press the Process! ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Advanced Speech Signal Analysis library provides a structure to handle various file formats and a variety of analysis functions commonly used in speech processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    This project aims to build a suite of Natural Language Processing tools. Modules will include corpus indexing and access tools, a part-of-speech tagger, tokenisers, text classification software, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10
    OpenImage
    Java image viewer, an open source java learning application. Currently in alpha stage.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    IUT-SimDSP
    This is a simple DSP simulator for educational purposes: developed as a course supplement of CIT-4617 (Digital Signal Processing) at Islamic University of Technology (IUT). Written in C++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    VoiceMan is the speech processing daemon designed to collect output from screen reading software used by blind users in one central place and translate it into speech with configured set of speech synthesizers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    The open source, multimodal interactive "Sensitive Artificial Listener" dialogue system created by the EU project SEMAINE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Maximum entropy is a powerful method for constructing statistical models of classification tasks, such as part of speech tagging in Natural Language Processing. Several example applications using maxent can be found in the OpenNLP Tools Library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The purpose of HSVT is the collection and setup of OSS audio tools for the processing of speech and vocals. The end result will be something between jack-rack and ardour, with partial emulation or co-operation with hardware-rack voice processors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    CSharpPOSTagger
    POS Tagger , Part of speech tagger, Hidden Markov Model , written with C#. Natural language Processing .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This project'll be the core engine of many voice based platforms,which can be implemented into your projects,websites...etc to provide an Arabic speech service, where your servers can interact with the clients through Arabic Speech Recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Concrete Voice is a text to speech program. It can read the time, anounce weather, read text file, save text files to audio files, open any text file (supports all text encoding formats) and many more advance stuff!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Matsig is an object-oriented signal class library (Toolbox in MATLAB lingo) for MATLAB 6.5 and later. It implements a signal class, simplifying operations and manipulations common in audio signal processing and speech processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Automatically translate english/french/german text to german/french/english text and output speech in appropriate language. All Automagically with the power of the inter-webs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AutoSummary uses Natural Language Processing to generate a contextually-relevant synopsis of plain text. It uses statistical and rule-based methods for part-of-speech tagging, word sense disambiguation, sentence deconstruction and semantic analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Open Mind Speech project is part of the Open Mind Initiative and aims to develop free(GPL) speech recognition and signal processing (DSP) tools and applications, as well as collect speech data from "e-citizens" using the Internet.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo