Join/Login
Open Source Software
Business Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Open Source Software

Business Software

SourceForge Podcast

Articles
Case Studies
Learn
Blog

Menu

Help
Create
Join
Login

Home
Browse Open Source
Search Results

Search Results for "audio recognition"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 20
Windows 18
Mac 13
More...
BSD 11
ChromeOS 6
Desktop Operating Systems 5
Mobile Operating Systems 1
Server Operating Systems 1

Category

Multimedia 21
Artificial Intelligence 15
Scientific/Engineering 8
Software Development 3
Education 2
Games 2
Security 1
System 1

License

OSI-Approved Open Source 19
Other License 2
Creative Commons Attribution License 1
Public Domain 1

Translations

English 8
Polish 1
Swedish 1

Programming Language

C++ 24
C 3
C# 3
Java 3
Python 3
More...
Assembly 1
JavaScript 1
MATLAB 1
Perl 1

Status

Production/Stable 8
Beta 5
Planning 3
Alpha 2

Showing 24 open source projects for "audio recognition"

View related business solutions

C++ Clear Filters & Widen Search

The Secure Workspace for Remote Work
Venn isolates and protects work from any personal use on the same computer, whether BYO or company issued.

Venn is a secure workspace for remote work that isolates and protects work from any personal use on the same computer. Work lives in a secure local enclave that is company controlled, where all data is encrypted and access is managed. Within the enclave – visually indicated by the Blue Border around these applications – business activity is walled off from anything that happens on the personal side. As a result, work and personal uses can now safely coexist on the same computer.

Learn More
Recruit and Manage your Workforce
Evolia makes it easier to hire, schedule and track time worked by frontline in medium and large-sized businesses.

Evolia is a web and mobile platform that connects enterprises with 1000’s of local shift workers and offers free workforce scheduling and time and attendance solutions. Is your business on Evolia?

Learn More
1

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 1 This Week

Last Update: 3 hours ago
See Project
2

VoodooHDA

VoodooHDA is an open source audio driver for devices compliant with the Intel High Definition Audio specification. It is intended as a replacement for AppleHDA on Mac OS X with support for a wide range of audio controllers and codecs.

20 Reviews

Downloads: 493 This Week

Last Update: 2022-09-07
See Project
3

wav2letter++

Facebook AI research's automatic speech recognition toolkit

... export KENLM_ROOT_DIR=... so that wav2letter++ can find it. This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. Each dataset (test/valid/train) needs to be in a separate file with one sample per line. A sample is specified using 4 columns separated by space (or tabs).

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
4

RAVL, Recognition And Vision Library.

General C++ Library, with modules for Computer Vision, Pattern Recognition and much more.

Downloads: 0 This Week

Last Update: 2020-04-22
See Project
Cyber Risk Assessment and Management Platform
ConnectWise Identify is a powerful cybersecurity risk assessment platform offering strategic cybersecurity assessments and recommendations.

When it comes to cybersecurity, what your clients don’t know can really hurt them. And believe it or not, keep them safe starts with asking questions. With ConnectWise Identify Assessment, get access to risk assessment backed by the NIST Cybersecurity Framework to uncover risks across your client’s entire business, not just their networks. With a clearly defined, easy-to-read risk report in hand, you can start having meaningful security conversations that can get you on the path of keeping your clients protected from every angle. Choose from two assessment levels to cover every client’s need, from the Essentials to cover the basics to our Comprehensive Assessment to dive deeper to uncover additional risks. Our intuitive heat map shows you your client’s overall risk level and priority to address risks based on probability and financial impact. Each report includes remediation recommendations to help you create a revenue-generating action plan.

Learn More
5

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki...

2 Reviews

Downloads: 1 This Week

Last Update: 2020-03-15
See Project
6

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 22 This Week

Last Update: 2018-05-11
See Project
7

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research...

Downloads: 0 This Week

Last Update: 2019-08-21
See Project
8

My Music Recognition

This application can help you quickly identify the name of any song.

My Music Recognition uses a powerful audio recognition engine in order to help you get the name of the song you are listening to. It can capture sound from radio streams, the installed music player or any other source and display the name of the song in seconds.

3 Reviews

Downloads: 5 This Week

Last Update: 2016-11-29
See Project
9

jaivox

Speech recognition application builder and library

Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
Innovate faster with enterprise-ready generative AI—enhanced by Gemini
Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case.

Vertex AI offers everything you need to build and use generative AI—from AI solutions, to Search and Conversation, to 130+ foundation models, to a unified AI platform.

Try for free
10

LeapInto

Simplified interface to Leap Motion designed for art and music apps

LeapInto provides a simplified interface to the Leap Motion hand sensor input device. Multiple hand recognition is simplified to several stable categories and coordinates are normalised. The interface comes two flavours at present, an open broadcast system using the OSC protocol and a plugin for the Csound audio/music programming language.

Downloads: 1 This Week

Last Update: 2016-05-05
See Project
11

avimmir

(audio, video, image) Multimedia Multimodal Information Retrieval

audio classification; speaker segmentation; speaker clustering; speaker recognition; spoken document retrieval; image retrieval; video retrieval; etc.

Downloads: 0 This Week

Last Update: 2013-11-23
See Project
12

Voce

A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.

3 Reviews

Downloads: 1 This Week

Last Update: 2013-10-03
See Project
13

RNNLIB

RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition. full installation and usage instructions given at http://sourceforge.net/p/rnnl/wiki/Home/

2 Reviews

Downloads: 1 This Week

Last Update: 2016-11-28
See Project
14

openSMILE

SMILE = Speech & Music Interpretation by Large Space Extraction openSMILE is a fast, real-time (audio) feature extraction utility for automatic speech, music and paralinguistic recognition research developed originally at TUM in the scope of the EU-project SEMAINE, now maintained and supported by audEERING.

Downloads: 0 This Week

Last Update: 2014-11-27
See Project
15

Sound Pitch Recognition

A set of Qt/C++ classes enabling cross-platform sound recording and pitch recognition. Can be used in software (e.g. instrument tuners, sound dictation, music teaching and tests) as a user input method. Includes a guitar-tuner example.

Downloads: 0 This Week

Last Update: 2013-05-21
See Project
16

CJ7

CJ7 is an open-source speech recognition engine.

Downloads: 0 This Week

Last Update: 2016-10-23
See Project
17

openEAR

openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.

4 Reviews

Downloads: 28 This Week

Last Update: 2015-08-06
See Project
18

VisAmp

VisAmp is a visually controlled mp3 player. It was initially developed during the "Softwarepraktikum" at the Chair for Image Processing and Pattern Recognition of the University of Freiburg, Germany in 2001.

Downloads: 0 This Week

Last Update: 2016-11-13
See Project
19

Advanced Sphinx TRAiner

Graphical User Interface and advanced facilities for training the speech recognition system Sphinx-III (using SphinxTrain).

1 Review

Downloads: 0 This Week

Last Update: 2013-04-15
See Project
20

Bird Species Classifier

Program performs bird species recognition by their voices. In early stage of development but working well with some popular species and good samples quality.

Downloads: 0 This Week

Last Update: 2013-04-18
See Project
21

KLearnNotes2

A software for teaching the names of music notes. *Intelligent questioning *Gradual learning of successive notes *Bass and treble clefs *A game *Voice recognition, sound. In future:rhythm, scales, key signatures, chords with focus on playing the guitar.

Downloads: 0 This Week

Last Update: 2013-10-10
See Project
22

Kainoa Biometric User Authentication

The purpose of this project is to provide a biometric security solution by using voice print, fingerprint and/or facial recognition along with a password and/or smart card support using AES to protect data. Please read forums for if interested.

Downloads: 0 This Week

Last Update: 2015-08-03
See Project
23

Ebba

EBBA is a project aiming to develop an advanced chatbot by combining AIML, 3d facial expressions, speech synthesizer, speech recognition and an iq-test solving functionality.

2 Reviews

Downloads: 2 This Week

Last Update: 2016-06-01
See Project
24

ROSA is an Open Source Agent (ROSA)

ROSA is an open source agent implementation. It will contain a speech engine, a speech recognition engine and many more...

Downloads: 0 This Week

Last Update: 2013-02-25
See Project

Previous
You're on page 1
Next

Related Searches

chromebook audio driver

opensmile-2.3.0

license plate recognition using java

5.1 surround audio

arabic audio transcription

pattern recognition

Related Categories

Artificial Intelligence

Scientific/Engineering

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 454-5900

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2024 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: