Join/Login
Open Source Software
Business Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Open Source Software

Business Software

SourceForge Podcast

Articles
Case Studies
Learn
Blog

Menu

Help
Create
Join
Login

Home
Browse Open Source
Search Results

Search Results for "audio recognition"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 86
Linux 70
Mac 54
More...
BSD 35
ChromeOS 27
Desktop Operating Systems 8
Mobile Operating Systems 5
Game Consoles 1
Server Operating Systems 1

Category

Artificial Intelligence 61
Multimedia 58
Scientific/Engineering 18
Software Development 13
System 6
Communications 5
Business 4
Internet 4
Games 3
Security 3
Education 2
Database 1
Mobile 1
Text Editors 1

License

OSI-Approved Open Source 67
Other License 5
Creative Commons Attribution License 3
Public Domain 2
More...
GNU Free Documentation License 1

Translations

English 27
French 5
German 5
Spanish 5
More...
Dutch 3
Afrikaans 2
Arabic 2
Western Frisian 2
Danish 1
Indonesian 1
Italian 1
Javanese 1
Latin 1
Norwegian 1
Polish 1
Portuguese 1
Russian 1
Scottish Gaelic 1
Swedish 1
Welsh 1

Programming Language

Java 22
C++ 18
Python 14
C 9
More...
C# 9
JavaScript 7
MATLAB 4
Perl 4
PHP 4
Visual Basic .NET 3
Assembly 2
Delphi/Kylix 2
Go 2
Pascal 2
Simulink 2
Unix Shell 2
Visual Basic 2
BASIC 1
Free Pascal 1
IDL 1
JSP 1
Lazarus 1
Ruby 1
TypeScript 1

Status

Production/Stable 25
Beta 13
Pre-Alpha 12
Alpha 10
More...
Planning 7
Mature 1
Inactive 1

Showing 86 open source projects for "audio recognition"

View related business solutions

Windows Clear Filters & Widen Search

Recruit and Manage your Workforce
Evolia makes it easier to hire, schedule and track time worked by frontline in medium and large-sized businesses.

Evolia is a web and mobile platform that connects enterprises with 1000’s of local shift workers and offers free workforce scheduling and time and attendance solutions. Is your business on Evolia?

Learn More
Total Network Visibility for Network Engineers and IT Managers
Network monitoring and troubleshooting is hard. TotalView makes it easy.

This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.

Learn More
1

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 36 This Week

Last Update: 6 days ago
See Project
2

Buster

Captcha solver extension for humans

Save time by asking Buster to solve captchas for you. Buster is a Firefox extension which helps you to solve difficult captchas by completing reCAPTCHA audio challenges using speech recognition. Challenges are solved by clicking on the extension button at the bottom of the reCAPTCHA widget. It is not guaranteed that challenges are always solved, the limitations of the technology need to be considered. The continued development of Buster is made possible thanks to the support of awesome backers...

Downloads: 38 This Week

Last Update: 2024-06-04
See Project
3

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 1 This Week

Last Update: 3 hours ago
See Project
4

audioFlux

A library for audio and music analysis, feature extraction

A library for audio and music analysis, and feature extraction. Can be used for deep learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training and is used...

Downloads: 1 This Week

Last Update: 2024-08-09
See Project
Email and SMS Marketing Software
Boost Sales. Grow Audiences. Reduce Workloads.

Our intuitive email marketing software to help you save time and build lasting relationships with your subscribers.

Learn More
5

Transformers

State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX

... classification, object detection, and segmentation. Audio, for tasks like speech recognition and audio classification. Transformers provides APIs to quickly download and use those pretrained models on a given text, fine-tune them on your own datasets and then share them with the community on our model hub. At the same time, each python module defining an architecture is fully standalone and can be modified to enable quick research experiments.

Downloads: 0 This Week

Last Update: 2024-09-26
See Project
6

TorchAudio

Data manipulation and transformation for audio signal processing

The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch...

Downloads: 0 This Week

Last Update: 2024-08-22
See Project
7

hfapigo

Unofficial (Golang) Go bindings for the Hugging Face Inference API

(Golang) Go bindings for the Hugging Face Inference API. Directly call any model available in the Model Hub. An API key is required for authorized access. To get one, create a Hugging Face profile.

Downloads: 0 This Week

Last Update: 5 days ago
See Project
8

WPPConnect

WPPConnect is an open source project

WPPConnect is an open-source project developed by the JavaScript community with the aim of exporting functions from WhatsApp Web to the node, which can be used to support the creation of any interaction, such as customer service, media sending, intelligence recognition based on phrases artificial and many other things, use your imagination. We are the best WhatsApp automation solution you have been looking for. We are a team that started an OpenSource project that performs automation...

Downloads: 0 This Week

Last Update: 2024-09-27
See Project
9

VideoSrt

Windows-GUI

... to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.

Downloads: 15 This Week

Last Update: 2023-01-13
See Project
All-in-One Payroll and HR Platform
For small and mid-sized businesses that need a comprehensive payroll and HR solution with personalized support

We design our technology to make workforce management easier. APS offers core HR, payroll, benefits administration, attendance, recruiting, employee onboarding, and more.

Learn More
10

ml5.js

Friendly machine learning for the web

A neighborly approach to creating and exploring artificial intelligence in the browser. ml5.js aims to make machine learning approachable for a broad audience of artists, creative coders, and students. The library provides access to machine learning algorithms and models in the browser, building on top of TensorFlow.js with no other external dependencies.

Downloads: 2 This Week

Last Update: 2024-08-01
See Project
11

OpenClinic GA

Open Source Integrated Hospital Information Management System

OpenClinic GA is an open source integrated hospital information management system covering management of administrative, financial, clinical, lab, x-ray, pharmacy, meals distribution and other data. Extensive statistical and reporting capabilities.

29 Reviews

Downloads: 243 This Week

Last Update: 6 days ago
See Project
12

Tensorflow Transformers

State of the art faster Transformer with Tensorflow 2.0

... speech recognition and audio classification. Faster AutoReggressive Decoding, TFlite support, creating TFRecords is simple. Auto-Batching tf.data.dataset or tf.ragged tensors. Everything is dictionary (inputs and outputs) Multiple mask modes like causal, user-defined, prefix. tensorflow-text tokenizer support. Supports GPU, TPU, multi-GPU trainer with wandb, multiple callbacks, auto tensorboard.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
13

CerberusCMS5

Cerberus Content Management System

Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...

Downloads: 47 This Week

Last Update: 13 hours ago
See Project
14

PlateCatcher

Free Automatic Number Plate Recognition (ANPR) for Windows

"Plate Catcher" is designed to offer Windows users robust ANPR functionality with a user-friendly interface. Below are the key features of the application: "Plate Catcher" offers comprehensive alert options when number plates in it's database are detected: Visual Alerts: Pop-up windows with time-stamped vehicle registrations. Email Alerts: Email notifications with detected number plates and optional vehicle snapshots. Audio Alerts: Customizable audio alerts for individual number plates...

Downloads: 2 This Week

Last Update: 2024-07-08
See Project
15

Vosk Desktop

Desktop software for controlling the Vosk Speech Recognition Toolkit

Downloads: 0 This Week

Last Update: 2023-08-10
See Project
16

cerberuscms2

Cerberus Content Management System

Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...

1 Review

Downloads: 6 This Week

Last Update: 2024-06-10
See Project
17

Common Resource Grep - crgrep

Common Resource Grep

CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...

3 Reviews

Downloads: 3 This Week

Last Update: 2023-04-23
See Project
18

Meihu-FaceBeauty-Live

Beauty can be applied to live broadcasts, short videos, and selfies

Meihu beauty sdk is a mobile sdk with face recognition technology as the core, providing professional-grade real-time beauty, big eyes and face reduction, beauty filters, dynamic stickers and other filters, to create a multi-functional video beauty software The goal is to fully meet the beautification needs of customers in many audio and video software business scenarios such as live beauty and short video beauty. The open source version is now available for iOS, and the Android open source...

Downloads: 0 This Week

Last Update: 2022-05-24
See Project
19

wav2letter++

Facebook AI research's automatic speech recognition toolkit

... export KENLM_ROOT_DIR=... so that wav2letter++ can find it. This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. Each dataset (test/valid/train) needs to be in a separate file with one sample per line. A sample is specified using 4 columns separated by space (or tabs).

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
20

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments...

Downloads: 0 This Week

Last Update: 2021-05-24
See Project
21

RAVL, Recognition And Vision Library.

General C++ Library, with modules for Computer Vision, Pattern Recognition and much more.

Downloads: 0 This Week

Last Update: 2020-04-22
See Project
22

JNIZ music notation audio to midi

music composition and notation software, audio to midi converter

Jniz is a piece of software designed for musicians as a support tool to the musical composition. It allows you to build and to harmonize several voices according to the rules of classical harmony. Sound/audio-to-Midi converter: real-time conversion of any monophonic sound (voice, instrument etc.) into notes / tones. Jniz is a free proprietary piece of software. You do not have the right to sell, distribute Jniz or use its sources under penalty of law. You will infringes on the Jniz staff...

2 Reviews

Downloads: 6 This Week

Last Update: 2020-05-02
See Project
23

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki...

2 Reviews

Downloads: 1 This Week

Last Update: 2020-03-15
See Project
24

chatbot_chung

chatbot chung is a keywords based probabilities algorythm simple entertainment chatbot with 3D talking openGL avatars written in freebasic. Can import aiml simple question/answer or question/random/answers or single star/ multi srai data saved from "AIML_chung" open source application . Online html5 javascript version with 44 languages multilingual auto detection available on the website (source included in the zip file). SORT gentext text generation algorythm option added (desktop version) .

Downloads: 0 This Week

Last Update: 2020-06-27
See Project
25

CMU Sphinx

Speech Recognition Toolkit

Thank you for visiting! ----> Maintenance and improvement work has MOVED to https://cmusphinx.github.io/ Please go there for the most recent software and documentation. <---- CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.

58 Reviews

Downloads: 762 This Week

Last Update: 2024-01-11
See Project

Previous
You're on page 1
2
3
4
Next

Related Searches

hospital and clinic management software

clinic management

pharmacy management

convert audio to srt

cmusphinx-zh-cn-5.2.tar.gz

auto captcha solver

forensic audio analysis

Related Categories

Artificial Intelligence

Scientific/Engineering

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 454-5900

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2024 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: