artificial intelligence python free download

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 21 This Week

Last Update: 8 hours ago

See Project

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples....

Downloads: 525 This Week

Last Update: 2025-10-15

See Project

Vosk Speech Recognition Toolkit

Offline speech recognition API for Android, iOS, Raspberry Pi

Vosk is an offline open source speech recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech, Polish. More to come. Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API,...

Downloads: 66 This Week

Last Update: 2024-04-22

See Project

OnnxStream

Lightweight inference library for ONNX files, written in C++

The challenge is to run Stable Diffusion 1.5, which includes a large transformer model with almost 1 billion parameters, on a Raspberry Pi Zero 2, which is a microcomputer with 512MB of RAM, without adding more swap space and without offloading intermediate results on disk. The recommended minimum RAM/VRAM for Stable Diffusion 1.5 is typically 8GB. Generally, major machine learning frameworks and libraries are focused on minimizing inference latency and/or maximizing throughput, all of which...

Downloads: 4 This Week

Last Update: 2024-08-14

See Project

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the...

Downloads: 16 This Week

Last Update: 2021-04-08

See Project

raspicam

C++ library for controlling Raspberry Pi Camera (with/without OpenCV)

This library allows to use the Raspberry Pi Camera. Main features: - Provides class RaspiCam for easy and full control of the camera - Provides class RaspiCam_Cv for easy control of the camera with OpenCV. - Easy compilation/installation using cmake. - No need to install development file of userland. Implementation is hidden. - Many examples

4 Reviews

Downloads: 4 This Week

Last Update: 2020-06-15

See Project

Search Results for "artificial intelligence python"

Showing 6 open source projects for "artificial intelligence python"

sherpa-onnx

whisper.cpp

Vosk Speech Recognition Toolkit

OnnxStream

DeepSpeech

raspicam

Search Results for "artificial intelligence python"

Showing 6 open source projects for "artificial intelligence python"

sherpa-onnx

whisper.cpp

Vosk Speech Recognition Toolkit

OnnxStream

DeepSpeech

raspicam

Related Searches

Related Categories