Build multimodal AI applications with cloud-native stack
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Improve human sleep through scientifically
Various plugins for Logitech Media Server
Action-adventure game originally released in 2013
Software that uses AI to perform real-time voice conversion
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
An AI for Music Generation
TorchMultimodal is a PyTorch library
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A set of Rust crates for interacting with the Matrix chat network
Go bindings for raylib, a simple and easy-to-use library
This is a simple demonstration of more advanced, agentic patterns
Data Lake for Deep Learning. Build, manage, and query datasets
(Golang) Go bindings for Discord
Telegram Bot API for NodeJS
Audio codecs extracted from Android Open Source Project
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Build cross-modal and multimodal applications on the cloud
An advanced drum machine with pattern-based programming
AI Multi-Agent Framework in .NET
Discover pretrained models for deep learning in MATLAB
Toolkit for audio, music, and speech generation