A gallery that showcases on-device ML/GenAI use cases
A Python library for audio
An Open Source implementation of Notebook LM with more flexibility
Build multimodal AI applications with cloud-native stack
A general fine-tuning kit geared toward image/video/audio diffusion
The Triton Inference Server provides an optimized cloud
An AI for Music Generation
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Build cross-modal and multimodal applications on the cloud
A timeline of the latest AI models for audio generation
Efficient approximate nearest neighbor search algorithm collections
Facebook AI research's automatic speech recognition toolkit