Instructions on how to use the Realtime API on Microcontrollers
A Powerful Native Multimodal Model for Image Generation
Open-source multi-speaker long-form text-to-speech model
A HTML5 video player with a parser that saves traffic
C++ inference library for multiple SVC/TTS
A Model Context Protocol server for searching and analyzing arXiv
Google AI Studio Starter Apps
Blazeface is a lightweight model that detects faces in images
A CNN model that predicts human joints from RGB images of a person
SSD-based object detection model trained on Open Images V4
Detect faces in an image
A computer vision framework to create and deploy apps in minutes
Chinese text-to-speech engine
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
experimental AI language for multiple agents
Natural Language Processing (NLP) for the Masses
Technologies for automating food production on various scales
The easiest C++ way to deal with constraints !