Framework for building realtime multimodal voice AI agents apps
Open-source abilities for OpenHome agents
A Telegram RSS bot that cares about your reading experience
LLM Large Model of Selling Anchor
Synchronized Translation for Videos
Controllable and fast Text-to-Speech for over 7000 languages
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Generate high-definition story short videos with one click using AI
"VideoRAG: Chat with Your Videos
Foundational model for human-like, expressive TTS
Open-source Video Translation Skill
Multi-source content processor for NotebookLM
Instill Core is a full-stack AI infrastructure tool for data
Build multimodal AI applications with cloud-native stack
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Improve human sleep through scientifically
WhatsApp MCP server enabling AI access to chats and messaging
Omnilingual ASR Open-Source Multilingual SpeechRecognition
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Context data platform for building observable, self-learning AI agents
Data Lake for Deep Learning. Build, manage, and query datasets
The Triton Inference Server provides an optimized cloud
Hub of ready-to-use datasets for ML models