Clone a voice in 5 seconds to generate arbitrary speech in real-time
Real time face swap and one-click video deepfake
Framework for building real-time voice and multimodal AI agents
A high-quality rapid TTS voice cloning model
RF-DETR is a real-time object detection and segmentation
Fast multimodal LLM for real-time voice interaction and AI apps
State-of-the-art TTS model under 25MB
Python & JS/TS SDK for running AI-generated code/code
Connect any LLM to your internal knowledge sources
Develop software autonomously
AI tool for real-time monitoring and analysis of Goofish listings
Python framework for building scalable multi-agent systems
A nearly-live implementation of OpenAI's Whisper
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Software that uses AI to perform real-time voice conversion
Talk to Your AI Agents from Anywhere
The behavior guidance framework for customer-facing LLM agents
Large Audio Language Model built for natural interactions
TTS with kokoro and onnx runtime
Python inference and LoRA trainer package for the LTX-2 audio–video
Execute SQL queries and manage databases seamlessly with Timeplus
Capable of understanding text, audio, vision, video
Synthetic data generators for tabular and time-series data
Ready-to-run cloud templates for RAG
Advancing Open-source World Models