AgentHandover observes, learns and teaches agents with skills
A sound cloning tool with a web interface, using your voice
NVR with realtime local object detection for IP cameras
An open-source, modern-design AI training tracking and visualization
A nearly-live implementation of OpenAI's Whisper
LLM-based agent for general purpose software engineering tasks
Speech recognition module for Python
Fast backend for long-term AI user memory via structured profiles
LLM-based Reinforcement Learning audio edit model
Open-source MCP server that gives your coding agent
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Separate audio recordings into individual sources
General Speech Restoration
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
fastNLP: A Modularized and Extensible NLP Framework