Knowledge Graph Generation from Any Text
Controllable and fast Text-to-Speech for over 7000 languages
The python library for real-time communication
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
OpenRecall is a fully open-source, privacy-first alternative
A sound cloning tool with a web interface, using your voice
Extract schema, statistics and entities from datasets
Semantic search and document parsing tools for the command line
Structured data extraction and instruction calling with ML, LLM
Framework for building real-time voice and multimodal AI agents
Autonomous agents for everyone
Web-based tool converts GitHub repository contents
Build AI-powered semantic search applications
Improve your resumes with Resume Matcher
Fast multimodal LLM for real-time voice interaction and AI apps
Diffusion Transformer with Fine-Grained Chinese Understanding
Large-language-model & vision-language-model based on Linear Attention
Build Vision Agents quickly with any model or video provider
Visual Causal Flow
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
AI tool that turns Hacker News posts into daily podcast updates
AutoGluon: AutoML for Image, Text, and Tabular Data
OCR expert VLM powered by Hunyuan's native multimodal architecture
Weaviate is a cloud-native, modular, real-time vector search engine
Edit videos with Claude Code