AI assistant based on large models that can actively think and plan
A lightweight, powerful framework for multi-agent workflows
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
Build and run agents you can see, understand and trust
Transforming Multimodal Content into Captivating Multilingual Audio
Framework for building realtime multimodal voice AI agents apps
Flowly is 100x faster than OpenClaw
AI Slack bot for reading, summarizing, and chatting with content
MARS5 speech model (TTS) from CAMB.AI
PyTorch3D is FAIR's library of reusable components for deep learning
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Run a full local LLM stack with one command using Docker
Generate audiobooks from e-books
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A specialized Claude Code workspace for creating long-form
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Framework for building AI-powered interactive digital humans and agent
Machine learning on FPGAs using HLS
SDG is a specialized framework
Open Source Deep Research Alternative to Reason and Search
Voice Recognition to Text Tool
Fully Local Manus AI. No APIs, No $200 monthly bills
Management of Yandex Station and other smart home devices
Multilingual speech recognition and audio understanding model
Package manager and build abstraction tool for FPGA/ASIC development