AI assistant based on large models that can actively think and plan
A lightweight, powerful framework for multi-agent workflows
AI-Researcher: Autonomous Scientific Innovation
A generative speech model for daily dialogue
AI-powered tool for efficient abstract and PDF screening
Context-aware desktop AI assistant that understands screen content
Run a full local LLM stack with one command using Docker
Sharp Monocular View Synthesis in Less Than a Second
Framework for building realtime multimodal voice AI agents apps
SDG is a specialized framework
Flowly is 100x faster than OpenClaw
Transforming Multimodal Content into Captivating Multilingual Audio
A fast TTS architecture with conditional flow matching
AI Slack bot for reading, summarizing, and chatting with content
MARS5 speech model (TTS) from CAMB.AI
PyTorch3D is FAIR's library of reusable components for deep learning
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A specialized Claude Code workspace for creating long-form
Voice Recognition to Text Tool
Generate audiobooks from e-books
A Python library for audio
Machine learning on FPGAs using HLS
Open Source Deep Research Alternative to Reason and Search
Fully Local Manus AI. No APIs, No $200 monthly bills
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)