Transforming Multimodal Content into Captivating Multilingual Audio
Flowly is 100x faster than OpenClaw
PyTorch3D is FAIR's library of reusable components for deep learning
AI Slack bot for reading, summarizing, and chatting with content
MARS5 speech model (TTS) from CAMB.AI
Management of Yandex Station and other smart home devices
SoTA open-source TTS
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A Python library for audio
Voice Recognition to Text Tool
Machine learning on FPGAs using HLS
Open Source Deep Research Alternative to Reason and Search
Generate audiobooks from e-books
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Generate high-definition story short videos with one click using AI
DeepCode: Open Agentic Coding
Package manager and build abstraction tool for FPGA/ASIC development
A Web UI for easy subtitle using whisper model
Synchronized Translation for Videos
Multilingual speech recognition and audio understanding model
An open source python library for automated feature engineering
Framework for building AI-powered interactive digital humans and agent
High-Fidelity and Controllable Generation of Textured 3D Assets