An Open Source text-to-speech system built by inverting Whisper
Generate audiobooks from e-books
A Telegram RSS bot that cares about your reading experience
Official PyTorch Implementation
Private AI platform for agents, enterprise search and RAG pipelines
Get your documents ready for gen AI
VMZ: Model Zoo for Video Modeling
High-resolution models for human tasks
Official repository for LTX-Video
Document Image Parsing via Heterogeneous Anchor Prompting”
Large Multimodal Models for Video Understanding and Editing
Multi-lingual large voice generation model, providing inference
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
The data structure for multimodal data
Instill Core is a full-stack AI infrastructure tool for data
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
LLM Large Model of Selling Anchor
A simple native web interface that uses ChatTTS to synthesize text
Hub of ready-to-use datasets for ML models
WhatsApp MCP server enabling AI access to chats and messaging
Open-source abilities for OpenHome agents
Generate high-definition story short videos with one click using AI
Private chat with local GPT with document, images, video, etc.
Controllable and fast Text-to-Speech for over 7000 languages