World's first open-source, agentic video production system
Large-language-model & vision-language-model based on Linear Attention
A sound cloning tool with a web interface, using your voice
The most powerful local music generation model
Knowledge Graph Generation from Any Text
End-to-end speech processing toolkit
Open Source Document Management System for Digital Archives
A Web UI for easy subtitle using whisper model
A theme for Sublime Text 3 by Mattia Astorino
A speech-text foundation model for real time dialogue
A Multi-Modal World Model for Reconstructing, Generating, Simulation
A high-quality PDF to Markdown tool based on large language model
A Python toolbox for gaining geometric insights
Autoregressive Model Beats Diffusion
Free, high-quality text-to-speech API endpoint to replace OpenAI
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Enhances Tesseract OCR output using LLMs (local or API)
Fast stable diffusion on CPU and AI PC
Real-time voice interactive digital human
Multilingual sentence & image embeddings with BERT
General-purpose image editing model that delivers high-fidelity
The most accurate natural language detection library for Python
A python library that makes AMR parsing, generation and visualization
A modular graph-based Retrieval-Augmented Generation (RAG) system
Build Vision Agents quickly with any model or video provider