Powerful AI language model (MoE) optimized for efficiency/performance
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Open-source, high-performance AI model with advanced reasoning
Ready-to-use OCR with 80+ supported languages
Stable Diffusion web UI
A gradio web UI for running Large Language Models like LLaMA
A deep learning toolkit for Text-to-Speech, battle-tested in research
Open-Sora: Democratizing Efficient Video Production for All
Image inpainting tool powered by SOTA AI Model
NVR with realtime local object detection for IP cameras
Speech recognition module for Python
Machine learning in Python
Powerful tool that lets you create and run intelligent agents
Central interface to connect your LLM's with external data
Image polygonal annotation with Python
Simple and powerful voice changer for Linux, written with Python & GTK
Open Source Document Management System for Digital Archives
Create UIs for your machine learning model in Python in 3 minutes
A Lightweight Face Recognition and Facial Attribute Analysis
Awesome multilingual OCR toolkits based on PaddlePaddle
A high-throughput and memory-efficient inference and serving engine
A lightweight audio-to-MIDI converter with pitch bend detection
Stable Diffusion built-in to Blender
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Generate short videos with one click using AI LLM