A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Structured outputs for llms
Magnetoencephalography (MEG) and Electroencephalography EEG in Python
RGBD video generation model conditioned on camera input
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
NVR with realtime local object detection for IP cameras
Stable Diffusion web UI
Open-Sora: Democratizing Efficient Video Production for All
Portia Labs Python SDK for building agentic workflows
The official Python SDK for Model Context Protocol servers and clients
A deep learning toolkit for Text-to-Speech, battle-tested in research
Ready-to-use OCR with 80+ supported languages
A Powerful Native Multimodal Model for Image Generation
Qwen3 is the large language model series developed by Qwen team
TensorFlow is an open source library for machine learning
Qwen3-Coder is the code version of Qwen3
An experimental version of DeepSeek model
Provides convenient access to the Anthropic REST API from any Python 3
Comprehensive Gradio WebUI for audio processing
Awesome multilingual OCR toolkits based on PaddlePaddle
Generating Immersive, Explorable, and Interactive 3D Worlds
A community-supported supercharged version of paperless
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Image polygonal annotation with Python