FastAPI framework, high performance, easy to learn, fast to code
clangd language server
Qwen3 is the large language model series developed by Qwen team
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Label Studio is a multi-type data labeling and annotation tool
Comprehensive Gradio WebUI for audio processing
Azure command-line interface
Ready-to-use OCR with 80+ supported languages
Open source personal AI Assistant for Linux, Windows and Mac
Speech recognition module for Python
InvokeAI is a leading creative engine for Stable Diffusion models
Open-Sora: Democratizing Efficient Video Production for All
Models for the spaCy Natural Language Processing (NLP) library
Universal Radio Hacker: Investigate Wireless Protocols Like A Boss
Open Source Document Management System for Digital Archives
Qwen-Image is a powerful image generation foundation model
A deep learning toolkit for Text-to-Speech, battle-tested in research
Generating Immersive, Explorable, and Interactive 3D Worlds
Awesome multilingual OCR toolkits based on PaddlePaddle
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Ark pixel font - Open source Pan-CJK pixel font
Library for OCR-related tasks powered by Deep Learning
Web interface for generating images using Stable Diffusion models
Open source machine learning framework to automate text conversations
Open-Source Python3 tool for recognizing layouts, tables, and math