kaldi-asr/kaldi is the official location of the Kaldi project
Stanford NLP Python library for many human languages
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCR software, free and offline
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Asynchronous multi-platform robot framework written in Python
Multimodal embedding and reranking models built on Qwen3-VL
UI-TARS-desktop version that can operate on your local personal device
Foundational model for human-like, expressive TTS
InvokeAI is a leading creative engine for Stable Diffusion models
Access large language models from the command-line
Chinese XLNet pre-trained model
Datasets, transforms and models specific to Computer Vision
AutoML library for deep learning
Composable building blocks to build Llama Apps
Library for OCR-related tasks powered by Deep Learning
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Security Scanner for Agent Skills
Document Image Parsing via Heterogeneous Anchor Prompting”
4M: Massively Multimodal Masked Modeling
3D reconstruction software
Agent S: an open agentic framework that uses computers like a human
Deep universal probabilistic programming with Python and PyTorch
Effortless data labeling with AI support from Segment Anything