From Vibe Coding to Agentic Engineering
Renderer for the harmony response format to be used with gpt-oss
kaldi-asr/kaldi is the official location of the Kaldi project
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
UI-TARS-desktop version that can operate on your local personal device
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Accurate × Fast × Comprehensive
4M: Massively Multimodal Masked Modeling
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Stanford NLP Python library for many human languages
InvokeAI is a leading creative engine for Stable Diffusion models
Multimodal embedding and reranking models built on Qwen3-VL
Open source template for AI-powered code generation apps w/ sandboxes
Foundational model for human-like, expressive TTS
Asynchronous multi-platform robot framework written in Python
A generic, simple and fast implementation of Deepmind's AlphaZero
Chinese XLNet pre-trained model
Access large language models from the command-line
Sharp Monocular Metric Depth in Less Than a Second
Library for OCR-related tasks powered by Deep Learning
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Security Scanner for Agent Skills
Document Image Parsing via Heterogeneous Anchor Prompting”
AutoML library for deep learning
Composable building blocks to build Llama Apps