OpenAI gpt-image-2 API
Modular AI image and video generation web UI with extensible tools
Advanced electron-based frontend for yt-dlp
A new DSL and server for AI agents and multi-step tasks
Automated translation solution for visual novels
Examples and guides for using the Gemini API
Implementing large models into scenario-based applications
Public opinion analysis system
End-to-end speech processing toolkit
Qwen3-ASR is an open-source series of ASR models
A multimodal model for brain response prediction
PostgreSQL extension for BM25 relevance-ranked full-text search
Stanford NLP Python library for many human languages
95% token savings. 155x faster queries. 16 languages
Docker image used to run data processing workloads
Markdown parser, done right. 100% CommonMark support, extensions
Open source NLP guide with models, methods, and real use cases
Goverlay is an easy graphical interface to configure MangoHud
General-purpose image editing model that delivers high-fidelity
Audiocraft is a library for audio processing and generation
AI-assisted storyboard and video generation tool
Python library for scraping and analyzing online news articles easily
Using AI models to automatically provide commentary and edit videos
Framework for building realtime multimodal voice AI agents apps