Implementing large models into scenario-based applications
Advanced electron-based frontend for yt-dlp
A multimodal model for brain response prediction
Qwen3-ASR is an open-source series of ASR models
Automated translation solution for visual novels
PostgreSQL extension for BM25 relevance-ranked full-text search
Audiocraft is a library for audio processing and generation
Goverlay is an easy graphical interface to configure MangoHud
95% token savings. 155x faster queries. 16 languages
AI-assisted storyboard and video generation tool
Public opinion analysis system
Docker image used to run data processing workloads
End-to-end speech processing toolkit
Open source NLP guide with models, methods, and real use cases
Stanford NLP Python library for many human languages
Semantic search and document parsing tools for the command line
Python library for scraping and analyzing online news articles easily
General-purpose image editing model that delivers high-fidelity
Framework for building realtime multimodal voice AI agents apps
Structured data extraction and instruction calling with ML, LLM
Using AI models to automatically provide commentary and edit videos
A simple and easy-to-use library for interacting with the Ollama API
Sora AI Video Generator by Sora.FM
Running large language models on a single GPU
Framework for building real-time voice and multimodal AI agents