An opinionated CLI to transcribe Audio files w/ Whisper on-device
Easy-to-use and high-performance NLP and LLM framework
Deep Research framework, combining language models with tools
Modular AI image and video generation web UI with extensible tools
FastFlix is a free GUI for H.264, HEVC and AV1 hardware and software
Your CrewAI Powered Video Editing Assistant
Automated translation solution for visual novels
A new DSL and server for AI agents and multi-step tasks
Public opinion analysis system
Qwen3-ASR is an open-source series of ASR models
95% token savings. 155x faster queries. 16 languages
Docker image used to run data processing workloads
End-to-end speech processing toolkit
Audiocraft is a library for audio processing and generation
Open source NLP guide with models, methods, and real use cases
General-purpose image editing model that delivers high-fidelity
Stanford NLP Python library for many human languages
Python library for scraping and analyzing online news articles easily
Structured data extraction and instruction calling with ML, LLM
Using AI models to automatically provide commentary and edit videos
Check code for common misspellings
Framework for building realtime multimodal voice AI agents apps
Running large language models on a single GPU
Framework for building real-time voice and multimodal AI agents
Open Source Speech Language Model