Flet enables developers to easily build realtime web and mobile apps
Data Infrastructure providing an approach to multimodal AI workloads
Google Gen AI Python SDK provides an interface for developers
ComfyUI wrapper nodes for WanVideo and related models
High-Resolution Image Synthesis with Latent Diffusion Models
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Qwen3-ASR is an open-source series of ASR models
LLM abstractions that aren't obstructions
Open source libraries and APIs to build custom preprocessing pipelines
ImageBind One Embedding Space to Bind Them All
A speech-text foundation model for real time dialogue
The Markdown Editor for Linux
A Python utility / library to sort imports
Conversational voice AI agents
Open source terminal session recorder
Flexible Photo Recrafting While Preserving Your Identity
Multi-tool for semantic search
Flowly is 100x faster than OpenClaw
Check code for common misspellings
Bailing is a voice dialogue robot similar to GPT-4o
Towards Human-Sounding Speech
Multimodal AI chat app with dynamic conversation routing
Parse files for optimal RAG
MARS5 speech model (TTS) from CAMB.AI