Document Image Parsing via Heterogeneous Anchor Prompting”
Build Vision Agents quickly with any model or video provider
Clean network diagrams, One-time setup, zero upkeep
Abstraction layer over YouTube's internal API
Framework for building realtime multimodal voice AI agents apps
Large Multimodal Models for Video Understanding and Editing
Rust binding generator, feature-rich, but seamless and simple
LLM Large Model of Selling Anchor
Nyquist is a language for sound synthesis and music composition.
Discover pretrained models for deep learning in MATLAB
A Conversational Speech Generation Model
MIDI libraries for Qt/C++
Video mixer for mixing live and recorded video and audio feeds
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
The ultimate cataloguer
Booking/reservation of meeting rooms/equipment with e-mail invitations
Chinese text-to-speech engine
A distributed workflow engine
Some functional modules developed by Qt on a daily basis or demos
Common Resource Grep
Real-time music generation using stable diffusion techniques AI
Graph-oriented live coding language and music/audio DSP library