Recognition and resolution of numbers, units, date/time, etc.
Implementation of Make-A-Video, new SOTA text to video generator
A Unified Framework for Image Customization
Flexible Photo Recrafting While Preserving Your Identity
Multi-Agent daTa geneRation Infra and eXperimentation framework
Diversity-driven optimization and large-model reasoning ability
MetricFlow allows you to define, build, and maintain metrics in code
Open-source framework for conversational voice AI agents
Django friendly finite state machine support
A library for deep learning end-to-end dialog systems and chatbots
Visual Studio extension for syntax highlighting assembly
Efficiently diff rows across two different databases
Usage-based pricing and billing for developers
Frame profiler
A blazing fast multi-language serialization framework
A cross-platform, portable, linkable Git implementation library
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Controllable & emotion-expressive zero-shot TTS
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second
One-click deployment (including offline integration package)
A single Gradio + React WebUI with extensions for ACE-Step
A TTS model capable of generating ultra-realistic dialogue
A collection of learning resources for curious software engineers