Multimodal Diffusion with Representation Alignment
A simple app to get songs from YouTube in mp3 format with artist name
Miso TTS is an 8 billion, highly emotive text-to-speech model
The most powerful local music generation model
Toloka-Kit is a Python library for working with Toloka API
Multi-lingual large voice generation model, providing inference
Official inference repo for FLUX.1 models
A ranked list of awesome python developer tools and libraries
The awesome document factory
Reverse engineering Gemini's SynthID detection
Multi-agent autonomous startup system for Claude Code
Streaming Real-time Audio-Driven Avatar Generation
Dataset Management Framework, a Python library and a CLI tool to build
Collaborative & Open-Source Quality Assurance for all AI models
The interactive graphing library for Python
Automatically Visualize any dataset, any size
HY-Motion model for 3D character animation generation
Node repackaging(wrapping) of the LLVM Clang's clang-format
A fun, new monospaced font that includes programming ligatures
Animation engine for explanatory math videos
A lightning fast audio upsampler
PPTAgent: Generating and Evaluating Presentations
Agentic, Reasoning, and Coding (ARC) foundation models
Open image model at the forefront of design
Audio Normalization for Python/ffmpeg