A general fine-tuning kit geared toward image/video/audio diffusion
A cycle-accurate Nintendo Game Boy Advance emulator
Qwen3-TTS is an open-source series of TTS models
The official Allegro 5 git repository. Pull requests welcome
AI tool converting video/audio into structured documents instantly
Code and models for ICML 2024 paper, NExT-GPT
GenAI Processors is a lightweight Python library
Python inference and LoRA trainer package for the LTX-2 audio–video
Spring AI Alibaba examples for building and testing AI apps
Python library and CLI tool to interface with Google Translate
High-Quality Voice Cloning TTS for 600+ Languages
Cross-platform, customizable ML solutions
A high-quality rapid TTS voice cloning model
A blazing fast AI Gateway with integrated guardrails
Network transparent, client/server audio transport system
online video editor built with nextjs, remotion and ffmpeg
High-resolution models for human tasks
Automated YouTube Shorts pipeline
Phaser is a free and fast 2D game framework for making HTML5 games
The official Node.js / Typescript library for the Groq API
A suite of advanced multi-modal LLMs
Industrial-level controllable zero-shot text-to-speech system
Instantly generate AI-powered subtitles on your device
A lightweight text-to-speech model with zero-shot voice cloning
The .NET library to build AI agents with 30+ built-in connectors