Text and image to video generation: CogVideoX and CogVideo
A generative speech model for daily dialogue
Automatic Speech Recognition with Word-level Timestamps
A Family of Open Sourced Music Foundation Models
Edit PDF files with Nano Banana
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
Official inference repo for FLUX.2 models
A robust, efficient, low-latency speech-to-text library
FastAPI framework, high performance, easy to learn, fast to code
A simple tool for reading in poorly redacted documents
Open source annotation tool for machine learning practitioners
Python library and CLI tool to interface with Google Translate
Converts text to speech in realtime
Label Studio is a multi-type data labeling and annotation tool
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Open source no-code system for text annotation and building of text
Official MiniMax Model Context Protocol (MCP) server
Cut videos with a text editor
A simple, high-quality voice conversion tool focused on ease of use
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Offline Text To Speech synthesis for python
A free & open-source 2D sprite editor, made with the Godot Engine
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Video-based AI memory library. Store millions of text chunks in MP4
AI-powered tool for generating, optimizing, and translating subtitles